Principal Data Engineer
hace 2 días
About Us The Ksquare Group was established in 2014 in Dallas TX, we are an international company with a presence in different countries such as Mexico, the USA, India, and Dominican Republic currently being more than 250 Ksquarians and growing. We offer technological solutions through multidisciplinary Pods specifically designed to meet our customer's needs. Depending on the skills required by every project, the best professionals are selected to bring all their talent together to achieve mutual goals. We authentically care about the comfort, wellbeing, and career growth of all our employees making sure integrity, innovation, leadership, accountability, and collaboration continuously stand by one and each one of us. We believe and trust all our outstanding professionals and realize how special they are but what makes us even special is the family we have become through the years. We are looking for a senior technical leader and hands-on architect — a Principal Data Engineer — to own the design and evolution of an AWS + Databricks enterprise lakehouse spanning ingestion (batch/streaming), transformation (canonical/Silver), and curated insights (Gold), with strong governance, security, performance, and cost controls. This role sets engineering standards, mentors teams, and drives scalable delivery across multiple domains and source systems. Key Responsibilities: Architecture & Platform Leadership Own end-to-end lakehouse architecture across Bronze/Silver/Gold and ensure best practices for: Raw landing on S3 + Glue Catalog Canonical modeling and transformations Curated datasets for analytics/AI/consumption Define standards for data layout, partitioning, file sizing, compaction, and Iceberg table management. Establish platform patterns for batch + streaming ingestion, orchestration, and automated deployment. Data Engineering Delivery (Hands-On + Leadership) Lead implementation of scalable pipelines in Databricks (PySpark/Spark SQL) and AWS (Glue, Lambda, Step Functions where needed). Design and maintain robust data models (canonical/Silver) and curated marts (Gold) optimized for analytics and downstream use. Ensure pipelines meet SLA, reliability, security, and cost objectives. Streaming and Near Real-Time Enablement Define approach for streaming ingestion and processing (e.g., Kafka/Kinesis patterns if applicable, structured streaming/micro-batch). Ensure correctness, idempotency, late arriving data handling, and replay strategies. Governance, Security & Compliance Implement security architecture: IAM, least privilege, encryption (KMS), secrets management, network controls. Integrate catalog/lineage/governance (e.g., Atlan) with standardized metadata practices. Establish data access patterns including RBAC/ABAC and controlled data sharing with partners. Performance, Reliability & FinOps Drive optimization on Databricks clusters, job tuning, caching strategies, and query performance. Implement observability: pipeline metrics, logs, lineage, and incident runbooks. Own cost optimization strategy: autoscaling, cluster policies, workload isolation, storage optimization. Engineering Excellence & Team Enablement Create reference architectures, coding standards, reusable libraries, and delivery playbooks. Mentor data engineers and reviewers; lead design reviews, code reviews, and production readiness checks. Collaborate with stakeholders across Analytics/BI, AI/ML, and product teams. Must-Have Skills & Experience: 15+ years in Data Engineering with proven leadership owning enterprise-scale platforms. Expert-level Databricks: Spark architecture, PySpark optimization, Spark SQL, workflows, job orchestration, cluster policies. Deep AWS expertise: S3, Glue, Lake Formation (if used), IAM, CloudWatch, KMS, VPC/security controls. Strong experience with Lakehouse table formats: Iceberg (preferred) / Delta / Hudi and parquet optimization. Strong architecture skills for data ingestion, canonical modeling, and curated layer design. Strong hands-on coding in Python and advanced SQL. Experience implementing CI/CD for data (Git branching, deployment automation, environment promotion). Experience designing for analytics consumption: semantic layer readiness, BI/Power BI integration patterns. Nice-to-Have: Experience with Unity Catalog, multi-workspace governance, data sharing, and fine-grained access controls. Exposure to data virtualization patterns (semantic/virtualization layer) and federation strategies. AI/ML enablement experience (feature datasets, training data pipelines, governance for GenAI/LLM apps). Experience integrating enterprise apps (ERP, ServiceNow, Workday, factory systems like MES). The Ksquare Group does not and shall not discriminate based on race, color, religion, gender, gender expression, age, national origin, disability, marital status, sexual orientation, or military status, in any of its activities or operations. These activities include, but are not limited to, hiring, and firing of staff, selection of volunteers and vendors, and provision of services. We are committed to providing an inclusive and welcoming environment for all members of our staff, clients, volunteers, subcontractors, vendors, and clients. As an equal-opportunity employer. We will not discriminate and will take affirmative action measures to ensure against discrimination in employment, recruitment, advertisements for employment, compensation, termination, upgrading, promotions, and other conditions of employment against any employee or job applicant on the bases of the conditions previously mentioned.
-
Principal Data Engineer
hace 2 semanas
Juárez, Juárez, Chih., México Motivus A tiempo completoAbout Motivus: At Motivus, we believe in unlocking human potential through innovative, cutting-edge solutions. With over 1,600 team members across 5 countries, we provide a full spectrum of software and digital services. Our teams are dedicated to driving sustainability, pushing the boundaries of technology, and building the next generation of world-class...
-
Data Engineer: SQL/ETL
hace 3 semanas
Juárez, Juárez, Chih., México NTT DATA Europe & Latam A tiempo completoSobre NTT DATA En NTT DATA somos más que una empresa de tecnología: somos un equipo global de más de +190,000 profesionales con presencia en +50 países. Colaboramos en sectores clave como telecomunicaciones, servicios financieros, industria, energía, sector público y salud, aportando soluciones innovadoras que impulsan la transformación digital. ...
-
Principal Data Scientist
hace 1 semana
Juárez, Juárez, Chih., México Atos A tiempo completoPrincipal Data Scientist Location: Monterrey or Mexico City mainly (remote position but willing to assist eventually to the offices when it's required e.g. when upper management or clients from US visits Mexico, etc.) Language: English Fluent Job Description: We are seeking an exceptional Principal Data Scientist having 10+ years of combined experience in...
-
Principal Data Scientist
hace 3 días
Juárez, Juárez, Chih., México Atos A tiempo completoPrincipal Data Scientist Location : Monterrey or Mexico City mainly (remote position but willing to assist eventually to the offices when it's required e.g. when upper management or clients from US visits Mexico, etc.) Language : English Fluent Job Description: We are seeking an exceptional Principal Data Scientist having 10+ years of combined experience in...
-
Data Engineer
hace 6 días
Benito Juárez, CDMX, México IDS Comercial, S.A. de C.V. A tiempo completoIDS es una empresa mexicana con 30 años de experiência en el mercado, estamos acreditados en el Nível 5 de CMMI (Capability Maturity Model Integration), modelo de calidad establecido por el Software Engineering Institute (SEI) Nos especializamos en servicios de consultaría, desarrollo y capacitación en TI. Más de 700 consultores trabajan con un gran...
-
Data Engineer
hace 13 horas
Benito Juárez, CDMX, México IDS Comercial, S.A. de C.V. A tiempo completoIDS es una empresa mexicana con 30 años de experiência en el mercado, estamos acreditados en el Nível 5 de CMMI (Capability Maturity Model Integration), modelo de calidad establecido por el Software Engineering Institute (SEI) Nos especializamos en servicios de consultaría, desarrollo y capacitación en TI. Más de 700 consultores trabajan con un gran...
-
Principal Java Engineer
hace 3 días
Juárez, Juárez, Chih., México Persistent Systems A tiempo completoAbout Persistent We are an AI-led, platform-driven Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate what's next. We work with many industry-leading organizations across the world, including 20 Fortune 50 companies and 4 of the 5 top banks in both the US and...
-
Principal Java Engineer
hace 7 días
Juárez, Juárez, Chih., México Persistent Systems A tiempo completoAbout Persistent We are an AI-led, platform-driven Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate what’s next. We work with many industry-leading organizations across the world, including 20 Fortune 50 companies and 4 of the 5 top banks in both the US and...
-
Data Analyst
hace 4 semanas
Juárez, Juárez, Chih., México Nfq Advisory, Solutions, Outsourcing A tiempo completo¡En Nfq seguimos creciendo! Buscamos integrar a nuestro equipo de Data & Analytics (D&A) a un Data Analyst / Data Scientist enfocad@ en el seguimiento y desempeño de parámetros de riesgo de crédito. Si te apasionan los datos, la analítica avanzada y el mundo financiero, esta es tu oportunidad para ser parte de una compañía que impulsa la toma de...
-
Data Engineer
hace 6 días
Benito Juárez, México Colektia A tiempo completoÚnete al equipo de **Colektia** como: _**Data Engineer**_donde tus objetivos principales serán el facilitar la disponibilidad de datos en **Colektia**, aplicando herramientas avanzadas para apoyar decisiones estratégicas y operativas, asegurando precisión y accesibilidad de la información. **Actividades** - Diseñar y construir pipelines de datos -...