Data Engineer Senior

hace 5 días


Zapopan, México Derevo A tiempo completo

We are looking for your talent ✋

*** Data Engineer Senior**

**‍**

**El perfil deseado debe tener al menos 5 años de experiência práctica en el diseño, establecimiento y mantenimiento de sistemas de gestión y almacenamiento de datos. Hábil en la recopilación, procesamiento, limpieza y despliegue de grandes conjuntos de datos, la comprensión de los modelos de datos ER, y la integración con múltiples fuentes de datos. Eficaz en el análisis, la comunicación y la propuesta de diferentes formas de crear almacenes de datos, lagos de datos, conductos de extremo a extremo y soluciones de Big Data para los clientes, ya sea en estrategias por lotes o en streaming.**

Será muy importante que tengas los siguientes conocimientos/experiência:

- ** Inglés B2+ o más** (llevarás proyectos 100% con el idioma, por lo que será indispensable el dominio hablado y escrito)

**Technical Proficiencies**:

- **SQL**:
Data Definition Language, Data Manipulation Language, Intermediate/advanced queries for analytical purpose, Subqueries, CTEs, Data types, Joins with business rules applied, Grouping and Aggregates for business metrics, Indexing and optimizing queries for efficient ETL process, Stored Procedures for transforming and preparing data, SSMS, DBeaver
- **Python**:
Experience in object-oriented programming, Management and processing datasets, Use of variables, lists, dictionaries and tuples, Conditional and iterating functions, Optimization of memory consumption, Structures and data types, Data ingestion through various structured and semi-structured data sources, Knowledge of libraries such as pandas, numpy, sqlalchemy, Must have good practices when writing code
- **Databricks / Pyspark**:
Intermediate knowledge in

Understanding of narrow and wide transformations, actions, and lazy evaluations

How DataFrames are transformed, executed, and optimized in Spark

Use DataFrame API to explore, preprocess, join, and ingest data in Spark

Use Delta Lake to improve the quality and performance of data pipelines

Use SQL and Python to write production data pipelines to extract, transform, and load data into

tables and views in the Lakehouse

Understand the most common performance problems associated with data ingestion and how to

mitigate them

Monitor Spark UI: Jobs, Stages, Tasks, Storage, Environment, Executors, and Execution Plans

Configure a Spark cluster for maximum performance given specific job requirements

Configure Databricks to access Blob, ADL, SAS, user tokens, Secret Scopes and Azure Key Vault

Configure governance solutions through Unity Catalog and Delta Sharing

Use Delta Live Tables to manage an end-to-end pipeline with unit and integrations test
- **Azure**:
Intermediate/Advanced knowledge in

**Azure Storage Account**:
Provision Azure Blob Storage or Azure Data Lake instances

Build efficient file systems for storing data into folders with static or parametrized names, considering possible security rules and risks

Experience identifying use cases for open-source file formats like parquet, AVRO, ORC

Understanding optimized column-oriented file formats vs optimized row-oriented file formats

Implementing security configurations through Access Keys, SAS, AAD, RBAC, ACLs

**Azure Data Factory**:
Provision Azure Data Factory instances

Use Azure IR, Self-Hosted IR, Azure-SSIS to establish connections to distinct data sources

Use of Copy or Polybase activities for loading data

Build efficient and optimized ADF Pipelines using linked services, datasets, parameters, triggers, data movement activities, data transformation activities, control flow activities and mapping data flows

Build Incremental and Re-Processing Loads
- **Apache Kafka, Azure Event Hubs or AWS Kinesis**

Intermediate/Advanced knowledge in

Architecture and fundamental concepts of event streaming platforms, including producers, consumers, topics, partitions, and consumer groups

Configuration, deployment, and management of event streaming clusters/services for high availability, scalability, and fault tolerance

Performance tuning and optimization of event streaming clusters, including message retention, partition sizing, and data replication

Implementing common usage patterns such as asynchronous messaging, real-time stream processing, and end-to-end data pipelines for real-time data ingestion and processing

Security best practices for event streaming platforms, including encryption, authentication, and access control mechanisms

**Además, valoramos mucho es que a nível personal encajes con la cultura de Derevo**:

- Capacidad de adaptación y superación. Buscamos personas que se quieran comer el mundo, proactivas y flexibles, a las que no les importe adaptarse a los cambios tecnológicos y metodologías existentes.
- Capacidad analítica y capaz de transmitir confianza en entornos de incertidumbre: debes tener capacidad para gestionar los problemas y verlos como punto de partida para la mejora. Tener y generar


  • Data Engineer Senior

    hace 18 horas


    Zapopan, México Derevo A tiempo completo

    ¡¡We are looking for your talent!! ✋*** Data Engineer Senior****‍****El perfil deseado debe tener al menos 5 años de experiência práctica en el diseño, establecimiento y mantenimiento de sistemas de gestión y almacenamiento de datos. Hábil en la recopilación, procesamiento, limpieza y despliegue de grandes conjuntos de datos, la comprensión de...

  • Sr Data Engineer

    hace 7 días


    Zapopan, México Ryscode A tiempo completo

    Description Looking for a Sr. Data Engineer located in GUADALAJARA, México or willing to relocate for a full-time on-site work scheme. Required Technologies Python, AWS, Kafka, snowflake Good English communication GDL Position: ON-SITE Tipo de puesto: Tiempo completo Sueldo: $60,000.00 - $68,400.00 al mes Beneficios: - Caja de ahorro - Seguro de...

  • Data Engineer- Onsite

    hace 18 horas


    Zapopan, México GSB Solutions A tiempo completo

    Important IT company At the Latin American level, growth requires:**Data Engineer****Job description**:- Advanced Google Cloud development skills.- Experience with Google Cloud FHIR data stores.**Schedule**: 9:00 am a 6:00 pm.**Job type**: Onsite.**Location**:Av. Acueducto 4851 Piso 14, Puerta de Hierro, 45116 Zapopan, Jal.**Salary**: Open- Salary according...

  • Data Engineer Jr

    hace 18 horas


    Zapopan, México Derevo A tiempo completo

    ¡¡Buscamos tu talento!! ✋**DATA ENGINEER JR**En **Derevo** buscamos empoderar a las empresas y a las personas para liberar el valor de los datos en las organizaciones, a través de la implementación de procesos y plataformas de analítica con un enfoque que cubre el ciclo completo que necesitan llevar a cabo para lograrlo.Derevo inicio en 2010 con una...


  • Zapopan, México Oracle A tiempo completo

    **Our Team : Oracle GoldenGate for BigData** Oracle GoldenGate (OGG) is a comprehensive software package for real-time data integration and replication in heterogeneous IT environments. Oracle GoldenGate for Big Data streams transactional data into big data systems in real time, raising the quality and timeliness of business insights. For more information,...


  • Zapopan, México SISTEMA BEA A tiempo completo

    BEA-TT designs, develops, and supports a state-of-the-art Automated Fare Collection solutions for transit agencies in North America which currently serve over 15,000,000 passengers per day. At BEA-TT we are looking for new engineering team members that will focus on the creation of a new, unique platform that will serve the North American market as a first...


  • Zapopan, Jalisco, México Solidigm A tiempo completo

    Company DescriptionJoin a multibillion-dollar global company that brings together amazing technology, people, and operational scale to become a powerhouse in the memory industry. Headquartered in Rancho Cordova, California, Solidigm combines elements of an established, successful technology company with the spirit, agility, and entrepreneurial mindset of a...

  • Big Data Engineer

    hace 18 horas


    Zapopan, México Intugo A tiempo completo

    **Position Summary**:Our team is looking for a **Data Engineer** with a diverse background in data integration to join the Data Services team.Some data are small, some data are very large (1 trillion+ rows), some data is structured, some data is not. Our data comes in all kinds of sizes, shapes and formats.Traditional RDBMS like PostgreSQL, Oracle, SQL...

  • Data Engineer, Science

    hace 2 semanas


    Zapopan, México Servicios Comerciales Amazon Mexico S. de R.L. de C.V. - D44 A tiempo completo

    3+ years of data engineering experience - Experience with data modeling, warehousing and building ETL pipelines The Amazon Devices team designs and engineers high-profile consumer electronics, including the best-selling Kindle family of products. We have also produced groundbreaking devices like Fire tablets, Fire TV, Amazon Dash, and Amazon Echo. What...

  • Senior Data Analyst

    hace 3 días


    Zapopan, México BairesDev A tiempo completo

    At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact worldwide. Senior Data Analyst at...