Azure Data Engineer

hace 2 semanas


Guadalajara, México Derevo A tiempo completo

Become a Derevian and release your Superpower

Derevo

Science, art & human power applied to your information.

We are a consulting firm specialized in data discovery projects adding meaninig to your data through our disruptive methodology which combines elements of design thinking + user experience with technological platforms for information analytics.

**Summary**:
The desired profile should have hands-on experience in designing, establishing, and maintaining data management and storing systems. Skilled in collecting, processing, cleaning, and deploying large datasets, understanding ER data models, and integrating with multiple data sources. Efficient in analyzing, communicating, and proposing different ways of building Data Warehouses, Data Lakes, End-to-End Pipelines, and Big Data solutions to clients, either in batch or streaming strategies.

**Technical Proficiencies**:

- SQL:
Data Definition Language, Data Manipulation Language, Intermediate/advanced queries for analytical purpose, Subqueries, CTEs, Data types, Joins with business rules applied, Grouping and Aggregates for business metrics, Indexing and optimizing queries for efficient ETL process, Stored Procedures for transforming and preparing data, SSMS, DBeaver
- Python:
Experience with object-oriented, manage and processing large datasets, optimizing memory footprint, data structures, ingesting data through multiple structured or unstructured data sources, best practices for writing efficient code. Knowledge in pandas, numpy, pyspark libraries
- Azure:
Intermediate/Advanced knowledge in

Azure Storage Account:
Provision Azure Blob Storage or Azure Data Lake instances

Build efficient file systems for storing data into folders with static or parametrized names, considering possible security rules and risks

Experience identifying use cases for open-source file formats like parquet, AVRO, ORC

Understanding optimized column-oriented file formats vs optimized row-oriented file formats

Implementing security configurations through Access Keys, SAS, AAD, RBAC, ACLs

Azure Data Factory / Azure Synapse Pipelines:
Provision Azure Data Factory instances

Create ETL/ELT processes ingesting on-premises or cloud data sources

Use Azure IR, Self-Hosted IR, Azure-SSIS to establish connections to distinct data sources

Use of Copy or Polybase activities for loading data

Build efficient and optimized ADF Pipelines using linked services, datasets, parameters, triggers, data movement activities, data transformation activities, control flow activities and mapping data flows

Build Incremental and Re-Processing Loads

Azure Synapse Analytics:
Experience identifying and implementing use cases for dedicated and serverless SQL Pools

Experience identifying and implementing use cases for Spark Pools

Understanding MPP architectures and how to work with big data workflows

Experience identifying use cases for Copy or Polybase loads using different structured and semi-structured data sources

Create and manage internal, external and temporary tables in Azure Synapse Analytics

Use of staging, bronze, silver and gold layers. Load structured and semi-structured data in batch using incremental or reprocess strategies

Optimizing the Data Warehouse: strong knowledge of best practices to optimize query performance using replicate, round robin and hash distributions, data partitions, table indexing and table statistics

Optimizing Apache Spark cluster: strong knowledge of best practices to optimize spark job executions, using optimal data format, caching, partitioning, and bucketing data, optimizing joins and shuffles, broadcast variables, hyperspace indexes. Understanding of lazy executions, narrow and wide transformations. Work with the correct executor size in Spark cluster

Investigate optimal data skews in dedicated SQL and Spark pools

Monitoring and analyzing query execution plans to optimize response times and credit consumption

Manage workloads classes to assign different compute resource levels

Understanding of star and snowflake schemas, Watermark date columns, Slowly changing dimensions, ACID compliance and big data massively parallel processing architectures
- Desirable

Azure Databricks, Azure Purview, Azure Event Hubs, Azure Streaming Analytics
- Desired Certifications

Microsoft Certified: DP203 Azure Data Engineer Associate

Databricks Certified Developer

La contratación es inmediata, los beneficios que ofrecemos son superiores a los de ley, esquema 100% nominal, trabajo 100% remoto, trabajamos con diferentes clientes internacionales y nacionales por lo que los proyectos son muy interesantes para el desarrollo profesional, adicional apoyamos en certificaciones de nuestros colaboradores para continuar con el crecimiento.


  • Azure Data Engineer

    hace 2 días


    Guadalajara, México Slalom Consulting A tiempo completo

    **Who You’ll Work With** As a modern technology company, our Slalom Technologists are disrupting the market and bringing to life the art of the possible for our clients. We have passion for building strategies, solutions, and creative products to help our clients solve their most complex and interesting business problems. We surround our technologists...

  • Azure Data Engineer

    hace 19 horas


    Guadalajara, Jalisco, México Infosys A tiempo completo

    Microsoft –– Azure Data Engineer- Ingeniero de datos en AzureIn the role ofAzureData Engineer, you will be responsible for designing, building, and optimizing secure, scalable data pipelines and architectures on Microsoft Azure, enabling efficient data ingestion, transformation, and analytics for business insights. You will work with big data tools,...

  • Azure Data Engineer

    hace 4 semanas


    Guadalajara, México The Brick Soluciones A tiempo completo

    **Acerca del puesto Azure Data Engineer (Inglés Avanzado)**:Buscamos **Azure Data Engineer **para trabajar en modalidad Híbrida en una importante empresa transnacional de tecnología ubicada en la ciudad de Guadalajara Jalisco.**OFRECEMOS****Sueldo: Abierto**Esquema de bonos por objetivosPrestaciones de LeyEsquema de trabajo; Híbrido (2 días de Home...

  • Data Engineer

    hace 3 semanas


    Guadalajara, México NTT DATA A tiempo completo

    **Req ID**: We are currently seeking a Data Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX).**Data Engineer (Based in Guadalajara)****Data Engineers**are responsible for designing, building, and maintaining the systems and processes that collect, store, and analyze data. They are responsible for gathering data from various sources and...

  • Lead Cloud Engineer Azure

    hace 4 semanas


    Guadalajara, México NTT DATA A tiempo completo

    **Req ID**: We are currently seeking a Lead Cloud Engineer Azure to join our team in Guadalajara, Jalisco (MX-JAL), Mexico (MX).**Preferred Experience**:- Solid understanding of cloud computing, networking, and storage principles with focus on Azure. AWS experience a plus.- Cloud administration, OS/server administration, patching, maintenance, and...

  • Azure Data Architect

    hace 2 días


    Guadalajara, México Slalom Consulting A tiempo completo

    **Who You’ll Work With** As a modern technology company, our Slalom Technologists are disrupting the market and bringing to life the art of the possible for our clients. We have passion for building strategies, solutions, and creative products to help our clients solve their most complex and interesting business problems. We surround our technologists...

  • Senior Data Engineer

    hace 2 semanas


    Guadalajara, México Brillio A tiempo completo

    **Senior Data Engineer - R **:**Senior Data Engineer****Primary Skills**:- AKS, Event Hub, Azure DevOps, Cosmos DB, Azure Functions**Job requirements**:- Looking for a Data Engineer with Azure Data Stack expertise to design, build, and optimize large-scale data pipelines and analytics solutions.- Key Responsibilities:- Develop and maintain data pipelines...

  • Data Engineer

    hace 1 semana


    Guadalajara, México NTT DATA, Inc. A tiempo completo

    NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.We are currently seeking a Data Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX).Data Engineer (Based in Guadalajara)Data Engineers are responsible...

  • Data Engineer

    hace 7 días


    Guadalajara, México NTT DATA, Inc. A tiempo completo

    NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.We are currently seeking a Data Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX).Data Engineer (Based in Guadalajara)Data Engineers are responsible...


  • Guadalajara, México Training Talent A tiempo completo

    **Vacante para la empresa Training Talent en Guadalajara, Jalisco**:Somos una consultora de servicios de desarrollo de software con integraciones en nube, soluciones IoT y Machine Learning con presencia en Latam y EE. UU. socios de Microsoft, Amazon, buscamos en México un Arquitecto o ingeniero de datos Sr en Azure bilingüe en esquema remoto:Equipo...