Data Engineer Senior

hace 1 semana


Zapopan, México Derevo A tiempo completo

We are looking for your talent ✋*** Data Engineer Senior****‍****El perfil deseado debe tener al menos 5 años de experiência práctica en el diseño, establecimiento y mantenimiento de sistemas de gestión y almacenamiento de datos. Hábil en la recopilación, procesamiento, limpieza y despliegue de grandes conjuntos de datos, la comprensión de los modelos de datos ER, y la integración con múltiples fuentes de datos. Eficaz en el análisis, la comunicación y la propuesta de diferentes formas de crear almacenes de datos, lagos de datos, conductos de extremo a extremo y soluciones de Big Data para los clientes, ya sea en estrategias por lotes o en streaming.**Será muy importante que tengas los siguientes conocimientos/experiência:- ** Inglés B2+ o más** (llevarás proyectos 100% con el idioma, por lo que será indispensable el dominio hablado y escrito)**Technical Proficiencies**:- **SQL**:Data Definition Language, Data Manipulation Language, Intermediate/advanced queries for analytical purpose, Subqueries, CTEs, Data types, Joins with business rules applied, Grouping and Aggregates for business metrics, Indexing and optimizing queries for efficient ETL process, Stored Procedures for transforming and preparing data, SSMS, DBeaver- **Python**:Experience in object-oriented programming, Management and processing datasets, Use of variables, lists, dictionaries and tuples, Conditional and iterating functions, Optimization of memory consumption, Structures and data types, Data ingestion through various structured and semi-structured data sources, Knowledge of libraries such as pandas, numpy, sqlalchemy, Must have good practices when writing code- **Databricks / Pyspark**:Intermediate knowledge inUnderstanding of narrow and wide transformations, actions, and lazy evaluationsHow DataFrames are transformed, executed, and optimized in SparkUse DataFrame API to explore, preprocess, join, and ingest data in SparkUse Delta Lake to improve the quality and performance of data pipelinesUse SQL and Python to write production data pipelines to extract, transform, and load data intotables and views in the LakehouseUnderstand the most common performance problems associated with data ingestion and how tomitigate themMonitor Spark UI: Jobs, Stages, Tasks, Storage, Environment, Executors, and Execution PlansConfigure a Spark cluster for maximum performance given specific job requirementsConfigure Databricks to access Blob, ADL, SAS, user tokens, Secret Scopes and Azure Key VaultConfigure governance solutions through Unity Catalog and Delta SharingUse Delta Live Tables to manage an end-to-end pipeline with unit and integrations test- **Azure**:Intermediate/Advanced knowledge in**Azure Storage Account**:Provision Azure Blob Storage or Azure Data Lake instancesBuild efficient file systems for storing data into folders with static or parametrized names, considering possible security rules and risksExperience identifying use cases for open-source file formats like parquet, AVRO, ORCUnderstanding optimized column-oriented file formats vs optimized row-oriented file formatsImplementing security configurations through Access Keys, SAS, AAD, RBAC, ACLs**Azure Data Factory**:Provision Azure Data Factory instancesUse Azure IR, Self-Hosted IR, Azure-SSIS to establish connections to distinct data sourcesUse of Copy or Polybase activities for loading dataBuild efficient and optimized ADF Pipelines using linked services, datasets, parameters, triggers, data movement activities, data transformation activities, control flow activities and mapping data flowsBuild Incremental and Re-Processing Loads- **Apache Kafka, Azure Event Hubs or AWS Kinesis**Intermediate/Advanced knowledge inArchitecture and fundamental concepts of event streaming platforms, including producers, consumers, topics, partitions, and consumer groupsConfiguration, deployment, and management of event streaming clusters/services for high availability, scalability, and fault tolerancePerformance tuning and optimization of event streaming clusters, including message retention, partition sizing, and data replicationImplementing common usage patterns such as asynchronous messaging, real-time stream processing, and end-to-end data pipelines for real-time data ingestion and processingSecurity best practices for event streaming platforms, including encryption, authentication, and access control mechanisms**Además, valoramos mucho es que a nível personal encajes con la cultura de Derevo**:- Capacidad de adaptación y superación. Buscamos personas que se quieran comer el mundo, proactivas y flexibles, a las que no les importe adaptarse a los cambios tecnológicos y metodologías existentes.- Capacidad analítica y capaz de transmitir confianza en entornos de incertidumbre: debes tener capacidad para gestionar los problemas y verlos como punto de partida para la mejora. Tener y generar


  • Sr Data Engineer

    hace 3 semanas


    Zapopan, México Ryscode A tiempo completo

    DescriptionLooking for a Sr. Data Engineer located in GUADALAJARA, México or willing to relocate for a full-time on-site work scheme.Required TechnologiesPython, AWS, Kafka, snowflakeGood English communicationGDL Position: ON-SITETipo de puesto: Tiempo completoSueldo: $60,000.00 - $68,400.00 al mesBeneficios:- Caja de ahorro- Seguro de gastos médicos-...

  • Big Data Engineer

    hace 3 días


    Zapopan, México BairesDev A tiempo completo

    Join to apply for the Big Data Engineer - Remote Work | REF#281311 role at BairesDevContinue with Google Continue with Google5 months ago Be among the first 25 applicantsJoin to apply for the Big Data Engineer - Remote Work | REF#281311 role at BairesDevGet AI-powered advice on this job and more exclusive features.Continue with Google Continue with...

  • Big Data Engineer

    hace 5 días


    Zapopan, México BairesDev A tiempo completo

    Join to apply for the Big Data Engineer - Remote Work | REF#281311 role at BairesDevContinue with Google Continue with Google5 months ago Be among the first 25 applicantsJoin to apply for the Big Data Engineer - Remote Work | REF#281311 role at BairesDevGet AI-powered advice on this job and more exclusive features.Continue with Google Continue with...

  • QA Data Engineer

    hace 2 semanas


    Zapopan, Jalisco, México ECUACIÓN A tiempo completo

    QA Data Engineer (BI / Data Quality)About the RoleWe are looking for aQA Data Engineerwith prior experience inBusiness Intelligence (BI) quality teams. This role focuses on validating, testing, and ensuring the accuracy of data and dashboards used for business reporting. You will work closely with BI developers, data engineers, and stakeholders to guarantee...

  • Senior Data Analyst

    hace 4 semanas


    Zapopan, México Bizee A tiempo completo

    We are seeking a highly skilled and experienced Senior Data Analyst to join our growing team. As a Senior Data Analyst, you will play a pivotal role in driving data-driven decision-making across our organization. The primary focus of this position will be on analyzing marketing efficacy, product efficiency, Google Analytics, and other first-party data to...

  • Data Engineer

    hace 3 semanas


    Zapopan, México Dresden Partners A tiempo completo

    **Descripción**:Dresden Partnersexpertos en tecnología mobile, aplicaciones web, servicios near-shore staffing, tech international y local tech sourcing; está en busca de:**Data Engineer**Experiência- +3 años de experiência- Experiência con IBM Datastage- Conocimiento en SQL y UNIX- Deseable tener conocimiento en Python y Snowflake- Inglés...

  • Senior Platform Engineer

    hace 1 semana


    Zapopan, México UST A tiempo completo

    **Senior Platform Engineer** **Lead I - DevOps Engineering **Who we are**: Born digital, UST transforms lives through the power of technology. We walk alongside our clients and partners, embedding innovation and agility into everything they do. We help them create transformative experiences and human-centered solutions for a better world. UST is a...

  • Senior Software Engineer

    hace 3 semanas


    Zapopan, México SISTEMA BEA A tiempo completo

    BEA-TT designs, develops, and supports a state-of-the-art Automated Fare Collection solutions for transit agencies in North America which currently serve over 15,000,000 passengers per day. At BEA-TT we are looking for new engineering team members that will focus on the creation of a new, unique platform that will serve the North American market as a first...


  • Zapopan, México Canonical A tiempo completo

    Join to apply for the Senior/Staff/Principal Engineer role at Canonical3 days ago Be among the first 25 applicantsJoin to apply for the Senior/Staff/Principal Engineer role at CanonicalCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in...


  • Zapopan, México Canonical A tiempo completo

    Join to apply for the Senior/Staff/Principal Engineer role at Canonical3 days ago Be among the first 25 applicantsJoin to apply for the Senior/Staff/Principal Engineer role at CanonicalCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in...