Principal Data Engineer for Enterprise Ai/ml

hace 4 semanas


Zapopan, México Oracle A tiempo completo

**Responsibilities**:- Design, develop, and implement data pipelines for ingesting, pre-processing, and transforming diverse data types (html, Image,.pdf, Audio, video) for Generative AI model training and inference.- Engineer data for vector databases (e.g., Pinecone, Redis, Chroma) and Large Language Models (LLM, GPT-4, Claude 2.0) and for tasks like text summarization, entity extraction, and classification.- Build and maintain efficient data storage solutions, including data lakes, warehouses, and databases suitable for large-scale generative AI datasets. Implement data security and governance policies to safeguard the privacy and integrity of sensitive data used in Generative AI projects.- Collaborate with data scientists and engineers to understand data requirements for Generative AI models and translate them into efficient data pipelines.- Monitor and optimize data pipelines for performance, scalability, and cost-effectiveness.- Build analytical tools to utilize the data pipeline, providing actionable insight into key business performance metrics, including operational efficiency and customer acquisition.- Work with stakeholders, including data, design, product, and executive teams, and assisting them with data-related technical issues- Collaborate with stakeholders, including the Executive, Product, Data, and Design teams, to support their data infrastructure needs while assisting with data-related technical issues.**Qualifications**:- Bachelor's degree in computer science, data science, statistics, or a related field, or equivalent experience.- 6+ years of proven experience in data engineering, ETL, SQL, database, JSON data, data pipeline development, building data platforms, and data storage technologies.- 2+ years of experience building and maintaining data pipelines for machine learning projects.- Strong understanding of data structures, data modeling principles, data quality measures, and data security best practices, with experience in transforming, cleaning, and organizing unstructured data- High proficiency in Python, SQL, and scripting languages.- Experience in continuous integration/deployments for large data pipelines and familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes) for scalable and efficient model deployment.- Familiarity with implementing data and/or machine learning algorithms in production systems (e.g. AWS Sagemaker, GCP Datalab, or custom implementation);- Hands-on experience with cloud platforms (e.g., OCI, AWS, GCP, Azure) for data storage and processing, along with Gen AI services like OCI Gen AI, Azure AI Services, or AWS BedRock.- Strong problem-solving skills and the ability to analyze data and design solutions to complex data issues.- Familiarity with modern ETL stack (Airflow, DBT, Snowflake), data stream frameworks (Kafka, Kinesis), vector databases (e.g., Pinecone, Redis, Chroma) and OpenSearch / Elasticsearch.- Understanding of Large Language Models (LLM, GPT-4, Claude 2.0) for tasks like text summarization, entity extraction, and classification.- Excellent communication skills and the ability to convey complex technical concepts to non-technical stakeholders.- Ability to work independently and collaboratively in a fast-paced environment.- Practical knowledge of Agile project management and software development methodologies such as Scrum and SAFe.- Experiencing working with globally distributed teams.



  • Zapopan, México Oracle A tiempo completo

    Oracle’s Forward Deployed Engineer (FDE) team is hiring a Principal Solutions Architect - AI Data Platform to help global customers unlock the full potential of their data. You will provide expert architectural guidance focused on designing, optimizing, and scaling modern AI/ML-centric data platforms. As a key member of Oracle’s Analytics and AI Service...


  • zapopan, México Canonical A tiempo completo

    Python and Kubernetes Software Engineer - Data, AI/ML & Analytics Join to apply for the Python and Kubernetes Software Engineer - Data, AI/ML & Analytics role at Canonical Python and Kubernetes Software Engineer - Data, AI/ML & Analytics 4 months ago Be among the first 25 applicants Join to apply for the Python and Kubernetes Software Engineer - Data, AI/ML...


  • zapopan, México Canonical A tiempo completo

    Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics Join to apply for the Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics role at Canonical Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics 3 days ago Be among the first 25 applicants Join to apply for the Python and Kubernetes...


  • Zapopan, México AstraZeneca A tiempo completo

    **Location** Zapopan, Jalisco, Mexico**Job ID** R- **Date posted** 05/06/2025**AI Support Engineer**:**Location: Hybrid (3 days on-site in Guadalajara, MX; 2 days remote)***:**About AstraZeneca**:AstraZeneca is a global, science-led biopharmaceutical company focused on the discovery, development and commercialisation of life-changing medicines. Our...

  • Ai Expert Engineer

    hace 2 semanas


    Zapopan, México Incfile A tiempo completo

    **Key Responsibilities**:- Develop, implement, and maintain AI tools tailored to business and/or customer needs.- Collaborate with cross-functional teams to understand requirements and integrate AI solutions into existing workflows.- Perform data preprocessing, feature engineering, and model evaluation to ensure high accuracy and performance.- Create...

  • Ai Expert Engineer

    hace 2 semanas


    Zapopan, México Incfile A tiempo completo

    **Key Responsibilities**: - Develop, implement, and maintain AI tools tailored to business and/or customer needs. - Collaborate with cross-functional teams to understand requirements and integrate AI solutions into existing workflows. - Perform data preprocessing, feature engineering, and model evaluation to ensure high accuracy and performance. - Create...

  • Senior Machine Learning

    hace 2 semanas


    zapopan, México BairesDev A tiempo completo

    Join to apply for the Senior Machine Learning & LLM Engineer - Remote Work | REF# role at BairesDev 2 months ago Be among the first 25 applicants Join to apply for the Senior Machine Learning & LLM Engineer - Remote Work | REF# role at BairesDev Get AI-powered advice on this job and more exclusive features. At BairesDev, we've been leading the way in...


  • Zapopan, México Oracle A tiempo completo

    **About the Company and Team **The Infrastructure Industries Global Industry Unit's (IGIU) mission is to build and deliver technology and solutions that improve the lives of global citizens.For Energy & Water, that's striving to ensure that every global citizen has access to clean and affordable energy and water.For Construction & Engineering, it's...


  • Zapopan, México Oracle A tiempo completo

    Key Responsibilities:Design and implement end-to-end Generative AI solutions using LLMs, Agents, and RAG workflows.Build and optimize AI/ML pipelines using OCI Generative AI services.Develop Gen AI Agents for enterprise use cases and workflow automation.Develop AI solutions that adhere to security, compliance, and governance requirements.Contribute to code...

  • Data Engineer

    hace 3 semanas


    Zapopan, México Agileengine A tiempo completo

    AgileEngine is an Inc. **** company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.WHY JOIN US If you're looking for a place to grow, make an...