Principal Data Engineer for Enterprise Ai/ml
hace 3 días
**Responsibilities**:
- Design, develop, and implement data pipelines for ingesting, pre-processing, and transforming diverse data types (html, Image,.pdf, Audio, video) for Generative AI model training and inference.
- Engineer data for vector databases (e.g., Pinecone, Redis, Chroma) and Large Language Models (LLM, GPT-4, Claude 2.0) and for tasks like text summarization, entity extraction, and classification.
- Build and maintain efficient data storage solutions, including data lakes, warehouses, and databases suitable for large-scale generative AI datasets. Implement data security and governance policies to safeguard the privacy and integrity of sensitive data used in Generative AI projects.
- Collaborate with data scientists and engineers to understand data requirements for Generative AI models and translate them into efficient data pipelines.
- Monitor and optimize data pipelines for performance, scalability, and cost-effectiveness.
- Build analytical tools to utilize the data pipeline, providing actionable insight into key business performance metrics, including operational efficiency and customer acquisition.
- Work with stakeholders, including data, design, product, and executive teams, and assisting them with data-related technical issues
- Collaborate with stakeholders, including the Executive, Product, Data, and Design teams, to support their data infrastructure needs while assisting with data-related technical issues.
**Qualifications**:
- Bachelor's degree in computer science, data science, statistics, or a related field, or equivalent experience.
- 6+ years of proven experience in data engineering, ETL, SQL, database, JSON data, data pipeline development, building data platforms, and data storage technologies.
- 2+ years of experience building and maintaining data pipelines for machine learning projects.
- Strong understanding of data structures, data modeling principles, data quality measures, and data security best practices, with experience in transforming, cleaning, and organizing unstructured data
- High proficiency in Python, SQL, and scripting languages.
- Experience in continuous integration/deployments for large data pipelines and familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes) for scalable and efficient model deployment.
- Familiarity with implementing data and/or machine learning algorithms in production systems (e.g. AWS Sagemaker, GCP Datalab, or custom implementation);
- Hands-on experience with cloud platforms (e.g., OCI, AWS, GCP, Azure) for data storage and processing, along with Gen AI services like OCI Gen AI, Azure AI Services, or AWS BedRock.
- Strong problem-solving skills and the ability to analyze data and design solutions to complex data issues.
- Familiarity with modern ETL stack (Airflow, DBT, Snowflake), data stream frameworks (Kafka, Kinesis), vector databases (e.g., Pinecone, Redis, Chroma) and OpenSearch / Elasticsearch.
- Understanding of Large Language Models (LLM, GPT-4, Claude 2.0) for tasks like text summarization, entity extraction, and classification.
- Excellent communication skills and the ability to convey complex technical concepts to non-technical stakeholders.
- Ability to work independently and collaboratively in a fast-paced environment.
- Practical knowledge of Agile project management and software development methodologies such as Scrum and SAFe.
- Experiencing working with globally distributed teams.
-
Principal Data Engineer for Enterprise Ai/ml
hace 4 semanas
Zapopan, México Oracle A tiempo completo**Responsibilities**:- Design, develop, and implement data pipelines for ingesting, pre-processing, and transforming diverse data types (html, Image,.pdf, Audio, video) for Generative AI model training and inference.- Engineer data for vector databases (e.g., Pinecone, Redis, Chroma) and Large Language Models (LLM, GPT-4, Claude 2.0) and for tasks like text...
-
Principal Solutions Architect
hace 2 semanas
Zapopan, México Oracle A tiempo completoOracle’s Forward Deployed Engineer (FDE) team is hiring a Principal Solutions Architect - AI Data Platform to help global customers unlock the full potential of their data. You will provide expert architectural guidance focused on designing, optimizing, and scaling modern AI/ML-centric data platforms. As a key member of Oracle’s Analytics and AI Service...
-
Python and Kubernetes Software Engineer
hace 2 semanas
zapopan, México Canonical A tiempo completoPython and Kubernetes Software Engineer - Data, AI/ML & Analytics Join to apply for the Python and Kubernetes Software Engineer - Data, AI/ML & Analytics role at Canonical Python and Kubernetes Software Engineer - Data, AI/ML & Analytics 4 months ago Be among the first 25 applicants Join to apply for the Python and Kubernetes Software Engineer - Data, AI/ML...
-
Python and Kubernetes Software Engineer
hace 2 semanas
zapopan, México Canonical A tiempo completoPython and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics Join to apply for the Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics role at Canonical Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics 3 days ago Be among the first 25 applicants Join to apply for the Python and Kubernetes...
-
Ai Support Engineer Zapopan, Jalisco, Mexico
hace 2 semanas
Zapopan, México AstraZeneca A tiempo completo**Location** Zapopan, Jalisco, Mexico**Job ID** R- **Date posted** 05/06/2025**AI Support Engineer**:**Location: Hybrid (3 days on-site in Guadalajara, MX; 2 days remote)***:**About AstraZeneca**:AstraZeneca is a global, science-led biopharmaceutical company focused on the discovery, development and commercialisation of life-changing medicines. Our...
-
Ai Expert Engineer
hace 2 semanas
Zapopan, México Incfile A tiempo completo**Key Responsibilities**:- Develop, implement, and maintain AI tools tailored to business and/or customer needs.- Collaborate with cross-functional teams to understand requirements and integrate AI solutions into existing workflows.- Perform data preprocessing, feature engineering, and model evaluation to ensure high accuracy and performance.- Create...
-
Senior Machine Learning
hace 2 semanas
zapopan, México BairesDev A tiempo completoJoin to apply for the Senior Machine Learning & LLM Engineer - Remote Work | REF# role at BairesDev 2 months ago Be among the first 25 applicants Join to apply for the Senior Machine Learning & LLM Engineer - Remote Work | REF# role at BairesDev Get AI-powered advice on this job and more exclusive features. At BairesDev, we've been leading the way in...
-
Senior Data Engineer- Ai Accelerator
hace 3 semanas
Zapopan, México Oracle A tiempo completo**About the Company and Team **The Infrastructure Industries Global Industry Unit's (IGIU) mission is to build and deliver technology and solutions that improve the lives of global citizens.For Energy & Water, that's striving to ensure that every global citizen has access to clean and affordable energy and water.For Construction & Engineering, it's...
-
Full Stack Data Engineer
hace 1 día
Zapopan, México AstraZeneca A tiempo completo**Location** Zapopan, Jalisco, Mexico**Job ID** R- **Date posted** 16/06/2025**Location: Hybrid (3 days on-site in Guadalajara, MX; 2 days remote)***:**About AstraZeneca**:AstraZeneca is a global, science-led biopharmaceutical company focused on the discovery, development and commercialisation of life-changing medicines. Our Enterprise AI team delivers the...
-
Principal Member of Technical Staff
hace 4 semanas
Zapopan, México Oracle A tiempo completoKey Responsibilities:Design and implement end-to-end Generative AI solutions using LLMs, Agents, and RAG workflows.Build and optimize AI/ML pipelines using OCI Generative AI services.Develop Gen AI Agents for enterprise use cases and workflow automation.Develop AI solutions that adhere to security, compliance, and governance requirements.Contribute to code...