Principal Data Engineer for Enterprise Ai/ml
hace 5 días
**Responsibilities**:
- Design, develop, and implement data pipelines for ingesting, pre-processing, and transforming diverse data types (html, Image,.pdf, Audio, video) for Generative AI model training and inference.
- Engineer data for vector databases (e.g., Pinecone, Redis, Chroma) and Large Language Models (LLM, GPT-4, Claude 2.0) and for tasks like text summarization, entity extraction, and classification.
- Build and maintain efficient data storage solutions, including data lakes, warehouses, and databases suitable for large-scale generative AI datasets. Implement data security and governance policies to safeguard the privacy and integrity of sensitive data used in Generative AI projects.
- Collaborate with data scientists and engineers to understand data requirements for Generative AI models and translate them into efficient data pipelines.
- Monitor and optimize data pipelines for performance, scalability, and cost-effectiveness.
- Build analytical tools to utilize the data pipeline, providing actionable insight into key business performance metrics, including operational efficiency and customer acquisition.
- Work with stakeholders, including data, design, product, and executive teams, and assisting them with data-related technical issues
- Collaborate with stakeholders, including the Executive, Product, Data, and Design teams, to support their data infrastructure needs while assisting with data-related technical issues.
**Qualifications**:
- Bachelor's degree in computer science, data science, statistics, or a related field, or equivalent experience.
- 6+ years of proven experience in data engineering, ETL, SQL, database, JSON data, data pipeline development, building data platforms, and data storage technologies.
- 2+ years of experience building and maintaining data pipelines for machine learning projects.
- Strong understanding of data structures, data modeling principles, data quality measures, and data security best practices, with experience in transforming, cleaning, and organizing unstructured data
- High proficiency in Python, SQL, and scripting languages.
- Experience in continuous integration/deployments for large data pipelines and familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes) for scalable and efficient model deployment.
- Familiarity with implementing data and/or machine learning algorithms in production systems (e.g. AWS Sagemaker, GCP Datalab, or custom implementation);
- Hands-on experience with cloud platforms (e.g., OCI, AWS, GCP, Azure) for data storage and processing, along with Gen AI services like OCI Gen AI, Azure AI Services, or AWS BedRock.
- Strong problem-solving skills and the ability to analyze data and design solutions to complex data issues.
- Familiarity with modern ETL stack (Airflow, DBT, Snowflake), data stream frameworks (Kafka, Kinesis), vector databases (e.g., Pinecone, Redis, Chroma) and OpenSearch / Elasticsearch.
- Understanding of Large Language Models (LLM, GPT-4, Claude 2.0) for tasks like text summarization, entity extraction, and classification.
- Excellent communication skills and the ability to convey complex technical concepts to non-technical stakeholders.
- Ability to work independently and collaboratively in a fast-paced environment.
- Practical knowledge of Agile project management and software development methodologies such as Scrum and SAFe.
- Experiencing working with globally distributed teams.
-
Principal Solutions Architect
hace 2 semanas
Zapopan, México Oracle A tiempo completoOracle’s Forward Deployed Engineer (FDE) team is hiring a Principal Solutions Architect - AI Data Platform to help global customers unlock the full potential of their data. You will provide expert architectural guidance focused on designing, optimizing, and scaling modern AI/ML-centric data platforms. As a key member of Oracle’s Analytics and AI Service...
-
Principal Solutions Architect
hace 2 semanas
Zapopan, México Oracle A tiempo completoOracle’s Forward Deployed Engineer (FDE) team is hiring a Principal Solutions Architect - AI Data Platform to help global customers unlock the full potential of their data. You will provide expert architectural guidance focused on designing, optimizing, and scaling modern AI/ML-centric data platforms. As a key member of Oracle’s Analytics and AI Service...
-
Principal Solutions Architect
hace 7 días
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionOracle's Forward Deployed Engineer (FDE) team is hiring a Principal Solutions Architect - AI Data Platform to help global customers unlock the full potential of their data. You will provide expert architectural guidance focused on designing, optimizing, and scaling modern AI/ML-centric data platforms. As a key member of Oracle's Analytics and AI...
-
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionOracle's Database Development organization is seeking seasoned developers passionate about building innovative AI and data systems.As a Principal Member Technical Staff, you'll play a pivotal role in designing, developing, and debugging Oracle's next-generation, massively distributed, and highly scalable data storage and query processing...
-
Principal Member of Technical Staff
hace 20 horas
Zapopan, México Oracle A tiempo completoKey Responsibilities:Design and implement end-to-end Generative AI solutions using LLMs, Agents, and RAG workflows.Build and optimize AI/ML pipelines using OCI Generative AI services.Develop Gen AI Agents for enterprise use cases and workflow automation.Develop AI solutions that adhere to security, compliance, and governance requirements.Contribute to code...
-
Full Stack Data Engineer
hace 4 días
Zapopan, México AstraZeneca A tiempo completo**Location** Zapopan, Jalisco, Mexico **Job ID** R-229315 **Date posted** 16/06/2025 **Location: Hybrid (3 days on-site in Guadalajara, MX; 2 days remote)***: **About AstraZeneca**: AstraZeneca is a global, science-led biopharmaceutical company focused on the discovery, development and commercialisation of life-changing medicines. Our Enterprise AI team...
-
AI Research Engineer/Scientist
hace 4 horas
Zapopan, Jalisco, México Intel Corporation A tiempo completoJob Details:Job Description: This position requires candidates to upload a resume in English, you are welcome to upload multiple versions of your resume if you prefer but an English version of your resume will be required to be considered for this position.The AI/ML Software Engineer supports the development and engineering of new artificial intelligence,...
-
Python Developer
hace 2 semanas
Zapopan, México Oracle A tiempo completo**Oracle Cloud Infrastructure (OCI) **is a strategic growth area for Oracle. It is a comprehensive cloud service offering in the enterprise software industry, spanning Infrastructure as a Service (IaaS), Platform as a Service (PaaS) and Software as a Service (SaaS). OCI is currently building a future-ready Gen2 cloud Data Science service platform. At the...
-
Ai Engineer
hace 1 semana
Zapopan, México Oracle A tiempo completoJob description displayed in the job posting **Experienced professionals** - 5+ years of experience in the Artificial Intelligence field. - Experience in a management or technical leadership position in AI / Data Science / Machine Learning. - Experience in pre-sales at Tech companies (Oracle, Azure, AWS, GCP, Databricks, Cloudera, Sales Force, IBM, Dell,...
-
Senior Principal DevOps Engineer
hace 2 semanas
Zapopan, México Oracle A tiempo completoThe Tools and Innovation (TINNO) team is a high-performing engineering organization within Oracle Product Development. Our mission is to design, secure, and scale the intelligent automation platforms that power Oracle Fusion SaaS. We build tools that drive operational efficiency, improve reliability, and ensure compliance across Oracle’s global cloud...