Sr AI Engineer
hace 2 semanas
Your Mission
To architect a secure, hybrid AI ecosystem that combines the power of Large Language Models (LLMs) with the privacy of local inference. You will build the "brain" of the platform: a sophisticated RAG (Retrieval-Augmented Generation) engine that indexes millions of words, understands user intent, and delivers personalized answers without ever exposing sensitive data.
Role Overview
We are seeking a pragmatic and security-minded AI Engineer to lead the development of our Generative AI and Personalization modules. This is not a role for someone who simply writes prompts for ChatGPT.
You will be engineering a Hybrid AI Architecture: deploying open-source models (e.g., Llama, Mistral) for private, local data processing, while securely orchestrating external LLM APIs (OpenAI/Anthropic) for public-facing tasks. You will own the Vector Database, the Semantic Search algorithms, and the critical Data Safety Layer that prevents PII leakage. You will leverage an AI-first workflow (using tools like Cursor and Claude) to rapidly prototype and deploy these complex systems.
Key Responsibilities
- Engineer RAG Pipelines: Architect and build the Retrieval-Augmented Generation system that powers the site's search. This involves chunking, embedding, and retrieving context from a dataset of ~38 million words.
- Build Private Inference Models: specific content classification tasks (e.g., auto-tagging legacy documents), deploy and optimize open-source Small Language Models (SLMs) and LLM to run entirely within our private cloud infrastructure to ensure data privacy.
- Vector Database Management: Manage the vector search infrastructure (e.g., Pinecone, Weaviate, or Milvus), optimizing embedding models to ensure high relevance in search results.
- Implement AI Safety & PII Scrubbing: Build the "airlock" systems that automatically detect and redact Personally Identifiable Information (PII) from user queries before they are sent to any third-party model .
- Vector Database Management: You are responsible for managing the vector search infrastructure (e.g., Pinecone, Weaviate, or Milvus) and optimizing embedding models to ensure high relevance.
- Develop Personalization Algorithms: Engineer privacy-safe recommendation engines that analyze anonymized behavioral data (collaborative filtering) to surface relevant content based on user roles .
- MLOps & Model Governance: Implement monitoring to detect "Model Drift" and automated pipelines for re-indexing content when new documents are published .
- AI-Accelerated Development: Actively utilize AI coding assistants (Cursor, Claude CLI, Gemini) to write boilerplate Python code, generate complex SQL/Vector queries, and unit test data pipelines.
Core Skills and Competencies
- Tech Stack: Expert-level Python skills. Proficiency with AI frameworks like PyTorch or TensorFlow.
- Generative AI Frameworks: Deep experience with orchestration libraries like LangChain or LlamaIndex.
- Vector Search: Hands-on experience with Vector Databases and embedding models (e.g., OpenAI text-embedding-3, HuggingFace models).
- Hybrid Architecture: Ability to work with both 3rd Party APIs (OpenAI, Anthropic) AND Local/Open Source Models (Ollama, vLLM, HuggingFace Transformers).
- Data Engineering: Experience building ETL pipelines to ingest unstructured data (PDFs, HTML) for AI processing .
- Security: Understanding of data redaction techniques and secure API handling.
Required Tech Stack
This role requires a "Hybrid Polyglot" who can work with both commercial APIs and raw open-source model weights.
- Core Language: Expert-level Python.
- AI Frameworks: Deep experience with PyTorch or TensorFlow.
- Orchestration: Proficiency with LangChain, LangGraph or LlamaIndex.
- Vector Databases: Hands-on experience with Pinecone, Weaviate, or Milvus.
- Model Ecosystem (Hybrid):
- Public APIs: OpenAI, Anthropic.
- Local/Open Source: Ollama, vLLM, Hugging Face Transformers.
- Data Engineering: Experience building ETL pipelines for unstructured data (PDFs, HTML).
- AI Workflow Tools: Proficiency in Cursor and Claude CLI.
- Code versioning: Git
Domains of Mastery
- Retrieval Optimization: The ability to tune a search engine so it doesn't just find keywords, but understands concepts (Semantic Search).
- Privacy-Preserving ML: Knowing how to build personalization systems that track behaviors without tracking individuals (anonymization techniques) .
- Latency Optimization: Balancing the trade-off between the "smartness" of a model and the speed of the response (using caching and lightweight models where possible).
- Prompt Engineering (System Level): Writing robust system prompts that prevent hallucinations and enforce brand tone constraints.
Job Type: Full-time
Pay: $30, $45,000.00 per month
Work Location: Remote
-
Ai Engineer
hace 7 días
Desde casa, México Mechanized AI A tiempo completo**Title**:AI Engineer **Job Type**:Full-Time **Location**:Remote **Company Description**: Mechanized AI is at the forefront of AI innovation, leveraging cutting-edge technology to transform legacy systems into modern, efficient, and scalable solutions. We work with today's fast-paced, digital landscape. Our team thrives on solving complex...
-
Sr. Ai Engineer
hace 2 semanas
Desde casa, México Sezzle A tiempo completo**The salary range for this role is $50,000 - $120,000 per year (Gross in USD)** **About Sezzle**: With a mission to financially empower the next generation, Sezzle is revolutionizing the shopping experience beyond payments, blending cutting-edge tech with seamless, interest-free installment plans that make shopping smarter and more accessible. We’re not...
-
Sr. Ai Engineer
hace 1 semana
Desde casa, México Sezzle A tiempo completo**The salary range for this role is $50,000 - $120,000 per year (Gross in USD)****About Sezzle**:With a mission to financially empower the next generation, Sezzle is revolutionizing the shopping experience beyond payments, blending cutting-edge tech with seamless, interest-free installment plans that make shopping smarter and more accessible. We’re not...
-
Sr Data Engineer
hace 4 días
Desde casa, México Go Sinergia A tiempo completoSR DATA ENGINEER - REMOTEJob Objective:We're hiring a Senior Data Engineer to architect and scale the systems that power our AI-driven valuation tools, market analytics, and collector insights. You'll lead development of our data ingestion pipelines, enrichment workflows, and metadata infrastructure—from scraping and parsing messy real-world sources to...
-
Senior Ai Ml Engineer
hace 2 semanas
Desde casa, México Datalogics A tiempo completo**Senior AI ML Engineer**- 100% Remote- Full-time- CoE / B2B / EoR- Up to USD per yearWe are looking for a skilled **Senior AI ML Engineer** to join one of our clients with a global footprint. In this role, you will develop advanced machine-learning models and AI-driven solutions to support the platform's key features. You will play a significant part in...
-
Ai/ml Engineer
hace 1 semana
Desde casa, México OneSeven Tech A tiempo completoDetails As an AI-First AI/ML Engineer, you'll be architecting and deploying intelligent systems that leverage cutting-edge AI technologies including LangChain orchestration, autonomous AI agents, and robust AWS cloud infrastructure. We are seeking expertise in modern AI/ML frameworks, agentic systems, and scalable backend development using Node.js and...
-
Ai Engineer
hace 2 días
Desde casa, México Icalia Labs A tiempo completoWe value **practical experience** over theory, and we're committed to building **cutting-edge AI solutions** for our clients. This is an opportunity to work on **high-impact projects** that push the boundaries of what AI can achieve.**Key Responsibilities**:- Work with **Machine Learning** and **Deep Learning** models, leveraging frameworks like...
-
Ai DevOps Engineer
hace 4 semanas
Desde casa, México Fast Dolphin A tiempo completo**Job Title: AI DevOps Engineer****Location**: hybrid (trips to West Palm Beach, FL)**Start Date**: ASAP**Duration**: 6+ Months**Requirement Details**:- Excellent understanding of Ruby, Python, Perl, and Java- Configuration and managing databases such as MySQL, Mongo- Working knowledge of various tools, open-source technologies, and cloud services-...
-
Ai DevOps Engineer
hace 9 horas
Desde casa, México Fast Dolphin A tiempo completo**Job Title: AI DevOps Engineer** **Location**: hybrid (trips to West Palm Beach, FL) **Start Date**: ASAP **Duration**: 6+ Months **Requirement Details**: - Excellent understanding of Ruby, Python, Perl, and Java - Configuration and managing databases such as MySQL, Mongo - Working knowledge of various tools, open-source technologies, and cloud...
-
Lead Ai Engineer
hace 4 semanas
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking a **Lead AI Engineer** with expertise in Python to drive the development and deployment of innovative AI solutions across critical projects.**Responsibilities**- Design AI models and systems using Python, agentic frameworks, and MCP- Build scalable data agents with Google ADK and other targeted tools- Integrate robust AI frameworks to automate...