Empleos actuales relacionados con MCP & Tools Python Developer - Agent Evaluation Infrastructure - Ciudad de México, Ciudad de México - Mindrift


  • Ciudad de México, Ciudad de México spacedev A tiempo completo

     Join SpaceDev Welcome to one of Latin America's leading companies providing premium development services. We work with: USA & Canada Clients Global teams on challenging projects Remote-First Culture Always Growing Innovative Full Stack Solutions Senior Full Stack Python DeveloperWe're looking for a hands-on developer ready to take ownership of high-impact...

  • Python Developer

    hace 2 semanas


    Ciudad de México, Ciudad de México Multiplica Talent A tiempo completo

    We need you talent to do.....Contribute to architecture and design of a frontend-backend-IoT system designed for massively parallel users and devicesWork closely with the product, design and engineering teams to conceive and implement new features for users across our stackInterface with customers to obtain feedback and improve the productDevelop code...

  • Senior Python Developer

    hace 14 horas


    Ciudad de México, Ciudad de México Salve Consulting A tiempo completo

    Type: Freelance/Contract | Remote | PST overlap requiredFully remote, short-term contract role (5-8 weeks) requiring 40 hours/week with 4 hours of overlap with PST. You will work as a contractor on advanced LLM-focused AI projects for a leading global AI research partner.About the project:This project supports the development and evaluation of...

  • Cloud Ops Developer

    hace 6 días


    Ciudad de México, Ciudad de México Interaxon A tiempo completo

    Interaxon (Muse) is a neurotechnology company building brain-computer interfaces (BCIs). Our Muse EEG headbands measure brain activity and deliver real-time feedback on mental states like focus, calm, and stress. Muse is used by clinicians, researchers, athletes, and individuals to support mindfulness, sleep, and mental health. In the role of CloudOps...

  • Full-Stack Developer

    hace 2 semanas


    Ciudad de México, Ciudad de México Sur A tiempo completo

    As the Full-stack Developer you will help lead the charge of the mission to turn big ideas into blazing realities. In this role, you won't just write code, you'll shape dreams, building custom apps that delight clients and advancing the AI tools that power magic. We're searching for a curious learner, a bold challenger, and a collaborative...


  • Ciudad de México, Ciudad de México Workana A tiempo completo

    Workana is the largest remote work platform for talents in Latin America. Our new segment, Workana Premium, focuses on matching the most exceptional professionals with leading and innovative companies around the globe. Enjoy competitive compensation, dedicated support, and the flexibility of remote work within a dynamic environment that fosters collaboration...

  • AI/ML Evaluation Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Truelogic A tiempo completo

    About TruelogicAt Truelogic we are a leading provider of nearshore staff augmentation services headquartered in New York. For over two decades, we've been delivering top-tier technology solutions to companies of all sizes, from innovative startups to industry leaders, helping them achieve their digital transformation goals.Our team of 600+ highly skilled...

  • Digital CM Developer

    hace 4 días


    Ciudad de México, Ciudad de México Sequoia Connect A tiempo completo

    Our client is a rapidly growing, automation-led service provider specializing in IT, business process outsourcing (BPO), and consulting services. With a strong focus on digital transformation, cloud solutions, and AI-driven automation, they help businesses optimize operations and enhance customer experiences. Backed by a global workforce of over 32,000...

  • Golang Developer

    hace 2 semanas


    Ciudad de México, Ciudad de México GFT A tiempo completo

    What are we are looking for?Golang DeveloperResponsabilities:Complete peer code reviews and provide written and verbal feedback in EnglishReview AI generated feedback generated during the code review processUse GenAI tools to generate recommendations to improve user codeProvide feedback and participate in discussions to determine how to improve the...

  • Golang Developer

    hace 4 días


    Ciudad de México, Ciudad de México GFT A tiempo completo

    What are we are looking for?Golang DeveloperResponsabilities:Complete peer code reviews and provide written and verbal feedback in EnglishReview AI generated feedback generated during the code review processUse GenAI tools to generate recommendations to improve user codeProvide feedback and participate in discussions to determine how to improve the...

MCP & Tools Python Developer - Agent Evaluation Infrastructure

hace 2 semanas


Ciudad de México, Ciudad de México Mindrift A tiempo completo

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. 

What we do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we're looking for

Calling all security researchers, engineers, and penetration testers with a strong foundation in problem-solving, offensive security, and AI-related risk assessment.

If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us 

We're looking for someone who can bring a hands-on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. 

About the project

We're on the hunt for hands-on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You'll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team.

What you'll be doing:

  • Developing and maintaining MCP-compatible evaluation servers
  • Implementing logic to check agent actions against scenario definitions
  • Creating or extending tools that writers and QAs use to test agents
  • Working closely with infrastructure engineers to ensure compatibility
  • Occasionally helping with test writing or debug sessions when needed

Although we're only looking for experts for this current project, contributors with consistent high-quality submissions may receive an invitation for ongoing collaboration across future projects. 

How to get started:

Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

Requirements

The ideal contributor will have:

  • 4+ years of Python development experience, ideally in backend or tools
  • Solid experience building APIs, testing frameworks, or protocol-based interfaces
  • Understanding of Docker, Linux CLI, and HTTP-based communication
  • Ability to integrate new tools into existing infrastructures
  • Familiarity with how LLM agents are prompted, executed, and evaluated
  • Clear documentation and communication skills - you'll work with QA and writers

We also value applicants who have:

  • Experience with Model Context Protocol (MCP) or similar structured agent-server interfaces
  • Knowledge of FastAPI or similar async web frameworks
  • Experience working with LLM logs, scoring functions, or sandbox environments
  • Ability to support dev environments (devcontainers, CI configs, linters)
  • JS experience

Benefits

  • Get paid for your expertise, with rates that can go up to $21/hour depending on your skills, experience, and project needs.
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
  • Influence how future AI models understand and communicate in your field of expertise.