Site Reliability Engineer

hace 7 días


Ciudad de México, Ciudad de México Sur A tiempo completo

As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform.

You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to respond to incidents quickly, support ongoing automation, and scale systems reliably.

Responsibilities
  1. Be part of the team that owns the uptime and performance of our core backend infrastructure (Windows + Linux)
  2. Maintain and enhance observability across systems using Kibana, CloudWatch, and custom telemetry
  3. Manage CI/CD pipelines, infrastructure as code (Terraform, Ansible), and deployment automation
  4. Support and maintain production Windows environments:

  5. .NET Framework/Core apps running in IIS

  6. SQL Server with AlwaysOn replication and Service Broker-based messaging

  7. Support and operate cloud-native services:

  8. AWS Lambdas, DynamoDB, Postgres/Aurora, Redshift, Redis, and containerized workloads in Docker

  9. Participate in on-call rotation and incident response

  10. Collaborate closely with engineering teams to improve system reliability and deployment workflows
Requirements
  1. 5+ years of SRE, DevOps, or WebOps experience supporting production SaaS systems
  2. Strong experience with Windows Server, IIS, and .NET applications in production
  3. Hands-on experience with SQL Server administration, including AlwaysOn and Service Broker
  4. Proficiency in AWS operations, including Lambda, DynamoDB, CloudWatch, and IAM
  5. Familiarity with Postgres, Redis, Kibana/ElasticSearch, and centralized logging
  6. Experience with Docker, Terraform, and Ansible for infrastructure management
  7. Strong scripting skills (PowerShell, Python)
  8. Experience running and debugging containerized and distributed systems in production
  9. Excellent incident response and debugging skills
Benefits

Salary: $6,000 USD/month + Holidays

Unlimited PTO



  • Ciudad de México, Ciudad de México Mastercard A tiempo completo

    Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...


  • Ciudad de México, Ciudad de México Azkait A tiempo completo

    AZKAITes una empresa mexicana que busca y conecta el mejor talento IT con empresas Latinoamericanas y de Estados Unidos.Estamos en la búsqueda de tu talento comoSite Reliability Engineer (SRE)Requisitos:Licenciatura o Ingeniería en Sistemas, Informática o afín.+5 años de experiencia en roles de SRE, DevOps o Ingeniería de Software.Experiencia...


  • Ciudad de México, Ciudad de México Royal Caribbean Group A tiempo completo

    Journey with usCombine your career goals and sense of adventure by joining our incredible team of employees atRoyal Caribbean Group. We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world.We are proud to be the vacation-industry leader with global...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Mastercard A tiempo completo

    Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...


  • Ciudad de México, Ciudad de México Mastercard A tiempo completo

    Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...


  • Ciudad de México, Ciudad de México Sur A tiempo completo

    As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...


  • Ciudad de México, Ciudad de México Oracle A tiempo completo

    DescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems....


  • Ciudad de México, Ciudad de México Sur A tiempo completo

    As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...


  • Ciudad de México, Ciudad de México Tech Mahindra A tiempo completo

    We're Hiring We are seeking a talented Site Reliability Engineer (SRE) CDMX with robust experience in Azure environments, Kubernetes, and DevOps practices.Your mission will be to ensure the reliability, scalability, and automation of our critical platforms. If you thrive on solving complex challenges, automating processes, and ensuring seamless operations,...


  • Ciudad de México, Ciudad de México UST A tiempo completo

    Role DescriptionSite Reliability EngineerLead I - Software EngineeringWho We AreBorn digital, UST transforms lives through the power of technology. We walk alongside our clients and partners, embedding innovation and agility into everything they do. We help them create transformative experiences and human-centered solutions for a better world.UST is a...