Site Reliability Engineer

hace 2 semanas


Ciudad de México, Ciudad de México Sur A tiempo completo

As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. 

You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to respond to incidents quickly, support ongoing automation, and scale systems reliably.

Responsibilities
  1. Be part of the team that owns the uptime and performance of our core backend infrastructure (Windows + Linux)
  2. Maintain and enhance observability across systems using Kibana, CloudWatch, and custom telemetry
  3. Manage CI/CD pipelines, infrastructure as code (Terraform, Ansible), and deployment automation
  4. Support and maintain production Windows environments:
  • .NET Framework/Core apps running in IIS
  • SQL Server with AlwaysOn replication and Service Broker-based messaging
Support and operate cloud-native services:
  • AWS Lambdas, DynamoDB, Postgres/Aurora, Redshift, Redis, and containerized workloads in Docker
Participate in on-call rotation and incident responseCollaborate closely with engineering teams to improve system reliability and deployment workflows

Requirements

  1. 5+ years of SRE, DevOps, or WebOps experience supporting production SaaS systems
  2. Strong experience with Windows Server, IIS, and .NET applications in production
  3. Hands-on experience with SQL Server administration, including AlwaysOn and Service Broker
  4. Proficiency in AWS operations, including Lambda, DynamoDB, CloudWatch, and IAM
  5. Familiarity with Postgres, Redis, Kibana/ElasticSearch, and centralized logging
  6. Experience with Docker, Terraform, and Ansible for infrastructure management
  7. Strong scripting skills (PowerShell, Python)
  8. Experience running and debugging containerized and distributed systems in production
  9. Excellent incident response and debugging skills

Benefits

Salary: $6,000 USD/month + Holidays

Unlimited PTO


  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Maven Workforce Inc. A tiempo completo

    Site Reliability Engineers (SREs)Engineers with a strongJava development backgroundand hands-on experience inproduction operations, monitoring, alerting, and incident management. Proficiency withSplunk, system reliability, performance tuning, and operational excellence is critical.


  • Ciudad de México, Ciudad de México Mastercard A tiempo completo

    Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Mastercard A tiempo completo

    Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Azkait A tiempo completo

    AZKAITes una empresa mexicana que busca y conecta el mejor talento IT con empresas Latinoamericanas y de Estados Unidos.Estamos en la búsqueda de tu talento comoSite Reliability Engineer (SRE)Requisitos:Licenciatura o Ingeniería en Sistemas, Informática o afín.+5 años de experiencia en roles de SRE, DevOps o Ingeniería de Software.Experiencia...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Sur A tiempo completo

    As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...


  • Ciudad de México, Ciudad de México Azkait A tiempo completo

    AZKAITes una empresa mexicana que busca y conecta el mejor talento IT con empresas Latinoamericanas y de Estados Unidos.Estamos en la búsqueda de tu talento comoSRE / Site Reliability Engineer.Requisitos:Licenciatura en Sistemas, Computación o afín+4 años de experiencia profesionalSoporte y troubleshooting en entornos .NET (producción)ServiceNow...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México itD Website A tiempo completo

    itD is seeking a Site Reliability Engineer who will report to the Sr. Engineering Manager for a client in the gaming and entertainment space.  As a Site Reliability Engineer, you will focus on designing, deploying, and operating resilient, secure, and globally scalable services in AWS, with , TypeScript, Kubernetes, GitLab, Argo CD (CI/CD).    This...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México itD Tech A tiempo completo

    itD is seeking a Site Reliability Engineer who will report to the Sr. Engineering Manager for a client in the gaming and entertainment space. As a Site Reliability Engineer, you will focus on designing, deploying, and operating resilient, secure, and globally scalable services in AWS, with , TypeScript, Kubernetes, GitLab, Argo CD (CI/CD).This long-term W2...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Sur A tiempo completo

    As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform.You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Tech Mahindra A tiempo completo

    We're Hiring We are seeking a talented Site Reliability Engineer (SRE) CDMX with robust experience in Azure environments, Kubernetes, and DevOps practices.Your mission will be to ensure the reliability, scalability, and automation of our critical platforms. If you thrive on solving complex challenges, automating processes, and ensuring seamless operations,...