Lead Site Reliability Engineer

hace 4 días


Ciudad de México, Ciudad de México Pathlock A tiempo completo

About Pathlock:

Pathlock is a leader in application security, access governance, and compliance automation. Our cloud-based solutions help organizations secure critical applications, mitigate risk, and enforce policies across a diverse IT landscape.

Job Summary:

Join Pathlock, a fast-growing leader in Governance, Access and Compliance, where you'll help shape the reliability and scalability of mission-critical platforms running on Azure, Kubernetes, and modern DevOps toolchains. If you thrive on solving complex infrastructure challenges, automating everything, and ensuring systems run flawlessly — this is your place.

Key Responsibilities:

· Lead a DevOps team in executing CI/CD, cloud infrastructure, and site readiness activities with a focus on performance, reliability, and continuous improvement.

Design, build, and enhance CI/CD pipelines to accelerate application deployments and infrastructure provisioning.

· Develop and maintain automation frameworks that simplify operations, reduce manual effort, and improve consistency across environments.

· Configure, manage, and optimize cloud infrastructure — ensuring alignment with security, scalability, and performance best practices.

· Collaborate closely with development teams to resolve deployment challenges, ensure site readiness, and improve delivery workflows.

· Monitor system performance and reliability, proactively identify issues, and implement data-driven improvements to enhance uptime and efficiency.

· Participate in on-call rotations, providing reliable production deployment support and incident resolution to ensure continuous operations.

· Maintain comprehensive technical documentation for CI/CD processes, configurations, and troubleshooting guidelines.

· Perform readiness assessments and validation tests to confirm application and infrastructure stability before production rollouts.

· Implement Infrastructure as Code (IaC) using tools like Terraform and ARM templates, ensuring version-controlled, reproducible infrastructure.

· Troubleshoot and resolve complex issues related to deployments, provisioning, and performance across multi-cloud or containerized environments.

Qualifications:

· 5-7 years of hands-on experience in Site Reliability Engineering (SRE), DevOps, or similar technical roles.

· Strong expertise with CI/CD tools such as GitHub Actions, Jenkins, or equivalent.

· Solid working experience with Microsoft Azure, including Azure Kubernetes Service (AKS), databases, and networking components.

· Proven ability to automate deployment workflows and infrastructure provisioning.

· Experience implementing Infrastructure as Code (IaC) using Terraform, ARM templates, or similar tools.

· Strong scripting knowledge in PowerShell, Bash, or Python.

· Experience with GitOps frameworks like ArgoCD or Flux.

· Hands-on experience with containerization technologies such as Docker and Kubernetes.

· Strong understanding of networking fundamentals and Azure network security practices.

· Knowledge of cloud security and compliance best practices.

· Excellent verbal and written communication skills in English.

· Flexibility to collaborate across US or EU time zones.

Preferred Qualifications:

· Microsoft Azure certifications (Azure Developer Associate, Azure DevOps Engineer Expert, or Azure Administrator).

· Experience with observability and monitoring tools such as Application Insights, Elastic Stack (ELK), or Grafana.

· Knowledge of log aggregation and analysis using Elastic and Prometheus.

· Understanding of high availability, scalability, and disaster recovery strategies.

· Experience managing containerized Windows-based applications.

Why Join Pathlock?

· Work on a cutting-edge cloud security and automation platform used by global enterprises.

· Gain hands-on experience with modern SRE technologies — from Kubernetes and Elastic to Terraform and GitOps automation.

· Be part of a fast-paced, growth-oriented environment where innovation, reliability, and performance matter.

· Enjoy competitive compensation, benefits, and equity options.

· Collaborate in an inclusive, knowledge-driven culture that values continuous learning, ownership, and technical excellence.



  • Ciudad de México, Ciudad de México Azkait A tiempo completo

    AZKAITes una empresa mexicana que busca y conecta el mejor talento IT con empresas Latinoamericanas y de Estados Unidos.Estamos en la búsqueda de tu talento comoSite Reliability Engineer (SRE)Requisitos:Licenciatura o Ingeniería en Sistemas, Informática o afín.+5 años de experiencia en roles de SRE, DevOps o Ingeniería de Software.Experiencia...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Sur A tiempo completo

    As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...


  • Ciudad de México, Ciudad de México Sur A tiempo completo

    As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Sur A tiempo completo

    As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform.You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...


  • Ciudad de México, Ciudad de México Tech Mahindra A tiempo completo

    We're Hiring We are seeking a talented Site Reliability Engineer (SRE) CDMX with robust experience in Azure environments, Kubernetes, and DevOps practices.Your mission will be to ensure the reliability, scalability, and automation of our critical platforms. If you thrive on solving complex challenges, automating processes, and ensuring seamless operations,...


  • Ciudad de México, Ciudad de México Thomson Reuters México A tiempo completo

    Are you passionate about the chance to bring your experience to a world-class company that is market-leading or both content and technology? If yes, we're looking for you.Join our team Senior Site Reliability Engineer (SRE) will be implement Site Reliability Engineering and DevOps best practices. Feed non-functional requirements into the product backlog,...


  • Ciudad de México, Ciudad de México Capgemini A tiempo completo

    Our Client is one of the United States' largest insurers, providing a wide range of insurance and financial services products with gross written premiums well over US$25 Billion (P&C). They proudly serve more than 10 million U.S. households with more than 19 million individual policies across all 50 states through the efforts of over 48,000 exclusive and...


  • Ciudad de México, Ciudad de México AXA Group Operations A tiempo completo

    Main missionsBeing part of our global team as a Linux Engineer and become a key member of the SRO Squad (Site Reliability Operations), collaborating with a diverse group of experts to ensure robust and secure Linux (RHEL) infrastructure worldwide.Engineer (Build) and test solutions, document accordingly and handover to operations team. Provide 3rd level...


  • Ciudad de México, Ciudad de México Oracle A tiempo completo

    DescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems....


  • Santiago de Querétaro, Querétaro de Arteaga, México RELEX Solutions A tiempo completo

    Technical Service Consultant/Site Reliability EngineerBased at: RELEX office in MexicoEmployment type: Permanent, full-timeTravel: Some ad hoc travel to client sites and the Atlanta office may be requiredThe RELEX team in the Americas is growing, and we're now looking for a Technical Consultant/Site Reliability Engineer. You'll join our global continuous...