DevOps / SRE Engineer – Azure Platform

hace 3 semanas


Mexico City Pinnacle Talent Placement A tiempo completo

Are you legally eligible to work where you live? We are not able to sponsor VISAs. Who We Are We are a worldwide technology consulting firm providing advanced software and cloud services to enterprise organizations. Our experts design and deliver scalable systems, transform legacy infrastructure, and deploy automation within complex, distributed ecosystems. With engineering teams based across LATAM, Europe, the United States, and South Africa, we blend strong technical capability with a collaborative mindset, ongoing innovation, and dependable execution. Overview The SRE Technical Team is seeking a Site Reliability Engineer (SRE) – Technical Member to support the engineering, operational, and reliability needs of a major cloud-based platform in a Microsoft Azure environment. This role will provide direct daily support and reliability engineering for production systems, working closely with SRE, DevOps, InfoSec, and Agile development teams to maintain platform stability, scalability, and performance. Role Summary The SRE Technical Member will: Deliver engineering, operational, and administrative support for the application and its technology landscape. Address reliability and operational challenges such as application failures, production issues, infrastructure performance (disk, memory), monitoring, and security. Serve as a mid-level subject matter expert, integrating with multiple teams to develop and evolve SRE practices for Azure-based environments. Participate in production support activities, including deployments, upgrades, and critical issue resolution. This role is central to designing, implementing, and maintaining monitoring, alerting, and reporting solutions across servers, containers, databases, and cloud infrastructure components. Project Context The platform is a distributed, cloud-based system serving hundreds of geographically dispersed clients. It operates on Microsoft Azure using a microservices architecture, combining open-source, licensed, and internally developed tools for provisioning, deployment, monitoring, and logging. SREs own the entire production stack — from application functionality to infrastructure resilience — ensuring availability, reliability, and scalability in a 24/7 operational environment. This role requires problem-solving through data, collaboration, and technical expertise, maintaining a balance between engineering innovation and practical delivery. Key Responsibilities Collaborate with SRE, DevOps, and InfoSec teams on new projects, platform builds, and deployments. Contribute to the design, implementation, and operation of large-scale, Azure-based platforms. Apply industry best practices in monitoring, alerting, reporting, and cloud architecture. Participate in infrastructure, application, and security planning, focusing on scalability, redundancy, and data preservation. Support high-availability topologies with development teams. Produce documentation and weekly operational status reports, detailing project progress and key metrics. Provide engineering and support for technical infrastructure, cloud, databases, and application performance. Manage incident response, change management, and user permissions following SRE best practices (Google SRE model). Maintain close collaboration between Application, SRE, DevOps, InfoSec, and business units. Assist in configuring and onboarding new applications into the Azure DevOps (ADO) platform. Core Technical Skills Operational Skills Strong understanding of SRE fundamentals: monitoring, alerting, reporting, performance, availability, and incident response. Hands-on experience with CI/CD tools (Git, Azure Pipelines, Ansible, etc.). Infrastructure as Code (IaC) design, scripting, and setup. Deep knowledge of Azure Web Services — installation, configuration, and management. Experience administering Microsoft applications (.NET, C#, Angular) with focus on automation, optimization, and security. Proficiency in Cosmos DB and MS SQL operational tasks. Excellent troubleshooting, root-cause analysis, and problem-solving skills. Experience with disaster recovery, scalability testing, and capacity planning. Automation Skills Expertise with cloud deployment and automation tools (Git, Azure DevOps, Ansible, etc.). Ability to automate routine deployment, monitoring, and administrative tasks. Write and maintain documentation and custom tools for monitoring and performance optimization. Scripting & Development Proficiency in Shell scripting and API troubleshooting for production support. Experience designing, authoring, and maintaining .NET / C# code. Capability to deliver hotfixes and operational patches (.NET & Angular). Working knowledge of automation scripting languages for operational tools development. Qualifications Bachelor’s degree in a technical discipline (Computer Science, Engineering, or related field). 5+ years of industry experience in SRE, DevOps, or related technical operations roles. Proven experience in cloud infrastructure, automation, and application reliability engineering within large-scale, enterprise environments. Summary This is a dynamic and hands-on role within a global, collaborative SRE environment. The SRE Technical Member will contribute to building resilient systems, automating operations, and ensuring the platform meets high standards for performance, reliability, and security. #J-18808-Ljbffr


  • Azure SRE

    hace 4 semanas


    Mexico City Pinnacle Talent Placement A tiempo completo

    Are you legally eligible to work where you live? We are not able to sponsor VISAs. Who We Are We are a global technology consulting company that delivers innovative software and cloud solutions for enterprise clients. Our teams specialize in building scalable platforms, and implementing automation across large distributed environments. With engineering...

  • Azure SRE

    hace 4 semanas


    Mexico City Pinnacle Talent Placement A tiempo completo

    Are you legally eligible to work where you live? We are not able to sponsor VISAs. Who We Are We are a global technology consulting company that delivers innovative software and cloud solutions for enterprise clients. Our teams specialize in building scalable platforms, and implementing automation across large distributed environments. With engineering...

  • Power Platform SRE

    hace 2 semanas


    Mexico City HSBC A tiempo completo

    A global banking and financial services company is seeking a Microsoft Power Platform Site Reliability Engineer in Mexico City. This role involves serving as an SRE champion, monitoring platform performance, and conducting root cause analysis to ensure system availability. Candidates should have strong automation skills and a good understanding of Azure...


  • Mexico City Pinnacle Talent Placement A tiempo completo

    A global technology consulting firm is seeking a Site Reliability Engineer (SRE) – Technical Member in Mexico City. The ideal candidate will have 5+ years of experience in SRE or DevOps roles, with strong skills in cloud infrastructure and automation tools. Key responsibilities include providing operational support for a cloud platform, collaborating with...


  • Mexico City Tech Mahindra A tiempo completo

    A leading technology firm is seeking a Site Reliability Engineer in Mexico City. This role involves designing and optimizing CI/CD pipelines, managing cloud infrastructure on Azure, and automating processes to enhance performance. Candidates should have 3–5 years of experience in SRE or DevOps, strong knowledge of Azure, containers, and scripting, and must...


  • Mexico City Tech Mahindra A tiempo completo

    A leading tech company is seeking a Site Reliability Engineer (SRE) in Mexico City. The role requires solid experience with Azure environments, Kubernetes, and DevOps practices. Key responsibilities include optimizing CI/CD pipelines, managing cloud infrastructure, and automating processes. Advanced English proficiency is essential. The position offers...


  • Mexico City Tech Mahindra A tiempo completo

    A leading tech company is seeking a Site Reliability Engineer (SRE) in Mexico City. The role requires solid experience with Azure environments, Kubernetes, and DevOps practices. Key responsibilities include optimizing CI / CD pipelines, managing cloud infrastructure, and automating processes. Advanced English proficiency is essential. The position offers...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Azka IT A tiempo completo

    Site Reliability Engineer (SRE) AZKAIT es una empresa mexicana que busca y conecta el mejor talento IT con empresas Latinoamericanas y de Estados Unidos. Requisitos Licenciatura o Ingeniería en Sistemas, Informática o afín. +5 años de experiencia en roles de SRE, DevOps o Ingeniería de Software. Experiencia programando en Python. Experiencia con Docker...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Azka IT A tiempo completo

    Site Reliability Engineer (SRE) AZKAIT es una empresa mexicana que busca y conecta el mejor talento IT con empresas Latinoamericanas y de Estados Unidos. Requisitos Licenciatura o Ingeniería en Sistemas, Informática o afín. +5 años de experiencia en roles de SRE, DevOps o Ingeniería de Software. Experiencia programando en Python. Experiencia con Docker...

  • DevOps Engineer

    hace 3 días


    Mexico City Gravity IT Resources A tiempo completo

    We’re seeking an experienced DevOps Engineer with intimate knowledge and experience of the Azure platform. Hands‑on experience and the ability to communicate clearly while delivering solutions that meet stringent requirements are necessary. The DevOps engineer will be responsible for implementing and maintaining Azure DevOps pipelines to deploy and...