DevOps / SRE Engineer – Azure Platform

hace 23 horas


Mexico City Pinnacle Talent Placement A tiempo completo

Are you legally eligible to work where you live? We are not able to sponsor VISAs. Who We Are We are a worldwide technology consulting firm providing advanced software and cloud services to enterprise organizations. Our experts design and deliver scalable systems, transform legacy infrastructure, and deploy automation within complex, distributed ecosystems. With engineering teams based across LATAM, Europe, the United States, and South Africa, we blend strong technical capability with a collaborative mindset, ongoing innovation, and dependable execution. Overview The SRE Technical Team is seeking a Site Reliability Engineer (SRE) – Technical Member to support the engineering, operational, and reliability needs of a major cloud-based platform in a Microsoft Azure environment. This role will provide direct daily support and reliability engineering for production systems, working closely with SRE, DevOps, InfoSec, and Agile development teams to maintain platform stability, scalability, and performance. Role Summary The SRE Technical Member will: Deliver engineering, operational, and administrative support for the application and its technology landscape. Address reliability and operational challenges such as application failures, production issues, infrastructure performance (disk, memory), monitoring, and security. Serve as a mid-level subject matter expert, integrating with multiple teams to develop and evolve SRE practices for Azure-based environments. Participate in production support activities, including deployments, upgrades, and critical issue resolution. This role is central to designing, implementing, and maintaining monitoring, alerting, and reporting solutions across servers, containers, databases, and cloud infrastructure components. Project Context The platform is a distributed, cloud-based system serving hundreds of geographically dispersed clients. It operates on Microsoft Azure using a microservices architecture, combining open-source, licensed, and internally developed tools for provisioning, deployment, monitoring, and logging. SREs own the entire production stack — from application functionality to infrastructure resilience — ensuring availability, reliability, and scalability in a 24/7 operational environment. This role requires problem-solving through data, collaboration, and technical expertise, maintaining a balance between engineering innovation and practical delivery. Key Responsibilities Collaborate with SRE, DevOps, and InfoSec teams on new projects, platform builds, and deployments. Contribute to the design, implementation, and operation of large-scale, Azure-based platforms. Apply industry best practices in monitoring, alerting, reporting, and cloud architecture. Participate in infrastructure, application, and security planning, focusing on scalability, redundancy, and data preservation. Support high-availability topologies with development teams. Produce documentation and weekly operational status reports, detailing project progress and key metrics. Provide engineering and support for technical infrastructure, cloud, databases, and application performance. Manage incident response, change management, and user permissions following SRE best practices (Google SRE model). Maintain close collaboration between Application, SRE, DevOps, InfoSec, and business units. Assist in configuring and onboarding new applications into the Azure DevOps (ADO) platform. Core Technical Skills Operational Skills Strong understanding of SRE fundamentals: monitoring, alerting, reporting, performance, availability, and incident response. Hands-on experience with CI/CD tools (Git, Azure Pipelines, Ansible, etc.). Infrastructure as Code (IaC) design, scripting, and setup. Deep knowledge of Azure Web Services — installation, configuration, and management. Experience administering Microsoft applications (.NET, C#, Angular) with focus on automation, optimization, and security. Proficiency in Cosmos DB and MS SQL operational tasks. Excellent troubleshooting, root-cause analysis, and problem-solving skills. Experience with disaster recovery, scalability testing, and capacity planning. Automation Skills Expertise with cloud deployment and automation tools (Git, Azure DevOps, Ansible, etc.). Ability to automate routine deployment, monitoring, and administrative tasks. Write and maintain documentation and custom tools for monitoring and performance optimization. Scripting & Development Proficiency in Shell scripting and API troubleshooting for production support. Experience designing, authoring, and maintaining .NET / C# code. Capability to deliver hotfixes and operational patches (.NET & Angular). Working knowledge of automation scripting languages for operational tools development. Qualifications Bachelor’s degree in a technical discipline (Computer Science, Engineering, or related field). 5+ years of industry experience in SRE, DevOps, or related technical operations roles. Proven experience in cloud infrastructure, automation, and application reliability engineering within large-scale, enterprise environments. Summary This is a dynamic and hands-on role within a global, collaborative SRE environment. The SRE Technical Member will contribute to building resilient systems, automating operations, and ensuring the platform meets high standards for performance, reliability, and security. #J-18808-Ljbffr


  • Azure SRE

    hace 1 semana


    Mexico City Pinnacle Talent Placement A tiempo completo

    Are you legally eligible to work where you live? We are not able to sponsor VISAs. Who We Are We are a global technology consulting company that delivers innovative software and cloud solutions for enterprise clients. Our teams specialize in building scalable platforms, and implementing automation across large distributed environments. With engineering...

  • Azure SRE

    hace 1 semana


    Mexico City Pinnacle Talent Placement A tiempo completo

    Are you legally eligible to work where you live? We are not able to sponsor VISAs. Who We Are We are a global technology consulting company that delivers innovative software and cloud solutions for enterprise clients. Our teams specialize in building scalable platforms, and implementing automation across large distributed environments. With engineering...

  • Power Platform SRE

    hace 4 semanas


    Mexico City HSBC A tiempo completo

    A global banking and financial services company is seeking a Microsoft Power Platform Site Reliability Engineer in Mexico City. This role involves serving as an SRE champion, monitoring platform performance, and conducting root cause analysis to ensure system availability. Candidates should have strong automation skills and a good understanding of Azure...

  • Senior DevOps Engineer

    hace 2 semanas


    Mexico Minacs A tiempo completo

    A global technology and services leader in Ontario, Mexico, is seeking an experienced Technical Engineer DevOps to optimize and design scalable cloud environments with Azure services. The ideal candidate will have over 8 years of SRE experience, solid skills in Kubernetes, Docker, and Python, as well as expertise in GitOps-based CI/CD pipelines....

  • Senior DevOps Engineer

    hace 2 semanas


    Mexico City Stefanini Group A tiempo completo

    En nuestra empresa buscamos un/a Senior DevOps Engineer / Cloud Platform Engineer para liderar la construcción, automatización y operación de infraestructura crítica en entornos cloud. Buscamos un perfil altamente técnico, con visión estratégica y pasión por las mejores prácticas. Desplegar y operar infraestructura de alta complejidad en AWS, GCP o...

  • Senior DevOps Engineer

    hace 2 semanas


    Mexico City Stefanini Group A tiempo completo

    En nuestra empresa buscamos un/a Senior DevOps Engineer / Cloud Platform Engineer para liderar la construcción, automatización y operación de infraestructura crítica en entornos cloud. Buscamos un perfil altamente técnico, con visión estratégica y pasión por las mejores prácticas. Desplegar y operar infraestructura de alta complejidad en AWS, GCP o...


  • Mexico City Pinnacle Talent Placement A tiempo completo

    A global technology consulting firm is seeking a Site Reliability Engineer (SRE) – Technical Member in Mexico City. The ideal candidate will have 5+ years of experience in SRE or DevOps roles, with strong skills in cloud infrastructure and automation tools. Key responsibilities include providing operational support for a cloud platform, collaborating with...


  • Mexico City Tech Mahindra A tiempo completo

    A leading tech company is seeking a Site Reliability Engineer (SRE) in Mexico City. The role requires solid experience with Azure environments, Kubernetes, and DevOps practices. Key responsibilities include optimizing CI/CD pipelines, managing cloud infrastructure, and automating processes. Advanced English proficiency is essential. The position offers...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City Coforge A tiempo completo

    Job Title / Role : SRE Lead Key Skills : Azure, AWS, Terraform, ARM templates Experience : 10+ Location : Mexico City, Mexico. Shift : General Mode : On-Site We at Coforge are seeking " SRE Lead " with the following skill-set: Role Overview We are seeking an experienced SRE and DevOps Lead to drive reliability, scalability, and automation across multi-cloud...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Coforge A tiempo completo

    Job Title / Role : SRE Lead Key Skills : Azure, AWS, Terraform, ARM templatesExperience : 10+Location : Mexico City, Mexico.Shift : GeneralMode : On-SiteWe at Coforge are seeking " SRE Lead " with the following skill-set:Role Overview We are seeking an experienced SRE and DevOps Lead to drive reliability, scalability, and automation across multi-cloud and...