Site Reliability Engineer

hace 3 horas


Mexico City Mastercard A tiempo completo

OverviewMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.Title And SummarySite Reliability Engineer (Automation & virtualization)About The RoleWe’re looking for a passionate and skilled Site Reliability Engineer (SRE) to join our Platform Engineering team. This role is pivotal in automating and managing VMware ESXi hypervisors across Dell and Cisco UCS platforms, ensuring high reliability, scalability, and performance of our infrastructure.You’ll work at the intersection of infrastructure and software, driving automation, observability, and operational excellence across our virtualization stack.Key ResponsibilitiesHypervisor & Infrastructure Management: Deploy, configure, and patch ESXi hosts using tools like VMware Update Manager, iDRAC, and UCS Central. Validate host readiness and enforce consistency across environments.Automation & Infrastructure as Code: Build and maintain automation pipelines using PowerCLI, Python, Terraform, and Ansible. Develop Infrastructure-as-Code (IaC) templates for scalable provisioning.NSX & Network Integration: Administer NSX-T/V for logical switching, routing, and micro-segmentation. Troubleshoot endpoint tagging and network performance issues between NSX and ESXi.Monitoring & Observability: Implement observability stacks using Prometheus, Grafana, Splunk, and Dynatrace. Define and track SLOs, SLIs, and error budgets.Security & Compliance: Ensure security and compliance requirements are integrated into design, deployment, and operations.Planning & Optimization: Lead modernization efforts including UCS blade decommissioning and Dell R760 upgrades. Optimize cluster and VM sizing for performance and cost efficiency.Collaboration & Stakeholder Engagement: Partner with application, storage, and network teams to align infrastructure with workload needs. Communicate upgrade plans and maintenance schedules across teams.Documentation & Knowledge Sharing: Maintain build guides, validation checklists, and operational runbooks. Contribute to internal wikis and onboarding materials.Required Skills5+ years in SRE, DevOps, or Platform Engineering roles.Strong scripting in PowerCLI, Python, or Go.Experience with VMware ESXi, vCenter, NSX, and UCS Manager.Proficiency in Terraform, Ansible, and CI/CD pipeline tools.Familiarity with observability platforms and incident response workflows.Preferred QualificationsExperience with REST API integration for ESXi and vCenter.Knowledge of GitOps, AIOps, and chaos engineering practices.Certifications: VMware VCP, CKA/CKAD, or equivalent.Corporate Security ResponsibilityAll activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:Abide by Mastercard’s security policies and practices;Ensure the confidentiality and integrity of the information being accessed;Report any suspected information security violation or breach, andComplete all periodic mandatory security trainings in accordance with Mastercard’s guidelines. #J-18808-Ljbffr



  • Mexico City W3Global A tiempo completo

    Site Reliability Engineer Join to apply for the Site Reliability Engineer role at W3Global Required qualifications: AWS experience Gitlab Terraform or AWS CDK Python Familiarity with GO Linux OS administration advanced scripting - bash Windows OS administration advanced scripting - powershell Seniority level Entry level Employment type Full-time Job function...


  • Mexico City W3Global A tiempo completo

    Site Reliability Engineer Join to apply for the Site Reliability Engineer role at W3Global Required qualifications: AWS experience Gitlab Terraform or AWS CDK Python Familiarity with GO Linux OS administration advanced scripting - bash Windows OS administration advanced scripting - powershell Seniority level Entry level Employment type Full-time Job function...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Royal Caribbean Group A tiempo completo

    Talent Acquisition @Royal Caribbean Group Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group. We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world. We are proud to...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Tata Consultancy Services A tiempo completo

    We are looking for a Site Reliability Engineer (SRE) to join our team and help us ensure seamless, high-performing, and reliable technology operations.What you’ll work with:Azure DevOps - Pipelines, repositories, and automationServiceNow - Incident, change, and problem managementAppDynamics - Application performance monitoring and alertingMicrosoft Azure...


  • Mexico City Tata Consultancy Services A tiempo completo

    We are looking for a Site Reliability Engineer (SRE) to join our team and help us ensure seamless, high-performing, and reliable technology operations.What you’ll work with:Azure DevOps - Pipelines, repositories, and automationServiceNow - Incident, change, and problem managementAppDynamics - Application performance monitoring and alertingMicrosoft Azure...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City The Functionary A tiempo completo

    Senior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...


  • Mexico City The Functionary A tiempo completo

    Senior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...


  • Mexico City Royal Caribbean Group A tiempo completo

    Join to apply for the Senior Site Reliability Engineer role at Royal Caribbean Group.1 week ago Be among the first 25 applicants.Journey with us! Combine your career goals and sense of adventure by joining our incredible team at Royal Caribbean Group. We offer a competitive compensation and benefits package, along with excellent career development...


  • Mexico City Royal Caribbean Group A tiempo completo

    Join to apply for the Senior Site Reliability Engineer role at Royal Caribbean Group.1 week ago Be among the first 25 applicants.Journey with us! Combine your career goals and sense of adventure by joining our incredible team at Royal Caribbean Group. We offer a competitive compensation and benefits package, along with excellent career development...


  • Mexico City BairesDev A tiempo completo

    Site Reliability Engineer - Remote Work | REF#282116Join to apply for the Site Reliability Engineer - Remote Work | REF#282116 role at BairesDevSite Reliability Engineer - Remote Work | REF#282116Join to apply for the Site Reliability Engineer - Remote Work | REF#282116 role at BairesDevGet AI-powered advice on this job and more exclusive features.At...