Site Reliability Engineer
hace 1 semana
Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build asustainableeconomy where everyone can prosper. We support a wide range of digital payments choices, making transactionssecure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential. Title and Summary Site Reliability Engineer (Automation & virtualization) Site Reliability Engineer About the Role We’re looking for a passionate and skilled Site Reliability Engineer (SRE) to join our Platform Engineering team. This role is pivotal in automating and managing VMware ESXi hypervisors across Dell and Cisco UCS platforms, ensuring high reliability, scalability, and performance of our infrastructure. You’ll work at the intersection of infrastructure and software, driving automation, observability, and operational excellence across our virtualization stack. Key Responsibilities Hypervisor & Infrastructure ManagementDeploy, configure, and patch ESXi hosts using tools like VMware Update Manager, iDRAC, and UCS Central.Validate host readiness and enforce consistency across environments. Automation & Infrastructure as CodeBuild and maintain automation pipelines using PowerCLI, Python, Terraform, and Ansible.Develop Infrastructure-as-Code (IaC) templates for scalable provisioning. NSX & Network IntegrationAdminister NSX-T/V for logical switching, routing, and micro-segmentation.Troubleshoot endpoint tagging and network performance issues between NSX and ESXi. Monitoring & ObservabilityImplement observability stacks using Prometheus, Grafana, Splunk, and Dynatrace.Define and track SLOs, SLIs, and error budgets. Security & Compliance Planning & OptimizationLead modernization efforts including UCS blade decommissioning and Dell R760 upgrades.Optimize cluster and VM sizing for performance and cost efficiency. Collaboration & Stakeholder EngagementPartner with application, storage, and network teams to align infrastructure with workload needs.Communicate upgrade plans and maintenance schedules across teams. Documentation & Knowledge SharingMaintain build guides, validation checklists, and operational runbooks.Contribute to internal wikis and onboarding materials. Required Skills 5+ years in SRE, DevOps, or Platform Engineering roles. Strong scripting in PowerCLI, Python, or Go. Experience with VMware ESXi, vCenter, NSX, and UCS Manager. Proficiency in Terraform, Ansible, and CI/CD pipeline tools. Familiarity with observability platforms and incident response workflows. Preferred Qualifications Experience with REST API integration for ESXi and vCenter. Knowledge of GitOps, AIOps, and chaos engineering practices. Certifications: VMware VCP, CKA/CKAD, or equivalent. Corporate Security Responsibility Abide by Mastercard’s security policies and practices; Ensure the confidentiality and integrity of the information being accessed; Report any suspected information security violation or breach, and Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines. #J-18808-Ljbffr
-
Site Reliability Engineer
hace 2 semanas
Ciudad de México Atos A tiempo completo**Job Applicant Privacy Notice**: **Site Reliability Engineer**: - Publication Date: Jan 8, 2025 - Ref. No: 523940 - Location: Mexico City, MX **_Site Reliability Engineer_** Certain Scripting experience in languages like Java or Python or Shell scripting. - +3 years of significant experience in working as Site Reliability Engineer - Strong in Terraform,...
-
Site Reliability Engineer
hace 1 semana
Ciudad de México UST A tiempo completoJoin to apply for the Site Reliability Engineer role at UST Continue with Google Continue with Google Join to apply for the Site Reliability Engineer role at UST Get AI-powered advice on this job and more exclusive features. Sign in to access AI-powered advices Continue with Google Continue with Google Continue with Google Continue with Google Continue with...
-
Site Reliability Engineer
hace 1 semana
Ciudad de México Atos A tiempo completo**Job Applicant Privacy Notice**:**Site Reliability Engineer**:- Publication Date: Jan 14, 2025- Ref. No: - Location: Mexico City, MXEviden, part of the Atos Group, with an annual revenue of circa € 5 billion is a global leader in data-driven, trusted and sustainable digital transformation. As a next generation digital business with worldwide leading...
-
Site Reliability Engineer
hace 1 semana
Ciudad de México Zenta group A tiempo completo**Site Reliability Engineer | Presencial - CDMX** **Resumen del Rol**: Como **Site Reliability Engineer (SRE)** en Zenta Group, serás el puente entre desarrollo y operaciones, asegurando que los servicios sean **escalables, confiables y resilientes**. Diseñarás e implementarás soluciones que mejoren la estabilidad y el rendimiento de la infraestructura,...
-
Site Reliability Engineer
hace 2 semanas
Estado de México BairesDev A tiempo completoSite Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Site Reliability Engineer - Remote Work | REF# 6 months ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev At BairesDev, we've been leading the way in...
-
Site Reliability Engineer
hace 3 semanas
Ciudad de México Quantum World Technologies Inc. A tiempo completoRole: Site Reliability Engineer (SRE) – Database Services Location: Mexico / Costa Rica / Argentina preferred (Open to LATAM) Availability: Immediate About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission-critical PostgreSQL...
-
Site Reliability Engineer
hace 3 semanas
Ciudad de México Quantum World Technologies Inc. A tiempo completoRole: Site Reliability Engineer (SRE) – Database Services Location: Open to LATAM About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission-critical PostgreSQL environments. This role is ideal for an SRE who wants to grow into database...
-
Site Reliability Engineer
hace 2 semanas
Ciudad de México Royal Caribbean Group A tiempo completo**Journey with us!** Combine your career goals and sense of adventure by joining our incredible team of employees at **Royal Caribbean Group** We are proud to offer a competitive compensation and benefits package and excellent career development opportunities each offering unique ways to explore the world We are proud to be the vacation-industry leader with...
-
Site Reliability Engineer
hace 2 semanas
Ciudad de México Royal Caribbean Group A tiempo completo**Journey with us!** Combine your career goals and sense of adventure by joining our incredible team of employees at **Royal Caribbean Group** We are proud to offer a competitive compensation and benefits package and excellent career development opportunities each offering unique ways to explore the worldWe are proud to be the vacation-industry leader with...
-
Site Reliability Engineer
hace 1 semana
México ITJ A tiempo completoPosition OverviewOur customer is revolutionizing the cancer diagnostics space and is now looking for another Site Reliability Engineer (SRE) to join its incredible team. SREs support our mission by pushing out new features and applications every day. The Site Reliability Engineering team constantly practices the DevOps mindset to build and deploy...