Director, Site Reliability Engineering

hace 24 horas


Ciudad de México, Ciudad de México Mastercard A tiempo completo

Our Purpose
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title And Summary
Director, Site Reliability Engineering

Who is Mastercard?

Mastercard is a global technology company in the payments industry. Our mission is to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential.

Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. With connections across more than 210 countries and territories, we are building a sustainable world that unlocks priceless possibilities for all.

Overview
Are you a visionary leader who thrives on driving transformation in complex infrastructure environments? Do you excel at building high-performing teams, fostering innovation, and aligning technology with business outcomes? The Distributed Platform Operations team is seeking a Director of Site Reliability Engineering (SRE) to lead strategic initiatives that ensure the reliability, scalability, and performance of our VMware and Oracle Linux platforms.

This role is ideal for a seasoned leader who combines deep technical expertise with a passion for operational excellence, automation, and cross-functional collaboration.

Skills
Strategic Leadership & Vision

  • Define and execute the strategic roadmap for Site Reliability Engineering across distributed platforms.
  • Lead modernization efforts including hardware lifecycle management, virtualization upgrades, and infrastructure optimization.
  • Champion a culture of automation, resilience, and continuous improvement.
  • Build, mentor, and scale a high-impact SRE organization with a focus on technical excellence and career development.
  • Establish clear objectives, performance metrics, and development plans for team members.
  • Promote knowledge sharing and operational maturity through documentation and onboarding programs.
  • Oversee the health and performance of VMware clusters, ESXi hosts, and Oracle Linux environments.
  • Ensure robust disaster recovery and high availability strategies are in place and tested.
  • Drive incident management and root cause analysis for critical infrastructure issues.
  • Lead the adoption of Infrastructure-as-Code and automation frameworks using tools like Chef, Ansible, PowerCLI, Python, and Jenkins.
  • Reduce operational toil through scalable automation and self-healing systems.
  • Align engineering practices with DevOps principles and agile methodologies.
  • Architect observability solutions using Prometheus, Grafana, Splunk, and Dynatrace.
  • Define and enforce Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
  • Optimize alerting and telemetry to support proactive incident response.
  • Ensure infrastructure compliance with security baselines, OS configurations, and regulatory standards.
  • Collaborate with InfoSec and audit teams to maintain a secure and compliant environment.
  • Partner with application, network, and storage teams to align infrastructure capabilities with business needs.
  • Communicate technical strategies, upgrade plans, and operational impacts to executive stakeholders.
  • Influence enterprise architecture and platform engineering decisions.

Experience

  • 10+ years of experience in infrastructure, SRE, or platform engineering roles, with 5+ years in leadership.
  • Proven success in leading large-scale infrastructure modernization and automation initiatives.
  • Deep expertise in VMware, Linux systems, and SRE practices.
  • Strong executive communication, strategic thinking, and stakeholder management skills.

Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:

  • Abide by Mastercard's security policies and practices;
  • Ensure the confidentiality and integrity of the information being accessed;
  • Report any suspected information security violation or breach, and
  • Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.


  • Ciudad de México, Ciudad de México Mastercard A tiempo completo

    Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...


  • Ciudad de México, Ciudad de México Pathlock A tiempo completo

    About Pathlock:Pathlock is a leader in application security, access governance, and compliance automation. Our cloud-based solutions help organizations secure critical applications, mitigate risk, and enforce policies across a diverse IT landscape.Job Summary:Join Pathlock, a fast-growing leader in Governance, Access and Compliance, where you'll help shape...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Azkait A tiempo completo

    AZKAITes una empresa mexicana que busca y conecta el mejor talento IT con empresas Latinoamericanas y de Estados Unidos.Estamos en la búsqueda de tu talento comoSite Reliability Engineer (SRE)Requisitos:Licenciatura o Ingeniería en Sistemas, Informática o afín.+5 años de experiencia en roles de SRE, DevOps o Ingeniería de Software.Experiencia...


  • Ciudad de México, Ciudad de México itD A tiempo completo

    itD is seeking aSite Reliability Engineerwho will report to the Sr. Engineering Manager for a client in the gaming and entertainment space.As a Site Reliability Engineer, you will focus on designing, deploying, and operating resilient, secure, and globally scalable services in AWS, with , TypeScript, Kubernetes, GitLab, Argo CD (CI/CD).This long-term W2...


  • Ciudad de México, Ciudad de México Mastercard A tiempo completo

    Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...

  • Site Director

    hace 1 semana


    Ciudad de México, Ciudad de México The QualiFind Group A tiempo completo

    POSITION SUMMARYAs the Site Director, you will be responsible for leading the successful launch, growth, and ongoing performance of the center. This role requires an experienced operations leader with a strong BPO and telecom background who can build, inspire, and run a high-performing customer engagement operation. The ideal candidate blends strategic...

  • Director of Engineering

    hace 2 semanas


    Ciudad de México, Ciudad de México TransNetwork LLC A tiempo completo

    We are seeking an accomplished engineering leader to scale and mature a high-performance technology organization across multiple product lines and platforms. The ideal candidate is a strategic, hands-on leader who excels in building and guiding modern engineering teams, driving delivery excellence, and partnering cross-functionally to execute an ambitious...

  • Engineering Director

    hace 6 días


    Ciudad de México, Ciudad de México Nubank A tiempo completo

    About NubankNu is the world's largest digital banking platform outside of Asia, serving over 123 million customers across Brazil, Mexico, and Colombia. The company has been leading an industry transformation by leveraging data and proprietary technology to develop innovative products and services. Guided by its mission to fight complexity and empower people,...


  • Ciudad de México, Ciudad de México Thomson Reuters México A tiempo completo

    Are you passionate about the chance to bring your experience to a world-class company that is market-leading or both content and technology? If yes, we're looking for you.Join our team Senior Site Reliability Engineer (SRE) will be implement Site Reliability Engineering and DevOps best practices. Feed non-functional requirements into the product backlog,...


  • Ciudad de México, Ciudad de México Capgemini A tiempo completo

    Our Client is one of the United States' largest insurers, providing a wide range of insurance and financial services products with gross written premiums well over US$25 Billion (P&C). They proudly serve more than 10 million U.S. households with more than 19 million individual policies across all 50 states through the efforts of over 48,000 exclusive and...