Site Reliability Engineer

hace 3 días


Guadalajara, México Valce Talent Solutions A tiempo completo

We are looking for a Lead Site Reliability Engineer who takes the initiative on developing and maintain the system and services for our Cash Management Platform, automating the deployment process, ensuring system scaling, investigating and resolving outdates, identifying and implementing preventive measures proactively, collaborating with key stakeholders, continuously looking for ways to provide real-time visual feedback for all the metrics and statuses.

**What you will do**:
Proactively build and implement services to make IT and support better at their jobs.

Design and implement dashboard that provide valuable real-time insights of platform key metrics.

Leads engagement with software developers, DevOps and other infrastructure engineers to integrate software development and delivery from inception to full operation, ensuring robust released software and systems.

Optimizing on-call rotations & processes.

Ensure Incidents assigned to the team are being managed within agreed SLAs

Ensure alarms are documented in up to date Knowledge Base Articles.

Conduct pot-incident reviews to identify platform status.

**What we’re looking for**:
Bachelor’s degree in computer science or equivalent relevant to SR or Automation/development experience.

7+ years’ experience focussed on Site Reliability Engineering or related position in some of the majors Cloud Platforms.

Involved in the automation of multi-tenant systems, preferably in a cloud environment.

Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms and tools, SLO management, incident resolution, and automation;
Ability to explain technical concepts in clear, non-technical language

Experience building Infrastructure-As-Code.

Experience in Docker and Kubernetes and networking concepts.

Experience with Graphana and Prometeus.

Integration experience with Pager-Duty, ServiceNow, Datadog.

Expertise with system and performance monitoring tools (Dynatrace, Splunk, etc.).

**Hybrid position based in Mexico City, Monterrey or Guadalajara.



  • Guadalajara, Jalisco, México NTT DATA A tiempo completo

    SRE - Site Reliability Engineer We are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX). Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage. Troubleshoot Azure...


  • Guadalajara, Jalisco, México NTT DATA North America A tiempo completo

    SRE – Site Reliability EngineerWe are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX).Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage. Troubleshoot Azure...


  • Guadalajara, Jalisco, México NTT DATA A tiempo completo

    SRE - Site Reliability EngineerWe are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX). Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage. Troubleshoot Azure...


  • Guadalajara, México Finastra USA Corporation A tiempo completo

    **Responsibilities**:**What will you contribute?**As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be treating operations as a software...


  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.Software engineering is a core discipline at F5 for many roles. As a software engineer specializing in site reliability, you will bring a...

  • Site Reliability Engineer

    hace 3 semanas


    Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.Business/Job Title: Site Reliability Engineer - IAM - IIIPosition Summary:Software engineering is a core discipline at F5 for many roles. As a...

  • Site Reliability Engineer

    hace 3 semanas


    Guadalajara, México Careers at SunDevs A tiempo completo

    **Descripción del puesto**:Como Site Reliability Engineer en SunDevs, colaborarás con otros ingenieros de software senior y Platform Engineers para diseñar y desarrollar sistemas y plataformas en la nube altamente disponibles, escalables, seguras y mantenibles para resolver grandes desafíos.Brindarás asesoramiento y guía a nuestros ingenieros de...


  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.Position SummarySoftware engineering is a core discipline at F5 for many roles. As a software engineer specializing in site reliability, you...


  • Guadalajara, México Wizeline A tiempo completo

    **The Company**:Wizeline is a global digital services company helping mid-size to Fortune 500 companies build, scale, and deliver high-quality digital products and services. We thrive in solving our customer’s challenges through human-centered experiences, digital core modernization, and intelligence everywhere (AI/ML and data). We help them succeed in...


  • Guadalajara, Jalisco, México ValorH A tiempo completo

    Conceivable Life Sciencesis pioneering the world's first AI-powered, automated IVF laboratory, revolutionizing reproductive healthcare through cutting-edge robotics and artificial intelligence. We are seeking a passionate and dedicatedSite Reliability Cloud Engineerto design, implement, and maintain the entire cloud infrastructure of our growing company (~60...