Site Reliability Engineer
hace 2 semanas
We are looking for a Lead Site Reliability Engineer who takes the initiative on developing and maintain the system and services for our Cash Management Platform, automating the deployment process, ensuring system scaling, investigating and resolving outdates, identifying and implementing preventive measures proactively, collaborating with key stakeholders, continuously looking for ways to provide real-time visual feedback for all the metrics and statuses.What you will do: Proactively build and implement services to make IT and support better at their jobs. Design and implement dashboard that provide valuable real-time insights of platform key metrics. Leads engagement with software developers, DevOps and other infrastructure engineers to integrate software development and delivery from inception to full operation, ensuring robust released software and systems. Optimizing on-call rotations & processes. Ensure Incidents assigned to the team are being managed within agreed SLAs Ensure alarms are documented in up to date Knowledge Base Articles. Conduct pot-incident reviews to identify platform status. What we’re looking for: Bachelor’s degree in computer science or equivalent relevant to SR or Automation/development experience. 7+ years’ experience focussed on Site Reliability Engineering or related position in some of the majors Cloud Platforms. Involved in the automation of multi-tenant systems, preferably in a cloud environment. Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms and tools, SLO management, incident resolution, and automation; Ability to explain technical concepts in clear, non-technical language Experience building Infrastructure-As-Code. Experience in Docker and Kubernetes and networking concepts. Experience with Graphana and Prometeus. Integration experience with Pager-Duty, ServiceNow, Datadog. Expertise with system and performance monitoring tools (Dynatrace, Splunk, etc.). Hybrid position based in Mexico City, Monterrey or Guadalajara.
-
Site Reliability Engineer
hace 1 semana
Guadalajara, Jalisco, México NTT DATA A tiempo completoSRE - Site Reliability EngineerWe are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX). Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage. Troubleshoot Azure...
-
Site Reliability Engineer
hace 1 semana
Guadalajara, Jalisco, México NTT DATA North America A tiempo completoSRE – Site Reliability EngineerWe are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX).Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage. Troubleshoot Azure...
-
Site Reliability Engineer
hace 3 semanas
Guadalajara, México f5 A tiempo completoEverything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.- Site Reliability Engineer IIIWhy do you want to join our team?- Everything we do centers around people. That means we obsess over how to...
-
Site Reliability Engineer
hace 2 semanas
Guadalajara, México F5 A tiempo completoEverything we do centers around people.That means we obsess over how to make the lives of our customers, and their customers, better.And it means we prioritize a diverse F5 community where each individual can thrive.- Site Reliability Engineer IIIWhy do you want to join our team?- Everything we do centers around people.That means we obsess over how to make...
-
Site Reliability Engineer
hace 1 semana
Guadalajara, México Valce Talent Solutions A tiempo completoWe are looking for a Lead Site Reliability Engineer who takes the initiative on developing and maintain the system and services for our Cash Management Platform, automating the deployment process, ensuring system scaling, investigating and resolving outdates, identifying and implementing preventive measures proactively, collaborating with key stakeholders,...
-
Sr Site Reliability Engineer
hace 3 días
Guadalajara, México f5 A tiempo completoEverything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Business/Job Title: Senior Site Reliability Engineer Position Summary Software engineering is a core discipline at F5 for many roles. As a...
-
Associate Site Reliability Engineer/Site
hace 2 semanas
Guadalajara, México C3 Ai A tiempo completoWe are looking for **Associate Site Reliability Engineer**/**Site Reliability Engineer** to join our team in Guadalajara, Mexico.**Responsibilities**:- Maximize system uptime and availability, ensuring functional and performance SLAs.- Establish end-to-end monitoring and alerting on all critical aspects.- Solve complex problems for critical services and...
-
Graphite - Site Reliability Engineer (SRE)
hace 1 semana
Guadalajara, Jalisco, México rctsglobal A tiempo completoSite Reliability Engineer (SRE)Overview We're looking for a passionate and hands-on Site Reliability Engineer (SRE) to join our team. This role is critical for ensuring the stability, performance, and scalability of our production services. You'll be the bridge between development and operations, with a strong focus on using code to manage infrastructure and...
-
Site Reliability Engineer
hace 2 días
Guadalajara, México f5 A tiempo completoEverything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.Business/Job Title: Site Reliability Engineer - IAM - IIIPosition Summary:Software engineering is a core discipline at F5 for many roles. As a...
-
Site Reliability Engineer Iii
hace 2 semanas
Guadalajara, México F5 A tiempo completoEverything we do centers around people.That means we obsess over how to make the lives of our customers, and their customers, better.And it means we prioritize a diverse F5 community where each individual can thrive.Position SummarySoftware engineering is a core discipline at F5 for many roles.As a software engineer specializing in site reliability, you will...