Principal Site Reliability Engineer
hace 3 semanas
Principal Site Reliability Engineer (SRE) ZAPOPAN, JALISCO, Mexico Hot Job Job Identification Job Category Product Development Posting Date 10/17/2025, 03:31 AM Job Type Regular Employee Does this position require a security clearance? No Years 3 to 5+ years Applicants are required to read, write, and speak the following languages English Job Description As a senior member of the Site Reliability Engineering (SRE) team, you'll take ownership of highly available systems, influence service design, and work across teams to drive resiliency, automation, and operational excellence. This is a hands‑on engineering role where deep infrastructure knowledge meets software engineering expertise, ideal for experienced SREs ready to take the lead. Responsibilities What You’ll Do: Lead the design, automation, and support of OCI services with a focus on resiliency, security, scalability, and performance. Own and improve the end‑to‑end reliability metrics (SLOs, SLAs, KPIs) for your services. Design and implement high‑availability architectures and standards for large‑scale distributed systems. Serve as the ultimate escalation point for complex operational issues, using a deep understanding of service topologies and interdependencies. Architect and build automation and orchestration tools that reduce manual work and prevent problem recurrence. Collaborate with development teams to improve service designs, optimize deployments, and implement best practices for operational efficiency. Guide technical decision‑making and mentor junior SREs and developers across teams. Participate in and lead postmortems, root cause analysis, and preventative design changes. Contribute to capacity planning, demand forecasting, and long‑term service scalability strategies. Participate in a rotational on‑call schedule to ensure the health and availability of production services. What We’re Looking For: Advanced experience with Linux systems administration Strong programming skills in Python (with automation libraries) Advanced Bash/Shell scripting Deep understanding of distributed systems, networking, and service architecture Solid knowledge of databases and how they behave in production (SQL or NoSQL) Strong understanding of CI/CD pipelines, Agile methodologies, and DevOps best practices Experience writing and maintaining unit tests and production‑grade software Proven ability to lead cross‑functional efforts and technical problem‑solving in live environments Nice to Have: Hands‑on experience with monitoring and observability tools (Grafana, Prometheus, New Relic, etc.) Familiarity with Oracle Cloud Infrastructure (OCI) or other cloud platforms (AWS, Azure, GCP) Experience with Infrastructure‑as‑Code (Terraform, Ansible) and container orchestration (Kubernetes) Qualifications About Us As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry‑leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity. We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all. Oracle careers open the door to global opportunities where work‑life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs. We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing or by calling in the United States. Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law. Request a referral from an Oracle employee. #J-18808-Ljbffr
-
Site Reliability Engineer
hace 2 días
Región Centro, México NTT DATA A tiempo completoSRE – Site Reliability Engineer We are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX). Responsibilities Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage....
-
Site Reliability Engineer
hace 4 días
Región Centro, México NTT DATA, Inc. A tiempo completoSite Reliability Engineer – GDL, Jalisco, Mexico We are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX). Responsibilities Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in...
-
Site Reliability Engineer
hace 2 días
Región Centro, México NTT DATA North America A tiempo completoJob Overview SRE – Site Reliability Engineer We are currently seeking a Site Reliability Engineer to join our team in Guadalajara, Jalisco, Mexico. In this role you will perform L1.5 activities including monitoring, deployment, and rollback. You will monitor the efficiency of Azure cloud systems to prevent outages and initiate an Incident Management bridge...
-
Site Reliability Engineer
hace 3 semanas
Región Centro, México Oracle A tiempo completoA leading cloud solutions provider in Mexico is seeking a skilled Cloud Region Build Site Reliability Engineer to join its team. This full-time role focuses on ensuring the performance, availability, and scalability of cloud infrastructure services. Responsibilities include building and maintaining OCI infrastructure, responding to incidents, and improving...
-
Site Reliability Engineer
hace 2 semanas
región centro jalisco, México GrainChain Inc A tiempo completo¡Estamos en busca de nuevos talentos! GrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de...
-
Site Reliability Engineer
hace 1 semana
Región Centro, México F5 A tiempo completoSite Reliability Engineer – Incident Management Join to apply for the Site Reliability Engineer – Incident Management role at F5 . At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate...
-
Site Reliability Engineer: Infra as Code
hace 2 semanas
región centro jalisco, México GrainChain Inc A tiempo completoUna empresa tecnológica en el sector agrícola está en búsqueda de un Site Reliability Engineer para integrar y automatizar las áreas de desarrollo y operaciones, asegurando la calidad y la entrega de soluciones de software. El candidato ideal tendrá experiencia en scripting, infraestructura Linux y herramientas de CI/CD. Se ofrece un ambiente inclusivo...
-
Senior Site Reliability Test Engineer
hace 3 semanas
Región Centro, México Jabil A tiempo completoA global product solutions company in Jalisco is seeking a Site Reliability Test Engineer to maintain and improve their Cloud Test Platform. This role involves supporting manufacturing server operations, responding to production issues, and enhancing usability of test applications. The ideal candidate has a BS degree in a related field, 5-8 years of relevant...
-
Site Reliability Engineer
hace 2 semanas
región centro jalisco, México FICO A tiempo completoSite Reliability Engineer - Engineer I page is loaded## Site Reliability Engineer - Engineer Ilocations: Guadalajara, Mexicotime type: Full timeposted on: Posted Yesterdayjob requisition id: 31193**FICO (NYSE: FICO)** is a leading global analytics software company, helping businesses in 100+ countries make better decisions. Join our world-class team today...
-
Site Reliability Engineer
hace 7 días
región centro jalisco, México ValorH A tiempo completoConceivable Life Sciences is pioneering the world's first AI-powered, automated IVF laboratory, revolutionizing reproductive healthcare through cutting-edge robotics and artificial intelligence. We are seeking a passionate and dedicated Site Reliability Cloud Engineer to design, implement, and maintain the entire cloud infrastructure of our growing company...