Reliability Engineer

hace 2 semanas


Xico, México Bebeecloud A tiempo completo

Job Role Summary Seeking an experienced SRE Lead to drive reliability, scalability, and automation across multi‑cloud and application platforms. Job DescriptionAs a seasoned SRE and DevOps Lead, you will combine leadership, hands‑on engineering, and strategic thinking to ensure high availability and performance of mission‑critical systems. Key Responsibilities Design and implement SRE best practices for monitoring, alerting, and incident response. Define and track SLIs, SLOs, and SLAs to improve system reliability. Lead CI/CD pipeline design and optimization for multi‑cloud environments (Azure & AWS). Automate infrastructure provisioning and deployments using Infrastructure as Code (IaC). Own incident response processes leveraging PagerDuty and Datadog for alerting and observability. Conduct post‑mortems and implement preventive measures. Architect and manage hybrid cloud environments (Azure, AWS). Optimize cost, performance, and security across cloud services. Ensure high availability and performance of MongoDB clusters. Implement backup, recovery, and disaster recovery strategies. Mentor SRE / DevOps engineers and foster a culture of reliability and automation. Collaborate with development, QA, and product teams to embed reliability into the SDLC. Required Skills & Qualifications The ideal candidate should possess the following skills: Strong experience with Datadog, PagerDuty, Azure, AWS, and MongoDB. Proficiency in scripting (Python, Bash) and Infrastructure as Code (Terraform, ARM templates). Hands‑on experience with containerization (Docker, Kubernetes). Deep understanding of SLIs / SLOs, error budgets, and reliability engineering practices. Expertise in CI/CD tools (Azure DevOps, Jenkins, GitHub Actions). Strong automation mindset and experience with configuration management tools (Ansible, Chef, or similar). Soft Skills Excellent communication and leadership skills. Ability to work in a fast‑paced, collaborative environment. Preferred Qualifications Experience in regulated industries (Healthcare, Finance, etc.). AWS Solutions Architect, Azure Administrator, or Datadog Certified Professional. #J-18808-Ljbffr


  • Site Reliability Engineer

    hace 3 semanas


    Xico, México Quantum World Technologies Inc. A tiempo completo

    Role: Site Reliability Engineer (SRE) – Database Services Location: Open to LATAM About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission-critical PostgreSQL environments.This role is ideal for an SRE who wants to grow into database...


  • Xico, México Coderoad Inc A tiempo completo

    OverviewSenior Site Reliability Engineer / Observability Engineer At CodeRoad, we're more than just a software development company—we're your gateway to the global tech world. We offer end-to-end software development services and give you the opportunity to work on exciting, real-world projects in a supportive environment. Whether it's staff augmentation,...


  • Xico, México Coderoad Inc A tiempo completo

    OverviewSenior Site Reliability Engineer / Observability Engineer At CodeRoad, we're more than just a software development company—we're your gateway to the global tech world. We offer end-to-end software development services and give you the opportunity to work on exciting, real-world projects in a supportive environment. Whether it's staff augmentation,...

  • Site Reliability Engineer

    hace 3 semanas


    Xico, México Quantum World Technologies Inc. A tiempo completo

    Role: Site Reliability Engineer (SRE) – Database Services. Location: Open to LATAM. About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission‑critical PostgreSQL environments. This role is ideal for an SRE who wants to grow into...

  • Site Reliability Engineer

    hace 3 semanas


    Xico, México Quantum World Technologies Inc. A tiempo completo

    Role: Site Reliability Engineer (SRE) – Database Services. Location: Open to LATAM. About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission‑critical PostgreSQL environments. This role is ideal for an SRE who wants to grow into...


  • Xico, México Royal Caribbean Group A tiempo completo

    Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group.We are proud to be the vacation-industry leader with global brands — including Royal Caribbean International, Celebrity Cruises and Silversea Cruises — the most innovative fleet and private destinations, and the best people.Royal...

  • Reliability Engineer

    hace 2 días


    Xico, México Bsb-Jll - Lasalle Services, Mex A tiempo completo

    Regional Reliability Engineer At JLL we have a great commitment to diversity, which is why we promote the inclusion of all people on equal terms; that is, we do not discriminate based on disability, sexual orientation, gender identity, sex, race, ethnic group, religion, and / or physical appearance. What this job involves : Responsible for implementing a...

  • Reliability Engineer

    hace 1 día


    Xico, México Bsb-Jll - Lasalle Services, Mex A tiempo completo

    Regional Reliability Engineer At JLL we have a great commitment to diversity, which is why we promote the inclusion of all people on equal terms; that is, we do not discriminate based on disability, sexual orientation, gender identity, sex, race, ethnic group, religion, and / or physical appearance. What this job involves : Responsible for implementing a...

  • Reliability Engineer

    hace 1 día


    Xico, México Bsb-Jll - Lasalle Services, Mex A tiempo completo

    Regional Reliability Engineer At JLL we have a great commitment to diversity, which is why we promote the inclusion of all people on equal terms; that is, we do not discriminate based on disability, sexual orientation, gender identity, sex, race, ethnic group, religion, and / or physical appearance. What this job involves : Responsible for implementing a...


  • Xico, México Thales Group A tiempo completo

    Customer Reliability Engineer (CRE) page is loaded## Customer Reliability Engineer (CRE)remote type: Hybridlocations: Mexico Citytime type: Full timeposted on: Posted Todaytime left to apply: End Date: December 30, **** (27 days left to apply)job requisition id: R*******Thales people architect identity management and data protection solutions at the heart of...