Site Reliability Engineer

hace 1 semana


Xico, México Coderoad Inc A tiempo completo

OverviewSenior Site Reliability Engineer / Observability Engineer At CodeRoad, we're more than just a software development company—we're your gateway to the global tech world. We offer end-to-end software development services and give you the opportunity to work on exciting, real-world projects in a supportive environment. Whether it's staff augmentation, dedicated IT teams, or general software engineering, we have opportunities for everyone to challenge themselves and take their career to the next levelAbout the role: We are looking for a Senior Site Reliability Engineer (SRE) with strong experience in observability, metrics, logging, and reliability engineering. This role will lead the design and implementation of our monitoring and observability strategy across multiple services, ensuring system performance, resiliency, and operational excellence. The ideal candidate combines deep expertise in SRE practices, strong understanding of software engineering, and hands-on experience with modern observability stacks.ResponsibilitiesDefine and implement SLIs, SLOs, and error budgets for critical services.Design and maintain dashboards and alerting systems using tools like Prometheus, Grafana, ELK, OpenTelemetry, or equivalents.Standardize logging, tracing, and metrics across all applications and services.Continuously improve the system's visibility and health tracking to support high availability.Drive incident response, post-mortems, and root-cause analyses.Identify performance bottlenecks and propose architectural improvements.Implement chaos testing and resilience strategies where applicable.Develop CI / CD improvements that support reliability and quality.Automate operational workflows, deployments, and monitoring pipelines.Collaborate with development teams to ensure reliability is built into every service.Work closely with software engineers to establish observability best practices.Create internal standards for logs, metrics, and distributed tracing.Provide technical mentorship and help shape long-term reliability roadmaps.Qualifications5-7+ years of experience in SRE, DevOps, or Platform Engineering roles.Strong experience with observability tools such as Prometheus, Grafana, ELK Stack, OpenTelemetry, Jaeger, Datadog, New Relic, etc.Solid understanding of Kubernetes, Docker, cloud platforms (AWS / GCP / Azure).Proficiency in at least one programming language (e.g., Java, Go, Python, Node.js).Experience implementing SLIs, SLOs, alerting strategies, and incident response.Ability to work cross-functionally and drive technical decisions.Experience with service mesh technologies (e.g., Istio).Background in performance testing, load testing, or capacity planning.Experience with infrastructure as code (Terraform, Ansible).What you’ll loveUSA Contractor100% RemoteHolidays OffPaid Time OffHealth insurance assistance programCompetitive Pay (USD)Excellent teamwork and work environmentTrainingSeniority level: Mid-Senior levelEmployment type: ContractJob function: Consulting and Business DevelopmentIndustry: IT Services and IT Consulting #J-18808-Ljbffr


  • Site Reliability Engineer

    hace 4 semanas


    Xico, México Quantum World Technologies Inc. A tiempo completo

    Role: Site Reliability Engineer (SRE) – Database Services Location: Open to LATAM About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission-critical PostgreSQL environments.This role is ideal for an SRE who wants to grow into database...


  • Xico, México Royal Caribbean Group A tiempo completo

    Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group.We are proud to be the vacation-industry leader with global brands — including Royal Caribbean International, Celebrity Cruises and Silversea Cruises — the most innovative fleet and private destinations, and the best people.Royal...

  • Site Reliability Engineer

    hace 4 semanas


    Xico, México Quantum World Technologies Inc. A tiempo completo

    Role: Site Reliability Engineer (SRE) – Database Services. Location: Open to LATAM. About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission‑critical PostgreSQL environments. This role is ideal for an SRE who wants to grow into...

  • Site Reliability Engineer

    hace 4 semanas


    Xico, México Quantum World Technologies Inc. A tiempo completo

    Role: Site Reliability Engineer (SRE) – Database Services. Location: Open to LATAM. About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission‑critical PostgreSQL environments. This role is ideal for an SRE who wants to grow into...

  • Site Reliability Engineer

    hace 4 semanas


    Xico, México Royal Caribbean Group A tiempo completo

    Press Tab to Move to Skip to Content LinkSelect how often (in days) to receive an alert:Site Reliability EngineerJourney with us!Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group.We are proud to offer a competitive compensation and benefits package, and excellent career development...


  • Xico, México Royal Caribbean Group A tiempo completo

    Join to apply for the Senior Site Reliability Engineer role at Royal Caribbean Group.1 week ago Be among the first 25 applicants.Journey with us!Combine your career goals and sense of adventure by joining our incredible team at Royal Caribbean Group.We offer a competitive compensation and benefits package, along with excellent career development...

  • Site Reliability Engineer

    hace 4 semanas


    Xico, México Jaak-It S.A.P.I. De C.V. A tiempo completo

    **JAAK-IT somos la mejor empresa de tecnología especializada en reconocimiento facial.****¡Te estamos buscando! como Site Reliability Engineer (SRE)****Indispensable**:- **Ingeniería en Informática o afín**:- **2 años de experiência en Sistemas y Desarrollo de Software**:- **Disponibilidad Lunes a Viernes 9 a 6****Herramientas**:- **Sistemas Linux**:-...


  • Xico, México Royal Caribbean Group A tiempo completo

    A leading cruise company in Xico is seeking a full-time Lead Site Reliability Engineer. This role involves supporting the website's performance, managing incidents, and collaborating within teams. Ideal candidates have 10+ years in Site Reliability Engineering, are adept at using monitoring tools, and possess strong communication skills. A Bachelor's degree...


  • Xico, México Royal Caribbean Group A tiempo completo

    A leading cruise company in Xico is seeking a full-time Lead Site Reliability Engineer. This role involves supporting the website's performance, managing incidents, and collaborating within teams. Ideal candidates have 10+ years in Site Reliability Engineering, are adept at using monitoring tools, and possess strong communication skills. A Bachelor's degree...


  • Xico, México Royal Caribbean Group A tiempo completo

    Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world. We are proud to be the vacation‑industry leader with...