Senior Site Reliability Engineer

hace 4 semanas


Mexico City Royal Caribbean Group A tiempo completo

Senior Site Reliability Engineer – Royal Caribbean Group Journey with us Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group. We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world. We are proud to be the vacation‑industry leader with global brands — including Royal Caribbean International, Celebrity Cruises and Silversea Cruises — the most innovative fleet and private destinations, and the best people. Together, we are dedicated to turning the vacation of a lifetime into a lifetime of vacations for our guests. Royal Caribbean Group’s Global eCommerce has an exciting career opportunity for a full time Senior Site Reliability Engineer reporting to the Sr. Manager, Site Reliability Engineer. This position will work on‑site in Mexico City. Position Summary Senior Site Reliability Engineer (Sr SRE) will assist the SRE team in support of the Royal Caribbean website ($183M gross revenue in 2021) using application and user performance data to guide informed decision making. The Sr SRE will use site performance metrics collected by various sources and tools to support the following tasks: the initial triage of critical production incidents, analysis of bugs, implementing best practices in site reliability engineering, optimizing infrastructure, ensuring seamless collaboration between internal teams and external service providers, among other operational initiatives. Essential Duties and Responsibilities Critical Incident Support Review ticket analysis and approve closure of tickets/incidents Understand architecture of Royal website and elevate incidents as needed to the appropriate team for further triage. Synthesizes and communicates incident details to the production team, stakeholders, including documentation of the incident, post‑mortem and next steps. Review post‑mortem / RCA document and follow up. Monitor and Optimize Systems Provide insight into application performance metrics (errors, exceptions, baseline violations, etc.) to identify technical impacts of bugs and enhancements. Build a case for prioritizing bug and enhancement tickets by identifying the business value of fixes. Create reports on new deployment build performance for product teams to ensure build quality. Ensure System Reliability and Performance Adjust health thresholds and other monitoring settings based on historical performance. Creates and maintains performance dashboards used by support and product teams. Maintains alerting, communication, and documentation tool chain to ensure it is up to date. Collaboration with Cross‑Functional Teams Establish and maintain clear communication channels (e.g., Slack, Teams) with scrum and marketing teams. Ensure all team members are informed about relevant updates and changes that may affect the website. Qualifications, Knowledge and Skills Experience Minimum 6–10 years in Site Reliability Engineering (SRE), DevOps, QA, or a related IT operations role. Skills and Abilities Technical Expertise Proficiency in cloud platforms such as AWS, AWS Elastic Beanstalk. Understanding of API design principles: REST, SOAP, Graph. Advanced knowledge of monitoring and logging tools (AppDynamics, DataDog, Splunk, New Relic, etc.). Problem‑Solving Skills Strong analytical and troubleshooting skills to diagnose and resolve complex production issues swiftly. Ability to develop and implement effective incident response plans. Communication and Collaboration Excellent written and verbal communication skills for effective interaction with cross‑functional teams and documentation. Ability to collaborate with Development, QA, IT, and external managed service providers to ensure seamless operations. It is the policy of the Company to ensure equal employment and promotion opportunity to qualified candidates without discrimination or harassment on the basis of race, color, religion, sex, age, national origin, disability, sexual orientation, sexuality, gender identity or expression, marital status, or any other characteristic protected by law. Royal Caribbean Group and each of its subsidiaries prohibit and will not tolerate discrimination or harassment. #J-18808-Ljbffr



  • Mexico City Royal Caribbean Group A tiempo completo

    A leading cruise company in Mexico City is seeking a Senior Site Reliability Engineer to support its website operations. The ideal candidate should have 6–10 years of experience in Site Reliability Engineering, with expertise in cloud platforms like AWS and strong problem-solving skills. This full-time position emphasizes collaboration with various teams...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City W3Global A tiempo completo

    Site Reliability Engineer Join to apply for the Site Reliability Engineer role at W3Global Required qualifications: AWS experience Gitlab Terraform or AWS CDK Python Familiarity with GO Linux OS administration advanced scripting - bash Windows OS administration advanced scripting - powershell Seniority level Entry level Employment type Full-time Job function...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City W3Global A tiempo completo

    Site Reliability Engineer Join to apply for the Site Reliability Engineer role at W3Global Required qualifications: AWS experience Gitlab Terraform or AWS CDK Python Familiarity with GO Linux OS administration advanced scripting - bash Windows OS administration advanced scripting - powershell Seniority level Entry level Employment type Full-time Job function...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City Yochana A tiempo completo

    We’re Hiring | Site Reliability Engineer (SRE) Hybrid | Mexico City Full-Time We are looking for a highly skilled Site Reliability Engineer (SRE) to join our team in Mexico City. This role is ideal for a proactive engineer with strong AWS expertise , a passion for automation, and a solid background in systems reliability, scalability, and performance. Key...

  • Senior Azure DevOps

    hace 2 semanas


    Mexico Tata Consultancy Services A tiempo completo

    A global consulting firm is seeking a Senior Site Reliability / Gitops Engineer in Mexico. The ideal candidate will possess a Bachelor's degree in Computer Science or a related field, along with proven experience as a DevOps Engineer. Key skills include Azure services, scripting in PowerShell and Bash, and familiarity with Git and monitoring tools like...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City CodeRoad Inc A tiempo completo

    Senior Site Reliability Engineer / Observability Engineer At CodeRoad, we’re more than just a software development company—we’re your gateway to the global tech world. We offer end‑to‑end software development services and give you the opportunity to work on exciting, real‑world projects in a supportive environment. Whether it’s staff...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City CodeRoad Inc A tiempo completo

    Senior Site Reliability Engineer / Observability Engineer At CodeRoad, we’re more than just a software development company—we’re your gateway to the global tech world. We offer end‑to‑end software development services and give you the opportunity to work on exciting, real‑world projects in a supportive environment. Whether it’s staff...


  • Mexico City The Functionary A tiempo completo

    Direct message the job poster from The Functionary Experienced Technical recruiter with 6+ years of experience. Now hiring for LATAM, India and US. Must-Haves: Looking for a Senior Site Reliability Engineer with strong experience in Terraform, EKS, and Kubernetes. Ability to work with stakeholders and has experience leading P1 and P2 teams. Experience...


  • Mexico City The Functionary A tiempo completo

    Direct message the job poster from The Functionary Experienced Technical recruiter with 6+ years of experience. Now hiring for LATAM, India and US. Must-Haves: Looking for a Senior Site Reliability Engineer with strong experience in Terraform, EKS, and Kubernetes. Ability to work with stakeholders and has experience leading P1 and P2 teams. Experience...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Royal Caribbean Group A tiempo completo

    Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group . We are proud to be the vacation-industry leader with global brands — including Royal Caribbean International, Celebrity Cruises and Silversea Cruises — the most innovative fleet and private destinations, and the best people. Royal...