Lead, Site Reliability Engineer

hace 2 semanas


Ciudad de México Royal Caribbean Group A tiempo completo

**Journey with us** Combine your career goals and sense of adventure by joining our incredible team of employees at **Royal Caribbean Group** We are proud to offer a competitive compensation and benefits package and excellent career development opportunities each offering unique ways to explore the worldWe are proud to be the vacation-industry leader with global brands — including Royal Caribbean International Celebrity Cruises and Silversea Cruises — the most innovative fleet and private destinations and the best people Together we are dedicated to turning the vacation of a lifetime into a lifetime of vacations for our guestsRoyal Caribbean Group’s **Global eCommerce**has an exciting career opportunity for a full time** Lead Site Reliability Engineer**reporting to the **Sr Manager Site Reliability Engineer****This position will work on-site in** Mexico City****Position Summary**:**Essential Duties and Responsibilities**:**Critical Incident Support**- Review ticket analysis and approve closure of tickets/incidents- Understands architecture of Royal website and escalates incidents as needed to the appropriate team for further triage Synthesizes and communicates incident details to the production team stakeholders- including executive level stakeholders- Review postmortem / RCA document and follow up**Monitor and Optimize Systems**- Builds case for prioritizing bug and enhancement tickets- Create reports on new deployment build performance for product teams to ensure quality**Ensure System Reliability and Performance**- Adjust health thresholds and other monitoring settings based on historical performance- Creates and maintains performance dashboards used by support and product teams Maintains alerting communication- and documentation tool chain to ensure it is up to date and efficient**Collaboration with Cross-Functional Teams**Establish and maintain clear communication channels (eg Slack- Teams) with the scrum and marketing teamsEnsure all team members are informed about relevant updates and changes that may affect the website**Qualifications Knowledge and Skills**:**Experience**- **Minimum Years of Experience**: 10+ years in Site Reliability Engineering (SRE) DevOps or a related IT operations role- **Management Experience**: At least 3 years of experience managing teams and collaborating with external service providers**Skills and Abilities**- **Technical Expertise**:Proficiency in cloud platforms such as AWS- AWS Elastic Beanstalk Understanding of API design principles: REST SOAP- Graph Advanced knowledge of monitoring and logging tools (AppDynamics- DataDog Splunk New Relic- etc)- **Problem-Solving Skills**:- Strong analytical and troubleshooting skills to diagnose and resolve complex- production issues swiftly- Ability to develop and implement effective incident response plans- **Communication and Collaboration**:- Excellent written and verbal communication skills for effective interaction with cross- functional teams and documentation Ability to collaborate with Development QA IT- and external managed service providers to ensure seamless operations**Education**- **Bachelor’s Degree**: In Computer Science Information Technology Engineering or a related field**Certifications**- **Preferred Certifications**:- Any monitoring and alerting tools equivalent to certification- Any certification or equivalent knowledge of IT service managementLI-SS1


  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México Atos A tiempo completo

    **Job Applicant Privacy Notice**:**Site Reliability Engineer**:- Publication Date: Jan 8, 2025- Ref. No: - Location: Mexico City, MX**_Site Reliability Engineer_**Certain Scripting experience in languages like Java or Python or Shell scripting.- +3 years of significant experience in working as Site Reliability Engineer- Strong in Terraform, Ansible, Packer,...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México Atos A tiempo completo

    **Job Applicant Privacy Notice**: **Site Reliability Engineer**: - Publication Date: Jan 8, 2025 - Ref. No: 523940 - Location: Mexico City, MX **_Site Reliability Engineer_** Certain Scripting experience in languages like Java or Python or Shell scripting. - +3 years of significant experience in working as Site Reliability Engineer - Strong in Terraform,...


  • Ciudad de México Royal Caribbean Group A tiempo completo

    **Journey with us!** Combine your career goals and sense of adventure by joining our incredible team of employees at **Royal Caribbean Group** We are proud to offer a competitive compensation and benefits package and excellent career development opportunities each offering unique ways to explore the world We are proud to be the vacation-industry leader with...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México Royal Caribbean Group A tiempo completo

    Join to apply for the Site Reliability Engineer role at Royal Caribbean Group 1 week ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer role at Royal Caribbean Group Get AI-powered advice on this job and more exclusive features. Journey with us! Combine your career goals and sense of adventure by joining our incredible team...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México Atos A tiempo completo

    **Job Applicant Privacy Notice**: **Site Reliability Engineer**: - Publication Date: Jan 14, 2025 - Ref. No: 523941 - Location: Mexico City, MX Eviden, part of the Atos Group, with an annual revenue of circa € 5 billion is a global leader in data-driven, trusted and sustainable digital transformation. As a next generation digital business with worldwide...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México Royal Caribbean Group A tiempo completo

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Site Reliability Engineer Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group . We are proud to offer a competitive compensation and benefits package, and excellent career development...


  • Ciudad de México Zenta group A tiempo completo

    **Site Reliability Engineer | Presencial - CDMX****Resumen del Rol**:Como **Site Reliability Engineer (SRE)** en Zenta Group, serás el puente entre desarrollo y operaciones, asegurando que los servicios sean **escalables, confiables y resilientes**. Diseñarás e implementarás soluciones que mejoren la estabilidad y el rendimiento de la infraestructura,...


  • Ciudad de México Zenta group A tiempo completo

    **Site Reliability Engineer | Presencial - CDMX** **Resumen del Rol**: Como **Site Reliability Engineer (SRE)** en Zenta Group, serás el puente entre desarrollo y operaciones, asegurando que los servicios sean **escalables, confiables y resilientes**. Diseñarás e implementarás soluciones que mejoren la estabilidad y el rendimiento de la infraestructura,...


  • Ciudad de México The Functionary A tiempo completo

    Senior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México Tata Consultancy Services A tiempo completo

    We are looking for a Site Reliability Engineer (SRE) to join our team and help us ensure seamless, high-performing, and reliable technology operations. What you’ll work with: Azure DevOps - Pipelines, repositories, and automation ServiceNow - Incident, change, and problem management AppDynamics - Application performance monitoring and alerting Microsoft...