Site Reliability Engineer
hace 3 días
Journey with us
Combine your career goals and sense of adventure by joining our incredible team of employees at
Royal Caribbean Group
. We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world.
We are proud to be the vacation-industry leader with global brands — including Royal Caribbean International, Celebrity Cruises and Silversea Cruises — the most innovative fleet and private destinations, and the best people. Together, we are dedicated to turning the vacation of a lifetime into a lifetime of vacations for our guests.
Royal Caribbean Group's
Global eCommerce
has an exciting career opportunity for a full time
Site Reliability Engineer
reporting to the
Sr. Manager, Site Reliability Engineer
This position will work on-site in Mexico City .
Position Summary
The Site Reliability Engineer (Senior SRE) will report to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The SRE will use application and user performance metrics collected from various sources and tools to support tasks such as initial triage of critical production incidents, bug analysis, implementation of best practices in site reliability engineering, infrastructure optimization, and seamless collaboration between internal teams and external service providers, among other operational initiatives.
The ideal candidate will have a deep understanding and proven track record in an IT support role. The ideal candidate will also have an eye toward the rapidly evolving technology landscape and implement proactive and preventative measures that avoid technical incidents.
S/he must be able to work with multiple product and project teams simultaneously, thrive in a fast-paced and dynamic environment and connect unexpected threads across disparate teams.
Essential Duties And Responsibilities
At a high-level, responsibilities for this role will include:
- Product Health: Responsible for the Incident Management, Application Performance, Configuration Management and Operational Readiness of the products within her/his ownership. Partners with and collaborate closely with stakeholders from the various teams within IT to ensure that performance tools, configuration tools and monitoring tools meet the needs of her/his products.
- Incident Management: Responsible for the initial response, triage, and communication of key production incidents (customer impacting) that occur on the site with the goal to restore systems/applications back to normal service operation as quickly as possible and minimizing the impact on guest/crew experience or business operations, thus ensuring the best possible service levels and availability are maintained. Performs analysis of incident impact on site to determine the root cause by reviewing performance data, including end user experience, application metrics, and infrastructure metrics. Support product team initiatives and releases. Synthesizes and communicates incident details to the production team, stakeholders, including executive level stakeholders. Document incident, perform postmortem and create next steps (as needed)
- Application Performance Management (APM): Ensures the proactive monitoring and management of performance and availability of the software applications within the products s/he is responsible for. Strives to detect and diagnose complex application performance problems to maintain an expected level of service. Provides insight into application performance metrics (errors, exceptions, baseline violations, etc.) to identify technical impacts of bugs and enhancements. Understands key performance metrics (traffic volumes, booking volumes, response times, etc.) to identify business value of bug fixes and enhancements.
- Configuration Management: Understands high level view of the website operations to identify performance trends between business processes. Performs daily governance of application monitoring software.
- Change Control Governance: Ensuring all production changes required by the product teams are carried out in a planned and authorized manner, within established change control policies and procedures and that all changes are thoroughly tested and validated from the monitoring perspective.
- Production Operations Readiness: Ensure all product implementations go through an operational readiness review. Establish and maintain clear communication channels (e.g., Slack, Teams) with the scrum and marketing teams. Ensure all team members are informed about relevant updates and changes that may affect the website.
Qualifications, Knowledge And Skills
- 3-6 years in Site Reliability Engineering (SRE), DevOps, QA, or a related IT operations role.
- Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or other relevant advanced degree preferred.
- Technical Expertise:
- Proficiency in cloud platforms such as AWS, AWS Elastic Beanstalk.
- Understanding of API design principles: REST, SOAP, Graph
- Advanced knowledge of monitoring and logging tools (AppDynamics, Datadog, Splunk, New Relic, etc.).
- Familiarity with Adobe AEM Cloud is preferred to enhance system performance and reliability
- Problem-Solving Skills:
- Strong analytical and troubleshooting skills to diagnose and resolve complex production issues swiftly.
- Ability to develop and implement effective incident response plans.
- Communication and Collaboration:
- Excellent written and verbal communication skills for effective interaction with cross-functional teams and documentation.
- Ability to collaborate with Development, QA, IT, and external managed service providers to ensure seamless operations.
Work Environment:
- The SRE may be required to participate in an on-call rotation to handle urgent incidents and ensure 24x7 system reliability.
- On-call duties may include evenings, weekends, and holidays as needed.
We know there's a lot to consider.
As you go through the application process, our recruiters will be glad to provide guidance, and more relevant details to answer any additional questions. Thank you again for your interest in Royal Caribbean Group. We'll hope to see you onboard soon
It is the policy of the Company to ensure equal employment and promotion opportunity to qualified candidates without discrimination or harassment on the basis of race, color, religion, sex, age, national origin, disability, sexual orientation, sexuality, gender identity or expression, marital status, or any other characteristic protected by law. Royal Caribbean Group and each of its subsidiaries prohibit and will not tolerate discrimination or harassment.
-
Lead, Site Reliability Engineer
hace 2 días
Miguel Hidalgo, Ciudad de México Royal Caribbean Group A tiempo completoJourney with usCombine your career goals and sense of adventure by joining our incredible team of employees atRoyal Caribbean Group. We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world.We are proud to be the vacation-industry leader with global...
-
Customer Reliability Engineer
hace 7 días
Miguel Hidalgo, Ciudad de México Thales A tiempo completoThales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...
-
Quality Engineer
hace 2 días
Miguel Hidalgo, Ciudad de México W-Industries A tiempo completoCOMPLEXITY LEVEL: 3Intermediate complexity LevelJOB SUMMARYQuality Engineer to support and manage the quality function for major projects.PREFERRED EDUCATION & CERTIFICATIONSCollege Associates or Bachelor's required.Three years' experience working as a Quality or Electrical engineer / technologist.HIRING CONDITIONSCompetitive salary based on experience and...
-
n8n/AI Automation Engineer
hace 2 semanas
Miguel Hidalgo, Ciudad de México decodeai A tiempo completoLooking for a n8n / AI Automation Superstar MissionOwn and buildAI-driven automation workflowsusingn8nas a core orchestration layer.Turn product logic and AI requirements intoreliable, scalable, production-grade automations.This role is critical. If you are not deeply experienced with n8n and AI agents, this role isnotfor you.This Role Is NOTA beginner or...
-
Senior Cyber Security Engineer
hace 5 días
Miguel Hidalgo, Ciudad de México Ori-On A tiempo completoMost cybersecurity roles talk about tools.This one talks about leadership.We're opening a Senior Cybersecurity Engineer role in Mexico City— designed as aclear path to CISO within ~3 years.This opportunity is for professionals who don't justsecure systems, but understandrisk, business impact, and communication.You'll join a100-year-old U.S. retail...
-
Field Engineer X-Ray Equipment Technician
hace 2 semanas
Miguel Hidalgo, Ciudad de México Néxum Executive Search A tiempo completoEsta vacante viene de la bolsa de empleo Vacante para la empresa Nexum en Miguel Hidalgo, Ciudad de MéxicoField Engineer (X-Ray Equipment Technician)Are you a specialist in medical equipment and passionate about technology applied to healthcare? This is your opportunity We are looking for a highly committed X-Ray Machine Maintenance Technician with strong...
-
n8n/automation expert
hace 7 días
Miguel Hidalgo, Ciudad de México DecodeAI A tiempo completoSenior n8n / AI Automation EngineerMissionOwn and buildAI-driven automation workflowsusingn8nas a core orchestration layer.Turn product logic and AI requirements intoreliable, scalable, production-grade automations.This role is critical. If you are not deeply experienced with n8n and AI agents, this role isnotfor you.Required Experience (Non-Negotiable)Deep,...
-
Senior Data Scientist
hace 3 días
Miguel Hidalgo, Ciudad de México finvero A tiempo completoAbout the RoleWe are looking for a Data & AI Engineer to design, build, and scale advanced data pipelines and AI-driven systems. This role sits at the intersection of data engineering, machine learning, and production AI, enabling real-time analytics, predictive models, and intelligent automation across the organization.You will work closely with product,...
-
HSEQ Manager
hace 7 días
Miguel Hidalgo, Ciudad de México LyondellBasell A tiempo completoLyondellBasell is a leader in the global chemical industry creating solutions for everyday sustainable living. With a nearly 70-year legacy that includes a Nobel Prize in Chemistry and our proprietary MoReTec recycling technology, LYB is enabling a more sustainable future for generations to come. LYB develops high-quality and innovative products for...
-
CAPEX Procurement Coordinator
hace 2 semanas
Miguel Hidalgo, Ciudad de México Kraft Heinz A tiempo completoAll Posting Locations: Miguel Hidalgo, Distrito Federal, MXJob Functions: ProcurementDate Published: November 25, 2025Ref#: R-97045ABOUT THE ROLEJob DescriptionObjective:This role is responsible to lead give support, facilitate and establish the sourcing strategy (industry analysis, market analysis, sourcing, negotiation, and purchase) for capital investment...