Site Reliability Engineer

hace 1 mes

Ciudad de México, Ciudad de México Thales A tiempo completo

Thales is a global leader in digital security. Our solutions empower organizations to securely interact with people, objects, and services. As a Site Reliability Engineer, you will contribute to the development and maintenance of our large-scale ODC services. Your focus will be on ensuring the reliability, availability, and performance of these systems. This role requires close collaboration with development teams to design, build, and maintain scalable infrastructure, automate processes, and monitor system health. You will adopt ITIL and Agile methodologies, coaching and mentoring teams on best practices, and ensuring the full lifecycle of Public Cloud services meets external customer SLA and internal OLAs.

Responsibilities

Develop and maintain infrastructure as code and automation tools
Ensure 7x24 mission-critical services with 5x9 availability on public cloud
Review technical products and understand customer requirements
Work with distributed teams worldwide
Define business continuity strategy for operated services over public cloud
Continuously improve service reliability, performance, and security
Design and implement changes into the systems
Participate in presales, deployment, and integration of solutions from the support perspective

Qualifications & Experience

Bachelor's degree in information technology, systems engineering, software engineering, or related fields
+5 years of experience in design, development, and implementation of applications and public cloud (AWS or GCP)
Strong experience in CI/CD using Terraform, Kubernetes, Datadog, and GitHub
Apache Http Server and embedding agile performance metrics
Working experience with scripting languages (Python)
Experience with SOAP and Rest API
Fluent in Spanish and English Language (B2)

Position Requirements

Mexican citizenship or work permit
Hybrid role, office-based in Mexico City and/or surroundings

As a global leader in digital security, Thales empowers organizations to securely interact with people, objects, and services. We are currently looking for a Site Reliability Engineer to contribute to the development and maintenance of our large-scale ODC services. The successful candidate will be responsible for ensuring the reliability, availability, and performance of our systems, working closely with development teams to design, build, and maintain scalable infrastructure, automate processes, and monitor system health.

Key Responsibilities

Develop and maintain infrastructure as code and automation tools to ensure high availability and reliability of our services
Collaborate with development teams to design, build, and maintain scalable infrastructure and automate processes
Monitor system health and performance, identifying areas for improvement and implementing changes to enhance reliability and security
Work with distributed teams worldwide to ensure seamless delivery of our services

Staff Site Reliability Engineer

hace 1 mes

Ciudad de México, Ciudad de México Crunchyroll, LLC A tiempo completo

About CrunchyrollAt Crunchyroll, we're committed to delivering the art and culture of anime to our global community. As a Staff Site Reliability Engineer on our Data Engineering team, you'll play a pivotal role in ensuring the reliability, scalability, and performance of our data infrastructure.About the RoleWe're looking for a highly skilled engineer to...
Site Reliability Engineer

hace 1 mes

Ciudad de México, Ciudad de México Trax A tiempo completo

About TraxAt Trax, we empower brands and retailers to harness the power of digital technologies and create exceptional shopping experiences. Our retail platform provides real-time insights into in-store activities, enabling businesses to focus on what matters most – delighting customers.As a pioneer in computer vision, Trax continues to innovate and lead...
Service Reliability Engineer

hace 1 mes

Ciudad de México, Ciudad de México Thales A tiempo completo

Position SummaryThales is seeking a Service Reliability Engineer to ensure the best customer experience by assuring services reliability and resolving incidents in the shortest timeframe. This position requires a strong technical background and excellent communication skills.Key ResponsibilitiesManage incidents and service requests within the Service Level...
Site Reliability Engineer

hace 1 mes

Ciudad de México, Ciudad de México Thales A tiempo completo

At Thales, we rely on talented individuals to architect digital security solutions. As a Site Reliability Engineer, you will play a vital role in ensuring the reliability, availability, and performance of our large-scale services. Collaborating closely with development teams, you will design, build, and maintain scalable infrastructure, automate processes,...
Cloud Engineer with Expertise in Site Reliability

hace 4 semanas

Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

About the RoleWe are seeking a highly skilled Cloud Engineer to join our team as a Site Reliability Engineer. This role involves driving technical excellence and ensuring the reliability of our cloud-based systems.The ideal candidate will have extensive experience in AWS, a strong understanding of cloud-native applications, and excellent problem-solving...
Reliability Engineer for Cloud Infrastructure

hace 4 semanas

Ciudad de México, Ciudad de México Sequoia Connect A tiempo completo

Sequoia Connect is a USD 6 billion company with 163,000+ professionals across 90 countries, helping 1279 global customers, including Fortune 500 companies.We are currently searching for a Site Reliability Engineer (SRE) to join our team in Mexico. This position plays a critical role in ensuring the scalability and reliability of our Cash Management...
Cloud Reliability Engineer for Enterprise Transformation

hace 4 semanas

Ciudad de México, Ciudad de México Sequoia Connect A tiempo completo

We are Sequoia Connect, a leading provider of innovative IT solutions, and we're seeking a highly skilled Cloud Reliability Engineer to join our team.This role is part of our DevOps team, responsible for designing, implementing, and maintaining scalable and efficient cloud-based systems. Our client represents the connected world, offering cutting-edge...
Senior Cloud Reliability Engineer

hace 1 mes

Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

About the RoleWe are seeking a skilled Cloud Reliability Engineer to join our team at Thomson Reuters ONESOURCE Platform. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based services. Your primary focus will be on designing, implementing, and maintaining scalable and highly available...
Data Infrastructure Reliability Engineer

hace 3 semanas

Ciudad de México, Ciudad de México Crunchyroll A tiempo completo

About the RoleWe are seeking a highly skilled Staff Site Reliability Engineer to join our Data Engineering team at Crunchyroll. This is an exceptional opportunity for an experienced professional to shape the future of anime by maintaining and enhancing the reliability of our data infrastructure.The successful candidate will be responsible for ensuring the...
Reliability Engineer Position

hace 1 mes

Ciudad de México, Ciudad de México Svitla Systems A tiempo completo

About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team. This is an exciting opportunity to work with cutting-edge technologies and be part of a dynamic organization.Key ResponsibilitiesWork on service resiliency, performance tuning, and design to ensure high-quality systems.Drive resolution of critical incidents and...
Cloud Reliability Engineer

hace 2 semanas

Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

Cloud Reliability EngineerWe are seeking a Cloud Reliability Engineer to join our team at Thomson Reuters, a world-class company that is market-leading for both content and technology.This role will allow you to expand your technical skills while networking with professionals in Cloud operations, technology development, and project management teams.The Cloud...
Highly Available Systems Specialist

hace 4 semanas

Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

Azka IT Consulting is a dynamic company connecting top talent with businesses in Latin America and the US.We are seeking an exceptional professional to fill the role of Site Reliability Engineer.Job Overview:The Site Reliability Engineer (SRE) plays a pivotal role in designing, implementing, and maintaining scalable and highly available systems.
Data Infrastructure Reliability Specialist

hace 4 semanas

Ciudad de México, Ciudad de México Crunchyroll A tiempo completo

About CrunchyrollWe're a global entertainment company dedicated to delivering anime and manga experiences to our fans.As a leading platform, we serve over 100 million users across 200+ countries, providing an extensive library of content, merchandise, events, and more.This role is part of our Data Engineering team, which ensures seamless data operations and...
Cloud Infrastructure Reliability Engineer

hace 2 semanas

Ciudad de México, Ciudad de México Thales A tiempo completo

Company OverviewThales is a global leader in digital security and identity management, trusted by over 30,000 organizations to provide secure solutions for billions of digital interactions.Job DescriptionAs a Cloud Infrastructure Reliability Engineer at Thales, you will play a crucial role in ensuring the reliability, availability, and performance of...
Global Command Center Reliability Engineer

hace 4 semanas

Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

About the RoleAs a Senior Service Reliability Engineer for the Global Command Center, you will be responsible for running the production environment, monitoring availability, and taking a holistic view of system health.You will build software and systems to manage platform infrastructure and applications, improving reliability, quality, and time-to-market of...
Reliability Solutions Expert

hace 1 mes

Ciudad de México, Ciudad de México Wipro A tiempo completo

Job Title: Reliability EngineerJob Description:We are seeking a highly skilled Reliability Engineer to join our team at Wipro. In this role, you will be responsible for designing, analyzing, developing, and troubleshooting highly distributed large-scale production systems and event-driven, cloud-based services. You will also ensure repeatability,...
Cloud Reliability Expert

hace 4 semanas

Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

About the RoleIn this opportunity as a Cloud Reliability Expert, you will be responsible for ensuring the stability, scalability, and supportability of our enterprise-level applications.You will develop, deliver, and support high-quality solutions by applying modern SRE operational & development practices. This includes monitoring, automation, building, and...
Cloud Engineer

hace 4 semanas

Ciudad de México, Ciudad de México Sequoia Connect A tiempo completo

Sequoia Connect is a USD 6 billion company with 163,000+ professionals across 90 countries.We are currently searching for a Cloud Engineer who will play a key role in our Azure SRE team.About the Role:Automate multi-tenant systems, preferably in Azure environment.Implement Site Reliability Engineering (SRE) practices, ensuring system reliability,...
Cloud Infrastructure Engineer

hace 4 semanas

Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

About the RoleIn this opportunity as a Cloud Infrastructure Engineer - Service Reliability Specialist, you will be responsible for delivering high-quality solutions for SRE team.Provides skilled technical support/delivery capability, with minimal supervision, for the current and future design, testing, delivery, support, and maintenance of production...
Cloud-Native Infrastructure Architect

hace 4 semanas

Ciudad de México, Ciudad de México Wiser Solutions A tiempo completo

Senior DevOps EngineerWe are looking for a seasoned Cloud-Native Infrastructure Architect to lead our engineering teams in delivering top-notch quality of service. As a key member of our infrastructure team, you will help set and drive the technical vision for our infrastructure, observability, site reliability, and software release pipeline.

Américas

Europa

Asia / Oceanía

África

Site Reliability Engineer