Site Reliability Engineer

hace 1 mes


Ciudad de México, Ciudad de México Thales A tiempo completo

Thales is a global leader in digital security. Our solutions empower organizations to securely interact with people, objects, and services. As a Site Reliability Engineer, you will contribute to the development and maintenance of our large-scale ODC services. Your focus will be on ensuring the reliability, availability, and performance of these systems. This role requires close collaboration with development teams to design, build, and maintain scalable infrastructure, automate processes, and monitor system health. You will adopt ITIL and Agile methodologies, coaching and mentoring teams on best practices, and ensuring the full lifecycle of Public Cloud services meets external customer SLA and internal OLAs.

Responsibilities

  • Develop and maintain infrastructure as code and automation tools
  • Ensure 7x24 mission-critical services with 5x9 availability on public cloud
  • Review technical products and understand customer requirements
  • Work with distributed teams worldwide
  • Define business continuity strategy for operated services over public cloud
  • Continuously improve service reliability, performance, and security
  • Design and implement changes into the systems
  • Participate in presales, deployment, and integration of solutions from the support perspective

Qualifications & Experience

  • Bachelor's degree in information technology, systems engineering, software engineering, or related fields
  • +5 years of experience in design, development, and implementation of applications and public cloud (AWS or GCP)
  • Strong experience in CI/CD using Terraform, Kubernetes, Datadog, and GitHub
  • Apache Http Server and embedding agile performance metrics
  • Working experience with scripting languages (Python)
  • Experience with SOAP and Rest API
  • Fluent in Spanish and English Language (B2)

Position Requirements

  • Mexican citizenship or work permit
  • Hybrid role, office-based in Mexico City and/or surroundings

As a global leader in digital security, Thales empowers organizations to securely interact with people, objects, and services. We are currently looking for a Site Reliability Engineer to contribute to the development and maintenance of our large-scale ODC services. The successful candidate will be responsible for ensuring the reliability, availability, and performance of our systems, working closely with development teams to design, build, and maintain scalable infrastructure, automate processes, and monitor system health.

Key Responsibilities

  • Develop and maintain infrastructure as code and automation tools to ensure high availability and reliability of our services
  • Collaborate with development teams to design, build, and maintain scalable infrastructure and automate processes
  • Monitor system health and performance, identifying areas for improvement and implementing changes to enhance reliability and security
  • Work with distributed teams worldwide to ensure seamless delivery of our services


  • Ciudad de México, Ciudad de México Crunchyroll, LLC A tiempo completo

    About CrunchyrollAt Crunchyroll, we're committed to delivering the art and culture of anime to our global community. As a Staff Site Reliability Engineer on our Data Engineering team, you'll play a pivotal role in ensuring the reliability, scalability, and performance of our data infrastructure.About the RoleWe're looking for a highly skilled engineer to...


  • Ciudad de México, Ciudad de México Trax A tiempo completo

    About TraxAt Trax, we empower brands and retailers to harness the power of digital technologies and create exceptional shopping experiences. Our retail platform provides real-time insights into in-store activities, enabling businesses to focus on what matters most – delighting customers.As a pioneer in computer vision, Trax continues to innovate and lead...


  • Ciudad de México, Ciudad de México Thales A tiempo completo

    Position SummaryThales is seeking a Service Reliability Engineer to ensure the best customer experience by assuring services reliability and resolving incidents in the shortest timeframe. This position requires a strong technical background and excellent communication skills.Key ResponsibilitiesManage incidents and service requests within the Service Level...


  • Ciudad de México, Ciudad de México Thales A tiempo completo

    At Thales, we rely on talented individuals to architect digital security solutions. As a Site Reliability Engineer, you will play a vital role in ensuring the reliability, availability, and performance of our large-scale services. Collaborating closely with development teams, you will design, build, and maintain scalable infrastructure, automate processes,...


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleWe are seeking a highly skilled Cloud Engineer to join our team as a Site Reliability Engineer. This role involves driving technical excellence and ensuring the reliability of our cloud-based systems.The ideal candidate will have extensive experience in AWS, a strong understanding of cloud-native applications, and excellent problem-solving...


  • Ciudad de México, Ciudad de México Sequoia Connect A tiempo completo

    Sequoia Connect is a USD 6 billion company with 163,000+ professionals across 90 countries, helping 1279 global customers, including Fortune 500 companies.We are currently searching for a Site Reliability Engineer (SRE) to join our team in Mexico. This position plays a critical role in ensuring the scalability and reliability of our Cash Management...


  • Ciudad de México, Ciudad de México Sequoia Connect A tiempo completo

    We are Sequoia Connect, a leading provider of innovative IT solutions, and we're seeking a highly skilled Cloud Reliability Engineer to join our team.This role is part of our DevOps team, responsible for designing, implementing, and maintaining scalable and efficient cloud-based systems. Our client represents the connected world, offering cutting-edge...


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleWe are seeking a skilled Cloud Reliability Engineer to join our team at Thomson Reuters ONESOURCE Platform. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based services. Your primary focus will be on designing, implementing, and maintaining scalable and highly available...


  • Ciudad de México, Ciudad de México Crunchyroll A tiempo completo

    About the RoleWe are seeking a highly skilled Staff Site Reliability Engineer to join our Data Engineering team at Crunchyroll. This is an exceptional opportunity for an experienced professional to shape the future of anime by maintaining and enhancing the reliability of our data infrastructure.The successful candidate will be responsible for ensuring the...


  • Ciudad de México, Ciudad de México Svitla Systems A tiempo completo

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team. This is an exciting opportunity to work with cutting-edge technologies and be part of a dynamic organization.Key ResponsibilitiesWork on service resiliency, performance tuning, and design to ensure high-quality systems.Drive resolution of critical incidents and...


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    Cloud Reliability EngineerWe are seeking a Cloud Reliability Engineer to join our team at Thomson Reuters, a world-class company that is market-leading for both content and technology.This role will allow you to expand your technical skills while networking with professionals in Cloud operations, technology development, and project management teams.The Cloud...


  • Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

    Azka IT Consulting is a dynamic company connecting top talent with businesses in Latin America and the US.We are seeking an exceptional professional to fill the role of Site Reliability Engineer.Job Overview:The Site Reliability Engineer (SRE) plays a pivotal role in designing, implementing, and maintaining scalable and highly available systems.


  • Ciudad de México, Ciudad de México Crunchyroll A tiempo completo

    About CrunchyrollWe're a global entertainment company dedicated to delivering anime and manga experiences to our fans.As a leading platform, we serve over 100 million users across 200+ countries, providing an extensive library of content, merchandise, events, and more.This role is part of our Data Engineering team, which ensures seamless data operations and...


  • Ciudad de México, Ciudad de México Thales A tiempo completo

    Company OverviewThales is a global leader in digital security and identity management, trusted by over 30,000 organizations to provide secure solutions for billions of digital interactions.Job DescriptionAs a Cloud Infrastructure Reliability Engineer at Thales, you will play a crucial role in ensuring the reliability, availability, and performance of...


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleAs a Senior Service Reliability Engineer for the Global Command Center, you will be responsible for running the production environment, monitoring availability, and taking a holistic view of system health.You will build software and systems to manage platform infrastructure and applications, improving reliability, quality, and time-to-market of...


  • Ciudad de México, Ciudad de México Wipro A tiempo completo

    Job Title: Reliability EngineerJob Description:We are seeking a highly skilled Reliability Engineer to join our team at Wipro. In this role, you will be responsible for designing, analyzing, developing, and troubleshooting highly distributed large-scale production systems and event-driven, cloud-based services. You will also ensure repeatability,...

  • Cloud Reliability Expert

    hace 4 semanas


    Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleIn this opportunity as a Cloud Reliability Expert, you will be responsible for ensuring the stability, scalability, and supportability of our enterprise-level applications.You will develop, deliver, and support high-quality solutions by applying modern SRE operational & development practices. This includes monitoring, automation, building, and...

  • Cloud Engineer

    hace 4 semanas


    Ciudad de México, Ciudad de México Sequoia Connect A tiempo completo

    Sequoia Connect is a USD 6 billion company with 163,000+ professionals across 90 countries.We are currently searching for a Cloud Engineer who will play a key role in our Azure SRE team.About the Role:Automate multi-tenant systems, preferably in Azure environment.Implement Site Reliability Engineering (SRE) practices, ensuring system reliability,...


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleIn this opportunity as a Cloud Infrastructure Engineer - Service Reliability Specialist, you will be responsible for delivering high-quality solutions for SRE team.Provides skilled technical support/delivery capability, with minimal supervision, for the current and future design, testing, delivery, support, and maintenance of production...


  • Ciudad de México, Ciudad de México Wiser Solutions A tiempo completo

    Senior DevOps EngineerWe are looking for a seasoned Cloud-Native Infrastructure Architect to lead our engineering teams in delivering top-notch quality of service. As a key member of our infrastructure team, you will help set and drive the technical vision for our infrastructure, observability, site reliability, and software release pipeline.