Site Reliability Engineer

hace 4 semanas


Ciudad de México Kunai A tiempo completo

Production Support and Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to manage and support large-scale, massively distributed, fault-tolerant systems hosted in the external cloud environment. SRE engineers ensure that systems—both internally critical and externally-visible —have reliability and uptime appropriate to customers’ needs and a fast rate of improvement while keeping an ever-watchful eye on availability, capacity, and performance in a 24/7 environment.

Production Support/SRE engineers will be engaged in automation and development work, reducing toil, developing self -service capabilities, automating manual tasks, and develop support tools, utilizing common scripting languages (i.e. Python). When incidents arise, our production support engineers take charge of incident management. They assume ownership, collaborate, consult, and partner with Lines of Business to guide the team to a successful resolution.

Job Responsibilities:

  • Effectively manage troubleshooting and recovery of complex production incidents, ranging from low to critical impacts
  • Drive incident resolution through a systematic problem solving approach, coupled with a strong sense of ownership and drive
  • Actively participate in teams’ Agile stories (project work) to streamline and enhance day to day operations of the team
  • Create, manage, and utilize appropriate technical procedural documentation (run books)
  • Proactively monitor applications and infrastructure behind external and internal customer facing services including their availability, latency, performance, and capacity
  • Influence resiliency and scalability in production environments in Amazon Web Services (AWS)
  • Identify opportunities and develop proactive automated monitoring and alerting solutions by leveraging available tools (Splunk, New Relic, etc.)
  • Assist with conducting Root Cause Analysis (RCA) on critical production outages to develop and implement future mitigation strategies
  • Utilize production support expertise to influence and support new designs, architectures, standards, and methods to maintain stability and availability for large-scale distributed systems
  • Proactively identify and implement automations for routine maintenance tasks, data gathering, and resolution of common issues
  • Continuously seek to develop new skills and technical expertise, and proactively share knowledge with others

Basic Qualifications:

  • Bachelors or equivalent certification
  • At least 2 years of experience managing and troubleshooting incident bridge calls
  • At least 2 years of experience with Python scripting
  • At least 2 years of experience using and supporting public cloud environments (AWS, Azure, or GCP)
  • At least 2 years of experience with Splunk, New Relic, or DataDog monitoring

Preferred Qualifications:

  • AWS Associate level certification (Solutions Architect, SysOps Administrator, or Developer)
  • 2+ years of experience with Linux, UNIX, Ruby, Go, JavaScript, or NoSQL
  • 3+ years of experience using and supporting public cloud environments (AWS, Azure, or GCP)
  • 2+ years of experience with networks, load balancers, firewalls and web application firewall (WAF)
  • 2+ years experience with web API service

This is a mostly-remote role that requires you to work 40 hours per week during nights and weekends. The coverage schedule you will be supporting will happen during the following times: Monday - Friday, 11pm-6am, Saturday and Sunday 24 hours. On-call support is highly unlikely.

#J-18808-Ljbffr
  • Site Reliability Engineer

    hace 3 semanas


    México Doyensys, Inc. A tiempo completo

    About Doyensys: Doyensys is a Management & Technology Consulting company with expertise in Enterprise applications, Infrastructure Platform Support, and solutions. Doyensys helps clients to harness the power of innovation to thrive on change. The company leverages its technology expertise, global talent, and extensive industry experience to deliver powerful...

  • Site Reliability Engineer

    hace 4 semanas


    Ciudad de México TraxRetail A tiempo completo

    Description The Position Site Reliability Engineer About Trax Trax’s mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences imaginable. Trax’s retail platform allows customers to understand what is happening on shelf, in every store, all the time so they can focus on what they...


  • México spekit A tiempo completo

    Details: This is a contract position. Must be able to work in PST hours About the role: Spekit’s Infrastructure Team is looking for a highly skilled Senior Site Reliability Engineer (SRE). This role plays a critical role in ensuring the reliability, scalability, and performance of our systems and services. This position requires a deep understanding of...

  • Site Reliability Engineer

    hace 4 semanas


    Nuevo México Trax A tiempo completo

    The Position Site Reliability EngineerCity: Mexico CityAbout Trax: Trax’s mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences imaginable. Trax’s retail platform allows customers to understand what is happening on shelf, in every store, all the time so they can focus on what they...


  • Ciudad de México SimCorp A tiempo completo

    Sr. Site Reliability Engineer (Azure) page is loaded Sr. Site Reliability Engineer (Azure) Apply locations Manila time type Full time posted on Posted 30+ Days Ago job requisition id R-206416 Who we are: For over 50 years, we have worked closely with investment and asset managers to become the world’s leading provider of integrated investment...


  • Ciudad de México Acxiom A tiempo completo

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Mexico City time type Full time posted on Posted 30+ Days Ago job requisition id JR011205 Acxiom is seeking a well-rounded Senior Site Reliability Engineer, with a focus in Nutanix, to support customer-facing systems. This role will support critical systems...


  • Ciudad de México SimCorp A tiempo completo

    Senior Site Reliability Engineer (SRE/Azure) page is loaded Senior Site Reliability Engineer (SRE/Azure) Apply locations Manila posted on Posted 30+ Days Ago job requisition id R-206253 Senior Site Reliability Engineer (SRE/Azure) Who we are: For over 50 years, we have worked closely with investment and asset managers to become the world’s leading...


  • Ciudad Juarez, México PCE TECHNOLOGY DE JUÁREZFOXCONN GROUP (SAN JERÓNIMO) A tiempo completo

    **PCE TECHNOLOGY DE JUÁREZ/FOXCONN GROUP (SAN JERÓNIMO)** **Solicita**: **Reliability Quality Engineer (Tesla)** **Descripción y Requisitos** - Guides efforts to ensure reliability and maintainability of equipment, processes, utilities, facilities, mechanical, electrical, control, and safety/security systems. - Guides efforts to ensure reliability and...

  • Reliability Engineer

    hace 2 semanas


    Ciudad del Carmen, Camp., México LHR Américas A tiempo completo

    *At this time, we are only receiving applications from candidates who are currently situated in the Americas, thank you. Who is our client and your future employer? Our client has a unique position in the energy industry worldwide, being the largest hydrocarbons producer with the least carbon intensity. With their considerable investment in technology and...


  • Ciudad de México Match Group A tiempo completo

    At Tinder, our Engineering team is at the forefront of building innovative features and resilient systems that connect our members globally. We're constantly experimenting with new ideas and features to engage with our members and enhance their experience. Even though we're a large-scale tech company, our member-to-engineer ratio remains high, which means...


  • Ciudad de México McDermott A tiempo completo

    **Company Overview** People power our future. That is why advancing a dynamic, inclusive environment, where everyone grows and thrives is critically important to us. Our ingenuity fuels daily life. Together, we’ve forged some of the most trusted partnerships across the energy value chain to make what was once just an idea a reality: laying subsea...


  • México Valeo A tiempo completo

    Valeo is a tech global company, designing breakthrough solutions to reinvent the mobility. We are an automotive supplier partner to automakers and new mobility actors worldwide. Our vision? Invent a greener and more secured mobility, thanks to solutions focusing on intuitive driving and reducing CO2 emissions. We are leader on our businesses, and recognized...

  • Site Engineer

    hace 1 mes


    Ciudad de México Ericsson A tiempo completo

    CMID Casale Media Collects visitor data related to the user's visits to the website, such as the number of visits, average time spent on the website and what pages have been loaded, with the purpose of displaying targeted ads. 1 year HTTP CMPRO Casale Media Collects data on visitor behaviour from multiple websites, in order to present more relevant...


  • México Lapieza.io A tiempo completo

    Requirements B.S. in Computer Engineering or Computer Science, or a related field. 8+ years of experience in software engineering with proficiency in one or more programming languages (e.g., Python, Go, Node) and the ability to write and review code, automate tasks, and develop tools to improve system reliability. 5+ years of extensive hands-on...

  • Network Engineer

    hace 4 semanas


    Ciudad de México ITKAWA A tiempo completo

    **Network Engineer - Santander** - **Esquema**:100% Remoto. - **Salario**:Abierto a negociar de acuerdo a experiência. - **Duración**:Posibilidad de prórrogas y/o contratación a tiempo completo. - **Idioma**: Bilingüe inglés/español. - **Experiência**:Mayor a 6 años. **Responsabilidades**: - Participar como recurso técnico en proyectos de las...


  • México, B.C. Allegion Canada Inc. A tiempo completo

    Field Services Site Lead + Network Engineer page is loaded Field Services Site Lead + Network Engineer Apply locations Tijuana, Mexico time type Full time posted on Posted Yesterday job requisition id JR28742 Creating Peace of Mind by Pioneering Safety and Security At Allegion, we help keep the people you know and love safe and secure where they live,...

  • Lead Engineer Mexico

    hace 4 semanas


    Ciudad de México Port Cities A tiempo completo

    Requirements Advanced Diploma/Bachelors Degree in Computer Science/IT/Math/Physics/Engineering. A minimum of 2 years experience as a Software Engineer. Experience with Odoois mandatory. Good communication and interpersonal skills. Have experience with Object Oriented Programming Language (Python). Have experience with Java/Javascript/Mobile dev. Have...


  • Ciudad de México Siemens Mobility A tiempo completo

    **Job Description**: **Job ID**: - 368940**Company**: - Siemens, S.A. de C.V.**Organization**: - Smart Infrastructure**Job Family**: - Engineering**Experience Level**: - Early Professional**Full Time / Part Time**: - Full-time**Remote vs Office**: - Office/Site only**Contract Type**: - Permanent**Change the future with us!** We are looking for...

  • Applications Engineer

    hace 3 semanas


    Ciudad de México Jobot A tiempo completo

    **Growing Water Pump Manufacturer Seeking Applications Engineer for great new opportunity** This Jobot Job is hosted by: Leesa Purtzer **Salary**: $80,000 - $100,000 per year **A bit about us**: My client, a leading Manufacturer of fire protection pumps and water infrastructure systems, is seeking an Applications Engineer to join their growing...

  • Design Release Engineer

    hace 4 semanas


    Ciudad de México Stellantis A tiempo completo

    The Power Electronics Design and Release Engineer will be responsible for the activities related to Power Electronics components (Converters, Inverters, AC and DC charging devices) designs for Electrified Vehicle applications. Responsibilities include but not limited to: Working with power electronics suppliers to ensure corporate requirements and...