Site Reliability Engineer

hace 4 semanas


Nuevo México Trax A tiempo completo

The Position

Site Reliability Engineer



City:
Mexico City

About Trax:

Trax’s mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences imaginable. Trax’s retail platform allows customers to understand what is happening on shelf, in every store, all the time so they can focus on what they do best – delighting shoppers. Many of the world’s top CPG companies and retailers use Trax’s dynamic merchandising, in-store execution, shopper engagement, market measurement, analytics, and shelf monitoring solutions at scale to drive positive shopper experiences and unlock revenue opportunities at all points of sale. As pioneers in computer vision, Trax continues to lead the industry in innovation and excellence through development of advanced technologies and autonomous data collection methods. Trax is a global company with hubs in the United States, Singapore and Israel, serving customers in more than 90 countries worldwide. To learn more, visit www.traxretail.com.


Job Description

The Site Reliability Engineer (SRE) is responsible for implementing and maintaining the Cloud Infrastructure which runs services developed by Trax. SREs are responsible for the reliability and scalability of Trax services. This includes supporting both our production-critical systems and our internal tools for developer productivity. A strong candidate for this position would be a generalist who can maintain our cloud infrastructure while being an advocate for DevOps principles throughout our organization.


Responsibilities:

  • Implement cost-effective and scalable solutions to complex cloud infrastructure problems.
  • Maintain the reliability of our cloud infrastructure while simultaneously improving and upgrading it. 
  • Perform low-level analysis and debugging of problems in both containerized and VM-based Linux workloads.
  • Automate manual processes to improve developer productivity.
  • Ensure stable and reliable releases by maintaining and improving our CI/CD systems.
  • Be an advocate for DevOps best practices in both the Infrastructure team and across the organization.
  • Manage and participate in a rotating On Call team which is responsible for handling high-priority bugs and issues. 


Requirements:

  • 5+ years of experience managing Linux-based Server Operating Systems.
  • 5+ years of experience managing cloud infrastructure (GCP, AWS, or Azure)
  • 5+ years of experience managing large high-performance databases and data processing jobs for business-critical reporting applications.
  • 5+ years of experience managing environments using Infrastructure and Configuration-as-Code (Terraform/CloudFormation/Puppet/Chef/Etc).
  • 5+ years of experience with CI/CD and test automation systems (Jenkins/Gitlab/Argo/Helm/etc.)
  • Excellent written and verbal communication skills and ability to communicate with stakeholders across the business.
  • Knowledge of monitoring systems including host/OS metrics, logging, and web application performance, using both SaaS products (DataDog/NewRelic/etc.) and open-source solutions (syslog/Loki/Grafana/etc.).
  • Knowledge of container orchestration systems such as Kubernetes, including autoscaling, service mesh, rollout strategies, and cost management.
  • Knowledge of network protocols, including TCP/IP, HTTP/S, DNS, DHCP, and NAT.
  • Thorough understanding of web service fundamentals, such as caching, CDNs, load balancing, and traffic shaping.
  • MySQL Database performance tuning and high-availability experience.
  • Experience with security systems, including WAF, firewall rules, public key infrastructure, and cryptography.
  • Experience writing code in any programming language.
  • Experience writing optimized SQL queries.

Preferred Skills and Experience:

  • Production experience with Google Cloud Platform (GCP).
  • Ability to code modern, containerized web applications.
  • Strong understanding of the Python programming language.
  • Ability to perform low-level network debugging, including packet analysis and an understanding of the Linux network stack.


Trax is committed to a diverse, inclusive, and equitable workplace where all team members, whatever their gender, race, ethnicity, national origin, age, sexual orientation or identity, education, or disability, feels valued and respected. We are committed to a nondiscriminatory approach and maintaining an inclusive environment with equitable treatment for all. 


None
  • Site Reliability Engineer

    hace 3 semanas


    México Doyensys, Inc. A tiempo completo

    About Doyensys: Doyensys is a Management & Technology Consulting company with expertise in Enterprise applications, Infrastructure Platform Support, and solutions. Doyensys helps clients to harness the power of innovation to thrive on change. The company leverages its technology expertise, global talent, and extensive industry experience to deliver powerful...


  • México spekit A tiempo completo

    Details: This is a contract position. Must be able to work in PST hours About the role: Spekit’s Infrastructure Team is looking for a highly skilled Senior Site Reliability Engineer (SRE). This role plays a critical role in ensuring the reliability, scalability, and performance of our systems and services. This position requires a deep understanding of...

  • Site Reliability Engineer

    hace 4 semanas


    Ciudad de México TraxRetail A tiempo completo

    Description The Position Site Reliability Engineer About Trax Trax’s mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences imaginable. Trax’s retail platform allows customers to understand what is happening on shelf, in every store, all the time so they can focus on what they...


  • Ciudad de México SimCorp A tiempo completo

    Sr. Site Reliability Engineer (Azure) page is loaded Sr. Site Reliability Engineer (Azure) Apply locations Manila time type Full time posted on Posted 30+ Days Ago job requisition id R-206416 Who we are: For over 50 years, we have worked closely with investment and asset managers to become the world’s leading provider of integrated investment...


  • Ciudad de México Acxiom A tiempo completo

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Mexico City time type Full time posted on Posted 30+ Days Ago job requisition id JR011205 Acxiom is seeking a well-rounded Senior Site Reliability Engineer, with a focus in Nutanix, to support customer-facing systems. This role will support critical systems...


  • Ciudad de México SimCorp A tiempo completo

    Senior Site Reliability Engineer (SRE/Azure) page is loaded Senior Site Reliability Engineer (SRE/Azure) Apply locations Manila posted on Posted 30+ Days Ago job requisition id R-206253 Senior Site Reliability Engineer (SRE/Azure) Who we are: For over 50 years, we have worked closely with investment and asset managers to become the world’s leading...

  • Site Reliability Engineer

    hace 4 semanas


    Ciudad de México Kunai A tiempo completo

    Production Support and Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to manage and support large-scale, massively distributed, fault-tolerant systems hosted in the external cloud environment. SRE engineers ensure that systems—both internally critical and externally-visible —have...


  • México Valeo A tiempo completo

    Valeo is a tech global company, designing breakthrough solutions to reinvent the mobility. We are an automotive supplier partner to automakers and new mobility actors worldwide. Our vision? Invent a greener and more secured mobility, thanks to solutions focusing on intuitive driving and reducing CO2 emissions. We are leader on our businesses, and recognized...


  • Ciudad de México Match Group A tiempo completo

    At Tinder, our Engineering team is at the forefront of building innovative features and resilient systems that connect our members globally. We're constantly experimenting with new ideas and features to engage with our members and enhance their experience. Even though we're a large-scale tech company, our member-to-engineer ratio remains high, which means...


  • México Lapieza.io A tiempo completo

    Requirements B.S. in Computer Engineering or Computer Science, or a related field. 8+ years of experience in software engineering with proficiency in one or more programming languages (e.g., Python, Go, Node) and the ability to write and review code, automate tasks, and develop tools to improve system reliability. 5+ years of extensive hands-on...


  • México, B.C. Allegion Canada Inc. A tiempo completo

    Field Services Site Lead + Network Engineer page is loaded Field Services Site Lead + Network Engineer Apply locations Tijuana, Mexico time type Full time posted on Posted Yesterday job requisition id JR28742 Creating Peace of Mind by Pioneering Safety and Security At Allegion, we help keep the people you know and love safe and secure where they live,...


  • México Bosch Group A tiempo completo

    Company Description Bosch was founded in Stuttgart in 1886 by Robert Bosch (1861-1942) and for over 130 years has been distinguished by a unique corporate culture based on solid values that drive us to improve every day. Our products, present in a wide variety of lands, contribute to improve the quality of life of millions of people **Job...


  • Ciudad de México McDermott A tiempo completo

    **Company Overview** People power our future. That is why advancing a dynamic, inclusive environment, where everyone grows and thrives is critically important to us. Our ingenuity fuels daily life. Together, we’ve forged some of the most trusted partnerships across the energy value chain to make what was once just an idea a reality: laying subsea...

  • Reliability Engineer

    hace 4 semanas


    México Advanced Technology Services A tiempo completo

    Founded in 1985, ATS is a company with a presence in the United States, Mexico and the United Kingdom. We are professionals in Industrial Maintenance and we make factories run better. Fundada en 1985, ATS es una empresa con presencia en los Estados Unidos, México y el Reino Unido. Somos profesionales en mantenimiento industrial y hacemos que las fábricas...


  • México Visteon Corporation A tiempo completo

    **Hardware Engineer** Transforming how we interact with our vehicles to make the driving experience more enjoyable, connected, and safe. **Enabling a software-defined, electrified future.** **The Mission of the Role**: The Hardware Engineer's primary mission is to administrate the lifecycle of electronic components on the database and perform...

  • Site Engineer

    hace 1 mes


    Ciudad de México Ericsson A tiempo completo

    CMID Casale Media Collects visitor data related to the user's visits to the website, such as the number of visits, average time spent on the website and what pages have been loaded, with the purpose of displaying targeted ads. 1 year HTTP CMPRO Casale Media Collects data on visitor behaviour from multiple websites, in order to present more relevant...

  • Sr. SRE

    hace 2 semanas


    México NTD Software A tiempo completo

    The Site Reliability Engineer (SRE) is responsible for ensuring the reliability, scalability, and performance of production systems. The role focuses on monitoring, alerting, and dashboard creation with a strong emphasis on SRE tools like Grafana, Prometheus, and Datadog. The ideal candidate should have hands-on experience with Python scripting and be able...

  • Network Engineer

    hace 4 semanas


    Ciudad de México ITKAWA A tiempo completo

    **Network Engineer - Santander** - **Esquema**:100% Remoto. - **Salario**:Abierto a negociar de acuerdo a experiência. - **Duración**:Posibilidad de prórrogas y/o contratación a tiempo completo. - **Idioma**: Bilingüe inglés/español. - **Experiência**:Mayor a 6 años. **Responsabilidades**: - Participar como recurso técnico en proyectos de las...

  • Lead DevOps Engineer

    hace 4 semanas


    México AgileEngine A tiempo completo

    **What you will do** - Designing, building, and deploying solutions that increase product reliability and organizational efficiency; - Motivating and guiding the creation of effective CI/CD pipelines; - Providing mentorship and insight into DevSecOps best practices; - Working with product teams to expose their requirements and support the above; - Improving...

  • Lead Engineer Mexico

    hace 4 semanas


    Ciudad de México Port Cities A tiempo completo

    Requirements Advanced Diploma/Bachelors Degree in Computer Science/IT/Math/Physics/Engineering. A minimum of 2 years experience as a Software Engineer. Experience with Odoois mandatory. Good communication and interpersonal skills. Have experience with Object Oriented Programming Language (Python). Have experience with Java/Javascript/Mobile dev. Have...