Site Reliability Engineer

hace 5 meses


Mexico City Trax A tiempo completo

About The Position

The Position

Site Reliability Engineer

About Trax

Trax’s mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences imaginable. Trax’s retail platform allows customers to understand what is happening on shelf, in every store, all the time so they can focus on what they do best – delighting shoppers. Many of the world’s top CPG companies and retailers use Trax’s dynamic merchandising, in-store execution, shopper engagement, market measurement, analytics, and shelf monitoring solutions at scale to drive positive shopper experiences and unlock revenue opportunities at all points of sale. As pioneers in computer vision, Trax continues to lead the industry in innovation and excellence through development of advanced technologies and autonomous data collection methods. Trax is a global company with hubs in the United States, Singapore and Israel, serving customers in more than 90 countries worldwide. To learn more, visit .

Job Description

The Site Reliability Engineer (SRE) is responsible for implementing and maintaining the Cloud Infrastructure which runs services developed by Trax. SREs are responsible for the reliability and scalability of Trax services. This includes supporting both our production-critical systems and our internal tools for developer productivity. A strong candidate for this position would be a generalist who can maintain our cloud infrastructure while being an advocate for DevOps principles throughout our organization.

Responsibilities:

Implement cost-effective and scalable solutions to complex cloud infrastructure problems. Maintain the reliability of our cloud infrastructure while simultaneously improving and upgrading it.  Perform low-level analysis and debugging of problems in both containerized and VM-based Linux workloads. Automate manual processes to improve developer productivity. Ensure stable and reliable releases by maintaining and improving our CI/CD systems. Be an advocate for DevOps best practices in both the Infrastructure team and across the organization. Manage and participate in a rotating On Call team which is responsible for handling high-priority bugs and issues. 

Requirements:

5+ years of experience managing Linux-based Server Operating Systems. 5+ years of experience managing cloud infrastructure (GCP, AWS, or Azure) 5+ years of experience managing large high-performance databases and data processing jobs for business-critical reporting applications. 5+ years of experience managing environments using Infrastructure and Configuration-as-Code (Terraform/CloudFormation/Puppet/Chef/Etc). 5+ years of experience with CI/CD and test automation systems (Jenkins/Gitlab/Argo/Helm/etc.) Excellent written and verbal communication skills and ability to communicate with stakeholders across the business. Knowledge of monitoring systems including host/OS metrics, logging, and web application performance, using both SaaS products (DataDog/NewRelic/etc.) and open-source solutions (syslog/Loki/Grafana/etc.). Knowledge of container orchestration systems such as Kubernetes, including autoscaling, service mesh, rollout strategies, and cost management. Knowledge of network protocols, including TCP/IP, HTTP/S, DNS, DHCP, and NAT. Thorough understanding of web service fundamentals, such as caching, CDNs, load balancing, and traffic shaping. MySQL Database performance tuning and high-availability experience. Experience with security systems, including WAF, firewall rules, public key infrastructure, and cryptography. Experience writing code in any programming language. Experience writing optimized SQL queries.

Preferred Skills and Experience:

Production experience with Google Cloud Platform (GCP). Ability to code modern, containerized web applications. Strong understanding of the Python programming language. Ability to perform low-level network debugging, including packet analysis and an understanding of the Linux network stack.

Trax is committed to a diverse, inclusive, and equitable workplace where all team members, whatever their gender, race, ethnicity, national origin, age, sexual orientation or identity, education, or disability, feels valued and respected. We are committed to a nondiscriminatory approach and maintaining an inclusive environment with equitable treatment for all. 



  • Mexico City 1210 Kyndryl Mexico S. de R.L. de C.V. A tiempo completo

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...


  • Mexico City Virtualent A tiempo completo

    Site Reliability Engineer (SRE)VirtualentAbout Us:We’re a leading IT Staffing company, passionate about connecting top talent with the best opportunities. We are looking for a Site Reliability Engineer (SRE) to join our team.Responsibilities:• Design, implement, and maintain scalable and highly available infrastructures.• Monitor and ensure the...


  • Mexico City Thales A tiempo completo

    Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...


  • Mexico City F5 A tiempo completo

    At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.    Everything we do centers...


  • Mexico City Epam A tiempo completo

    Description DESCRIPTION Are you a DevOps expert with a passion for improving communication between operational and developmental sides of the software development process? Do you thrive in dynamic, collaborative environments? If so, we have an exciting opportunity for you! We're currently seeking a Site Reliability Engineer to join...


  • Mexico City Oracle A tiempo completo

    Responsibilities Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure  Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA) Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...


  • Mexico City Crunchyroll A tiempo completo

    About CrunchyrollWE HELP EVERYONE BELONG. IT’S OUR PURPOSE.Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We super-serve over 100 million anime and manga fans across 200+ countries and territories, and help them connect with the stories and characters they crave. Whether that experience is online or in-person,...


  • Mexico City Thomson Reuters A tiempo completo

    About the Role In this opportunity as a Site Reliability Engineer, you will:  Provides skilled technical support/delivery capability, with minimal supervision, for the current and future design, testing, delivery, support, and maintenance of production services in the technical operations environment. Provides technical and procedural consistency...


  • Mexico City Thales A tiempo completo

    Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...


  • Mexico City Thomson Reuters A tiempo completo

    About the Role In this opportunity as a Senior Site Reliability Engineer , you will:  Develop, Deliver, and Support: By applying modern SRE operational & development practices, you will be involved in the entire operational support, Monitoring, automation, building, and delivering high-quality solutions for the team. Be a Team Player: Working in...


  • City, México Svitla Systems A tiempo completo

    - Requirements: - 5+ years of experience in a SRE or similar role. - 3+ years of experience supporting containerized production services using Kubernetes. - 2+ years of experience with Infrastructure as Code and configuration tools like Terraform and Ansible. - 1+ year of recent experience in the cloud (Google Cloud Platform preferred, AWS and Azure will...

  • SRE Engineer

    hace 4 meses


    Mexico City Azka IT Consulting A tiempo completo

    AZKA IT is a Mexican company that seeks and connects the best IT talent with Latin American and United States companies.We are looking for your talent as Site Reliability EngineerRequirements:The Site Reliability Engineer (SRE) plays a crucial role in the design, implementation and maintenance of highly available, scalable and reliable systems.  Technical...


  • Mexico City Servicios Comerciales Amazon Mexico S. de R.L. de C.V. - D44 A tiempo completo

    Reliability, Maintenance, and Engineering (RME) Central Services is hiring for Systems Engineers!At Amazon we believe that Every Day is still Day One! We’re working to be the most customer-centric company on earth. To get there, we need talented, bright and driven people.The System Development Engineer position provides proactive technical support for...


  • Mexico City Thales A tiempo completo

    Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...

  • Project Engineer

    hace 5 meses


    Mexico City Unilever A tiempo completo

    Unilever is currently hiring for Project Engineer Function: Project Enginner Work Level: 1C Reports to :ARACELI SOTO VAZQUEZ Scope :NUTRITION LATAM Location : LERMA. Terms & Conditions : Full time position.  ABOUT UNILEVER Unilever is the place where you can bring your purpose to life with the work that you do – creating a better...

  • Support Engineer

    hace 3 meses


    Mexico City Ericsson A tiempo completo

    Description Join our Team About this opportunity: At Ericsson, we are seeking a knowledgeable and dedicated Support Engineer to join our team. In this pivotal role, you will be charged with providing data-driven solutions to customer reported issues in line with set processes and Service Level Agreements. The role involves identifying,...


  • Mexico City KION Group A tiempo completo

    We are seeking a dedicated Mechatronics Engineers to join our team and provide solutions at client sites (Not experience required) You will work closely with customers and other engineers to effectively resolve customer issues. The ideal candidate must be willing to travel up to 90% of the time to the USA. As a Mechatronics Commissioning Engineer, you will...

  • VoIP Engineer

    hace 3 semanas


    Mexico City Alia Integrando Talento A tiempo completo

    VoIP Engineer (100% remote work)Company: VoiceSpinAbout UsAt Alia Integrando Talento, we pride ourselves on connecting talented professionals with innovative companies. We are currently seeking a skilled VoIP Engineer to join our dynamic team and support our client, VoiceSpin, a leading global provider of unified communications and Linux-based IP...

  • SRE Software Engineer

    hace 5 meses


    Mexico City Ford Motor Company A tiempo completo

    SRE Software Engineer is responsible for Designing, configuring, monitoring, implementing, and maintaining our observability solutions and troubleshooting Ford Credit IT systems and applications to ensure optimal performance and reliability  MAJOR RESPONSIBILITIES   They will be utilizing Observability and Monitoring tools to detect and resolves...


  • Mexico City Siemens Gamesa A tiempo completo

    It takes the brightest minds to be a technology leader. It takes imagination to create green energy for the generations to come. At Siemens Gamesa we make real what matters, join our global team. Siemens Gamesa has a vision for renewable energy: we believe in the power of nature and technology. Help us to be ready to face the energy challenges of...