Site Reliability Engineer
hace 4 semanas
Production Support and Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to manage and support large-scale, massively distributed, fault-tolerant systems hosted in the external cloud environment. SRE engineers ensure that systems—both internally critical and externally-visible —have reliability and uptime appropriate to customers’ needs and a fast rate of improvement while keeping an ever-watchful eye on availability, capacity, and performance in a 24/7 environment.
Production Support/SRE engineers will be engaged in automation and development work, reducing toil, developing self -service capabilities, automating manual tasks, and develop support tools, utilizing common scripting languages (i.e. Python). When incidents arise, our production support engineers take charge of incident management. They assume ownership, collaborate, consult, and partner with Lines of Business to guide the team to a successful resolution.
Job Responsibilities:
- Effectively manage troubleshooting and recovery of complex production incidents, ranging from low to critical impacts
- Drive incident resolution through a systematic problem solving approach, coupled with a strong sense of ownership and drive
- Actively participate in teams’ Agile stories (project work) to streamline and enhance day to day operations of the team
- Create, manage, and utilize appropriate technical procedural documentation (run books)
- Proactively monitor applications and infrastructure behind external and internal customer facing services including their availability, latency, performance, and capacity
- Influence resiliency and scalability in production environments in Amazon Web Services (AWS)
- Identify opportunities and develop proactive automated monitoring and alerting solutions by leveraging available tools (Splunk, New Relic, etc.)
- Assist with conducting Root Cause Analysis (RCA) on critical production outages to develop and implement future mitigation strategies
- Utilize production support expertise to influence and support new designs, architectures, standards, and methods to maintain stability and availability for large-scale distributed systems
- Proactively identify and implement automations for routine maintenance tasks, data gathering, and resolution of common issues
- Continuously seek to develop new skills and technical expertise, and proactively share knowledge with others
Basic Qualifications:
- Bachelors or equivalent certification
- At least 2 years of experience managing and troubleshooting incident bridge calls
- At least 2 years of experience with Python scripting
- At least 2 years of experience using and supporting public cloud environments (AWS, Azure, or GCP)
- At least 2 years of experience with Splunk, New Relic, or DataDog monitoring
Preferred Qualifications:
- AWS Associate level certification (Solutions Architect, SysOps Administrator, or Developer)
- 2+ years of experience with Linux, UNIX, Ruby, Go, JavaScript, or NoSQL
- 3+ years of experience using and supporting public cloud environments (AWS, Azure, or GCP)
- 2+ years of experience with networks, load balancers, firewalls and web application firewall (WAF)
- 2+ years experience with web API service
This is a mostly-remote role that requires you to work 40 hours per week during nights and weekends. The coverage schedule you will be supporting will happen during the following times: Monday - Friday, 11pm-6am, Saturday and Sunday 24 hours. On-call support is highly unlikely.
#J-18808-Ljbffr-
Site Reliability Engineer
hace 3 semanas
México Doyensys, Inc. A tiempo completoAbout Doyensys: Doyensys is a Management & Technology Consulting company with expertise in Enterprise applications, Infrastructure Platform Support, and solutions. Doyensys helps clients to harness the power of innovation to thrive on change. The company leverages its technology expertise, global talent, and extensive industry experience to deliver powerful...
-
Site Reliability Engineer
hace 4 semanas
Ciudad de México TraxRetail A tiempo completoDescription The Position Site Reliability Engineer About Trax Trax’s mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences imaginable. Trax’s retail platform allows customers to understand what is happening on shelf, in every store, all the time so they can focus on what they...
-
Senior Site Reliability Engineer
hace 4 semanas
México spekit A tiempo completoDetails: This is a contract position. Must be able to work in PST hours About the role: Spekit’s Infrastructure Team is looking for a highly skilled Senior Site Reliability Engineer (SRE). This role plays a critical role in ensuring the reliability, scalability, and performance of our systems and services. This position requires a deep understanding of...
-
Site Reliability Engineer
hace 4 semanas
Nuevo México Trax A tiempo completoThe Position Site Reliability EngineerCity: Mexico CityAbout Trax: Trax’s mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences imaginable. Trax’s retail platform allows customers to understand what is happening on shelf, in every store, all the time so they can focus on what they...
-
Sr. Site Reliability Engineer
hace 2 semanas
Ciudad de México SimCorp A tiempo completoSr. Site Reliability Engineer (Azure) page is loaded Sr. Site Reliability Engineer (Azure) Apply locations Manila time type Full time posted on Posted 30+ Days Ago job requisition id R-206416 Who we are: For over 50 years, we have worked closely with investment and asset managers to become the world’s leading provider of integrated investment...
-
Senior Site Reliability Engineer
hace 4 semanas
Ciudad de México Acxiom A tiempo completoSenior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Mexico City time type Full time posted on Posted 30+ Days Ago job requisition id JR011205 Acxiom is seeking a well-rounded Senior Site Reliability Engineer, with a focus in Nutanix, to support customer-facing systems. This role will support critical systems...
-
Senior Site Reliability Engineer
hace 2 semanas
Ciudad de México SimCorp A tiempo completoSenior Site Reliability Engineer (SRE/Azure) page is loaded Senior Site Reliability Engineer (SRE/Azure) Apply locations Manila posted on Posted 30+ Days Ago job requisition id R-206253 Senior Site Reliability Engineer (SRE/Azure) Who we are: For over 50 years, we have worked closely with investment and asset managers to become the world’s leading...
-
Reliability Quality Engineer
hace 4 semanas
Ciudad Juarez, México PCE TECHNOLOGY DE JUÁREZFOXCONN GROUP (SAN JERÓNIMO) A tiempo completo**PCE TECHNOLOGY DE JUÁREZ/FOXCONN GROUP (SAN JERÓNIMO)** **Solicita**: **Reliability Quality Engineer (Tesla)** **Descripción y Requisitos** - Guides efforts to ensure reliability and maintainability of equipment, processes, utilities, facilities, mechanical, electrical, control, and safety/security systems. - Guides efforts to ensure reliability and...
-
Reliability Engineer
hace 2 semanas
Ciudad del Carmen, Camp., México LHR Américas A tiempo completo*At this time, we are only receiving applications from candidates who are currently situated in the Americas, thank you. Who is our client and your future employer? Our client has a unique position in the energy industry worldwide, being the largest hydrocarbons producer with the least carbon intensity. With their considerable investment in technology and...
-
Software Engineer, Data Services
hace 4 semanas
Ciudad de México Match Group A tiempo completoAt Tinder, our Engineering team is at the forefront of building innovative features and resilient systems that connect our members globally. We're constantly experimenting with new ideas and features to engage with our members and enhance their experience. Even though we're a large-scale tech company, our member-to-engineer ratio remains high, which means...
-
Principal Telecom Engineer
hace 4 semanas
Ciudad de México McDermott A tiempo completo**Company Overview** People power our future. That is why advancing a dynamic, inclusive environment, where everyone grows and thrives is critically important to us. Our ingenuity fuels daily life. Together, we’ve forged some of the most trusted partnerships across the energy value chain to make what was once just an idea a reality: laying subsea...
-
Site Financial Controller
hace 6 días
México Valeo A tiempo completoValeo is a tech global company, designing breakthrough solutions to reinvent the mobility. We are an automotive supplier partner to automakers and new mobility actors worldwide. Our vision? Invent a greener and more secured mobility, thanks to solutions focusing on intuitive driving and reducing CO2 emissions. We are leader on our businesses, and recognized...
-
Site Engineer
hace 1 mes
Ciudad de México Ericsson A tiempo completoCMID Casale Media Collects visitor data related to the user's visits to the website, such as the number of visits, average time spent on the website and what pages have been loaded, with the purpose of displaying targeted ads. 1 year HTTP CMPRO Casale Media Collects data on visitor behaviour from multiple websites, in order to present more relevant...
-
México Lapieza.io A tiempo completoRequirements B.S. in Computer Engineering or Computer Science, or a related field. 8+ years of experience in software engineering with proficiency in one or more programming languages (e.g., Python, Go, Node) and the ability to write and review code, automate tasks, and develop tools to improve system reliability. 5+ years of extensive hands-on...
-
Network Engineer
hace 4 semanas
Ciudad de México ITKAWA A tiempo completo**Network Engineer - Santander** - **Esquema**:100% Remoto. - **Salario**:Abierto a negociar de acuerdo a experiência. - **Duración**:Posibilidad de prórrogas y/o contratación a tiempo completo. - **Idioma**: Bilingüe inglés/español. - **Experiência**:Mayor a 6 años. **Responsabilidades**: - Participar como recurso técnico en proyectos de las...
-
Field Services Site Lead + Network Engineer
hace 3 semanas
México, B.C. Allegion Canada Inc. A tiempo completoField Services Site Lead + Network Engineer page is loaded Field Services Site Lead + Network Engineer Apply locations Tijuana, Mexico time type Full time posted on Posted Yesterday job requisition id JR28742 Creating Peace of Mind by Pioneering Safety and Security At Allegion, we help keep the people you know and love safe and secure where they live,...
-
Lead Engineer Mexico
hace 4 semanas
Ciudad de México Port Cities A tiempo completoRequirements Advanced Diploma/Bachelors Degree in Computer Science/IT/Math/Physics/Engineering. A minimum of 2 years experience as a Software Engineer. Experience with Odoois mandatory. Good communication and interpersonal skills. Have experience with Object Oriented Programming Language (Python). Have experience with Java/Javascript/Mobile dev. Have...
-
Jr Electrical Product Engineer
hace 4 semanas
Ciudad de México Siemens Mobility A tiempo completo**Job Description**: **Job ID**: - 368940**Company**: - Siemens, S.A. de C.V.**Organization**: - Smart Infrastructure**Job Family**: - Engineering**Experience Level**: - Early Professional**Full Time / Part Time**: - Full-time**Remote vs Office**: - Office/Site only**Contract Type**: - Permanent**Change the future with us!** We are looking for...
-
Applications Engineer
hace 3 semanas
Ciudad de México Jobot A tiempo completo**Growing Water Pump Manufacturer Seeking Applications Engineer for great new opportunity** This Jobot Job is hosted by: Leesa Purtzer **Salary**: $80,000 - $100,000 per year **A bit about us**: My client, a leading Manufacturer of fire protection pumps and water infrastructure systems, is seeking an Applications Engineer to join their growing...
-
Design Release Engineer
hace 4 semanas
Ciudad de México Stellantis A tiempo completoThe Power Electronics Design and Release Engineer will be responsible for the activities related to Power Electronics components (Converters, Inverters, AC and DC charging devices) designs for Electrified Vehicle applications. Responsibilities include but not limited to: Working with power electronics suppliers to ensure corporate requirements and...