Site Reliability Enginner L2
hace 4 meses
Position Summary
The candidate will be working as a SRE member who will help the organization to constantly ensure reliability, availability and performance of large-scale ODC services.
SRE will work closely with development teams to design, build, and maintain scalable and reliable infrastructure, automate processes, monitor system health, and respond to incidents effectively with a mindset of efficiency on day-to-day activities.
SRE will constantly adopt ITIL and Agile methodologies/processes, coaching and mentoring on best practices. Will endorse whole lifecycle over Public Cloud ensuring to meet external customer SLA and internal OLAs.
Essential Functions / Key Areas of Responsibility
[List the essential functions required for this position to exist. List the responsibilities that must be completed in achieving the objectives of the position. Include all-important aspects of the job -- whether performed daily, weekly, monthly, or annually; and any that occur at irregular intervals. Focus these responses on direct actions or key functions.]
Develop and maintain Infrastructure as a Code and automation tools Responsible to Integrate, Operate and Support 7x24 mission critical services with 5x9 availability on public cloud. Responsible to ensure tier 1 / Platinum SLAs Responsible to review technical products and understand customer requirements. Responsible to perform regular tuning. Able to work with distributed teams worldwide. Responsible for defining business continuity strategy for Operated services over public cloud. Must animate and motivate the team on daily basis through Agile ceremonies (Daily, refinement, planning...) Must animate the team in term of self-organization. Responsible for suggesting indicators on team monitoring. Responsible for facilitating exchanges with the many stakeholders. Continuously improve service reliability, performance, and security of the services Collaborate with Service Delivery Managers on traffic trends, analyze the impact of mid-term business changes on capacity requirements. Participate in capacity management processes and security audits. Design and implement changes into the systems. Adapt solution parameters to make architecture evolutions. Maintain and enhance internal tools to improve service industrialization. Participate in the Presales, deployment and integration of the solutions form the Support perspectives. Definition of production requirements.Minimum Requirements: Skills, Experience, Education, Technical/Specialized Knowledge, Certifications, Language
Bachelor Degree in Information Technology or a related field +5 years of experience in design, development and implementation of applications. +5 years of proven experience in Public Cloud (GCP or AWS) Minimum C1 English (Advanced Level) Strong experience on Kubernetes (certification) Strong experience on Apache Http Server Strong experience on TLS > = 1.2 Ability to work SRE engineers during integration and operation project phases. Strong experience working in Agile teams ITIL/Agile certification Experience in embedding agile performance metrics to drive accountability. Effective verbal and written communication skills Strong working experience on one of the scripting languages – SHELL/Python is required. Ability to problem solve and be analytical. Strong results orientation with follow-up skill. Experience with SOAP and Rest API. Experience in No-SQL and SQL query construction. Experience on Datadog Monitoring tool implementation and monitoring Experience on Github Experience on Pipelines Experience to operate secrets on Vault Experience on GCP Terraform At Thales we provide CAREERS and not only jobs. With Thales employing 80,000 employees in 68 countries our mobility policy enables thousands of employees each year to develop their careers at home and abroad, in their existing areas of expertise or by branching out into new fields. Together we believe that embracing flexibility is a smarter way of working. Great journeys start here, apply now-
Site Reliability Engineer
hace 7 meses
Mexico City Trax A tiempo completoAbout The Position The Position Site Reliability Engineer About Trax Trax’s mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences imaginable. Trax’s retail platform allows customers to understand what is happening on shelf, in every store, all the time so they...
-
Site Reliability Engineer
hace 4 meses
Mexico City Thales A tiempo completoThales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...
-
Site Reliability Engineer
hace 7 meses
Mexico City Epam A tiempo completoDescription DESCRIPTION Are you a DevOps expert with a passion for improving communication between operational and developmental sides of the software development process? Do you thrive in dynamic, collaborative environments? If so, we have an exciting opportunity for you! We're currently seeking a Site Reliability Engineer to join...
-
Site Reliability Engineer
hace 6 meses
Mexico City Virtualent A tiempo completoSite Reliability Engineer (SRE)VirtualentAbout Us:We’re a leading IT Staffing company, passionate about connecting top talent with the best opportunities. We are looking for a Site Reliability Engineer (SRE) to join our team.Responsibilities:• Design, implement, and maintain scalable and highly available infrastructures.• Monitor and ensure the...
-
Site Reliability Engineer III/Network
hace 7 meses
Mexico City F5 A tiempo completoAt F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation. Everything we do centers...
-
Staff Site Reliability Engineer
hace 2 meses
Mexico City Crunchyroll A tiempo completoAbout CrunchyrollWE HELP EVERYONE BELONG. IT’S OUR PURPOSE.Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We super-serve over 100 million anime and manga fans across 200+ countries and territories, and help them connect with the stories and characters they crave. Whether that experience is online or in-person,...
-
Principal Site Reliability Engineer
hace 8 meses
Mexico City Oracle A tiempo completoResponsibilities Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA) Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...
-
On-Site Technical Support L2
hace 3 meses
Mexico City Stefanini A tiempo completoJOB DESCRIPTION ¡Sé parte de Stefanini! En Stefanini somos más de 30.000 genios, conectados desde 41 países, haciendo lo que les apasiona y co-creando un futuro mejor. ¡Seguro no te quieres quedar fuera, On-Site Technical Support L2! ¿Por qué te elegiremos? ¡Porque los desafíos que asumirás reflejan tus ambiciones! RESPONSIBILITIES AND...
-
Site Reliability Engineer
hace 7 meses
Mexico City Thomson Reuters A tiempo completoAbout the Role In this opportunity as a Site Reliability Engineer, you will: Provides skilled technical support/delivery capability, with minimal supervision, for the current and future design, testing, delivery, support, and maintenance of production services in the technical operations environment. Provides technical and procedural consistency...
-
Site Reliability Engineer
hace 6 meses
City, México Svitla Systems A tiempo completo- Requirements: - 5+ years of experience in a SRE or similar role. - 3+ years of experience supporting containerized production services using Kubernetes. - 2+ years of experience with Infrastructure as Code and configuration tools like Terraform and Ansible. - 1+ year of recent experience in the cloud (Google Cloud Platform preferred, AWS and Azure will...
-
Senior Site Reliability Engineer
hace 7 meses
Mexico City Thomson Reuters A tiempo completoAbout the Role In this opportunity as a Senior Site Reliability Engineer , you will: Develop, Deliver, and Support: By applying modern SRE operational & development practices, you will be involved in the entire operational support, Monitoring, automation, building, and delivering high-quality solutions for the team. Be a Team Player: Working in...
-
Service Reliability Engineer
hace 4 meses
Mexico City Thales A tiempo completoThales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...
-
Mexico City Servicios Comerciales Amazon Mexico S. de R.L. de C.V. - D44 A tiempo completoReliability, Maintenance, and Engineering (RME) Central Services is hiring for Systems Engineers!At Amazon we believe that Every Day is still Day One! We’re working to be the most customer-centric company on earth. To get there, we need talented, bright and driven people.The System Development Engineer position provides proactive technical support for...
-
Project Engineer
hace 7 meses
Mexico City Unilever A tiempo completoUnilever is currently hiring for Project Engineer Function: Project Enginner Work Level: 1C Reports to :ARACELI SOTO VAZQUEZ Scope :NUTRITION LATAM Location : LERMA. Terms & Conditions : Full time position. ABOUT UNILEVER Unilever is the place where you can bring your purpose to life with the work that you do – creating a better...
-
SRE Engineer
hace 6 meses
Mexico City Azka IT Consulting A tiempo completoAZKA IT is a Mexican company that seeks and connects the best IT talent with Latin American and United States companies.We are looking for your talent as Site Reliability EngineerRequirements:The Site Reliability Engineer (SRE) plays a crucial role in the design, implementation and maintenance of highly available, scalable and reliable systems. Technical...
-
Drupal Developer
hace 4 semanas
Mexico City Cognizant A tiempo completoWe’re hiring! At Cognizant we have an ideal opportunity for you to be part of one of the largest companies in the digital sector worldwide. A Great Place To Work where we look for people who contribute new ideas, experiencing a dynamic and growing environment. At Cognizant we promote an inclusive culture, where we value different perspectives providing...
-
Senior IT Support Engineer II
hace 4 meses
Mexico City KeepTruckin A tiempo completoWho we are: Motive empowers the people who run physical operations with tools to make their work safer, more productive, and more profitable. For the first time ever, safety, operations and finance teams can manage their drivers, vehicles, equipment, and fleet related spend in a single system. Combined with industry leading AI, the Motive platform gives...
-
Integration Engineer
hace 4 meses
Mexico City Blend A tiempo completocontribute: Design, develop, and maintain integration flows using the Mule Anypoint Platform using capabilities such as the Mule Runtime, Connectors, Design Center, and API management Collaborate and review code with other engineers on the team to ensure each integration maintains a consistent level of technical standards as set by the team ...
-
Drupal Developer
hace 1 mes
Mexico City Cognizant A tiempo completoWe’re hiring! At Cognizant we have an ideal opportunity for you to be part of one of the largest companies in the digital sector worldwide. A Great Place To Work where we look for people who contribute new ideas, experiencing a dynamic and growing environment. At Cognizant we promote an inclusive culture, where we value different perspectives providing...
-
Senior Site Reliability Engineer
hace 2 meses
Veracruz, Veracruz de Ignacio de la Llave, Mexico Zillow A tiempo completoAbout the teamThe Transformation Enablement Team (TE) at Zillow Group empowers ZG Product Teams to efficiently run “Zillow 2.0” services by reducing human error, aggressively focusing on automation, and providing deep insight into application behavior and health. We do that by incorporating aspects of software engineering and applying them to...