Site Reilability Engineer

hace 4 días

Ciudad de México, Ciudad de México HCLTech A tiempo completo

About the job

HCLTech is a global technology company, home to more than 218,000 people across 59 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. Consolidated revenues as of 12 months ending September 2024 totaled $13.7 billion.

We are looking for a highly motivated SRE Engineer who thrives in troubleshooting complex systems, taking initiative, and driving solutions independently. Our infrastructure is built on aws EKS and we need someone who is passionate, motivated, and excited about data around our Kubernetes, ArgoCD, and the many pieces that make up our Tier 1 Ordering application.

ESSENTIAL DUTIES AND RESPONSIBILITIES

Self-starter mentality: take ownership of the problems or tasks, drive solutions, and continuously improve our processes
Data driven.
Strong focus on sharing what is working well, what is not, and what plans we have to improve the areas that are not working well.
Responsible for establishing end-to-end monitoring and alerting all critical aspects of supported pipelines.
Identify low hanging fruits.
Manage and troubleshoot our aws eks clusters, ensuring reliability and performance.
Strong focus on improving team practices.
Ensure technical solutions meet quality, security, and compliance requirements; work directly with key stakeholders and technical teams to ensure solutions have passed the required quality checks.
Partner with other SREs on configuration management at scale.
Work with software engineers (SWEs) in product development and SREs to define all the steps required to release software: how the software is stored in the source code repository; how artifacts are stored in artifact management repositories; how middleware is configuration and patched; enforcing build rules for compilation; automated testing and establishing quality gates; packaging; and deployment processes.
Proactively work on toil reduction, efficiency, and capacity planning.
Promote a culture of shared responsibility.

MINIMUM QUALIFICATIONS

1+ years of experience in Site Reliability Engineering practices
required
3+ years of experience in DevOps related field
required || Good to Have
3+ years of experience with automation tools and Continuous Integration systems (e.g., Bamboo, Chef, Git)
required
2+ years of experience with Kubernetes & Docker
required

Education

B.S. in computer science or a related technical field required.

PREFERRED QUALIFICATIONS

Strong focus on Innovation (process & technology)
Experience working with business partners and vendors.
Coordination and planning with project management on resource allocations.
Focus on a more wholistic project level and less focus on the day-to-day interactions unless certain details are needed.
Focus on project health and roadblock remediation.
Strong Soft Skills (negotiate, communicate, listen, solve problems, customer-first mindset, flexibility).
Advanced Infrastructure Knowledge (Data Center, Cloud, infrastructure as code, Containers [
Docker & Kubernetes
]).
Security Awareness.
Advanced Knowledge of modern DevOps Stack Tools example but not limited to: Chef, Bamboo, Bitbucket,
Containers
[
Docker & Kubernetes
], and AWS Expertise).
Big Picture Thinking.
Network Awareness.
Practice incident response and blameless postmortems.

We are actively working to identify additional strong profiles for this role & coordinate accordingly to share the shortlisted profiles with you focusing on,

Observability – Interaction with tools like
New Relic, Datadog, Dynatrace, etc
. The specific tool is not necessary but the practices behind observability is what we are interested in.

Team focused - Past work experience is working with product teams, or teams where coordination between many would be necessary.

We offer a competitive package that includes

Life insurance
Major Medical Expenses Insurance
Minor Medical Expense Insurance
Savings Fund 13%
Food vouchers 10%
30 days as Christmas Bonus
12 days of vacation in the first year, increasing 2 days as dictated by law.
Additionally, we provide continuous training and development opportunities to help our employees achieve their professional goals.

We need Fluent English

Site Reliability Engineer

hace 1 semana

Ciudad de México, Ciudad de México Azkait A tiempo completo

AZKAITes una empresa mexicana que busca y conecta el mejor talento IT con empresas Latinoamericanas y de Estados Unidos.Estamos en la búsqueda de tu talento comoSite Reliability Engineer (SRE)Requisitos:Licenciatura o Ingeniería en Sistemas, Informática o afín.+5 años de experiencia en roles de SRE, DevOps o Ingeniería de Software.Experiencia...
Senior Site Reliability Engineer

hace 6 días

Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

Senior Site Reliability Engineer (SRE)Are you passionate about the chance to bring your experience to a world-class company that is market-leading or both content and technology? If yes, we're looking for you.Join our team Senior Site Reliability Engineer (SRE) will be implement Site Reliability Engineering and DevOps best practices. Feed non-functional...
Site Reliability Engineer

hace 1 semana

Ciudad de México, Ciudad de México itD Tech A tiempo completo

itD is seeking a Site Reliability Engineer who will report to the Sr. Engineering Manager for a client in the gaming and entertainment space. As a Site Reliability Engineer, you will focus on designing, deploying, and operating resilient, secure, and globally scalable services in AWS, with , TypeScript, Kubernetes, GitLab, Argo CD (CI/CD).This long-term W2...
Site Reliability Engineer

hace 2 semanas

Ciudad de México, Ciudad de México Sur A tiempo completo

As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...
Site Reliability Engineer

hace 2 semanas

Ciudad de México, Ciudad de México Tech Mahindra A tiempo completo

We're Hiring We are seeking a talented Site Reliability Engineer (SRE) CDMX with robust experience in Azure environments, Kubernetes, and DevOps practices.Your mission will be to ensure the reliability, scalability, and automation of our critical platforms. If you thrive on solving complex challenges, automating processes, and ensuring seamless operations,...
Civil Engineer

hace 2 semanas

Ciudad de México, Ciudad de México Job Bridge Global A tiempo completo

Job Title: Civil Engineer – Concrete SpecialismLocation: Idaho, USAEmployment Type: Full-timeVisa Sponsorship: TN Visa (for eligible candidates)Salary: To be discussed during the hiring processNote: This job is based in the USA. All visa and relocation costs are covered by the employer.You must be able to effectively read, write and communicate in...
Site Reliability Engineer

hace 2 semanas

Ciudad de México, Ciudad de México Encora A tiempo completo

Important Information:Years of Experience: 5+ yearsJob Mode: Full-timeWork Mode: Remote within MexicoJob Summary:We are seeking a Site Reliability Engineer to ensure the reliability, scalability, and performance of custom platforms running on AWS infrastructure and Kubernetes. This role focuses on Tier 3 issue resolution, operational readiness for new...
FBS Site Reliability Engineer

hace 2 semanas

Ciudad de México, Ciudad de México Capgemini A tiempo completo

Our Client is one of the United States' largest insurers, providing a wide range of insurance and financial services products with gross written premiums well over US$25 Billion (P&C). They proudly serve more than 10 million U.S. households with more than 19 million individual policies across all 50 states through the efforts of over 48,000 exclusive and...
Remote Civil Engineer

hace 2 semanas

Ciudad de México, Ciudad de México Uptalent A tiempo completo

is currently hiring a Remote Civil Engineer with expertise in Site Plan Design and Grading to join our team. As a global platform that connects top talent with leading companies, is committed to providing exceptional service to our clients. In this role, you will have the opportunity to work for the most exciting Civil Engineering companies in the U.SThe...
Site Reliability Engineer

hace 2 semanas

Ciudad de México, Ciudad de México Felix Technologies, Inc. A tiempo completo

About Us At Félix, we're building the financial ecosystem for Latin immigrants in the U.S., starting with a revolution in remittances. Our core product is an AI-powered chatbot built on WhatsApp, allowing our users to send money home as easily as sending a text message. We leverage cutting-edge technology like AI, blockchain, and stablecoins to make...

Américas

Europa

Asia / Oceanía

África

Site Reilability Engineer