Sre - Site Reliability Engineer
hace 2 semanas
Fecha de publicación: 05 Agosto 2025- Lugar:- México - Remote- Skills:**MINIMUM QUALIFICATIONS**- 1+ years of experience in Site Reliability Engineering practices required- 3+ years of experience in DevOps related field required- 3+ years of experience with automation tools and Continuous Integration systems (e.g., Bamboo, Chef, Git) required- 2+ years of experience with Kubernetes & Docker required- **English level: Advanced****PREFERRED QUALIFICATIONS**- Strong focus on Innovation (process & technology)- Experience working with business partners and vendors.- Coordination and planning with project management on resource allocations.- Focus on a more wholistic project level and less focus on the day-to-day interactions unless certain details are needed.- Focus on project health and roadblock remediation.- Strong Soft Skills (negotiate, communicate, listen, solve problems, customer-first mindset, flexibility).- Advanced Infrastructure Knowledge (Data Center, Cloud, infrastructure as code, Containers (Docker & Kubernetes)).- Security Awareness.- Advanced Knowledge of modern DevOps Stack Tools example but not limited to: Chef, Bamboo, Bitbucket, Containers (Docker & Kubernetes), and AWS Expertise).- Big Picture Thinking.- Network Awareness.- Practice incident response and blameless postmortems.- Actividades:**ESSENTIAL DUTIES AND RESPONSIBILITIES**- Self-starter mentality: take ownership of the problems or tasks, drive solutions, and continuously improve our processes- Data driven.- Strong focus on sharing what is working well, what is not, and what plans we have to improve the areas that are not working well.- Responsible for establishing end-to-end monitoring and alerting all critical aspects of supported pipelines.- Identify low hanging fruits.- Manage and troubleshoot our aws eks clusters, ensuring reliability and performance.- Strong focus on improving team practices.- Ensure technical solutions meet quality, security, and compliance requirements; work directly with key stakeholders and technical teams to ensure solutions have passed the required quality checks.- Partner with other SREs on configuration management at scale.- Work with software engineers (SWEs) in product development and SREs to define all the steps required to release software: how the software is stored in the source code repository; how artifacts are stored in artifact management repositories; how middleware is configuration and patched; enforcing build rules for compilation; automated testing and establishing quality gates; packaging; and deployment processes.- Proactively work on toil reduction, efficiency, and capacity planning.- Promote a culture of shared responsibility.- Deseable:- Beneficios:- Life insurance. 24 salary Months natural death / 48 salary months accidental death- Major Medical Expenses Insurance (For employee and dependents spouse and children)- Minor Medical Expense Insurance. 4,000 MXN for single people, 6,000MXN for family, (per year through reimbursement)- Savings Fund 13% capped by law (Max 4,449 MXN Monthly)- Pantry vouchers 10% capped by law (Max 3,439 MXN Monthly)- 30 days Christmas bonus- 12 days of vacation in the first year, increasing according to law.- 25% vacation bonus as per law**H
-
Senior Site Reliability Engineer
hace 3 semanas
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking an experienced **Senior Site Reliability Engineer**to join our team.As a key member of the Reliability Tooling team, you will be responsible for writing and reviewing code, contributing to critical technical decisions, and mentoring engineers within your squad. This role requires a deep understanding of SRE principles and best practices, as...
-
Site Reliability Engineer
hace 2 semanas
Desde casa, México Right Balance A tiempo completo**Overview** We're looking for a Site Reliability Engineer. Headquartered in Los Angeles, California, Right Balance provides top-tier technology talent for innovative companies in the US. We’re in the top 50 companies to watch in LA. **Engagement Details** Our client is a USA-based company producing video solutions with the mission to advance scientific...
-
Lead Site Reliability Engineer
hace 3 semanas
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are looking for an experienced **Lead Site Reliability Engineer**to join our team.In this role, you will play a pivotal part in the Reliability Tooling team, taking responsibility for writing and reviewing code, making key technical decisions, and mentoring engineers within your squad. This position requires a strong grasp of SRE principles and best...
-
Sr Sre
hace 2 semanas
Desde casa, México Ryscode A tiempo completoJob Description: Required Technologies AWS, Docker, Kubernetes, Prometheus, JavaScript, Rust, Python On-Call Duties: Participate in on-call rotations to provide 24/7 support Be available to respond to critical incidents outside regular working hours. The Site Reliability Engineer (SRE) will support the maintenance and improvement of system reliability,...
-
Site Reliability Engineer with Java
hace 1 semana
Desde casa, México EPAM Systems A tiempo completo**DESCRIPTION**: Join EPAM as a remote **Site Reliability Engineer specializing in Java.** In this role, you'll provide 24/7 on-call support for Java backend services, prepare and deploy patches, and assist in establishing top-of-the-line metrics and dashboards. If you have 5-8 years of experience as a DevOps/SRE, proficiency in Java, and experience with...
-
Site Reliability Engineer
hace 3 semanas
Desde casa, México Synechron A tiempo completoSynechron is a self-funded, leading digital transformation Consulting firm focused on the financial services industry working to accelerate digital initiatives for Banks, Asset Managers and Insurance. We achieve this by providing our clients with innovative solutions that solve their most complex business challenges and combining Synechron’s unique,...
-
Site Reliability Engineer
hace 2 semanas
Desde casa, México EPAM Systems, Inc. A tiempo completoJoin our team as a **Site Reliability Engineer,** where you will focus on cloud infrastructure, containerization, and monitoring using Kubernetes and Microsoft Azure.**Responsibilities**- Deploy and maintain Kubernetes resource manifests in clusters such as Kind, GKE, or AKS- Troubleshoot and analyze logs to identify and resolve system events and issues-...
-
Senior Site Reliability Engineer
hace 3 semanas
Desde casa, México Zillow A tiempo completo**About the role**:As a member of the FUB+ Infrastructure & Security team you will architect, develop and deploy systems, processes and environments that support numerous services developed by engineers within the FUB+ engineering organization. You will help us deliver the most reliable and performant experience for our customers and keep our existing...
-
Senior Site Reliability Engineer
hace 2 semanas
Desde casa, México EPAM Systems, Inc. A tiempo completoJoin our team as a **Senior Site Reliability Engineer**, where you will maintain and improve our product monitoring system, manage incident responses, and facilitate collaboration between operations and development teams. **Responsibilities** - Maintain and improve the product monitoring system - Manage incident response including troubleshooting,...
-
Site Reliability Engineer Remote
hace 5 días
Desde casa, México Property Leads A tiempo completo**Location**: Remote (must work US time zone hours)**Language Requirement**: Fluent in spoken and written English**Technical requirement**: MongoDB experience**Employment Type**: Full-Time**About Property Leads**:Property Leads is a fast-growing, high-velocity lead generation company helping professionals acquire motivated, inbound leads through cutting-edge...