Lead Systems Engineer
hace 3 días
We are seeking a **Lead Systems Engineer** to guide a team of Support/SRE engineers in managing and maintaining critical platforms such as Backstage, Grafana, and Kubernetes.As a hands-on leader, you will provide technical expertise, mentor team members, and ensure operational excellence while driving innovation and efficiency in our infrastructure. This position includes a scheduled weekend on-call duty once per month, compensated separately.**Responsibilities**- Lead and mentor a team of Support/SRE engineers working on Backstage, Grafana, and Kubernetes platforms- Act as the technical authority, offering architectural direction, troubleshooting expertise, and best practices- Ensure system performance, scalability, and reliability through effective operations management- Collaborate with cross-functional teams to integrate systems and align them with organizational goals- Drive SRE principles, including incident management, root cause analysis, and process improvement- Ensure robust observability practices with Grafana, including dashboard development, alert configuration, and tool integrations- Manage and scale Kubernetes environments (e.g., Amazon EKS) to meet performance and resource needs- Oversee and optimize Backstage as a developer portal, ensuring usability and reliability- Define and implement SLOs, SLIs, and SLAs for system performance and availability- Automate repetitive tasks to improve operational efficiency and reduce manual workload- Cultivate a culture of accountability, learning, and collaboration within the team- Keep pace with industry trends and emerging technologies, proposing relevant enhancements to existing systems- Participate in one weekend on-call duty per month (paid separately)**Requirements**:- 5+ years of experience in Cloud Operations- Expertise in managing Backstage platforms, including configuration and customization- Proven experience in leading and mentoring technical teams- Knowledge of observability toolsets (e.g., LGTM stack) for monitoring, visualization, and alerting- Hands-on experience with Kubernetes platforms (e.g., Amazon EKS, OpenShift) and containerized environments- Solid understanding of cloud platforms such as AWS and Azure, as well as infrastructure management- Proficiency in scripting and automation with tools like Python or Bash- Experience with CI/CD pipelines and Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation, Ansible)- Familiarity with Linux/Unix systems and system-level troubleshooting- Problem-solving excellence with a proactive, detail-oriented mindset- Strong communication skills to effectively interact with diverse technical and non-technical teams**Nice to have**- Background in leading teams in enterprise-scale or managed services environments- Familiarity with incident management tools like PagerDuty or Opsgenie- Knowledge of microservices architecture, distributed systems, and load balancing**We offer**- Career plan and real growth opportunities- Unlimited access to LinkedIn learning solutions- International Mobility Plan within 25 countries- Constant training, mentoring, online corporate courses, eLearning and more- English classes with a certified teacher- Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)- Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)- Flexible work schedule and dress code- Collaborate in a multicultural environment and share best practices from around the globe- Hired directly by EPAM & 100% under payroll- Law benefits (IMSS, INFONAVIT, 25% vacation bonus)- Major medical expenses insurance: Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)- 13 % employee savings fund, capped to the law limit- Grocery coupons- 30 days December bonus- Employee Stock Purchase Plan- 12 vacations days plus 4 floating days- Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)- Monthly non-taxable amount for the electricity and internet billsEPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
-
Lead Systems Engineer
hace 6 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are looking for a **Lead Systems Engineer** to lead modernization and migration initiatives while delivering scalable, secure cloud-based solutions. This role is critical in managing AWS infrastructure, enhancing operational standards, and promoting team collaboration for seamless system and process integration. **Responsibilities** - Deploy AWS...
-
Lead Systems Engineer
hace 4 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking a **Lead Systems Engineer** to guide a team of Support/SRE engineers in managing and maintaining critical platforms such as Backstage, Grafana, and Kubernetes. As a hands-on leader, you will provide technical expertise, mentor team members, and ensure operational excellence while driving innovation and efficiency in our infrastructure. This...
-
Lead Systems Engineer
hace 7 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking a **Lead Systems Engineer** who will be instrumental in coaching team members, managing tasks, and improving platform automation. This role requires someone with a strong background in development, particularly in Site Reliability Engineering, and is designed for someone passionate about reducing toil through automation using various script...
-
Chief Systems Engineer
hace 4 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking a **Chief Systems Engineer** to lead a team of Support/SRE engineers in managing critical platforms, including Backstage, Grafana, and Kubernetes. This role combines technical leadership with hands-on involvement, mentoring team members, and driving operational excellence. The position requires participating in a scheduled weekend on-call...
-
Chief Systems Engineer
hace 3 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking a **Chief Systems Engineer** to lead a team of Support/SRE engineers in managing critical platforms, including Backstage, Grafana, and Kubernetes.This role combines technical leadership with hands-on involvement, mentoring team members, and driving operational excellence. The position requires participating in a scheduled weekend on-call duty...
-
Lead Database Engineer
hace 7 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking an experienced **Lead Database Engineer** to spearhead our AWS Cost Optimization team. In this pivotal role, you will be responsible for optimizing data storage, access, and retrieval in AWS environments and leading a team of engineers. As a Lead Database Engineer, your leadership and technical skills will play a critical role in optimizing...
-
Senior Systems Engineer
hace 6 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking a **Senior Systems Engineer** to drive modernization and migration efforts while implementing scalable, secure cloud solutions. This position is pivotal in managing AWS-focused infrastructure, developing operational excellence, and fostering collaboration across teams to enable seamless integration of systems and...
-
Lead DevOps Engineer
hace 7 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are looking for a seasoned **Lead DevOps Engineer**to spearhead the development, deployment, and optimization of cutting-edge software workflow automation systems. In this leadership role, you will oversee a team of DevOps engineers, collaborate with a world-class group of professionals, and drive the modernization and scaling of end-to-end SW processes...
-
Lead Data Engineer
hace 4 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are looking for a highly skilled **Lead Data Engineer**to join our team.In this role, you will lead technical efforts for NoSQL-related projects, evaluate database platform technologies, and provide expert analysis and recommendations based on business requirements. You will collaborate closely with the engineering team to design and implement best...
-
Lead Machine Learning Engineer
hace 4 días
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are looking for a **Lead Machine Learning Engineer** to spearhead the design, implementation, and optimization of ML-based systems that recommend user-generated content intelligently while maximizing engagement. You will lead the development of robust machine learning pipelines and oversee high-performance deployments optimized for real-time data...