Lead Systems Engineer

hace 3 semanas


Desde casa, México EPAM Systems, Inc. A tiempo completo

We are seeking a **Lead Systems Engineer** to guide a team of Support/SRE engineers in managing and maintaining critical platforms such as Backstage, Grafana, and Kubernetes.As a hands-on leader, you will provide technical expertise, mentor team members, and ensure operational excellence while driving innovation and efficiency in our infrastructure. This position includes a scheduled weekend on-call duty once per month, compensated separately.**Responsibilities**- Lead and mentor a team of Support/SRE engineers working on Backstage, Grafana, and Kubernetes platforms- Act as the technical authority, offering architectural direction, troubleshooting expertise, and best practices- Ensure system performance, scalability, and reliability through effective operations management- Collaborate with cross-functional teams to integrate systems and align them with organizational goals- Drive SRE principles, including incident management, root cause analysis, and process improvement- Ensure robust observability practices with Grafana, including dashboard development, alert configuration, and tool integrations- Manage and scale Kubernetes environments (e.g., Amazon EKS) to meet performance and resource needs- Oversee and optimize Backstage as a developer portal, ensuring usability and reliability- Define and implement SLOs, SLIs, and SLAs for system performance and availability- Automate repetitive tasks to improve operational efficiency and reduce manual workload- Cultivate a culture of accountability, learning, and collaboration within the team- Keep pace with industry trends and emerging technologies, proposing relevant enhancements to existing systems- Participate in one weekend on-call duty per month (paid separately)**Requirements**:- 5+ years of experience in Cloud Operations- Expertise in managing Backstage platforms, including configuration and customization- Proven experience in leading and mentoring technical teams- Knowledge of observability toolsets (e.g., LGTM stack) for monitoring, visualization, and alerting- Hands-on experience with Kubernetes platforms (e.g., Amazon EKS, OpenShift) and containerized environments- Solid understanding of cloud platforms such as AWS and Azure, as well as infrastructure management- Proficiency in scripting and automation with tools like Python or Bash- Experience with CI/CD pipelines and Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation, Ansible)- Familiarity with Linux/Unix systems and system-level troubleshooting- Problem-solving excellence with a proactive, detail-oriented mindset- Strong communication skills to effectively interact with diverse technical and non-technical teams**Nice to have**- Background in leading teams in enterprise-scale or managed services environments- Familiarity with incident management tools like PagerDuty or Opsgenie- Knowledge of microservices architecture, distributed systems, and load balancing**We offer**- Career plan and real growth opportunities- Unlimited access to LinkedIn learning solutions- International Mobility Plan within 25 countries- Constant training, mentoring, online corporate courses, eLearning and more- English classes with a certified teacher- Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)- Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)- Flexible work schedule and dress code- Collaborate in a multicultural environment and share best practices from around the globe- Hired directly by EPAM & 100% under payroll- Law benefits (IMSS, INFONAVIT, 25% vacation bonus)- Major medical expenses insurance: Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)- 13 % employee savings fund, capped to the law limit- Grocery coupons- 30 days December bonus- Employee Stock Purchase Plan- 12 vacations days plus 4 floating days- Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)- Monthly non-taxable amount for the electricity and internet billsEPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.


  • Lead Systems Engineer

    hace 1 hora


    Desde casa, México EPAM Systems, Inc. A tiempo completo

    We are seeking a **Lead Systems Engineer** who will be instrumental in coaching team members, managing tasks, and improving platform automation. This role requires someone with a strong background in development, particularly in Site Reliability Engineering, and is designed for someone passionate about reducing toil through automation using various script...

  • Lead Software Engineer

    hace 2 semanas


    Desde casa, México EPAM Systems, Inc. A tiempo completo

    We are looking for a **Lead Software Engineer** with exceptional skills to design, develop, and lead advanced AI-powered and agent-driven systems.This role focuses on leveraging state-of-the-art LLM tools, guiding team members, and defining the technical vision for groundbreaking projects in a dynamic and collaborative setting.**Responsibilities**- Lead the...

  • Chief Systems Engineer

    hace 3 semanas


    Desde casa, México EPAM Systems, Inc. A tiempo completo

    We are seeking a **Chief Systems Engineer** to lead a team of Support/SRE engineers in managing critical platforms, including Backstage, Grafana, and Kubernetes.This role combines technical leadership with hands-on involvement, mentoring team members, and driving operational excellence. The position requires participating in a scheduled weekend on-call duty...


  • Desde casa, México EPAM Systems, Inc. A tiempo completo

    We are seeking an experienced **Lead Database Engineer** to spearhead our AWS Cost Optimization team. In this pivotal role, you will be responsible for optimizing data storage, access, and retrieval in AWS environments and leading a team of engineers. As a Lead Database Engineer, your leadership and technical skills will play a critical role in optimizing...

  • Lead DevOps Engineer

    hace 2 horas


    Desde casa, México EPAM Systems, Inc. A tiempo completo

    We are looking for a seasoned **Lead DevOps Engineer**to spearhead the development, deployment, and optimization of cutting-edge software workflow automation systems. In this leadership role, you will oversee a team of DevOps engineers, collaborate with a world-class group of professionals, and drive the modernization and scaling of end-to-end SW processes...

  • Lead Data Engineer

    hace 4 semanas


    Desde casa, México EPAM Systems, Inc. A tiempo completo

    We are looking for a highly skilled **Lead Data Engineer**to join our team.In this role, you will lead technical efforts for NoSQL-related projects, evaluate database platform technologies, and provide expert analysis and recommendations based on business requirements. You will collaborate closely with the engineering team to design and implement best...


  • Desde casa, México EPAM Systems, Inc. A tiempo completo

    We are looking for a **Lead Machine Learning Engineer** to spearhead the design, implementation, and optimization of ML-based systems that recommend user-generated content intelligently while maximizing engagement. You will lead the development of robust machine learning pipelines and oversee high-performance deployments optimized for real-time data...


  • Desde casa, México EPAM Systems A tiempo completo

    We seek a seasoned **Senior Okta Systems Engineer** to join our client’s team in an independent capacity.The position entails the installation, integration, and deployment of Okta solutions, the design of technical architectures, and the provision of expertise in identity and access management systems.RESPONSIBILITIES- Conduct installation, integration,...

  • Senior Systems Engineer

    hace 3 semanas


    Desde casa, México EPAM Systems A tiempo completo

    We are actively seeking a highly experienced and skilled **Senior Systems Engineer** to become a key part of our dynamic team.This position is well-suited for a technical expert adept at managing complex systems and infrastructure effectively to meet the evolving needs of our growing organization.RESPONSIBILITIES- Implement and maintain CI/CD pipelines using...

  • Senior Systems Engineer

    hace 3 semanas


    Desde casa, México EPAM Systems A tiempo completo

    We are seeking a highly experienced and skilled **Senior Systems Engineer** to join our dynamic team.This role is ideal for a technical expert versed in managing complex systems and infrastructures efficiently to support our growing organization's needs.RESPONSIBILITIES- Implement and maintain CI/CD pipelines using Jenkins, Rundeck, GitLab, Artifactory, and...