Senior Elastic Platform Engineer
hace 1 semana
We are seeking a highly skilled **Senior Elastic Platform Engineer** to join our team, responsible for supporting, enhancing, and maintaining our Elastic & Observability Platform deployed across GCP and Elastic Cloud. This role will involve developing innovative solutions, maintaining platform reliability, and enabling self-service capabilities to empower platform consumers while participating in an on-call rotation to oversee platform health and functionality.
**Responsibilities**
- Ensure availability, functionality, performance, and security of observability and search platforms to meet business SLAs
- Respond to incidents and resolve escalations promptly during on-call periods
- Maintain platform documentation, standard operating procedures, and operational guidelines
- Collaborate with stakeholders and vendors to manage operational requirements, installations, and upgrades
- Enhance platform features and self-service capabilities, including Elastic Synthetics and chargeback automation
- Design proof-of-concepts for operational improvements like AI-driven observability or Kubernetes migration
- Build, deploy, and maintain Elastic clusters using Infrastructure-as-Code (IaC) tools like Terraform and Ansible
- Perform platform lifecycle management activities such as component upgrades, capacity planning, and cost optimisation
- Fine-tune ELK stack performance across ingestion, indexing, and query layers
- Configure and manage comprehensive alerting and incident management workflows, including Kibana Rules, Watchers, and PagerDuty
- Support ingestion, enrichment, backup, and restoration of platform data
- Plan and manage SSL certificate rotations and cluster scalability requirements
**Requirements**:
- 3+ years of experience in Operational Intelligence
- Proven expertise in implementing, operating, and managing Elastic clusters
- Knowledge of Elastic Stack components, including Elasticsearch, Kibana, and Logstash
- Proficiency in Infrastructure-as-Code (IaC) tools such as Terraform and Ansible, with flexibility to use Jenkins CI
- Skills in Python for automation and extending platform functionality
- Understanding of incident management workflows with tools like PagerDuty and Uptrends
- Background in troubleshooting and resolving complex platform issues efficiently
- Competency in managing scalable, fault-tolerant platforms with a focus on performance and security
- Strong communication skills in English (B2 level or higher) for collaborating with technical and non-technical stakeholders
**Nice to have**
- Familiarity with additional tools such as Groovy, Linux Administration, and Jenkins CI pipelines
- Capability to optimise observability workflows using advanced integrations in Uptrends and PagerDuty
- Showcase of previous work with Elastic Synthetics for advanced monitoring and testing
**We offer**
- Career plan and real growth opportunities
- Unlimited access to LinkedIn learning solutions
- Constant training, mentoring, online corporate courses, eLearning and more
- English classes with a certified teacher
- Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)
- Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)
- Flexible work schedule and dress code
- Collaborate in a multicultural environment and share best practices from around the globe
- Hired directly by EPAM & 100% under payroll
- Law benefits (IMSS, INFONAVIT, 25% vacation bonus)
- Major medical expenses insurance: Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)
- 13 % employee savings fund, capped to the law limit
- Grocery coupons
- 30 days December bonus
- Employee Stock Purchase Plan
- 12 vacations days
- Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)
- Monthly non-taxable amount for the electricity and internet bills
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
-
Senior AI Platform Engineer
hace 20 minutos
Desde casa, México EPAM Systems A tiempo completoJoin our team as a Senior AI Platform Engineer, where you will design, deploy, and maintain next-generation Databricks platforms on AWS to support advanced analytics and machine learning workflows.You will collaborate closely with data scientists and ML engineers to deliver a seamless developer experience on the Lakehouse. Apply now to contribute to...
-
Senior Operational Intelligence Developer
hace 3 semanas
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking a highly skilled **Senior Operational Intelligence Developer**to join our team, responsible for supporting, enhancing, and maintaining our Elastic & Observability Platform deployed across GCP and Elastic Cloud. This role will involve developing innovative solutions, maintaining platform reliability, and enabling self-service capabilities to...
-
Data Platform Engineer
hace 4 semanas
Desde casa, México Bhuvi IT Solutions A tiempo completo**#We_Are_Hiring for #Data_Platform_Engineer**Job Title**:Data Platform Engineer**Location: Remote (Mexico)Type: Long-term Contract**Key Skills: AWS, Snowflake, Kafka, Data Pipelines, Communication****Job Overview**:We’re looking for a skilled Data Platform Engineer to design, build, and manage scalable data pipelines and infrastructure using AWS,...
-
Platform Engineer
hace 2 semanas
Desde casa, México STEFANINI LATAM A tiempo completo**Platform Engineer - Global Project | Contractor | Stefanini** Stefanini is looking for a skilled **Platform Engineer** to join a global initiative for a multinational client in the **manufacturing and coatings** industry. This is a **contractor position** where you’ll collaborate with an international team driving cloud innovation and...
-
Senior AI Platform Engineer
hace 18 minutos
Desde casa, México EPAM Systems A tiempo completoJoin our team as a Senior AI Platform Engineer, where you will design, deploy, and maintain next-generation Databricks platforms on AWS to support advanced analytics and machine learning workflows.You will collaborate closely with data scientists and ML engineers to deliver a seamless developer experience on the Lakehouse. Apply now to contribute to...
-
Senior Ai Platform Engineer
hace 2 semanas
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are looking for a highly skilled **Senior AI Platform Engineer** to design and maintain advanced systems that equip stream-aligned teams with scalable, secure, and reliable AI-powered solutions. You will closely collaborate with diverse stakeholders to optimize software delivery and operationalize machine learning models for production, all while...
-
Senior Cloud Software Engineer
hace 2 semanas
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are looking for an experienced **Senior Cloud Software Engineer** with proficiency in Kubernetes and AWS to join our team.In this position, you will help build, scale, and manage a dependable DevOps service mesh platform designed for cloud infrastructure deployment. If you excel in designing innovative solutions and value collaboration, we would love to...
-
Senior Software Engineer
hace 1 semana
Desde casa, México Meltwater Group A tiempo completo**Description** What We’re Looking For: We're seeking a Senior Software Engineer with 5+ years of experience who thrives in multi-technology environments. You should be a versatile engineer with proven expertise across multiple tech stacks, particularly Python and Node.js, who can rapidly adapt to new technologies and drive architectural decisions. This...
-
Senior Site Reliability Engineer
hace 3 semanas
Desde casa, México EPAM Systems, Inc. A tiempo completoWe are seeking an experienced **Senior Site Reliability Engineer**to join our team.As a key member of the Reliability Tooling team, you will be responsible for writing and reviewing code, contributing to critical technical decisions, and mentoring engineers within your squad. This role requires a deep understanding of SRE principles and best practices, as...
-
Platform Engineer
hace 3 semanas
Desde casa, México Sequoia Connect A tiempo completoOur client represents the connected world, offering innovative and customer-centric information technology services and solutions, enabling Enterprises, Associates and the Society to Rise. Our client is a USD 4.0 billion company with 107,100+ professionals across 90 countries, helping over 800 global customers including Fortune 500 companies.Its innovation...