Engineer - Observability
hace 3 semanas
Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile.With more than 28,000 employees who make the world better every day, we know we have something special.Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility - our people are energized problem solvers that take pride in how the work we do changes the world for the better.We welcome all makers, forward thinkers, and problem solvers who are looking for a place to do their best work.And if that's you we would love to have you join us**Job Description**:Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile.With more than 28,000 employees who make the world better every day, we know we have something special.Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility - our people are energized problem solvers that take pride in how the work we do changes the world for the better.We welcome all makers, forward thinkers, and problem solvers who are looking for a place to do their best work.And if that's you we would love to have you join usSr Engineer - Observability**Executive Summary****Key Responsibilities**:- Analyzes, designs, programs, debugs, and modifies observability tools and interfaces.- Code may be used to enrich and correlate telemetry from many data sources in order to isolate events that indicate future or immediate IT availability issues.- Will interact with users to define system requirements and/or necessary modifications.- Design and Implement Observability Solutions: Develop and implement comprehensive observability solutions utilizing industry-standard tools and technologies such as Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Jaeger, and Open Telemetry.- Distributed Tracing: Implement distributed tracing techniques to trace and visualize the flow of requests across microservices architectures.Utilize tracking data to identify performance bottlenecks and optimize system performance.- Performance Analysis and Optimization: Analyze system performance metrics and identify opportunities for optimization.Collaborate with development teams to implement performance improvements and ensure scalability of systems.- Incident Response and Post-Mortems: Actively participate in incident response activities, providing expertise in diagnosing and resolving complex issues.Conduct thorough post-incident reviews to identify root causes and recommend preventive measures.- Documentation and Knowledge Sharing: Document observability best practices, standards, and procedures.Share knowledge and insights with team members through presentations, workshops, and documentation to foster a culture of continuous learning and improvement.- Cross-Functional Collaboration: Collaborate with cross-functional teams including DevOps, SRE, and software engineering to drive observability initiatives and ensure alignment with organizational goals and objectives.**Qualifications**:- Bachelor's or Master's degree in Computer Science, Information Technology, or related field.- 2+ years of experience in software engineering, with a focus on observability, monitoring, and/or site reliability engineering.- 1-2 years of experience with one or more of the following: Application Performance Management APM, Monitoring / Alerting, New Relic, DynaTrace, AppDynamics, Zabbix, Big Panda and ServiceNow.- Proficiency in designing and implementing observability solutions using tools such as Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry.- Strong understanding of distributed systems, microservices architectures, and cloud computing platforms (e.g., AWS, Azure, GCP).- Experience with containerization technologies such as Docker and Kubernetes.- Ideally 2+ years of development experience with programming languages such as C#,.NET or JavaScript.- Excellent analytical and problem-solving skills, with a strong attention to detail.- Effective communication and collaboration skills, with the ability to work across teams and influence stakeholders.- Experience working in an Agile/Scrum environment is preferred.LI-PT2LI-remote
-
Sr Engineer
hace 3 semanas
Xico, México Rockwell Automation A tiempo completoRockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile.With more than 28,000 employees who make the world better every day, we know we have something special.Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and...
-
Tools Hosting
hace 3 semanas
Xico, México Ntt Data Services A tiempo completo**Req ID**: ******We are currently seeking a Tools Hosting & Observability Engineer to join our team in CDMX, Ciudad de México (MX-CMX), Mexico (MX).**Tools Hosting & Observability Engineer****Day to Day job Duties: (what this person will do on a daily/weekly basis)**- Making sure the hosted tools are available 24x7 in collaboration with current tools...
-
Software Engineer
hace 3 semanas
Xico, México Teradata Group A tiempo completoOur CompanyAt Teradata, we believe that people thrive when empowered with better information. That's why we built the most complete cloud analytics and data platform for AI. By delivering harmonized data, trusted AI, and faster innovation, we uplift and empower our customers—and our customers' customers—to make better, more confident decisions. The...
-
Site Reliability Engineer
hace 3 días
Xico, México Coderoad Inc A tiempo completoOverviewSenior Site Reliability Engineer / Observability Engineer At CodeRoad, we're more than just a software development company—we're your gateway to the global tech world. We offer end-to-end software development services and give you the opportunity to work on exciting, real-world projects in a supportive environment. Whether it's staff augmentation,...
-
Site Reliability Engineer
hace 2 días
Xico, México Coderoad Inc A tiempo completoOverviewSenior Site Reliability Engineer / Observability Engineer At CodeRoad, we're more than just a software development company—we're your gateway to the global tech world. We offer end-to-end software development services and give you the opportunity to work on exciting, real-world projects in a supportive environment. Whether it's staff augmentation,...
-
Dynatrace Automation
hace 3 semanas
Xico, México Zurich Insurance Group A tiempo completoA leading insurance firm is seeking a Dynatrace Automation Engineer in Mexico City.This role involves monitoring systems, providing technical recommendations, and collaborating with teams to enhance service availability.Ideal candidates have strong troubleshooting skills, experience with various monitoring tools, and a relevant academic...
-
Automation & Virtualization Sre Engineer
hace 3 semanas
Xico, México Mastercard A tiempo completoA leading financial services firm seeks a Site Reliability Engineer (Automation & virtualization) to enhance infrastructure reliability and performance.This role involves managing VMware ESXi hypervisors, implementing automation and observability, and collaborating across teams.Candidates should have over 5 years of SRE or related experience with strong...
-
Senior DevSecOps Engineer: Cloud-Native CI/CD
hace 3 semanas
Xico, México Ford De México A tiempo completoA leading automotive company in Mexico is seeking an experienced Site Reliability Engineer. The role involves architecting and automating CI/CD pipelines using Google Cloud Platform, managing infrastructure with Terraform, and ensuring system reliability and performance. The ideal candidate should have expertise in scripting and strong problem-solving...
-
Senior DevSecOps Engineer: Cloud-Native CI/CD
hace 3 semanas
Xico, México Ford De México A tiempo completoA leading automotive company in Mexico is seeking an experienced Site Reliability Engineer. The role involves architecting and automating CI/CD pipelines using Google Cloud Platform, managing infrastructure with Terraform, and ensuring system reliability and performance. The ideal candidate should have expertise in scripting and strong problem-solving...
-
Senior Platform Engineer: Architecture
hace 3 semanas
Xico, México Baubap A tiempo completoA tech company in Mexico City is seeking a Senior Software Engineer to enhance their platform with robust technical solutions.The ideal candidate will have over 5 years of experience in backend engineering, proficiency in a backend language like Python or Node.js, and a deep understanding of AWS and microservices.This full-time role offers competitive...