Site Reliability Engineer

hace 6 meses


Guadalajara, México Capgemini Engineering A tiempo completo

At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world’s most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology.

**Your Role**
- Identify and diagnose issues and problems.
- Categorize and record reported queries and provide solutions.
- Monitor issues from start to resolution.
- Escalate if needed, unresolved problems to a higher level of support.
- Receive and handles request for service, following agreed procedures.
- Logs incidents and service requests and maintains relevant records: identifies and classifies incident types and service interruptions, record incidents cataloging them by symptom and resolution.
- Trouble shoot and resolve Azure-related issues.
- Create and maintain documentation for Azure configurations, processes, and procedures.
- Utilize Datadog metrics and analytics to perform capacity planning and resource optimization.
- Collaborate with clients to understand their monitoring needs and provide customized Datadog solutions.
- Manage server configurations of Linux systems using Puppet.

**Your Profile**
- Bachelor´s degree in Computer Science, information technology, or related field.
- Excellent communication (written and spoken) skills - English.
- Experience in tools such as JIRA, Confluence, ServiceNow, other monitoring tools.
- Excellent problem-solving and analytical skills
- Comprehensive ability to prioritize and delegate.
- Experience working with Microsoft Azure Services.
- Basic scripting and automation skills (PowerShell, Azure CLI).
- Experience performing troubleshooting in Azure Services.
- Understanding of networking concepts and protocols.
- Hands-on knowledge on Datadog and good knowledge on any other monitoring tool.
- Proficient in Datadog configurations, dashboards, and alerting.
- Experience provisioning and managing Linux servers.
- Experience troubleshooting and resolving incidents using Datadog insights.
- Demonstrate strong understanding of infrastructure components, networking and cloud environments.
- Good hands-on knowledge of configuration management and deployment tools like Puppet.
- Experience running Puppet-backup restore commands to restore PE infrastructure.
- Experience managing and working with Puppet servers.

**WHAT YOU’LL LOVE ABOUT WORKING HERE?**
- Capgemini Employer Promise: Learning + Flexibility + Team Spirit + Inclusion + Innovation.
- Work from home: fully remote position.
- Get competitive benefits above the law.
- Build your future within a worldwide leader in ER&D projects.
- Feel free to grow within different industries and choose your career path.
- Be part of a great family of Engineers, and people all over Mexico and the world.Capgemini is a global leader in partnering with companies to transform and manage their business by harnessing the power of technology. The Group is guided everyday by its purpose of unleashing human energy through technology for an inclusive and sustainable future. It is a responsible and diverse organization of over 300,000 team members in nearly 50 countries. With its strong 50-year heritage and deep industry expertise, Capgemini is trusted by its clients to address the entire breadth of their business needs, from strategy and design to operations, fueled by the fast evolving and innovative world of cloud, data, AI, connectivity, software, digital engineering and platforms.



  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. - Site Reliability Engineer III Why do you want to join our team? - Everything we do centers around people. That means we obsess over how to...


  • Guadalajara, México Finastra USA Corporation A tiempo completo

    **Responsibilities**: **What will you contribute?** As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be treating operations as a software...


  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Business/Job Title: Senior Site Reliability Engineer Position Summary Software engineering is a core discipline at F5 for many roles. As a...


  • Guadalajara, México C3 AI A tiempo completo

    We are looking for **Associate Site Reliability Engineer**/**Site Reliability Engineer** to join our team in Guadalajara, Mexico. **Responsibilities**: - Maximize system uptime and availability, ensuring functional and performance SLAs. - Establish end-to-end monitoring and alerting on all critical aspects. - Solve complex problems for critical services...

  • Infrastructure Engineer

    hace 4 semanas


    Guadalajara, Jalisco, México Broadridge A tiempo completo

    Broadridge fosters a culture where innovation meets reliability, empowering associates to drive scalable solutions.**Job Overview**We are seeking an experienced Infrastructure Engineer - Site Reliability to join our team. As a key member of our SRE group, you will be responsible for designing and implementing scalable and highly reliable software...


  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Business/Job Title: Site Reliability Engineer - IAM - III Position Summary: Software engineering is a core discipline at F5 for many...


  • Guadalajara, México Finastra A tiempo completo

    Your deliverables as a Site Reliability Engineer will include, but are not limited to, the following: - Work with containers and container orchestration systems such as Kubernetes - Capacity Planning to determine resource requirements of your service for it to be scalable, efficient, and reliable - Collaborate with other engineers to implement operational...


  • Guadalajara, México Wizeline A tiempo completo

    **The Company**: Wizeline is a global digital services company helping mid-size to Fortune 500 companies build, scale, and deliver high-quality digital products and services. We thrive in solving our customer’s challenges through human-centered experiences, digital core modernization, and intelligence everywhere (AI/ML and data). We help them succeed in...


  • Guadalajara, Jalisco, México Azka IT Consulting A tiempo completo

    Azka IT Consulting is seeking a highly skilled Site Reliability Specialist to join our team.The ideal candidate will have a strong background in automation, Unix, Linux, Ubuntu, and Windows, as well as experience with Oracle, MYSQL, and NOSQL Solutions.The Systems Operations Expert will be responsible for understanding system design and architecture,...

  • Site Reliability Engineer

    hace 4 semanas


    Guadalajara, Jalisco, México Tech Holding A tiempo completo

    About UsTech Holding is a full-service consulting firm that delivers predictable outcomes and high-quality solutions to clients. Our team has industry experience and holds senior positions in various companies, including emerging startups and large Fortune 50 firms.Our unique approach is supported by the principles of deep expertise, integrity, transparency,...


  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Position Summary Software engineering is a core discipline at F5 for many roles. As a software engineer specializing in site reliability,...


  • Guadalajara, Jalisco, México FICO A tiempo completo

    About FICOFICO is a leading global analytics software company, helping businesses in 100+ countries make better decisions.The OpportunityThe Site Reliability Engineer is a critical role that combines software development and systems engineering. As a full-stack support engineer, you will be responsible for managing complex distributed enterprise SaaS...


  • Guadalajara, México Finastra USA Corporation A tiempo completo

    **Responsibilities**: **What will you contribute?** As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be treating operations as a software...


  • Guadalajara, Jalisco, México F5 A tiempo completo

    F5 is a global leader in the technology industry, dedicated to delivering exceptional products and services that enhance the lives of our customers.We are seeking a highly skilled Site Reliability Engineer III to join our team. As a key member of our organization, you will play a critical role in ensuring the security and scalability of our systems by...


  • Guadalajara, Jalisco, México F5 A tiempo completo

    Job Summary:At F5, we're seeking a skilled Site Reliability Engineer II to join our Technology Services team. As a key member of our CEDI team, you'll be responsible for managing the systems used by developers throughout their SDLC lifecycle. This includes JIRA, Confluence, and other Atlassian tools. Your expertise in automation, scripting, and DevOps...

  • Reliability Engineer Lead

    hace 2 semanas


    Guadalajara, Jalisco, México Capgemini Engineering A tiempo completo

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Capgemini Engineering. In this role, you will be responsible for ensuring the reliability, availability, and scalability of our clients' digital resources.Main RequirementsTo be successful in this position, you will need:Bachelor's degree in Computer Science,...


  • Guadalajara, México Tech Holding A tiempo completo

    **About us**: Working at Tech Holding isn't just a job, it's an opportunity to be a part of something bigger. We are a full-service consulting firm that was founded on the premise of delivering predictable outcomes and high-quality solutions to our clients. Our founders and team members have industry experience and have held senior positions in a wide...


  • Guadalajara, México Azka IT Consulting A tiempo completo

    AZKA IT is a Mexican company that seeks and connects the best IT talent with Latin American and United States companies.We are looking for your talent as Site Reliability Engineer.Requirements:Mediator b/w development and application team, and good knowledge on automation Unix, Linux, Ubuntu, or Windows, Oracle, MYSQL, NOSQL Solutions.Key...


  • Guadalajara, México Encora A tiempo completo

    **Responsibilities** - Architect and implement observability solutions utilizing advanced cloud monitoring tools such as New Relic, Dynatrace, Splunk or equivalent, to provide comprehensive insights into system metrics, logs, and traces - Configure and customize monitoring dashboards, alerts, and metrics to enable real-time visibility into system health,...


  • Guadalajara, Jalisco, México F5 A tiempo completo

    About F5F5 is a company that puts people at the forefront of everything we do. We strive to make the lives of our customers and their customers better by delivering innovative solutions that drive business success.About the RoleThe Site Reliability Engineer III will be responsible for ensuring the reliability, availability, and scalability of critical...