Site Reliability Engineering Manager

hace 1 semana


Guadalajara, México Canonical - Jobs A tiempo completo

This is a world-class **devops engineering management** challenge, bringing together software engineering and product development, operations management, and team leadership in a single high-value role.

We work across the full stack, from bare metal to Kubernetes, including cloud and virtualisation. We also work across the full range of infrastructure, from public cloud to private cloud and edge. You will need to be a Linux and operations expert, as well as a great manager capable of leading a high-performance team, to excel in this role.

If you have an affinity for open source development and a passion for operations, software engineering, and new technology, then you will enjoy working with some of the best people in the industry at Canonical.

**Summary of role and responsibilities**:
The IS team at Canonical runs the services used by over 60 million Ubuntu users. We automate all of Canonical's production services with model-driven operations techniques and technology. We are part of Canonical's effort to raise the bar on ops technology, encapsulating real-world operational knowledge into reusable and composable software operations packages. We use our real-life operational experiences to contribute to product improvements.

From Kubernetes to the kernel and everything in-between, you'll be working with the latest technology in a fast-paced engineering environment. As an SRE Manager you will be responsible for the operations engineers in your time zone. This includes customer service management, managed services operations and consistent product improvement engineering. Collaboration with internal customers, product engineering, and development groups is critical to success.

**As an Engineering Manager in devops you will**:

- Lead your team in daily agile devops practices
- Optimise the quality and velocity of both development and operations
- Mentor engineers to improve their skills
- Identify and measure team health indicators
- Implement structured engineering and operations processes
- Ensure proper team focus on priorities, milestones, and deliverables
- Work to meet service level agreements with customer deployments around the globe
- Deliver quality managed services in a consistent, timely manner
- Represent the IS team to stakeholders, customers, and internal teams
- Bachelors (or equivalent) Degree level education in a technology field
- Proven experience of software delivery using Python, Go, C, C++, or Java
- Proven experience managing devops teams for SAAS or similar offerings
- Understanding of testing methodologies and maintainable code quality
- Experience with Ubuntu system administration
- Experience with agile software development methodologies
- Experience working in and managing distributed teams
- Technical aptitude for understanding complex distributed systems
- Experience with cloud topologies and technologies
- Ability to travel to global company events 10-15% of the time

**About Canonical**:
Canonical is a growing international software company that works with the open-source community to deliver Ubuntu - the world's #1 cloud operating system. Our mission is to realize the potential of free software in the lives of individuals and organisations. Our services help businesses worldwide to reduce costs, improve efficiency and enhance security with Ubuntu.

We offer:

- Learning and development
- Competitive salary
- Recognition rewards
- Priority Pass for travel
- Remote work-from-anywhere policy
- Canonical is proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background lead to a better environment for our employees and a better platform for our users and customers. This is something we value deeply and we encourage everyone to come be a part of the world of Ubuntu._

LI-Remote

stack



  • Guadalajara, México Capgemini Engineering A tiempo completo

    **Site Reliability Engineer (REMOTE)**: **At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world’s most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside...


  • Guadalajara, México GSB A tiempo completo

    Important IT company At the Latin American level, growth requires: **SRE- Site Reliability Engineering** **Job description**: - We are looking for a Lead Site Reliability Engineer who takes the initiative on developing and maintain the system and services for our Cash Management Platform, automating the deployment process, ensuring system scaling,...


  • Guadalajara, México Capgemini Engineering A tiempo completo

    **Senior SRE - Capgemini**: We’re hiring a **Senior Site Reliability Engineer**to join a major telecom client through Capgemini Engineering. Join a collaborative team building and operating large-scale cloud platforms that power next‑generation connectivity and customer experiences. This is a hands‑on role where you’ll design, automate, secure, and...


  • Guadalajara, México Valce Talent Solutions A tiempo completo

    We are looking for a Lead Site Reliability Engineer who takes the initiative on developing and maintain the system and services for our Cash Management Platform, automating the deployment process, ensuring system scaling, investigating and resolving outdates, identifying and implementing preventive measures proactively, collaborating with key stakeholders,...


  • Guadalajara, México Intel A tiempo completo

    Come and join a dynamic and challenging team within the Intel Data Center and Artificial Intelligence Group focused on engineering, developing, and supporting world class platforms and component building blocks aligned to the Data Center roadmap and strategies. We are seeking a well-rounded Site Reliability Engineer to work with a team of architects and...


  • Guadalajara, México Arrive Logistics A tiempo completo

    **Who We Are****Who We Want**As a Senior Site Reliability Engineer for Arrive Logistics, you will be responsible for building a purposeful, proactive, and sustainable approach to reliability based on core SRE principles and practices. Your role covers the entire life-cycle of a product: from helping engineering teams with architecture and delivery to on-call...


  • Guadalajara, México Arrive Logistics A tiempo completo

    **Who We Are** **Who We Want** As a Senior Site Reliability Engineer for Arrive Logistics, you will be responsible for building a purposeful, proactive, and sustainable approach to reliability based on core SRE principles and practices. Your role covers the entire life-cycle of a product: from helping engineering teams with architecture and delivery to...

  • Site Reliability Engineer

    hace 2 semanas


    Guadalajara, México Wizeline A tiempo completo

    **The Company**:Wizeline is a global digital services company helping mid-size to Fortune 500 companies build, scale, and deliver high-quality digital products and services. We thrive in solving our customer’s challenges through human-centered experiences, digital core modernization, and intelligence everywhere (AI/ML and data). We help them succeed in...


  • Guadalajara, México Finastra USA Corporation A tiempo completo

    **Responsibilities**:**What will you contribute?**As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be treating operations as a software...


  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.Position SummarySoftware engineering is a core discipline at F5 for many roles. As a software engineer specializing in site reliability, you...