Site Reliability

hace 2 días


Aguascalientes, México Canonical A tiempo completo

Site Reliability / Gitops Engineer Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of global distributed collaboration, with 1200+ colleagues in 75+ countries and very few office‑based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution. The company is founder‑led, profitable, and growing. Job Summary The IS team at Canonical supports and maintains all of Canonical's IT production services. The team is in charge of running services used by over 60 million Ubuntu users. As an SRE & Gitops engineer you will drive operations automation to the next level, both in our own private clouds and in public clouds, by utilizing the best open‑source infrastructure‑as‑code software, CI/CD pipelines, and Canonical's leading products for software operation automation. In addition to defining the infrastructure as code, you will improve Canonical products and open‑source technologies by providing critical feedback to developers on how their products operate at scale, submitting bugs and sometimes writing pull requests, and collaborating on design and implementation with other teams. Location This role is available remotely in any timezone. Responsibilities Apply your experience of IaC to develop infrastructure as code practice within IS by constantly increasing automation and improving IaC processes Automate software operations for re‑usability and consistency across private and public clouds, taking into consideration the complexities of distributed systems Develop new features and improve the resilience and scalability of the existing cloud and container portfolio at Canonical Maintain operational responsibility for all of Canonical's core services, networks, and infrastructure Develop skills in troubleshooting, capacity planning, and performance investigation; set up, maintain and use observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement and maintain monitoring and alerting for various systems and services Collaborate with development teams to design service architecture, documentation, playbooks, policies and operational procedures Provide assistance and work with globally distributed engineering, operations, and support peers Be given uninterrupted development time to focus on larger projects and automation of manual tasks Share your experience, know‑how and best practices with other team members in design sessions, mentorship and “doing work together” Carry final responsibility for time‑critical escalations Qualifications A deep experience of, and knowledge to define operations in code, using version control, peer review and CI/CD to roll out changes both to applications and infrastructure Strong modern engineering background (peer‑review, unit testing, SCM, CI/CD, Agile) Python software development experience with large projects Practical knowledge of Linux networking, routing, and firewalls Affinity with various forms of Linux storage, from Ceph to databases Hands‑on experience administering enterprise Linux servers Extensive knowledge of cloud computing concepts and technologies Bachelor's degree or greater, preferably in computer science or related engineering field Able to communicate clearly and effectively in English over email, chat, video or voice calls and in‑person Motivated and able to troubleshoot from kernel to web, and willing to ask others when appropriate A willingness to be flexible and able to learn new things quickly Be inspired by the needs of fast‑changing environments Happy to work within distributed teams Be passionate and familiarized about open‑source, especially Ubuntu or Debian About Canonical Canonical is a pioneering tech firm at the front of the global move to open source. As the company that publishes Ubuntu, one of the most important open‑source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence; in order to succeed, we need to be the best at what we do. Most colleagues at Canonical work from home since our inception in 2004. Working here is a step into the future and will challenge you to think differently, work smarter, learn new skills, and raise your game. Canonical is an equal opportunity employer. We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background create a better work environment and better products. Whatever your identity, we will give your application fair consideration. #J-18808-Ljbffr



  • Aguascalientes, México Innova Solutions A tiempo completo

    Job ID: Remote, Aguascalientes Job Type: Contract Added - 03/06/24APPLY**Job Description**:Innova Solutions is immediately hiring for a Site Reliability Engineer**Position type**:Remote**Duration: 6 Months****Location**:USA**As a Site Reliability Engineer, you will:- Site Reliability Engineer responsibilities include monitoring computer systems and building...


  • Aguascalientes, México Innova Solutions A tiempo completo

    Job ID: 943898 Remote, Aguascalientes Job Type: Contract Added - 03/06/24 APPLY **Job Description**: Innova Solutions is immediately hiring for a Site Reliability Engineer **Position type**:Remote **Duration: 6 Months** **Location**:USA** As a Site Reliability Engineer, you will: - Site Reliability Engineer responsibilities include monitoring computer...


  • Aguascalientes, México Canonical A tiempo completo

    Senior Site Reliability Engineer at Canonical Location: Globally remote role We are hiring a Senior Site Reliability Engineer. Next‑gen operations at scale, with pure Python infra‑as‑code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops. We run hundreds of private cloud, Kubernetes, and...


  • Aguascalientes, México Canonical A tiempo completo

    Senior Site Reliability Engineer at Canonical Location: Globally remote role We are hiring a Senior Site Reliability Engineer. Next‑gen operations at scale, with pure Python infra‑as‑code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops. We run hundreds of private cloud, Kubernetes, and...

  • Site Reliability Engineer

    hace 4 semanas


    Aguascalientes, México Capgemini A tiempo completo

    **RH**:Héctor Hernández**Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**General Description**- SER who manage and administer Cloudera Hadoop CDP Clusters and related big data services, with hands-on experience with Azure Cloud (VMs, ADLS, Key Vault, Vnet) and AWS Cloud (EC2, VPC, S3). Strong knowledge of DevOps tools:...

  • Site Reliability Engineer

    hace 3 semanas


    Aguascalientes, México Capgemini A tiempo completo

    **RH: Lessly Mujica****Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**Key Responsibilities**:Manage and administer Bigdata Hadoop clusters in public cloud environments (Azure and AWS).Handle and maintain Data Hub and Data Lake clusters, ensuring high availability and zero downtime.Configure and manage critical services,...

  • Site Reliability Engineer

    hace 3 semanas


    Aguascalientes, México Capgemini A tiempo completo

    **RH: Lessly Mujica****Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**General Description**- SER who manage and administer Cloudera Hadoop CDP Clusters and related big data services, with hands-on experience with Azure Cloud (VMs, ADLS, Key Vault, Vnet) and AWS Cloud (EC2, VPC, S3). Strong knowledge of DevOps tools: Docker,...

  • Site Reliability Engineer

    hace 3 semanas


    Aguascalientes, México Capgemini A tiempo completo

    **RH: Lessly Mujica****Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**General Description**- SER who manage and administer Cloudera Hadoop CDP Clusters and related big data services, with hands-on experience with Azure Cloud (VMs, ADLS, Key Vault, Vnet) and AWS Cloud (EC2, VPC, S3).Strong knowledge of DevOps tools: Docker,...


  • Aguascalientes, México Capgemini A tiempo completo

    **RH**:Héctor Hernández**Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**General Description**- SER who manage and administer Cloudera Hadoop CDP Clusters and related big data services, with hands-on experience with Azure Cloud (VMs, ADLS, Key Vault, Vnet) and AWS Cloud (EC2, VPC, S3). Strong knowledge of DevOps tools:...

  • Senior SRE

    hace 17 minutos


    Aguascalientes, México Canonical A tiempo completo

    A leading open-source technology company is seeking a Senior Site Reliability / Gitops Engineer in Aguascalientes, Mexico. This role emphasizes automation-first IT operations, infrastructure as code, and support for services used by millions. The ideal candidate will have strong skills in cloud computing, Python development, and Kubernetes, and a background...