Site Reliability Engineer
hace 2 días
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world’s leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of globally distributed collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles.
Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution. The company is founder‑led, profitable, and growing. We are hiring a Site Reliability Engineer Our goal is to perfect enterprise infrastructure DevOps practices, raising the bar on what’s possible with automation by embracing a model‑driven approach, whether on‑premise or on public clouds. We run hundreds of private cloud, Kubernetes clusters, and applications for customers across both physical and public cloud estates.
We identify and address incidents, monitor and observe applications, anticipate potential issues, and enable product refinement to ultimately achieve high-quality standards in our open source portfolio. To succeed in this role, you need to have a strong background in Linux, Python, networking, and knowledge of how clouds work. Your work will encompass the entire stack, from bare-metal networking and kernel up to Kubernetes and open source applications. You can expect to be trained in our core technologies like OpenStack, Kubernetes, security standards, open source products like Kubeflow, Kafka, OpenSearch, databases, and many others.
Automation for us is a software engineering problem that we approach with a scientific mindset to bring operations at scale, driven by metrics and code. Location: Globally remote role The role We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices. To become a member of our team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from bare metal to containers, and you need the ability to work in operations with mission-critical services for global brand-name customers. As a member of the team, you will gain experience in a broad range of cloud technologies.
We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure.
What we are looking
for in you Degree in software engineering or computer science Python software development experience Operational experience in Linux environments Experience with Kubernetes deployment or operations Excellent interpersonal skills, curiosity, flexibility, and accountability Ability to travel internationally twice a year, for company events up to two weeks long Bonus skills Familiarity with OpenStack deployment or operations Familiarity with public cloud deployment or operations Familiarity with private cloud management What we offer colleagues Distributed work environment with twice-yearly team sprints in person Personal learning and development budget of USD 2,000 per year Every 6 months compensation review Recognition rewards Annual holiday leave Maternity and paternity leave Employee Assistance Programs Opportunity to travel to new locations to meet your colleagues Priority Pass and travel upgrades for long‑haul company events About Canonical Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence— in order to succeed, we need to be the best at what we do.
Most colleagues at Canonical have worked from home since its inception in 2004. Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game. Canonical is an equal opportunity employer We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products.
Whatever your identity, we will give your application fair consideration. #J-18808-Ljbffr
-
Site Reliability Engineer
hace 1 semana
baja california, México National Oilwell Varco, Inc. A tiempo completoSite Reliability Engineer (SRE) – Application Performance Monitoring (APM) Location: Monterrey, Nuevo León, Mexico (Hybrid – candidates must reside in Monterrey or the metropolitan area) Language requirement: Fluent English (spoken and written) About the Role We’re looking for a Site Reliability Engineer (SRE) with a passion for Application...
-
Site Reliability Engineer ID45689
hace 2 días
baja california, México AgileEngine A tiempo completoJoin to apply for the Site Reliability Engineer ID45689 role at AgileEngine . AgileEngine is an Inc. 5000 company that creates award‑winning software for Fortune 500 brands and startups across 17+ industries. We rank among leaders in application development and AI/ML, and our people‑first culture has earned us multiple Best Place to Work awards. Why join...
-
Site Reliability Engineer: Cloud, Automation
hace 1 semana
baja california, México Itj A tiempo completoA leading technology company in Baja California is looking for a Site Reliability Engineer to enhance software architecture, automate infrastructure, and maintain monitoring solutions. This position requires experience in cloud-centric systems, particularly using Python and Terraform. The ideal candidate will collaborate with development teams and maintain...
-
Senior Site Reliability Engineer
hace 2 días
baja california, México Canonical A tiempo completoA leading software company is seeking a Senior Site Reliability Engineer to join their remote team. The role involves architecting and running OpenStack and Kubernetes, focusing on devsecops. Ideal candidates will have a degree in Software Engineering or Computer Science, along with Python expertise and operational experience in large-scale cloud...
-
Remote Site Reliability Engineer — Cloud
hace 2 días
baja california, México AgileEngine A tiempo completoA leading software company is seeking a Site Reliability Engineer to design and deploy secure, reliable cloud-native infrastructure. The role includes developing effective CI/CD pipelines, mentoring engineering teams, and collaborating with product teams. Ideal candidates should have 8–10 years of experience, strong AWS skills, and expertise in...
-
Remote Site Reliability Engineer – Open Source Cloud
hace 2 semanas
baja california, México Canonical A tiempo completoA leading open source software provider is seeking a Site Reliability Engineer. In this role, you will deploy and manage OpenStack, Kubernetes, and various storage solutions while practicing DevOps. Ideal candidates will have a strong foundation in Python and Linux, with the ability to operate in mission-critical services globally. This is a remote position...
-
Site Reliability Engineer
hace 2 días
baja california, México BairesDev A tiempo completoOverview Site Reliability Engineer - Remote Work | REF# at BairesDev. We are looking for a Site Reliability Engineer to administrate and provide support for the whole project infrastructure hosted in the cloud while implementing CI/CD pipelines for the automation of the deployments. What You Will Do Ensure high service availability, performance, security,...
-
Site Reliability Engineer
hace 1 semana
baja california, México Itj A tiempo completoPosition Overview SREs at ITJ support our mission by pushing out new features and applications every day. The Site Reliability Engineering team constantly practices the DevOps mindset to build and deploy distributed, fault‑tolerant systems at scale. As part of this team, you will work with developers, operations, and product sponsors to help design, build,...
-
Senior Reliability Engineer — RCA
hace 2 días
baja california sur, México Outset Medical A tiempo completoA pioneering medical device company in San José del Cabo seeks a Senior Reliability Engineer. The role focuses on leading reliability tests, conducting root cause investigations, and implementing improvements. Candidates should have a strong background in reliability engineering, excellent communication skills, and at least 7 years of relevant experience. A...
-
Senior Reliability Engineer
hace 2 días
baja california sur, México Outset Medical A tiempo completoCompany Overview Join us for an enriching journey with Outset, a trailblazing medical device company that is revolutionizing the field of dialysis. Our focus is to create one high performing team, obsessed with progress, in an atmosphere that is brimming with transformative opportunities. The heart of our mission is pioneering a groundbreaking technology...