Staff Site Reliability Engineer

hace 3 días


Mexico City Crunchyroll A tiempo completo

About Crunchyroll

WE HELP EVERYONE BELONG. IT’S OUR PURPOSE.

Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We super-serve over 100 million anime and manga fans across 200+ countries and territories, and help them connect with the stories and characters they crave. Whether that experience is online or in-person, streaming video, theatrical, games, merchandise, events and more, it’s powered by the anime content we all love.

Join our team, and help us shape the future of anime

Who We Are

We're a cast of characters working to shine a spotlight on anime. is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our pages for more information about our collection of brands.

About the Team

The Site Reliability Engineering (SRE) team is dedicated to ensuring the reliability, scalability, and performance of our data infrastructure. We focus on standardizing and implementing monitoring and alerting across all datastores to track key metrics like errors, latency, and throughput, and to ensure critical systems are covered. Our team also leads efforts to keep databases up-to-date, implements Infrastructure as Code (IaC) for high availability and performance, and automates key processes to enhance operational efficiency. 

We lead and evangelize the principle of 100% automation. Additionally, we define and document operational requirements, develop incident response processes, and automate monitoring and compliance checks to maintain a secure and reliable data environment. By continuously improving load testing and optimizing data governance practices, we support the overall health and efficiency of our data systems.

About the Role

Crunchyroll is growing and changing, presenting unique challenges and opportunities to support millions of anime fans around the world. The Data Engineering team provides seamless help to our internal stakeholders, ensuring an exceptional experience for all Crunchyroll fans.

As a Staff Site Reliability Engineer for the Data Engineering team, you will be responsible for maintaining and enhancing the reliability of our data infrastructure. Your work will directly impact the availability and performance of our data services, enabling the organization to better decisions. You will collaborate closely with data engineers, and software engineers to develop and drive 100% automation, best practices for deep monitoring and alerting. This role will report to our Director of Data Engineering and will be based out of our Mexico City office. 

About You

Bachelor's degree in Computer Science, Information Technology, or a related field.12+ years of experience in site reliability engineering, database operations, or a related role with a focus on data platforms, data stores, data operations.Extensive experience with AWS cloud platform and their data-related services.Proficiency in monitoring tools (e.g., Datadog, CloudWatch, DevOps Guru, DB Performance Insights).Proficiency in one or more programming languages (e.g. Python, Java)Proficiency in automation frameworks (e.g., Terraform, Cloud Formation).Strong understanding of various performance metrics both at a high level and at a low level like Disk/IO saturation.Experience in identifying and eliminating the bottlenecks in the system.Strong understanding of database internals like types of indexes, schemas, query plans.Strong understanding of database systems (e.g., SQL, NoSQL) and experience in managing large-scale data infrastructures.Strong understanding and hands-on implementation of CI/CD pipelines and DataOps practices.Experience with data governance, compliance, and lifecycle management.Ability to own and execute projects while effectively collaborating with the team to influence and shape the vision of the data engineering organization.

#LifeAtCrunchyroll #LI-Hybrid

About our Values

We want to be everything for someone rather than something for everyone and we do this by living and modeling our values in all that we do. We value

Courage. We believe that when we overcome fear, we enable our best selves.

Curiosity. We are curious, which is the gateway to empathy, inclusion, and understanding.

Service. We serve our community with humility, enabling joy and belonging for others.

Kaizen. We have a growth mindset committed to constant forward progress.

Our commitment to diversity and inclusion

Our mission of helping people belong reflects our commitment to diversity & inclusion. It's just the way we do business.

We are an equal opportunity employer and value diversity at Crunchyroll. Pursuant to applicable law, we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Crunchyroll, LLC is an independently operated joint venture between US-based Sony Pictures Entertainment, and Japan's Aniplex, a subsidiary of Sony Music Entertainment (Japan) Inc., both subsidiaries of Tokyo-based Sony Group Corporation.

Questions about Crunchyroll’s hiring process? Please check out our Hiring FAQs: 

Please refer to our Candidate Privacy Policy for more information about how we process your personal information, and your data protection rights:

Please beware of recent scams to online job seekers. Those applying to our job openings will only be contacted directly from @crunchyroll.com email account.



  • Mexico City Trax A tiempo completo

    About The Position The Position Site Reliability Engineer About Trax Trax’s mission is to enable brands and retailers to harness the power of digital technologies to produce the best shopping experiences imaginable. Trax’s retail platform allows customers to understand what is happening on shelf, in every store, all the time so they...


  • Mexico City 1210 Kyndryl Mexico S. de R.L. de C.V. A tiempo completo

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...


  • Mexico City Virtualent A tiempo completo

    Site Reliability Engineer (SRE)VirtualentAbout Us:We’re a leading IT Staffing company, passionate about connecting top talent with the best opportunities. We are looking for a Site Reliability Engineer (SRE) to join our team.Responsibilities:• Design, implement, and maintain scalable and highly available infrastructures.• Monitor and ensure the...


  • Mexico City Thales A tiempo completo

    Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...


  • Mexico City F5 A tiempo completo

    At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.    Everything we do centers...


  • Mexico City Epam A tiempo completo

    Description DESCRIPTION Are you a DevOps expert with a passion for improving communication between operational and developmental sides of the software development process? Do you thrive in dynamic, collaborative environments? If so, we have an exciting opportunity for you! We're currently seeking a Site Reliability Engineer to join...


  • Mexico City Oracle A tiempo completo

    Responsibilities Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure  Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA) Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...


  • Mexico City Thomson Reuters A tiempo completo

    About the Role In this opportunity as a Site Reliability Engineer, you will:  Provides skilled technical support/delivery capability, with minimal supervision, for the current and future design, testing, delivery, support, and maintenance of production services in the technical operations environment. Provides technical and procedural consistency...


  • Mexico City Thales A tiempo completo

    Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...


  • Mexico City Thomson Reuters A tiempo completo

    About the Role In this opportunity as a Senior Site Reliability Engineer , you will:  Develop, Deliver, and Support: By applying modern SRE operational & development practices, you will be involved in the entire operational support, Monitoring, automation, building, and delivering high-quality solutions for the team. Be a Team Player: Working in...


  • City, México Svitla Systems A tiempo completo

    - Requirements: - 5+ years of experience in a SRE or similar role. - 3+ years of experience supporting containerized production services using Kubernetes. - 2+ years of experience with Infrastructure as Code and configuration tools like Terraform and Ansible. - 1+ year of recent experience in the cloud (Google Cloud Platform preferred, AWS and Azure will...

  • SRE Engineer

    hace 4 meses


    Mexico City Azka IT Consulting A tiempo completo

    AZKA IT is a Mexican company that seeks and connects the best IT talent with Latin American and United States companies.We are looking for your talent as Site Reliability EngineerRequirements:The Site Reliability Engineer (SRE) plays a crucial role in the design, implementation and maintenance of highly available, scalable and reliable systems.  Technical...


  • Mexico City Servicios Comerciales Amazon Mexico S. de R.L. de C.V. - D44 A tiempo completo

    Reliability, Maintenance, and Engineering (RME) Central Services is hiring for Systems Engineers!At Amazon we believe that Every Day is still Day One! We’re working to be the most customer-centric company on earth. To get there, we need talented, bright and driven people.The System Development Engineer position provides proactive technical support for...


  • Mexico City Thales A tiempo completo

    Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...

  • Staff Engineer

    hace 5 meses


    Mexico City BetterCloud A tiempo completo

    at BetterCloud BetterCloud is the market leader for SaaS Operations, enabling IT professionals to transform their employee experience, maximize operational efficiency, and centralize data protection. With no-code automation enabling zero-touch workflows, thousands of forward-thinking organizations like Twitch, Oscar Health, and Cloud Factory now rely on...


  • Mexico City Western Digital A tiempo completo

    Job DescriptionAs a Staff Software Engineer in the Flash Business Unit, you will play a pivotal role in designing, developing, and optimizing software solutions for our advanced hardware devices. You will join a team of skilled engineers and collaborate closely with cross-functional teams to deliver high-quality software that enhances the performance and...

  • Project Engineer

    hace 5 meses


    Mexico City Unilever A tiempo completo

    Unilever is currently hiring for Project Engineer Function: Project Enginner Work Level: 1C Reports to :ARACELI SOTO VAZQUEZ Scope :NUTRITION LATAM Location : LERMA. Terms & Conditions : Full time position.  ABOUT UNILEVER Unilever is the place where you can bring your purpose to life with the work that you do – creating a better...


  • Mexico City Lenovo A tiempo completo

    Description and Requirements The Lenovo Infrastructure Solutions Group (ISG) Premium Services Support Engineer is a critical member of the Lenovo ISG Services Delivery team. Strong technical skills, complex problem isolation skills, as well as outstanding customer support and professional communications skills are essential to ensuring the best...


  • Mexico City Siemens, S.A. de C.V. A tiempo completo

    At Siemens Global Business Services(GBS), we shape the Shared Services landscape of the future by supportingcompanies in all sectors worldwide. We aim to excite our customers throughproviding value generating and high-quality solutions appropriately tailored totheir needs. Our mission is to seamlessly integrate, digitalize, and optimizebusiness processes in...


  • City, México ICON plc A tiempo completo

    **What you will be doing**: - **What you will be doing**: Recognize, exemplify and adhere to ICON's values which centers around our commitment to People, Clients and Performance. - As a member of staff, the employee is expected to embrace and contribute to our culture of process improvement with a focus on streamlining our processes adding value to our...