Data Infrastructure Reliability Engineer

hace 3 semanas


Ciudad de México, Ciudad de México Crunchyroll A tiempo completo
About the Role

We are seeking a highly skilled Staff Site Reliability Engineer to join our Data Engineering team at Crunchyroll. This is an exceptional opportunity for an experienced professional to shape the future of anime by maintaining and enhancing the reliability of our data infrastructure.

The successful candidate will be responsible for ensuring the availability and performance of our data services, collaborating closely with data engineers and software engineers to drive 100% automation and best practices for deep monitoring and alerting.

Responsibilities
  • Maintain and enhance the reliability of our data infrastructure
  • Collaborate with data engineers and software engineers to develop and drive 100% automation and best practices for deep monitoring and alerting
  • Develop and implement effective monitoring and alerting strategies to ensure high availability and performance of our data services
  • Work closely with cross-functional teams to identify and resolve technical issues impacting data services
Requirements
  • Bachelor's degree in Computer Science, Information Technology, or a related field
  • 12+ years of experience in site reliability engineering, database operations, or a related role with a focus on data platforms, data stores, and data operations
  • Extensive experience with AWS cloud platform and their data-related services
  • Proficiency in monitoring tools (e.g., Datadog, CloudWatch, DevOps Guru, DB Performance Insights)
  • Proficiency in one or more programming languages (e.g., Python, Java)
  • Proficiency in automation frameworks (e.g., Terraform, Cloud Formation)
  • Strong understanding of various performance metrics both at a high level and at a low level like Disk/IO saturation
  • Experience in identifying and eliminating bottlenecks in the system
  • Strong understanding of database internals like types of indexes, schemas, query plans
  • Strong understanding of database systems (e.g., SQL, NoSQL) and experience in managing large-scale data infrastructures
  • Strong understanding and hands-on implementation of CI/CD pipelines and DataOps practices
  • Experience with data governance, compliance, and lifecycle management
Salary and Benefits

The salary range for this position is $180,000 - $220,000 per annum, depending on experience. We also offer a comprehensive benefits package, including medical, dental, and vision insurance, 401(k) matching, and generous paid time off.

Crunchyroll is committed to providing a diverse and inclusive work environment, and we welcome applicants from all backgrounds. As an equal opportunity employer, we do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

About Crunchyroll

Crunchyroll is a leading global provider of anime content, delivering a wide range of titles to over 100 million fans across 200+ countries and territories. We strive to create a community-driven platform that fosters engagement and belonging among anime enthusiasts worldwide.



  • Ciudad de México, Ciudad de México Crunchyroll, LLC A tiempo completo

    About CrunchyrollAt Crunchyroll, we're committed to delivering the art and culture of anime to our global community. As a Staff Site Reliability Engineer on our Data Engineering team, you'll play a pivotal role in ensuring the reliability, scalability, and performance of our data infrastructure.About the RoleWe're looking for a highly skilled engineer to...


  • Ciudad de México, Ciudad de México Crunchyroll A tiempo completo

    About CrunchyrollWe're a global entertainment company dedicated to delivering anime and manga experiences to our fans.As a leading platform, we serve over 100 million users across 200+ countries, providing an extensive library of content, merchandise, events, and more.This role is part of our Data Engineering team, which ensures seamless data operations and...


  • Ciudad de México, Ciudad de México Snaphunt A tiempo completo

    About the RoleWe are seeking a skilled DevOps Engineer with expertise in Google Cloud Platform to join our team at Snaphunt. This is an exciting opportunity to work with a leading data and business analytics consulting company that enables large enterprises to tame modern data complexities.Design, implement, and maintain scalable and efficient CI/CD...


  • Ciudad de México, Ciudad de México Thales A tiempo completo

    Company OverviewThales is a global leader in digital security and identity management, trusted by over 30,000 organizations to provide secure solutions for billions of digital interactions.Job DescriptionAs a Cloud Infrastructure Reliability Engineer at Thales, you will play a crucial role in ensuring the reliability, availability, and performance of...


  • Ciudad de México, Ciudad de México Thales A tiempo completo

    Thales is a global leader in digital security. Our solutions empower organizations to securely interact with people, objects, and services. As a Site Reliability Engineer, you will contribute to the development and maintenance of our large-scale ODC services. Your focus will be on ensuring the reliability, availability, and performance of these systems. This...


  • Ciudad de México, Ciudad de México Sequoia Connect A tiempo completo

    Sequoia Connect is a USD 6 billion company with 163,000+ professionals across 90 countries, helping 1279 global customers, including Fortune 500 companies.We are currently searching for a Site Reliability Engineer (SRE) to join our team in Mexico. This position plays a critical role in ensuring the scalability and reliability of our Cash Management...


  • Ciudad de México, Ciudad de México NTT DATA, Inc. A tiempo completo

    Job Summary:We are seeking a highly skilled Cloud Infrastructure Specialist - Azure to join our team in Mexico. The successful candidate will have expertise in Microsoft Azure, Windows Server, SCCM, and SolarWinds.Key Responsibilities:Design, deploy, and manage Microsoft Azure solutions to ensure optimal performance, security, and scalability.Support and...


  • Ciudad de México, Ciudad de México Crunchyroll A tiempo completo

    About CrunchyrollWe empower a global community of anime enthusiasts by delivering exceptional data experiences. Our mission is to help everyone belong.Founded by fans, Crunchyroll provides seamless access to anime and manga content across 200+ countries and territories, serving over 100 million passionate individuals. We strive to create immersive...

  • Data Storage Engineer

    hace 4 semanas


    Ciudad de México, Ciudad de México Microsoft A tiempo completo

    Job Summary:We are seeking a highly skilled Data Storage Engineer to join our team at Microsoft. As a key member of our Azure Data engineering team, you will be responsible for designing, coding, testing, and developing features that improve the SQL DB service offerings, ensuring quality, maintainability, and end-to-end ownership.Key Responsibilities:Design...


  • Ciudad de México, Ciudad de México Thales A tiempo completo

    About the RoleThe Cloud Infrastructure Engineer - L2 will be responsible for ensuring the reliability, availability, and performance of large-scale ODC services on public cloud platforms. This involves working closely with development teams to design, build, and maintain scalable and reliable infrastructure, automate processes, monitor system health, and...


  • Ciudad de México, Ciudad de México Klar A tiempo completo

    We are seeking an experienced Cloud Infrastructure Engineer to join our team at Klar, a Mexican fintech startup that is revolutionizing the way financial services are delivered in Mexico.As a key member of our infrastructure team, you will play a crucial role in designing, building, and maintaining scalable and secure cloud-based infrastructure to support...


  • Ciudad de México, Ciudad de México Trax A tiempo completo

    About TraxAt Trax, we empower brands and retailers to harness the power of digital technologies and create exceptional shopping experiences. Our retail platform provides real-time insights into in-store activities, enabling businesses to focus on what matters most – delighting customers.As a pioneer in computer vision, Trax continues to innovate and lead...


  • Ciudad de México, Ciudad de México Crunchyroll A tiempo completo

    At Crunchyroll, we are committed to delivering exceptional experiences for millions of anime fans worldwide.About the RoleWe are seeking a highly skilled Data Reliability Specialist to join our Data Engineering team in Mexico City. As a key member of our team, you will be responsible for maintaining and enhancing the reliability of our data infrastructure,...


  • Ciudad de México, Ciudad de México ZMEX Zillow Mexico, S. de R.L. de C.V. A tiempo completo

    About the RoleAs a Senior Software Development Engineer in Big Data Infrastructure at ZMEX Zillow Mexico, S. de R.L. de C.V., you will be responsible for developing and supporting data platform infrastructure. You will utilize data technologies such as Spark, Hive, Airflow, Databricks, Kafka, Flink, and other industry and internal tools.Responsibilities*...


  • Ciudad de México, Ciudad de México NTT DATA, Inc. A tiempo completo

    About the RoleNTT DATA Services is seeking a Cloud Infrastructure Specialist to design, deploy, and manage Microsoft Azure solutions. The successful candidate will have strong knowledge of Microsoft Azure, Windows Server, and SCCM, with experience in system administration, network monitoring, and performance management.Key ResponsibilitiesDesign and deploy...


  • Ciudad de México, Ciudad de México Tala A tiempo completo

    About TalaTala is on a mission to unleash the economic power of underserved communities worldwide. Our innovative platform enables lending and other financial services, empowering individuals in emerging markets to start and grow small businesses, manage daily expenses, and achieve their financial goals with confidence.We have a global team with a...


  • Ciudad de México, Ciudad de México AgileEngine, LLC A tiempo completo

    About AgileEngine, LLCAgileEngine, LLC is a company that values innovation and collaboration.Job Title: Cloud Infrastructure EngineerEstimated Salary: $120,000 - $180,000 per yearWe are seeking an experienced Cloud Infrastructure Engineer to join our team. As a Cloud Infrastructure Engineer at AgileEngine, LLC, you will be responsible for designing,...


  • Ciudad de México, Ciudad de México NTT DATA A tiempo completo

    Senior Platform Engineer Job DescriptionWe are seeking a highly skilled Senior Platform Engineer to join our team. This role is responsible for designing, implementing, and maintaining the infrastructure that supports our applications.About UsNTT DATA is a global leader in business and technology services. We help our clients innovate, optimize, and...


  • Ciudad de México, Ciudad de México Wiser Solutions A tiempo completo

    Senior DevOps EngineerWe are looking for a seasoned Cloud-Native Infrastructure Architect to lead our engineering teams in delivering top-notch quality of service. As a key member of our infrastructure team, you will help set and drive the technical vision for our infrastructure, observability, site reliability, and software release pipeline.


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleIn this opportunity as a Cloud Infrastructure Engineer - Service Reliability Specialist, you will be responsible for delivering high-quality solutions for SRE team.Provides skilled technical support/delivery capability, with minimal supervision, for the current and future design, testing, delivery, support, and maintenance of production...