Data Infrastructure Reliability Engineer
hace 3 semanas
We are seeking a highly skilled Staff Site Reliability Engineer to join our Data Engineering team at Crunchyroll. This is an exceptional opportunity for an experienced professional to shape the future of anime by maintaining and enhancing the reliability of our data infrastructure.
The successful candidate will be responsible for ensuring the availability and performance of our data services, collaborating closely with data engineers and software engineers to drive 100% automation and best practices for deep monitoring and alerting.
Responsibilities- Maintain and enhance the reliability of our data infrastructure
- Collaborate with data engineers and software engineers to develop and drive 100% automation and best practices for deep monitoring and alerting
- Develop and implement effective monitoring and alerting strategies to ensure high availability and performance of our data services
- Work closely with cross-functional teams to identify and resolve technical issues impacting data services
- Bachelor's degree in Computer Science, Information Technology, or a related field
- 12+ years of experience in site reliability engineering, database operations, or a related role with a focus on data platforms, data stores, and data operations
- Extensive experience with AWS cloud platform and their data-related services
- Proficiency in monitoring tools (e.g., Datadog, CloudWatch, DevOps Guru, DB Performance Insights)
- Proficiency in one or more programming languages (e.g., Python, Java)
- Proficiency in automation frameworks (e.g., Terraform, Cloud Formation)
- Strong understanding of various performance metrics both at a high level and at a low level like Disk/IO saturation
- Experience in identifying and eliminating bottlenecks in the system
- Strong understanding of database internals like types of indexes, schemas, query plans
- Strong understanding of database systems (e.g., SQL, NoSQL) and experience in managing large-scale data infrastructures
- Strong understanding and hands-on implementation of CI/CD pipelines and DataOps practices
- Experience with data governance, compliance, and lifecycle management
The salary range for this position is $180,000 - $220,000 per annum, depending on experience. We also offer a comprehensive benefits package, including medical, dental, and vision insurance, 401(k) matching, and generous paid time off.
Crunchyroll is committed to providing a diverse and inclusive work environment, and we welcome applicants from all backgrounds. As an equal opportunity employer, we do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
About CrunchyrollCrunchyroll is a leading global provider of anime content, delivering a wide range of titles to over 100 million fans across 200+ countries and territories. We strive to create a community-driven platform that fosters engagement and belonging among anime enthusiasts worldwide.
-
Staff Site Reliability Engineer
hace 1 mes
Ciudad de México, Ciudad de México Crunchyroll, LLC A tiempo completoAbout CrunchyrollAt Crunchyroll, we're committed to delivering the art and culture of anime to our global community. As a Staff Site Reliability Engineer on our Data Engineering team, you'll play a pivotal role in ensuring the reliability, scalability, and performance of our data infrastructure.About the RoleWe're looking for a highly skilled engineer to...
-
Data Infrastructure Reliability Specialist
hace 4 semanas
Ciudad de México, Ciudad de México Crunchyroll A tiempo completoAbout CrunchyrollWe're a global entertainment company dedicated to delivering anime and manga experiences to our fans.As a leading platform, we serve over 100 million users across 200+ countries, providing an extensive library of content, merchandise, events, and more.This role is part of our Data Engineering team, which ensures seamless data operations and...
-
Cloud Infrastructure Engineer
hace 1 mes
Ciudad de México, Ciudad de México Snaphunt A tiempo completoAbout the RoleWe are seeking a skilled DevOps Engineer with expertise in Google Cloud Platform to join our team at Snaphunt. This is an exciting opportunity to work with a leading data and business analytics consulting company that enables large enterprises to tame modern data complexities.Design, implement, and maintain scalable and efficient CI/CD...
-
Cloud Infrastructure Reliability Engineer
hace 2 semanas
Ciudad de México, Ciudad de México Thales A tiempo completoCompany OverviewThales is a global leader in digital security and identity management, trusted by over 30,000 organizations to provide secure solutions for billions of digital interactions.Job DescriptionAs a Cloud Infrastructure Reliability Engineer at Thales, you will play a crucial role in ensuring the reliability, availability, and performance of...
-
Site Reliability Engineer
hace 1 mes
Ciudad de México, Ciudad de México Thales A tiempo completoThales is a global leader in digital security. Our solutions empower organizations to securely interact with people, objects, and services. As a Site Reliability Engineer, you will contribute to the development and maintenance of our large-scale ODC services. Your focus will be on ensuring the reliability, availability, and performance of these systems. This...
-
Reliability Engineer for Cloud Infrastructure
hace 4 semanas
Ciudad de México, Ciudad de México Sequoia Connect A tiempo completoSequoia Connect is a USD 6 billion company with 163,000+ professionals across 90 countries, helping 1279 global customers, including Fortune 500 companies.We are currently searching for a Site Reliability Engineer (SRE) to join our team in Mexico. This position plays a critical role in ensuring the scalability and reliability of our Cash Management...
-
Cloud Infrastructure Specialist
hace 1 mes
Ciudad de México, Ciudad de México NTT DATA, Inc. A tiempo completoJob Summary:We are seeking a highly skilled Cloud Infrastructure Specialist - Azure to join our team in Mexico. The successful candidate will have expertise in Microsoft Azure, Windows Server, SCCM, and SolarWinds.Key Responsibilities:Design, deploy, and manage Microsoft Azure solutions to ensure optimal performance, security, and scalability.Support and...
-
Reliable Data Infrastructure Specialist
hace 4 semanas
Ciudad de México, Ciudad de México Crunchyroll A tiempo completoAbout CrunchyrollWe empower a global community of anime enthusiasts by delivering exceptional data experiences. Our mission is to help everyone belong.Founded by fans, Crunchyroll provides seamless access to anime and manga content across 200+ countries and territories, serving over 100 million passionate individuals. We strive to create immersive...
-
Data Storage Engineer
hace 4 semanas
Ciudad de México, Ciudad de México Microsoft A tiempo completoJob Summary:We are seeking a highly skilled Data Storage Engineer to join our team at Microsoft. As a key member of our Azure Data engineering team, you will be responsible for designing, coding, testing, and developing features that improve the SQL DB service offerings, ensuring quality, maintainability, and end-to-end ownership.Key Responsibilities:Design...
-
Cloud Infrastructure Engineer
hace 4 semanas
Ciudad de México, Ciudad de México Thales A tiempo completoAbout the RoleThe Cloud Infrastructure Engineer - L2 will be responsible for ensuring the reliability, availability, and performance of large-scale ODC services on public cloud platforms. This involves working closely with development teams to design, build, and maintain scalable and reliable infrastructure, automate processes, monitor system health, and...
-
Ciudad de México, Ciudad de México Klar A tiempo completoWe are seeking an experienced Cloud Infrastructure Engineer to join our team at Klar, a Mexican fintech startup that is revolutionizing the way financial services are delivered in Mexico.As a key member of our infrastructure team, you will play a crucial role in designing, building, and maintaining scalable and secure cloud-based infrastructure to support...
-
Site Reliability Engineer
hace 1 mes
Ciudad de México, Ciudad de México Trax A tiempo completoAbout TraxAt Trax, we empower brands and retailers to harness the power of digital technologies and create exceptional shopping experiences. Our retail platform provides real-time insights into in-store activities, enabling businesses to focus on what matters most – delighting customers.As a pioneer in computer vision, Trax continues to innovate and lead...
-
Data Reliability Specialist
hace 3 semanas
Ciudad de México, Ciudad de México Crunchyroll A tiempo completoAt Crunchyroll, we are committed to delivering exceptional experiences for millions of anime fans worldwide.About the RoleWe are seeking a highly skilled Data Reliability Specialist to join our Data Engineering team in Mexico City. As a key member of our team, you will be responsible for maintaining and enhancing the reliability of our data infrastructure,...
-
Senior Software Development Engineer
hace 4 semanas
Ciudad de México, Ciudad de México ZMEX Zillow Mexico, S. de R.L. de C.V. A tiempo completoAbout the RoleAs a Senior Software Development Engineer in Big Data Infrastructure at ZMEX Zillow Mexico, S. de R.L. de C.V., you will be responsible for developing and supporting data platform infrastructure. You will utilize data technologies such as Spark, Hive, Airflow, Databricks, Kafka, Flink, and other industry and internal tools.Responsibilities*...
-
Cloud Infrastructure Specialist
hace 1 mes
Ciudad de México, Ciudad de México NTT DATA, Inc. A tiempo completoAbout the RoleNTT DATA Services is seeking a Cloud Infrastructure Specialist to design, deploy, and manage Microsoft Azure solutions. The successful candidate will have strong knowledge of Microsoft Azure, Windows Server, and SCCM, with experience in system administration, network monitoring, and performance management.Key ResponsibilitiesDesign and deploy...
-
Data Infrastructure Specialist
hace 4 semanas
Ciudad de México, Ciudad de México Tala A tiempo completoAbout TalaTala is on a mission to unleash the economic power of underserved communities worldwide. Our innovative platform enables lending and other financial services, empowering individuals in emerging markets to start and grow small businesses, manage daily expenses, and achieve their financial goals with confidence.We have a global team with a...
-
Cloud Infrastructure Engineer
hace 4 semanas
Ciudad de México, Ciudad de México AgileEngine, LLC A tiempo completoAbout AgileEngine, LLCAgileEngine, LLC is a company that values innovation and collaboration.Job Title: Cloud Infrastructure EngineerEstimated Salary: $120,000 - $180,000 per yearWe are seeking an experienced Cloud Infrastructure Engineer to join our team. As a Cloud Infrastructure Engineer at AgileEngine, LLC, you will be responsible for designing,...
-
Cloud Infrastructure Architect
hace 3 semanas
Ciudad de México, Ciudad de México NTT DATA A tiempo completoSenior Platform Engineer Job DescriptionWe are seeking a highly skilled Senior Platform Engineer to join our team. This role is responsible for designing, implementing, and maintaining the infrastructure that supports our applications.About UsNTT DATA is a global leader in business and technology services. We help our clients innovate, optimize, and...
-
Cloud-Native Infrastructure Architect
hace 4 semanas
Ciudad de México, Ciudad de México Wiser Solutions A tiempo completoSenior DevOps EngineerWe are looking for a seasoned Cloud-Native Infrastructure Architect to lead our engineering teams in delivering top-notch quality of service. As a key member of our infrastructure team, you will help set and drive the technical vision for our infrastructure, observability, site reliability, and software release pipeline.
-
Cloud Infrastructure Engineer
hace 4 semanas
Ciudad de México, Ciudad de México Thomson Reuters A tiempo completoAbout the RoleIn this opportunity as a Cloud Infrastructure Engineer - Service Reliability Specialist, you will be responsible for delivering high-quality solutions for SRE team.Provides skilled technical support/delivery capability, with minimal supervision, for the current and future design, testing, delivery, support, and maintenance of production...