Manager, Reliability Engineering

hace 3 semanas


Santiago de Querétaro, México Petco A tiempo completo

Create a healthier, brighter future for pets, pet parents and people If you want to make a real difference, create an exciting career path, feel welcome to be your whole self and nurture your wellbeing, Petco is the place for you. Our core values capture that spirit as we work to improve lives by doing what’s right for pets, people and our planet. We love all pets like our own We’re the future of the pet industry We’re here to improve lives We drive outstanding results together We’re welcome as we are Petco is a category-defining health and wellness company focused on improving the lives of pets, pet parents and Petco partners. We are 29,000 strong and operate 1,500+ pet care centers in the U.S., Mexico and Puerto Rico, including 250+ Vetco Total Care hospitals, hundreds of preventive care clinics and eight distribution centers. We’re focused on purpose-driven work, and strongly believe what’s good for pets, people and our planet is good for Petco. Summary Lead a team responsible for designing, implementing, and maintaining the infrastructure and processes that support the development, deployment, and operation of our software systems. You will play a critical role in driving efficiency, reliability, and scalability across our software development lifecycle while ensuring alignment with business objectives. This role requires strong leadership skills, technical expertise, and a strategic mindset to effectively manage resources, foster collaboration, and drive continuous improvement within the DevOps team. Duties & Responsibilities Lead and manage a team of DevOps engineers, providing coaching, mentorship, and performance feedback. Collaborate with senior leadership to define and align DevOps strategies with overall business objectives. Architect, implement, and manage CI/CD pipelines to automate software build, test, and deployment processes. Design and maintain infrastructure as code using tools such as Terraform, Ansible, or Chef. Oversee the implementation and management of containerization solutions using Docker and orchestration tools like Kubernetes. Ensure monitoring and troubleshooting of production systems to maintain high availability and reliability. Drive the adoption of best practices for infrastructure security, compliance, and governance. Evaluate and recommend new tools and technologies to improve efficiency and reliability of development and deployment processes. Collaborate with cross-functional teams to define infrastructure requirements and design scalable solutions. Champion a culture of collaboration, innovation, and continuous improvement within the DevOps team. Manage relationships with external vendors and service providers as needed. Develop and manage departmental budgets, forecasts, and resource allocation plans. Minimum Qualifications Bachelor's degree in Computer Science, Engineering, or a related field. (Master's degree preferred) 5+ years of experience in software development, IT operations, or a related field, with at least 2 years in a leadership or management role. Strong proficiency in scripting and programming languages such as Python, Bash, or Ruby. Experience with cloud computing platforms such as AWS, Azure, or Google Cloud Platform. In-depth knowledge of containerization technologies such as Docker and container orchestration tools like Kubernetes. Experience with configuration management tools such as Ansible, Puppet, or Chef. Proficiency in infrastructure as code concepts and tools such as Terraform. Hands‑on experience with CI/CD tools such as Jenkins, GitLab CI, or CircleCI. Excellent problem‑solving and troubleshooting skills. Strong leadership, communication, and collaboration skills, with the ability to motivate and inspire team members. Preferred Qualifications Certifications in relevant technologies such as AWS Certified DevOps Engineer, Kubernetes Certified Administrator, etc. Experience with monitoring and logging tools such as Prometheus, Grafana, ELK stack, or Splunk. Knowledge of agile software development methodologies. Experience with implementing and managing microservices architectures. Benefits For a more detailed overview of Petco Total Rewards, including health and financial benefits, 401K, incentives, and PTO, see Petco Total Rewards. Petco Animal Supplies, Inc. is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, protected veteran status, or any other protected classification. Referrals increase your chances of interviewing at Petco by 2x. Seniority Level Mid‑Senior level Employment Type Full‑time Job Function Quality Assurance Industry Retail #J-18808-Ljbffr



  • Santiago de Querétaro, México Petco A tiempo completo

    Summary: Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning...


  • Santiago de Querétaro, México Petco A tiempo completo

    Summary: Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning...


  • Santiago de Querétaro, México Petco A tiempo completo

    Summary: Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning...


  • Querétaro, México Petco A tiempo completo

    Summary: Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning...


  • Querétaro, México Petco A tiempo completo

    Summary: Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning...


  • Querétaro, Qro., México Petco A tiempo completo

    Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning reliability...


  • Querétaro City, México Petco A tiempo completo

    Summary: Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning...


  • Querétaro, Qro., México Petco A tiempo completo

    Summary: Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning...


  • Querétaro, Qro., México Petco A tiempo completo

    Summary: Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning...


  • Querétaro City, México Petco A tiempo completo

    Summary: Lead and grow a team of Reliability Engineers responsible for designing, implementing, and operating the platforms, practices, and tooling that ensure the availability, performance, and resilience of our production systems. You will drive reliability, scalability, and operational excellence across the software delivery lifecycle, while aligning...