Site Reliability Engineer
hace 2 días
itD is seeking a
Site Reliability Engineer
who will report to the Sr. Engineering Manager for a client in the gaming and entertainment space.
As a Site Reliability Engineer, you will focus on designing, deploying, and operating resilient, secure, and globally scalable services in AWS, with , TypeScript, Kubernetes, GitLab, Argo CD (CI/CD).
This long-term W2 opportunity is based remotely with a strong preference for candidates able to support PST working hours.
We provide comprehensive medical benefits, a 401k plan, paid holidays, and more.
Please note that we are only considering direct W2 candidates at this time, as we are unable to offer sponsorship.
Key Responsibilities
- Design, operate, and evolve cloud infrastructure to support internal users.
- Ensure security, governance, and cost efficiency across our GCP environments.
- Develop and support scalable services using and TypeScript, deployed with Kubernetes.
- Improve best practices for CI/CD pipelines, observability, incident response, and live service operation.
- Troubleshoot and improve live services for performance, scalability, and reliability.
- Automate deployment, monitoring, and recovery strategies to maximize uptime.
- Participate in on-call rotation, incident response, and root-cause analysis.
- Collaborate with development teams during the game prototyping and iteration process.
- Contribute to a culture of continuous improvement and knowledge sharing.
Required Qualifications And Skills
- Bachelor's/Master's degree in Computer Science, Software Engineering, or equivalent experience.
- 5+ years in SRE, DevOps, or cloud architecture, with hands-on experience running production services at scale.
- Deep expertise with public cloud environments (GCP preferred, AWS/Azure also valuable).
- Experience with Kubernetes for container orchestration and deployment.
- Proficiency with CI/CD pipelines using ArgoCD
- Solid scripting/automation skills (TypeScript/Node, js Python or Bash).
- Knowledge of observability tools (Prometheus, Grafana, Loki etc).
- Experience designing for high availability, reliability, and disaster recovery.
Education
Bachelor's/Master's degree in Computer Science, Software Engineering, or equivalent experience.
Company Description
About itD:
We are part of a new generation of consulting and software development company that blends diversity, innovation, and integrity with real business results. Our structure rejects any strong hierarchy, empowering us to deliver excellent results. We are a woman- and minority-led firm. Every day, we challenge ourselves to be considerate, fair and to re-think what great outcomes mean for our customers. This permeates down to how we approach every interaction, on every project, for every client. You'll thrive here if you are a dynamic self-starter, a difference-maker or someone who wants to deliver great results, without constraints.
The itD Experience
Joining us means you'll be part of our global community, you have a say about your own career journey, and you'll get a chance to give back to causes that matter. You will experience working with Fortune 500 companies and high-performance teams across numerous industries.
itD offers our employees excellent benefits such as medical, dental, vision, life insurance, paid holidays, PTO, 401K + matching, networking & career learning and development programs. We are growing and we want to see you grow
Visit to learn more about what working at itD can mean for you.
Company Description
About itD: We are part of a new generation of consulting and software development company that blends diversity, innovation, and integrity with real business results. Our structure rejects any strong hierarchy, empowering us to deliver excellent results. We are a woman- and minority-led firm. Every day, we challenge ourselves to be considerate, fair and to re-think what great outcomes mean for our customers. This permeates down to how we approach every interaction, on every project, for every client. You'll thrive here if you are a dynamic self-starter, a difference-maker or someone who wants to deliver great results, without constraints.
The itD Experience: Joining us means you'll be part of our global community, you have a say about your own career journey, and you'll get a chance to give back to causes that matter. You will experience working with Fortune 500 companies and high-performance teams across numerous industries. itD offers our employees excellent benefits such as medical, dental, vision, life insurance, paid holidays, 401K + matching, networking & career learning and development programs. We are growing and we want to see you grow Visit to learn more about what working at itD can mean for you.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability or protected veteran status, or any other legally protected basis, in accordance with applicable law. itD is committed to working with and providing reasonable accommodation to individuals with disabilities. If, because of a medical condition or disability, you need a reasonable accommodation for any part of the application process, or to perform the essential functions of a position, please contact us at and let us know the nature of your request and your contact information.
Additional Info
Dynamic environment in a culture of respect, empowerment and recognition for a job well done, apply today
-
Site Reliability Engineer
hace 2 semanas
Ciudad de México, Ciudad de México Azkait A tiempo completoAZKAITes una empresa mexicana que busca y conecta el mejor talento IT con empresas Latinoamericanas y de Estados Unidos.Estamos en la búsqueda de tu talento comoSite Reliability Engineer (SRE)Requisitos:Licenciatura o Ingeniería en Sistemas, Informática o afín.+5 años de experiencia en roles de SRE, DevOps o Ingeniería de Software.Experiencia...
-
Site Reliability Engineer
hace 4 días
Ciudad de México, Ciudad de México Mastercard A tiempo completoOur PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...
-
Lead Site Reliability Engineer
hace 2 semanas
Ciudad de México, Ciudad de México Pathlock A tiempo completoAbout Pathlock:Pathlock is a leader in application security, access governance, and compliance automation. Our cloud-based solutions help organizations secure critical applications, mitigate risk, and enforce policies across a diverse IT landscape.Job Summary:Join Pathlock, a fast-growing leader in Governance, Access and Compliance, where you'll help shape...
-
FBS Site Reliability Engineer
hace 2 semanas
Ciudad de México, Ciudad de México Capgemini A tiempo completoOur Client is one of the United States' largest insurers, providing a wide range of insurance and financial services products with gross written premiums well over US$25 Billion (P&C). They proudly serve more than 10 million U.S. households with more than 19 million individual policies across all 50 states through the efforts of over 48,000 exclusive and...
-
Senior Site Reliability Engineer
hace 7 días
Ciudad de México, Ciudad de México Thomson Reuters México A tiempo completoAre you passionate about the chance to bring your experience to a world-class company that is market-leading or both content and technology? If yes, we're looking for you.Join our team Senior Site Reliability Engineer (SRE) will be implement Site Reliability Engineering and DevOps best practices. Feed non-functional requirements into the product backlog,...
-
Senior Site Reliability Engineer
hace 4 días
Ciudad de México, Ciudad de México Third-Party Job Posts A tiempo completoWhat Makes Us Unique At Cloudbeds, we're not just building software, we're transforming hospitality. Our intelligently designed platform powers properties across 150 countries, processing billions in bookings annually. From independent properties to hotel groups, we help hoteliers transform operations and uplevel their commercial strategy through a unified...
-
Linux Site Reliability Engineer
hace 2 semanas
Ciudad de México, Ciudad de México AXA Group Operations A tiempo completoMain missionsBeing part of our global team as a Linux Engineer and become a key member of the SRO Squad (Site Reliability Operations), collaborating with a diverse group of experts to ensure robust and secure Linux (RHEL) infrastructure worldwide.Engineer (Build) and test solutions, document accordingly and handover to operations team. Provide 3rd level...
-
Site Reliability Engineer
hace 2 semanas
Santiago de Querétaro, Querétaro de Arteaga, México RELEX Solutions A tiempo completoTechnical Service Consultant/Site Reliability EngineerBased at: RELEX office in MexicoEmployment type: Permanent, full-timeTravel: Some ad hoc travel to client sites and the Atlanta office may be requiredThe RELEX team in the Americas is growing, and we're now looking for a Technical Consultant/Site Reliability Engineer. You'll join our global continuous...
-
Linux Site Reliability Engineer
hace 5 días
Ciudad de México, Ciudad de México AXA A tiempo completoAbout AXAAs a world-leading insurance company, we act for human progress by protecting what matters. With 153,000 employees in 54 countries working for 105 million customers, we've created a truly dynamic and vibrant community. Inclusion and diversity link closely with our values, and together we're nurturing a culture of respect, for each other, for our...
-
Sr. Site Reliability Engineer
hace 2 semanas
Ciudad de México, Ciudad de México Nova A tiempo completoAbout IO Connect Services: IO Connect Services is an AWS Advanced Tier Services Partner and Datadog Partner with a commitment to delivering complex and well-architected technical solutions worldwide. Founded in 2016, our professionals are dedicated to establishing and maintaining trust with our clients and business partners for long-term...