IT Site Reliability Engineer

hace 3 semanas


Lagos de Moreno, México Tata Consultancy Services A tiempo completo

About the Role

We are seeking a talented and experienced IT Engineer / Architect with a strong focus on site reliability engineering responsibilities to join our team. As a key member of our team, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure and applications, with a specific focus on the architecture design and implementation.

Responsibilities

- Design, build, and maintain the architecture of our cloud-based infrastructure to ensure high availability, scalability, and security for our medical device applications, including but not limited to Ignition, PostgreSQL, HiveMQ, Qlik, Confluent Kafka, and Tanzu.
- Collaborate with cross-functional teams to develop and implement best practices for container orchestration and management, with a specific focus on Kubernetes.
- Develop and maintain CI/CD pipelines to automate the deployment and testing of applications and infrastructure changes, utilizing tools such as Tanzu, Confluent Kafka, and others as needed.
- Manage and maintain the repository of infrastructure as code, ensuring proper version control and documentation, with a specific focus on the specified applications.
- Monitor and analyze system performance, identifying and resolving potential issues to ensure optimal reliability and performance for the specified applications.
- Lead efforts to implement disaster recovery and business continuity plans for critical systems and applications, including those utilizing Ignition, PostgreSQL, HiveMQ, Qlik, Confluent Kafka, and Tanzu.
- Strong Linux Experience:
- Proficient in administering Linux systems (e.g., Ubuntu, CentOS, RHEL, Debian) in production environments.
- Strong knowledge of Linux internals including system calls, process management, networking, and filesystems.
- Experience with system monitoring and performance tuning on Linux servers.

- DevOps:
- Implements GitOps workflows for Kubernetes using declarative infrastructure in Git.
- Manages manifests, Helm charts, or Kustomize in version control.
- Automates reconciliation between Git and clusters for consistent deployments.
- Monitors and troubleshoots GitOps deployment issues, enforcing drift detection with Git-centric tools.

Qualifications

- Bachelor's degree in computer science, Engineering, or a related field.
- Proven experience in designing and implementing architecture for cloud-based infrastructure, preferably in the medical device or healthcare industry, with expertise in the specified applications.
- Strong expertise in Kubernetes and other container orchestration technologies, with experience in managing the specified applications.
- Experience with infrastructure as code tools such as Terraform, Ansible, or CloudFormation, with a focus on the specified applications.
- Proficiency in developing and maintaining CI/CD pipelines using tools such as Tanzu, Confluent Kafka, and others as needed.
- Solid understanding of networking, security, and monitoring concepts in a cloud environment, with a focus on the specified applications.
- Experience working in Global / Multisite deployments of new architecture, change control for new requests as well as support in cases of issues.

Required Skills

- Drive for Results
- Interpersonal Relationships
- Adaptability

Preferred Skills

- Previous Medical Devices or Pharma experience
- Certified AWS Solution Architect
- Certified Kubernetes (CKS or CKA)
- ITILv4

Equal Opportunity Statement

We are committed to diversity and inclusivity in our hiring practices.



  • Ciudad de México Tata Consultancy Services A tiempo completo

    We are looking for a Site Reliability Engineer (SRE) to join our team and help us ensure seamless, high-performing, and reliable technology operations. What you’ll work with: Azure DevOps - Pipelines, repositories, and automation ServiceNow - Incident, change, and problem management AppDynamics - Application performance monitoring and alerting Microsoft...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México Atos A tiempo completo

    **Job Applicant Privacy Notice**:**Site Reliability Engineer**:- Publication Date: Jan 14, 2025- Ref. No: - Location: Mexico City, MXEviden, part of the Atos Group, with an annual revenue of circa € 5 billion is a global leader in data-driven, trusted and sustainable digital transformation. As a next generation digital business with worldwide leading...


  • Ciudad de México Royal Caribbean Group A tiempo completo

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Site Reliability Engineer Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group . We are proud to offer a competitive compensation and benefits package, and excellent career development...


  • Ciudad de México Royal Caribbean Group A tiempo completo

    Join to apply for the Site Reliability Engineer role at Royal Caribbean Group 1 week ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer role at Royal Caribbean Group Get AI-powered advice on this job and more exclusive features. Journey with us! Combine your career goals and sense of adventure by joining our incredible team...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México UST A tiempo completo

    Join to apply for the Site Reliability Engineer role at UST Continue with Google Continue with Google Join to apply for the Site Reliability Engineer role at UST Get AI-powered advice on this job and more exclusive features. Sign in to access AI-powered advices Continue with Google Continue with Google Continue with Google Continue with Google Continue with...

  • Site Reliability Engineer

    hace 3 semanas


    Estado de México BairesDev A tiempo completo

    Site Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Site Reliability Engineer - Remote Work | REF# 6 months ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev At BairesDev, we've been leading the way in...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México Royal Caribbean Group A tiempo completo

    **Journey with us!** Combine your career goals and sense of adventure by joining our incredible team of employees at **Royal Caribbean Group** We are proud to offer a competitive compensation and benefits package and excellent career development opportunities each offering unique ways to explore the worldWe are proud to be the vacation-industry leader with...


  • Ciudad de México Royal Caribbean Group A tiempo completo

    Join to apply for the Senior Site Reliability Engineer role at Royal Caribbean Group . 1 week ago Be among the first 25 applicants. Journey with us! Combine your career goals and sense of adventure by joining our incredible team at Royal Caribbean Group . We offer a competitive compensation and benefits package, along with excellent career development...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México The Functionary A tiempo completo

    Direct message the job poster from The Functionary Experienced Technical recruiter with 6+ years of experience. Now hiring for LATAM, India and US. Must-Haves: Looking for a Senior Site Reliability Engineer with strong experience in Terraform, EKS, and Kubernetes. Ability to work with stakeholders and has experience leading P1 and P2 teams. Experience...


  • Ciudad de México Thomson Reuters A tiempo completo

    Are you passionate about the chance to bring your extensive technical experience to drive the Site Reliability Engineering team using industry best practices in a world class company? Thomson Reuters ONESOURCE Platform’s SRE team is looking for a Site Reliability Engineer who will provide hands-on technical skills and share industry best practices with...