Site Reliability Engineer

hace 4 semanas


Aguascalientes, México Capgemini A tiempo completo

**RH: Lessly Mujica****Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**Key Responsibilities**:Manage and administer Bigdata Hadoop clusters in public cloud environments (Azure and AWS).Handle and maintain Data Hub and Data Lake clusters, ensuring high availability and zero downtime.Configure and manage critical services, including HDFS, YARN, Spark, Kafka, Impala, Hive, Zookeeper, and Ranger.Secure clusters by configuring Kerberos authentication, Ranger policies, and Active Directory integration.Perform cluster maintenance, node commissioning and decommissioning, snapshot creation, and user quota management on HDFS.Optimize cluster performance by tuning services such as YARN and Hive.Automate administrative tasks and deployments using, Docker, Kubernetes, and GitLab.Resolve user tickets, troubleshoot issues, and document problem resolutions.Must have experience in Operations Support, Release DeploymentsMonitor and manage the Hadoop cluster health using Cloudera Manager.Manage job queues, troubleshoot long-running jobs, and ensure efficient cluster resource allocation.Perform DISK-based cluster-to-cluster data transfers (DISTCP) and other migration tasks.Execute backup, recovery strategies, and performance monitoring for data-intensive environments.**Key Skills and Competencies**:Proficient in managing Bigdata Hadoop (AWS-EMR Hadoop). - MOST IMPORTANT is a mustHands-on experience with AWS Cloud (EC2, VPC, S3) Entire client systems are migrated to AWS Infra. - AWS Cloud (EC2, VPC, S3)Strong knowledge of CI/CD GIT, GITHUB, Jenkins Strong/Acceptable KnowledgeExpertise in implementing Kerberos authentication, user authorization via Ranger/Sentry, and HDFS access control. - Knowledge but not an expertExperience in performance tuning, security configurations, and troubleshooting. - Knowledge but not an expertKnowledge of scripting and automation using PowerShell and Bash. Phyton/Linux for automation- Knowledge but not an expertFamiliarity with ITIL and Agile/Scrum methodologies. - Production Support**Must have Skills**:Bigdata Hadoop ( Client is using AWS-EMR Hadoop Flavor, NOT Cloudera Hadoop )AWS cloud ( Since entire client systems are migrated to AWS Infra )CI/CD, Jenkins, GIT/GITHUB.Nice to have skillsOperations support & Release deployments knowledge.Phyton / Linux for automationTerraform, TableauDevOps & SRE Technical Skills are add an advantageKnowledge of containerization tools such as Docker and Kubernetes.Familiarity with monitoring tools like Splunk, Cloudera Manager, and data security best practices.Strong analytical skills and ability to lead troubleshooting efforts.**Nice to have skills**:Operations support & Release deployments knowledge.Phyton / Linux for automationTerraform, TableauDevOps & SRE Technical Skills are add an advantageKnowledge of containerization tools such as Docker and Kubernetes.Familiarity with monitoring tools like Splunk, Cloudera Manager, and data security best practices.Strong analytical skills and ability to lead troubleshooting efforts.**Soft skills**:Quality at work, Results Oriented**What can YOU expect in a career with Capgemini?**- Working in a team environment, Consultants will focus on the analysis, design and development of technology-based solutions for Capgemini’s clients.- You will work alongside technical, functional and industry specialists to assist with the development, implementation and integration of innovative system solutions including methods, techniques and tools.- You will contribute to client satisfaction by providing timely and responsive value-added services and work products.- Capgemini offers a competitive compensation and benefits package.- Headquartered in Paris, France, Capgemini has a presence of more than 340 thousand professionals in Mexico distributed among 3 sites located in Mexico City, Monterrey and Aguascalientes. A deeply multicultural organization.- Capgemini has developed its own way of working, the Collaborative Business ExperienceTM, and draws on Rightshore, its worldwide delivery model.**You will love this job because**- Capgemini focuses on giving each new hire a YOU-nique experience through our recruitment process and on-boarding program, as well as by helping you to build your own career and professional skills foundation.- Capgemini provides a collaborative environment that embodies and holds the following stated values close to heart: Honesty, Boldness, Trust, Freedom, Team Spirit, Modesty, and Fun.- Capgemini cultivates an atmosphere for development that enables YOU to be hands-on, planning for your growth, both horizontally and vertically.



  • Aguascalientes, México Innova Solutions A tiempo completo

    Job ID: Remote, Aguascalientes Job Type: Contract Added - 03/06/24APPLY**Job Description**:Innova Solutions is immediately hiring for a Site Reliability Engineer**Position type**:Remote**Duration: 6 Months****Location**:USA**As a Site Reliability Engineer, you will:- Site Reliability Engineer responsibilities include monitoring computer systems and building...


  • Aguascalientes, México Innova Solutions A tiempo completo

    Job ID: 943898 Remote, Aguascalientes Job Type: Contract Added - 03/06/24 APPLY **Job Description**: Innova Solutions is immediately hiring for a Site Reliability Engineer **Position type**:Remote **Duration: 6 Months** **Location**:USA** As a Site Reliability Engineer, you will: - Site Reliability Engineer responsibilities include monitoring computer...


  • Aguascalientes, México Canonical A tiempo completo

    Senior Site Reliability Engineer at Canonical Location: Globally remote role We are hiring a Senior Site Reliability Engineer. Next‑gen operations at scale, with pure Python infra‑as‑code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops. We run hundreds of private cloud, Kubernetes, and...


  • Aguascalientes, México Canonical A tiempo completo

    Senior Site Reliability Engineer at Canonical Location: Globally remote role We are hiring a Senior Site Reliability Engineer. Next‑gen operations at scale, with pure Python infra‑as‑code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops. We run hundreds of private cloud, Kubernetes, and...

  • Senior Site Reliability

    hace 22 horas


    Aguascalientes, México Canonical A tiempo completo

    Senior Site Reliability / Gitops EngineerCanonical is hiring a Senior Site Reliability / Gitops Engineer to join our Information Systems (IS) team. This role focuses on automation-first IT operations, infrastructure as code, and scalable service delivery across private and public clouds.OverviewWe are a leading provider of open source software and operating...

  • Senior Site Reliability

    hace 19 horas


    Aguascalientes, México Canonical A tiempo completo

    Senior Site Reliability / Gitops EngineerCanonical is hiring a Senior Site Reliability / Gitops Engineer to join our Information Systems (IS) team. This role focuses on automation-first IT operations, infrastructure as code, and scalable service delivery across private and public clouds.OverviewWe are a leading provider of open source software and operating...

  • Site Reliability

    hace 1 semana


    Aguascalientes, México Canonical A tiempo completo

    Site Reliability / Gitops Engineer Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public...

  • Site Reliability

    hace 6 días


    Aguascalientes, México Canonical A tiempo completo

    Site Reliability / Gitops Engineer Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public...

  • Senior SRE

    hace 5 días


    Aguascalientes, México Canonical A tiempo completo

    A leading open-source technology company is seeking a Senior Site Reliability / Gitops Engineer in Aguascalientes, Mexico. This role emphasizes automation-first IT operations, infrastructure as code, and support for services used by millions. The ideal candidate will have strong skills in cloud computing, Python development, and Kubernetes, and a background...

  • Remote SRE

    hace 1 semana


    Aguascalientes, México Canonical A tiempo completo

    A leading open-source software company seeks a Site Reliability / Gitops Engineer in a remote role. You will focus on operations automation across private and public clouds using Infrastructure as Code. Responsibilities include maintaining services for over 60 million users, improving operational processes, and collaborating with globally distributed teams....