Site Reliability Engineer

hace 6 días


Aguascalientes, México Capgemini A tiempo completo

**RH**:Héctor Hernández**Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**General Description**- SER who manage and administer Cloudera Hadoop CDP Clusters and related big data services, with hands-on experience with Azure Cloud (VMs, ADLS, Key Vault, Vnet) and AWS Cloud (EC2, VPC, S3). Strong knowledge of DevOps tools: Docker, Kubernetes, GitLab and Jenkins. Expertise in implementing Kerberos authentication, user authorization via Ranger/Sentry, and HDFS access control. Experience in performance tuning, security configurations, and troubleshooting. Knowledge of scripting and automation using PowerShell and Bash. Familiarity with ITIL and Agile/Scrum methodologies.**Responsabilities**:- Adopt various tools developed by AppBank Engineering team to automate failures using machine learning techniques and notify discrepancies in the health of production and automation of health-restoration, with a focus on continuous measurement of risk and cost.- Manage and administer Cloudera Hadoop CDP clusters in public cloud environments (Azure and AWS).- Handle and maintain Data Hub and Data Lake clusters, ensuring high availability and zero downtime.- Configure and manage critical services, including HDFS, YARN, Spark, Kafka, Impala, Hive, Zookeeper, and Ranger.- Secure clusters by configuring Kerberos authentication, Ranger policies, and Active Directory integration.- Perform cluster maintenance, node commissioning and decommissioning, snapshot creation, and user quota management on HDFS.- Optimize cluster performance by tuning services such as YARN and Hive.- Automate administrative tasks and deployments using Azure DevOps, Docker, Kubernetes, and GitLab.- Resolve user tickets, troubleshoot issues, and document problem resolutions.- Monitor and manage the Hadoop cluster health using Cloudera Manager.- Manage job queues, troubleshoot long-running jobs, and ensure efficient cluster resource allocation.- Perform DISK-based cluster-to-cluster data transfers (DISTCP) and other migration tasks.- Execute backup, recovery strategies, and performance monitoring for data-intensive environments.**Preferred Experience**:- Knowledge of containerization tools such as Docker and Kubernetes.- Familiarity with monitoring tools like Splunk, Cloudera Manager, and data security best practices.- Strong analytical skills and ability to lead troubleshooting efforts.**Soft skills**:Quality at work, Results Oriented**What can YOU expect in a career with Capgemini?**- Working in a team environment, Consultants will focus on the analysis, design and development of technology-based solutions for Capgemini’s clients.- You will work alongside technical, functional and industry specialists to assist with the development, implementation and integration of innovative system solutions including methods, techniques and tools.- You will contribute to client satisfaction by providing timely and responsive value-added services and work products.- Capgemini offers a competitive compensation and benefits package.- Headquartered in Paris, France, Capgemini has a presence of more than 340 thousand professionals in Mexico distributed among 3 sites located in Mexico City, Monterrey and Aguascalientes. A deeply multicultural organization.- Capgemini has developed its own way of working, the Collaborative Business ExperienceTM, and draws on Rightshore, its worldwide delivery model.**You will love this job because**- Capgemini focuses on giving each new hire a YOU-nique experience through our recruitment process and on-boarding program, as well as by helping you to build your own career and professional skills foundation.- Capgemini provides a collaborative environment that embodies and holds the following stated values close to heart: Honesty, Boldness, Trust, Freedom, Team Spirit, Modesty, and Fun.- Capgemini cultivates an atmosphere for development that enables YOU to be hands-on, planning for your growth, both horizontally and vertically.



  • Aguascalientes, México Innova Solutions A tiempo completo

    Job ID: Remote, Aguascalientes Job Type: Contract Added - 03/06/24APPLY**Job Description**:Innova Solutions is immediately hiring for a Site Reliability Engineer**Position type**:Remote**Duration: 6 Months****Location**:USA**As a Site Reliability Engineer, you will:- Site Reliability Engineer responsibilities include monitoring computer systems and building...


  • Aguascalientes, México Innova Solutions A tiempo completo

    Job ID: 943898 Remote, Aguascalientes Job Type: Contract Added - 03/06/24 APPLY **Job Description**: Innova Solutions is immediately hiring for a Site Reliability Engineer **Position type**:Remote **Duration: 6 Months** **Location**:USA** As a Site Reliability Engineer, you will: - Site Reliability Engineer responsibilities include monitoring computer...


  • Aguascalientes, México Canonical A tiempo completo

    Senior Site Reliability Engineer at Canonical Location: Globally remote role We are hiring a Senior Site Reliability Engineer. Next‑gen operations at scale, with pure Python infra‑as‑code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops. We run hundreds of private cloud, Kubernetes, and...


  • Aguascalientes, México Canonical A tiempo completo

    Senior Site Reliability Engineer at Canonical Location: Globally remote role We are hiring a Senior Site Reliability Engineer. Next‑gen operations at scale, with pure Python infra‑as‑code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops. We run hundreds of private cloud, Kubernetes, and...

  • Site Reliability

    hace 3 días


    Aguascalientes, México Canonical A tiempo completo

    Site Reliability / Gitops Engineer Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public...

  • Site Reliability

    hace 12 horas


    Aguascalientes, México Canonical A tiempo completo

    Site Reliability / Gitops Engineer Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public...

  • Remote SRE

    hace 3 días


    Aguascalientes, México Canonical A tiempo completo

    A leading open-source software company seeks a Site Reliability / Gitops Engineer in a remote role. You will focus on operations automation across private and public clouds using Infrastructure as Code. Responsibilities include maintaining services for over 60 million users, improving operational processes, and collaborating with globally distributed teams....


  • Aguascalientes, México Canonical A tiempo completo

    A leading open source technology firm is seeking a Senior Site Reliability Engineer for a globally remote role. You will utilize your Python software development expertise in a high-pressure operations environment, working on automation and full-stack open source infrastructure. Responsibilities include architecting OpenStack and Kubernetes, as well as...

  • Site Reliability Engineer

    hace 4 semanas


    Aguascalientes, México Capgemini A tiempo completo

    **RH**:Héctor Hernández**Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**General Description**- SER who manage and administer Cloudera Hadoop CDP Clusters and related big data services, with hands-on experience with Azure Cloud (VMs, ADLS, Key Vault, Vnet) and AWS Cloud (EC2, VPC, S3). Strong knowledge of DevOps tools:...

  • Site Reliability Engineer

    hace 3 semanas


    Aguascalientes, México Capgemini A tiempo completo

    **RH: Lessly Mujica****Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**Key Responsibilities**:Manage and administer Bigdata Hadoop clusters in public cloud environments (Azure and AWS).Handle and maintain Data Hub and Data Lake clusters, ensuring high availability and zero downtime.Configure and manage critical services,...