Site Reliability Engineer
hace 4 semanas
**RH: Lessly Mujica****Location**:any state of Mexico**Industry - Sector**:Financial Services - Banking**Key Responsibilities**:Manage and administer Bigdata Hadoop clusters in public cloud environments (Azure and AWS).Handle and maintain Data Hub and Data Lake clusters, ensuring high availability and zero downtime.Configure and manage critical services, including HDFS, YARN, Spark, Kafka, Impala, Hive, Zookeeper, and Ranger.Secure clusters by configuring Kerberos authentication, Ranger policies, and Active Directory integration.Perform cluster maintenance, node commissioning and decommissioning, snapshot creation, and user quota management on HDFS.Optimize cluster performance by tuning services such as YARN and Hive.Automate administrative tasks and deployments using, Docker, Kubernetes, and GitLab.Resolve user tickets, troubleshoot issues, and document problem resolutions.Must have experience in Operations Support, Release DeploymentsMonitor and manage the Hadoop cluster health using Cloudera Manager.Manage job queues, troubleshoot long-running jobs, and ensure efficient cluster resource allocation.Perform DISK-based cluster-to-cluster data transfers (DISTCP) and other migration tasks.Execute backup, recovery strategies, and performance monitoring for data-intensive environments.**Key Skills and Competencies**:Proficient in managing Bigdata Hadoop (AWS-EMR Hadoop). - MOST IMPORTANT is a mustHands-on experience with AWS Cloud (EC2, VPC, S3) Entire client systems are migrated to AWS Infra. - AWS Cloud (EC2, VPC, S3)Strong knowledge of CI/CD GIT, GITHUB, Jenkins Strong/Acceptable KnowledgeExpertise in implementing Kerberos authentication, user authorization via Ranger/Sentry, and HDFS access control. - Knowledge but not an expertExperience in performance tuning, security configurations, and troubleshooting. - Knowledge but not an expertKnowledge of scripting and automation using PowerShell and Bash. Phyton/Linux for automation- Knowledge but not an expertFamiliarity with ITIL and Agile/Scrum methodologies. - Production Support**Must have Skills**:Bigdata Hadoop ( Client is using AWS-EMR Hadoop Flavor, NOT Cloudera Hadoop )AWS cloud ( Since entire client systems are migrated to AWS Infra )CI/CD, Jenkins, GIT/GITHUB.Nice to have skillsOperations support & Release deployments knowledge.Phyton / Linux for automationTerraform, TableauDevOps & SRE Technical Skills are add an advantageKnowledge of containerization tools such as Docker and Kubernetes.Familiarity with monitoring tools like Splunk, Cloudera Manager, and data security best practices.Strong analytical skills and ability to lead troubleshooting efforts.**Nice to have skills**:Operations support & Release deployments knowledge.Phyton / Linux for automationTerraform, TableauDevOps & SRE Technical Skills are add an advantageKnowledge of containerization tools such as Docker and Kubernetes.Familiarity with monitoring tools like Splunk, Cloudera Manager, and data security best practices.Strong analytical skills and ability to lead troubleshooting efforts.**Soft skills**:Quality at work, Results Oriented**What can YOU expect in a career with Capgemini?**- Working in a team environment, Consultants will focus on the analysis, design and development of technology-based solutions for Capgemini’s clients.- You will work alongside technical, functional and industry specialists to assist with the development, implementation and integration of innovative system solutions including methods, techniques and tools.- You will contribute to client satisfaction by providing timely and responsive value-added services and work products.- Capgemini offers a competitive compensation and benefits package.- Headquartered in Paris, France, Capgemini has a presence of more than 340 thousand professionals in Mexico distributed among 3 sites located in Mexico City, Monterrey and Aguascalientes. A deeply multicultural organization.- Capgemini has developed its own way of working, the Collaborative Business ExperienceTM, and draws on Rightshore, its worldwide delivery model.**You will love this job because**- Capgemini focuses on giving each new hire a YOU-nique experience through our recruitment process and on-boarding program, as well as by helping you to build your own career and professional skills foundation.- Capgemini provides a collaborative environment that embodies and holds the following stated values close to heart: Honesty, Boldness, Trust, Freedom, Team Spirit, Modesty, and Fun.- Capgemini cultivates an atmosphere for development that enables YOU to be hands-on, planning for your growth, both horizontally and vertically.
-
Site Reliability Engineer
hace 1 semana
Aguascalientes, México Innova Solutions A tiempo completoJob ID: Remote, Aguascalientes Job Type: Contract Added - 03/06/24APPLY**Job Description**:Innova Solutions is immediately hiring for a Site Reliability Engineer**Position type**:Remote**Duration: 6 Months****Location**:USA**As a Site Reliability Engineer, you will:- Site Reliability Engineer responsibilities include monitoring computer systems and building...
-
Site Reliability Engineer
hace 1 semana
Aguascalientes, México Innova Solutions A tiempo completoJob ID: 943898 Remote, Aguascalientes Job Type: Contract Added - 03/06/24 APPLY **Job Description**: Innova Solutions is immediately hiring for a Site Reliability Engineer **Position type**:Remote **Duration: 6 Months** **Location**:USA** As a Site Reliability Engineer, you will: - Site Reliability Engineer responsibilities include monitoring computer...
-
Senior Site Reliability Engineer
hace 3 semanas
Aguascalientes, México Canonical A tiempo completoSenior Site Reliability Engineer at Canonical Location: Globally remote role We are hiring a Senior Site Reliability Engineer. Next‑gen operations at scale, with pure Python infra‑as‑code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops. We run hundreds of private cloud, Kubernetes, and...
-
Senior Site Reliability Engineer
hace 3 semanas
Aguascalientes, México Canonical A tiempo completoSenior Site Reliability Engineer at Canonical Location: Globally remote role We are hiring a Senior Site Reliability Engineer. Next‑gen operations at scale, with pure Python infra‑as‑code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops. We run hundreds of private cloud, Kubernetes, and...
-
Senior Site Reliability
hace 22 horas
Aguascalientes, México Canonical A tiempo completoSenior Site Reliability / Gitops EngineerCanonical is hiring a Senior Site Reliability / Gitops Engineer to join our Information Systems (IS) team. This role focuses on automation-first IT operations, infrastructure as code, and scalable service delivery across private and public clouds.OverviewWe are a leading provider of open source software and operating...
-
Senior Site Reliability
hace 19 horas
Aguascalientes, México Canonical A tiempo completoSenior Site Reliability / Gitops EngineerCanonical is hiring a Senior Site Reliability / Gitops Engineer to join our Information Systems (IS) team. This role focuses on automation-first IT operations, infrastructure as code, and scalable service delivery across private and public clouds.OverviewWe are a leading provider of open source software and operating...
-
Site Reliability
hace 1 semana
Aguascalientes, México Canonical A tiempo completoSite Reliability / Gitops Engineer Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public...
-
Site Reliability
hace 6 días
Aguascalientes, México Canonical A tiempo completoSite Reliability / Gitops Engineer Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public...
-
Senior SRE
hace 5 días
Aguascalientes, México Canonical A tiempo completoA leading open-source technology company is seeking a Senior Site Reliability / Gitops Engineer in Aguascalientes, Mexico. This role emphasizes automation-first IT operations, infrastructure as code, and support for services used by millions. The ideal candidate will have strong skills in cloud computing, Python development, and Kubernetes, and a background...
-
Remote SRE
hace 1 semana
Aguascalientes, México Canonical A tiempo completoA leading open-source software company seeks a Site Reliability / Gitops Engineer in a remote role. You will focus on operations automation across private and public clouds using Infrastructure as Code. Responsibilities include maintaining services for over 60 million users, improving operational processes, and collaborating with globally distributed teams....