CLOUDERA HADOOP ADMINISTRATOR - Hybrid, Reston VA at Reston, Virginia, USA |
Email: [email protected] |
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2059986&uid= From: Anand Mohan, Sibitalent Corp [email protected] Reply to: [email protected] Hi Associates, Title: CLOUDERA Hadoop ADMINISTRATOR Location: Hybrid, Reston VA Duration: 12 Month+ Contract to Hire Visa: USC, GC Cloudera CDP PUBLIC CLOUD Big Data Administrator (NOT a Developer) to support a Reston, VA-based health insurance customer. This is a hybrid position. Contract to Hire RESPONSIBILITIES Experience with building Cloudera clusters, setting up Ni Fi, SOLR, HBase, Kafka, and Knox in the Cloud using CDP Public Cloud v7.2.17 or higher. Set up High Availability of Services like Hue, Hive, HBase REST, SOLR, and IMPALA on top of the all-new clusters that were built on the BD PaaS Platform. Write scripts to monitor the health check of services and respond accordingly to any warning or failure conditions. Monitor the health of all the services running in the production cluster using the Cloudera Manager. Perform/access databases, meta store tables, and write Hive, and Impala queries using HUE. Monitor the health of the Services on top of all clusters. Work closely with the Application Development team, Security team, and Platform Support to identify and implement the Configurational changes that are needed on top of the cluster for better performance of the services. REQUIRED SKILLS Cloudera CDP Public Cloud v7.2.17 or higher Apache Kafka - strong administration & troubleshooting skills Kafka Streams - API stream processing with K Streams & K tables Kafka integration with IBM MQ Kafka broker management Topic/ offset management Apache Ni fi Administration Flow management, registry server management, controller service management Ni Fi to Kafka /HBase /SOLR integration HBase administration, database management and troubleshooting SOLR administration, manage Logging level, manage shards & high availability Collection management Rectify resource intensive & long-running SOLR queries ADDITIONAL MUST HAVE SKILLS: Proficient with handling AWS EC2, S3, EBS, EFS Ensure Cloudera installation and configuration meet optimal specifications (CDP, CDSW, Hive, Spark, Ni Fi). Design and implement big data pipelines and automated data flows using Python/R and Ni Fi. Assist and provide expertise as it pertains to automating the entire project lifecycle. Perform incremental updates and upgrades to the Cloudera environment with newer versions. Assist with new use cases (i.e., analytics/ML, data science, data ingest and processing) Infrastructure (including new cluster deployments, cluster migration, expansion, major upgrades, COOP/DR, and security). Assist in testing, governance, data quality, training, and documentation efforts. Move data and use YARN to allocate resources and schedule jobs. Manage job workflows with Hue. Implement comprehensive security policies across the Hadoop cluster using Ranger. Troubleshoot potential issues with Kerberos, TLS/SSL, Models, and Experiments, as well as other workload issues that data scientists might encounter once the application is running. Support Big Data / Hadoop databases throughout the development and production lifecycle. Troubleshoot and resolve database integrity issues, performance issues, blocking and deadlocking issues, replication issues, log shipping issues, connectivity issues, security issues, performance tuning, query optimization, using monitoring and troubleshooting tools. Create, test, and implement scripting for automation support. Experience with the Kafka ecosystem (Kafka Brokers, Connect, Zookeeper) in production is ideal. Implement and support streaming technologies such as Kafka, Spark & Kudu. Thanks, Anand Keywords: machine learning message queue sthree rlang information technology green card Virginia CLOUDERA HADOOP ADMINISTRATOR - Hybrid, Reston VA [email protected] http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2059986&uid= |
[email protected] View All |
03:58 AM 08-Jan-25 |