
Hadoop Administrator
5 days ago
Dindigul, India
Smartavya Analytica Private Limited
Full time
We are seeking an experienced Hadoop Administrator to manage and support our Hadoop ecosystem with 5+ Years of Experience. The ideal candidate will have strong expertise in Hadoop cluster administration, excellent troubleshooting skills, and a proven track record of maintaining and optimizing Hadoop environments.
Location: Mumbai
Key Responsibilities:
- Install, configure, and manage Hadoop clusters, including HDFS, YARN, Hive, HBase, and other ecosystem components.
- Monitor and manage Hadoop cluster performance, capacity, and security.
- Perform routine maintenance tasks such as upgrades, patching, and backups.
- Implement and maintain data ingestion processes using tools like Sqoop, Flume, and Kafka.
- Ensure high availability and disaster recovery of Hadoop clusters.
- Collaborate with development teams to understand requirements and provide appropriate Hadoop solutions.
- Troubleshoot and resolve issues related to the Hadoop ecosystem.
Maintain documentation of Hadoop environment configurations, processes, and procedures.
Requirement:
- Experience in Installing, configuring and tuning Hadoop distributions.
- Hands on experience in Cloudera.
- Understanding of Hadoop design principals and factors that affect distributed system performance, including hardware and network considerations
- Provide Infrastructure Recommendations, Capacity Planning, work load management.
- Develop utilities to monitor cluster better Ganglia, Nagios etc.
- Manage large clusters with huge volumes of data
- Perform Cluster maintenance tasks
- Create and removal of nodes, cluster monitoring and troubleshooting
- Manage and review Hadoop log files
- Install and implement security for Hadoop clusters
- Install Hadoop Updates, patches and version upgrades. Automate the same through scripts
- Point of Contact for Vendor escalation. Work with Hortonworks in resolving issues
- Should have Conceptual/working knowledge of basic data management concepts like ETL, Ref/Master data, Data quality, RDBMS
- Working knowledge of any scripting language like Shell, Python, Perl
- Should have experience in Orchestration & Deployment tools
- Cloudera and CDP Certifications mandatory.