Site Reliability Engineer- Big Data

1 week ago


Bengaluru, Karnataka, India PhonePe Full time

Job Overview:

As a Site Reliability Engineer (SRE) specializing in Data Platform OnPremise, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our Cloudera Data Platform (CDP) infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems that support our data-driven initiatives. The ideal candidate will have a deep understanding of Cloudera Data Platform, strong troubleshooting skills, and a proactive mindset towards automation and optimization. You will play a pivotal role in ensuring the smooth functioning, operation, performance and security of large high density Cloudera-based infrastructure.

Key Responsibilities:

  1. Implementation of Cloudera Data Platform: Lead the implementation process of Cloudera Data Platform on-premises, including planning, installation, configuration, and integration with existing systems.
  2. Infrastructure Management: Manage and maintain the Cloudera-based infrastructure, ensuring optimal performance, high availability, and scalability. This includes monitoring system health, troubleshooting issues, and performing routine maintenance tasks.
  3. Data Security and Compliance: Implement and enforce security best practices to safeguard data integrity and confidentiality within the Cloudera environment. Ensure compliance with relevant regulations and standards (e.g., GDPR, HIPAA, DPR).
  4. Performance Optimization: Continuously optimize the Cloudera infrastructure to enhance performance, efficiency, and cost-effectiveness. Identify and resolve bottlenecks, tune configurations, and implement best practices for resource utilization.
  5. Capacity Planning: Monitor resource utilization trends and plan for future capacity needs. Proactively identify potential capacity constraints and propose solutions to address them.
  6. Backup and Disaster Recovery: Implement robust backup and disaster recovery strategies to ensure data protection and business continuity. Test and maintain backup and recovery procedures regularly.
  7. Patches & Upgrades: Routinely apply recommended patches and perform rolling upgrades of the platform in accordance with the advisory from Cloudera, InfoSec and Compliance.
  8. Documentation and Knowledge Sharing: Create comprehensive documentation for configurations, processes, and procedures related to the Cloudera Data Platform. Share knowledge and best practices with team members to foster continuous learning and improvement.
  9. Collaboration and Communication: Collaborate effectively with cross-functional teams including data engineers, developers, and IT operations personnel. Communicate project status, issues, and resolutions clearly and promptly.

Qualifications:

  1. Bachelor's degree in Computer Science, Engineering, or related field.
  2. Proficiency in Linux system administration, shell scripting, and networking concepts.
  3. 5+ years of experience in managing Big Data infrastructure.
  4. Strong understanding of distributed computing principles and experience with Hadoop ecosystem technologies (HDFS, MapReduce, YARN, Hive, Spark, etc.).
  5. Hands-on experience with configuration management tools (e.g., Salt,Ansible, Puppet, Chef).
  6. Strong scripting skills (e.g., Python, Bash) for automation and troubleshooting.
  7. Experience with monitoring and logging solutions (e.g., Prometheus, Grafana, ELK stack).
  8. Knowledge of networking principles and protocols (TCP/IP, UDP, DNS, DHCP, etc.).
  9. Experience with managing *nix based machines and strong working knowledge of quintessential Unix programs and tools (e.g. Ubuntu, Fedora, Redhat, etc.)
  10. Excellent communication skills and the ability to collaborate effectively with cross-functional teams.
  11. Excellent analytical, problem-solving, and troubleshooting skills..
  12. Proven ability to work well under pressure and manage multiple priorities simultaneously.

Good To Have:

  1. Cloudera Certified Administrator (CCA) or Cloudera Certified Professional (CCP) certification preferred.
  2. Minimum 5 years of experience in managing and administering medium/large hadoop based environments (>100 machines), including Cloudera Data Platform (CDP) experience is highly desirable.
  3. Familiarity with Open Data Lake components such as Ozone, Iceberg, Spark, Flink, etc.
  4. Familiarity with containerization and orchestration technologies (e.g. Docker, Kubernetes, OpenShift) is a plus

  • Big Data Engineer

    1 week ago


    Bengaluru, Karnataka, India Virtusa Full time

    Big Data Engineer - CREQ188642 Description Looking for a Senior Software Engineer with strong analytical skills to help build and enhance our Big data application. This individual's day-to-day work will involve using Java, Scala, SQL, Big Data, Linux, git, and maven/gradle to design and build software solutions that are fast, scalable, and...


  • Bengaluru, Karnataka, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : Develop and provide operational support for fullstack software applications. Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation. Five years' experience as a site reliability engineer or similar role. Collaborate with development operations staff to create,...


  • Bengaluru, Karnataka, India The Nielsen Company Full time

    At Nielsen, we believe that career growth is a partnership. You ultimately own, fuel and set the journey. By joining our team of nearly 14,000 associates, you will become part of a community that will help you to succeed. We champion you because when you succeed, we do too. Embark on a new initiative, explore a fresh approach, and take license to think big,...


  • Bengaluru, Karnataka, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company's core applications span population health including care...


  • Bengaluru, Karnataka, India ViewSonic Full time

    Job Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.1+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...


  • Bengaluru, Karnataka, India Apple Full time

    Summary:The people here at Apple don't just build products— they craft the kind of wonder that has revolutionized entire industries. It's the diversity of those people and their ideas that inspires the innovation that runs through everything we do, from amazing technology to industry-leading environmental efforts. Imagine what you could do here. Join...


  • Bengaluru, Karnataka, India Cricbuzz Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience yearsResponsibilities: Design, implement,...


  • Bengaluru, Karnataka, India First American (India) Full time

    The Role: A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission. As a Site Reliability Engineering Manager...


  • Bengaluru, Karnataka, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • Bengaluru, Karnataka, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • Bengaluru, Karnataka, India Australia and New Zealand Banking Group Limited (ANZ) Full time

    Site Reliability Engineer Site Reliability Engineer Req ID: Department: AR Strategic Delivery Australia Data Division: Australia Retail Location: Bengaluru About the role At ANZ our purpose is to shape a world where people and communities thrive and to achieve this, we need a talented Site Reliability Engineer to join our Australia Data tribe. The...


  • Bengaluru, Karnataka, India Signify Netherlands B.V. Full time

    Site Reliability EngineerSignify, the new company name of Philips Lighting, is the global leader in lighting building on 125+ years of innovations. Our purpose is to unlock the extraordinary potential of light for brighter lives and a better world.We are proud to be ahead of the game in the Internet of Things and on track to be carbon neutral by 2020. We...


  • Bengaluru, Karnataka, India Signify Netherlands B.V. Full time

    Site Reliability EngineerSignify, the new company name of Philips Lighting, is the global leader in lighting building on 125+ years of innovations. Our purpose is to unlock the extraordinary potential of light for brighter lives and a better world.We are proud to be ahead of the game in the Internet of Things and on track to be carbon neutral by 2020. We...


  • Bengaluru, Karnataka, India Kunato Full time

    Site Reliability Engineer (SRE) - Python/GolangJob Description:We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) to join our technology team. The ideal candidate will possess strong programming skills with expertise in Python, Golang, or both. This role is pivotal in ensuring the high availability, performance, and security of...


  • Bengaluru, Karnataka, India Microsoft Full time

    OverviewMicrosoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.Microsoft's Azure Data engineering team is leading the transformation of analytics in the world...


  • Bengaluru, Karnataka, India Microsoft Full time

    OverviewMicrosoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.Microsoft's Azure Data engineering team is leading the transformation of analytics in the world...


  • Bengaluru, Karnataka, India Luxoft Full time

    What you will need: 6+ years in Site Reliability role (DevOps/System Administration) maintaining Linux systems in cloud environments (We use AWS) Excellent understanding of Linux systems and Network Fundamentals Deep understanding of Monitoring and Alerting systems such as Prometheus, Graphite, or equivalent Experience with infrastructure automation tools...


  • Bengaluru, Karnataka, India Ensono Full time

    About RoleEnsono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, Karnataka, India Ensono Full time

    About Role Ensono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, Karnataka, India ANZ Full time

    About the role At ANZ our purpose is to shape a world where people and communities thrive and to achieve this, we need a talented Site Reliability Engineer to join our Australia Data tribe. The Australia Data tribe sits within our Australia Retail & Commercial division and our mission is to combine business and technology capabilities to enable...