Site Reliability Engineer- Data Platform

7 days ago


Bangalore Urban, India PhonePe Full time

Site Reliability Engineer - Data Platform

Job Overview:

As a Site Reliability Engineer (SRE) specializing in Data Platform, you will play a

critical role in deployment, ensuring the reliability, scalability, and performance of our

data infrastructure. You will collaborate closely with cross-functional teams to

design, implement, and maintain robust systems that support our data-driven

initiatives. The ideal candidate will be able to write robust code that helps in

provisioning infrastructure components in Cloud / OnPrem. You will play a pivotal

role in ensuring the smooth functioning, operation, performance and security of large

high density Data infrastructure.

Key Responsibilities:

1. Configuration Management: Hands On Experience in Terraform / Ansible

to automate Cloud / OnPrem provisioning.

2. Infrastructure Management: Manage and maintain the Cloudera-based

infrastructure, ensuring optimal performance, high availability, and

scalability. This includes monitoring system health, troubleshooting

issues, and performing routine maintenance tasks.

3. Data Security and Compliance: Implement and enforce security best

practices to safeguard data integrity and confidentiality within the Data

environment. Ensure compliance with relevant regulations and standards

(e.g., GDPR, HIPAA, DPR).

4. Performance Optimization: Continuously optimize the Data infrastructure

to enhance performance, efficiency, and cost-effectiveness. Identify and

resolve bottlenecks, tune configurations, and implement best practices

for resource utilization.

5. Capacity Planning: Monitor resource utilization trends and plan for future

capacity needs. Proactively identify potential capacity constraints and

propose solutions to address them.

6. Backup and Disaster Recovery: Implement robust backup and disaster

recovery strategies to ensure data protection and business continuity.

Test and maintain backup and recovery procedures regularly.

7. Patches & Upgrades: Routinely apply recommended patches and perform

rolling upgrades of the platform in accordance with the advisory from

Cloudera, InfoSec and Compliance.

8. Documentation and Knowledge Sharing: Create comprehensive

documentation for configurations, processes, and procedures related to

the Data Platform. Share knowledge and best practices with team

members to foster continuous learning and improvement.

9. Collaboration and Communication: Collaborate effectively with

cross-functional teams including data engineers, developers, and IT

operations personnel. Communicate project status, issues, and

resolutions clearly and promptly.

Qualifications:

1. Bachelor's degree in Computer Science, Engineering, or related field.

2. Proficiency in Linux system administration, shell scripting, and

networking concepts.

3. 3 to 8 years of experience in Infrastructure Automation.

4. Hands-on experience with configuration management tools (e.g.,

Terraform, Salt, Ansible, Puppet, Chef).

5. Strong scripting skills (e.g., Python, Bash) for automation and

troubleshooting.

6. Experience with monitoring and logging solutions (e.g., Prometheus,

Grafana, ELK stack).

7. Knowledge of networking principles and protocols (TCP/IP, UDP, DNS,

DHCP, etc.).

8. Experience with managing *nix based machines and strong working

knowledge of quintessential Unix programs and tools (e.g. Ubuntu,

Fedora, Redhat, etc.)

9. Excellent communication skills and the ability to collaborate effectively

with cross-functional teams.

10.Excellent analytical, problem-solving, and troubleshooting skills..

11.Proven ability to work well under pressure and manage multiple priorities

simultaneously.

Good To Have:

1. Exposure in cloud platforms like Azure or AWS.

2. Understanding of distributed computing principles and experience with

Hadoop ecosystem technologies (HDFS, MapReduce, YARN, Hive,

Spark, etc.).

3. Familiarity with Open Data Lake components such as Ozone, Iceberg,

Spark, Flink, etc.

4. Familiarity with containerization and orchestration technologies (e.g.

Docker, Kubernetes, OpenShift) is a plu



  • Bangalore Urban, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bangalore Urban, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bangalore Urban, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bangalore Urban, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India PhonePe Full time

    Site Reliability Engineer - Data PlatformJob Overview:As a Site Reliability Engineer (SRE) specializing in Data Platform, you will play acritical role in deployment, ensuring the reliability, scalability, and performance of ourdata infrastructure. You will collaborate closely with cross-functional teams todesign, implement, and maintain robust systems that...


  • bangalore, India PhonePe Full time

    Site Reliability Engineer - Data Platform Job Overview: As a Site Reliability Engineer (SRE) specializing in Data Platform, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our data infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems...


  • Bangalore Urban, India PhonePe Full time

    Job Overview: As a Site Reliability Engineer (SRE) specializing in Data Platform OnPremise, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our Cloudera Data Platform (CDP) infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems that...


  • Bangalore Urban, India PhonePe Full time

    Job Overview: As a Site Reliability Engineer (SRE) specializing in Data Platform OnPremise, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our Cloudera Data Platform (CDP) infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems that...


  • Bangalore Urban, India PhonePe Full time

    Job Overview:As a Site Reliability Engineer (SRE) specializing in Data Platform OnPremise, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our Cloudera Data Platform (CDP) infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems that...


  • Bangalore Urban, India PhonePe Full time

    Job Overview: As a Site Reliability Engineer (SRE) specializing in Data Platform OnPremise, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our Cloudera Data Platform (CDP) infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems that...


  • bangalore, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Cricbuzz.com Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...


  • bangalore, India 5100 Kyndryl Solutions Private Limited Full time

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...


  • bangalore, India 5100 Kyndryl Solutions Private Limited Full time

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...


  • bangalore, India ANZ Full time

    About the role At ANZ our purpose is to shape a world where people and communities thrive and to achieve this, we need a talented Site Reliability Engineer to join our Australia Data tribe. The Australia Data tribe sits within our Australia Retail & Commercial division and our mission is to combine business and technology capabilities...


  • bangalore, India Qure.ai Full time

    About the jobJob Title: Site Reliability EngineerDepartment: EngineeringLocation: BangaloreYears of experience: 2-5 yearsType: Full Time EmploymentAbout Qure.ai:Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create cutting-edge solutions that...


  • bangalore, India Encora Inc. Full time

    Position: Site Reliability Engineer Location: Bangalore Experience: 4+ Years  Job Mode: Full-time Work Mode: Remote Responsibilities and Duties Collaborate with cross-functional teams to design, implement, and maintain reliable and scalable infrastructure solutions on the Azure cloud platform. Implement and maintain monitoring and...


  • bangalore, India Encora Inc. Full time

    Position: Site Reliability Engineer Location: Bangalore Experience: 4+ Years  Job Mode: Full-time Work Mode: Remote Responsibilities and Duties Collaborate with cross-functional teams to design, implement, and maintain reliable and scalable infrastructure solutions on the Azure cloud platform. Implement and maintain monitoring and...


  • Bangalore Urban, India Smarsh Full time

    Smarsh is the leader in communications compliance, archiving, and analytics. We provide compliance across the broadest set of communications channels with insights on what’s being captured. Smarsh customers manage over 500 million daily conversations across 80 channels and growing. Customers include the top 10 U.S., top 8 European, top 5 Canadian, and top...