![PhonePe](https://media.trabajo.org/img/noimg.jpg)
Site Reliability Engineer- Data Platform
7 days ago
Site Reliability Engineer - Data Platform
Job Overview:
As a Site Reliability Engineer (SRE) specializing in Data Platform, you will play a
critical role in deployment, ensuring the reliability, scalability, and performance of our
data infrastructure. You will collaborate closely with cross-functional teams to
design, implement, and maintain robust systems that support our data-driven
initiatives. The ideal candidate will be able to write robust code that helps in
provisioning infrastructure components in Cloud / OnPrem. You will play a pivotal
role in ensuring the smooth functioning, operation, performance and security of large
high density Data infrastructure.
Key Responsibilities:
1. Configuration Management: Hands On Experience in Terraform / Ansible
to automate Cloud / OnPrem provisioning.
2. Infrastructure Management: Manage and maintain the Cloudera-based
infrastructure, ensuring optimal performance, high availability, and
scalability. This includes monitoring system health, troubleshooting
issues, and performing routine maintenance tasks.
3. Data Security and Compliance: Implement and enforce security best
practices to safeguard data integrity and confidentiality within the Data
environment. Ensure compliance with relevant regulations and standards
(e.g., GDPR, HIPAA, DPR).
4. Performance Optimization: Continuously optimize the Data infrastructure
to enhance performance, efficiency, and cost-effectiveness. Identify and
resolve bottlenecks, tune configurations, and implement best practices
for resource utilization.
5. Capacity Planning: Monitor resource utilization trends and plan for future
capacity needs. Proactively identify potential capacity constraints and
propose solutions to address them.
6. Backup and Disaster Recovery: Implement robust backup and disaster
recovery strategies to ensure data protection and business continuity.
Test and maintain backup and recovery procedures regularly.
7. Patches & Upgrades: Routinely apply recommended patches and perform
rolling upgrades of the platform in accordance with the advisory from
Cloudera, InfoSec and Compliance.
8. Documentation and Knowledge Sharing: Create comprehensive
documentation for configurations, processes, and procedures related to
the Data Platform. Share knowledge and best practices with team
members to foster continuous learning and improvement.
9. Collaboration and Communication: Collaborate effectively with
cross-functional teams including data engineers, developers, and IT
operations personnel. Communicate project status, issues, and
resolutions clearly and promptly.
Qualifications:
1. Bachelor's degree in Computer Science, Engineering, or related field.
2. Proficiency in Linux system administration, shell scripting, and
networking concepts.
3. 3 to 8 years of experience in Infrastructure Automation.
4. Hands-on experience with configuration management tools (e.g.,
Terraform, Salt, Ansible, Puppet, Chef).
5. Strong scripting skills (e.g., Python, Bash) for automation and
troubleshooting.
6. Experience with monitoring and logging solutions (e.g., Prometheus,
Grafana, ELK stack).
7. Knowledge of networking principles and protocols (TCP/IP, UDP, DNS,
DHCP, etc.).
8. Experience with managing *nix based machines and strong working
knowledge of quintessential Unix programs and tools (e.g. Ubuntu,
Fedora, Redhat, etc.)
9. Excellent communication skills and the ability to collaborate effectively
with cross-functional teams.
10.Excellent analytical, problem-solving, and troubleshooting skills..
11.Proven ability to work well under pressure and manage multiple priorities
simultaneously.
Good To Have:
1. Exposure in cloud platforms like Azure or AWS.
2. Understanding of distributed computing principles and experience with
Hadoop ecosystem technologies (HDFS, MapReduce, YARN, Hive,
Spark, etc.).
3. Familiarity with Open Data Lake components such as Ozone, Iceberg,
Spark, Flink, etc.
4. Familiarity with containerization and orchestration technologies (e.g.
Docker, Kubernetes, OpenShift) is a plu
-
Site Reliability Engineer
2 months ago
Bangalore Urban, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
2 months ago
Bangalore Urban, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
3 weeks ago
Bangalore Urban, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
3 weeks ago
Bangalore Urban, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer- Data Platform
6 days ago
bangalore, India PhonePe Full timeSite Reliability Engineer - Data PlatformJob Overview:As a Site Reliability Engineer (SRE) specializing in Data Platform, you will play acritical role in deployment, ensuring the reliability, scalability, and performance of ourdata infrastructure. You will collaborate closely with cross-functional teams todesign, implement, and maintain robust systems that...
-
Site Reliability Engineer- Data Platform
7 days ago
bangalore, India PhonePe Full timeSite Reliability Engineer - Data Platform Job Overview: As a Site Reliability Engineer (SRE) specializing in Data Platform, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our data infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems...
-
Site Reliability Engineer- Big Data
3 weeks ago
Bangalore Urban, India PhonePe Full timeJob Overview: As a Site Reliability Engineer (SRE) specializing in Data Platform OnPremise, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our Cloudera Data Platform (CDP) infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems that...
-
Site Reliability Engineer- Big Data
3 weeks ago
Bangalore Urban, India PhonePe Full timeJob Overview: As a Site Reliability Engineer (SRE) specializing in Data Platform OnPremise, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our Cloudera Data Platform (CDP) infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems that...
-
Site Reliability Engineer
2 months ago
Bangalore Urban, India PhonePe Full timeJob Overview:As a Site Reliability Engineer (SRE) specializing in Data Platform OnPremise, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our Cloudera Data Platform (CDP) infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems that...
-
Site Reliability Engineer
2 months ago
Bangalore Urban, India PhonePe Full timeJob Overview: As a Site Reliability Engineer (SRE) specializing in Data Platform OnPremise, you will play a critical role in deployment, ensuring the reliability, scalability, and performance of our Cloudera Data Platform (CDP) infrastructure. You will collaborate closely with cross-functional teams to design, implement, and maintain robust systems that...
-
Site Reliability Engineer
2 months ago
bangalore, India Integra Connect Full timeAbout IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
3 weeks ago
bangalore, India Integra Connect Full timeAbout IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
3 weeks ago
bangalore, India Cricbuzz.com Full timeSite Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...
-
Azure Platform- Site Reliability Engineering
3 weeks ago
bangalore, India 5100 Kyndryl Solutions Private Limited Full timeWho We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...
-
Azure Platform- Site Reliability Engineering
1 month ago
bangalore, India 5100 Kyndryl Solutions Private Limited Full timeWho We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...
-
Site Reliability Engineer
2 months ago
bangalore, India ANZ Full timeAbout the role At ANZ our purpose is to shape a world where people and communities thrive and to achieve this, we need a talented Site Reliability Engineer to join our Australia Data tribe. The Australia Data tribe sits within our Australia Retail & Commercial division and our mission is to combine business and technology capabilities...
-
Site Reliability Engineer
2 months ago
bangalore, India Qure.ai Full timeAbout the jobJob Title: Site Reliability EngineerDepartment: EngineeringLocation: BangaloreYears of experience: 2-5 yearsType: Full Time EmploymentAbout Qure.ai:Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create cutting-edge solutions that...
-
Site Reliability Engineer
3 weeks ago
bangalore, India Encora Inc. Full timePosition: Site Reliability Engineer Location: Bangalore Experience: 4+ Years Job Mode: Full-time Work Mode: Remote Responsibilities and Duties Collaborate with cross-functional teams to design, implement, and maintain reliable and scalable infrastructure solutions on the Azure cloud platform. Implement and maintain monitoring and...
-
Site Reliability Engineer
4 weeks ago
bangalore, India Encora Inc. Full timePosition: Site Reliability Engineer Location: Bangalore Experience: 4+ Years Job Mode: Full-time Work Mode: Remote Responsibilities and Duties Collaborate with cross-functional teams to design, implement, and maintain reliable and scalable infrastructure solutions on the Azure cloud platform. Implement and maintain monitoring and...
-
Site Reliability Engineer- On Premises
2 months ago
Bangalore Urban, India Smarsh Full timeSmarsh is the leader in communications compliance, archiving, and analytics. We provide compliance across the broadest set of communications channels with insights on what’s being captured. Smarsh customers manage over 500 million daily conversations across 80 channels and growing. Customers include the top 10 U.S., top 8 European, top 5 Canadian, and top...