Site Reliability Engineer III

1 week ago


Hyderabad, Telangana, India Sonata Software Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Role & responsibilities

Category

Details

Role

Site Reliability Engineer (SRE) III Data Engineering

Location

Hyderabad- Hybrid

Employment Type

Full Time

Experience

7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)

Primary Skills (Must-Have)

AWS, CI/CD, Jenkins, IAAC, Terraform, Kubernetes

Secondary Skills (Good-to-Have)

AWS systems; Dataiku data, Platform updates and patching

Tools & Platforms

Data Warehousing & Processing: Snowflake, Redshift, Apache Airflow, dbt

CI/CD & Deployment: Jenkins, GitHub Actions, AWS CodePipeline, Terraform

Cloud & Event Processing: AWS Lambda, API Gateway, SNS/SQS, Kafka, Step Functions

Monitoring & Logging: DataDog, AWS CloudWatch, Prometheus, Splunk

Incident Management: PagerDuty, Opsgenie, AWS Health Dashboard

Collaboration & Code Review: GitHub, Jira, Confluence

Key Responsibilities

Data Pipeline Reliability & Observability:

  • Maintain and optimize highly available, fault-tolerant infrastructure for data pipelines, ETL jobs, and real-time data processing

  • Implement end-to-end monitoring of Airflow DAGs, Snowflake queries, and AWS-based data workflows

  • Automate data pipeline health checks, error handling, and auto-remediation strategies

Infrastructure & Cloud Automation:

  • Deploy and manage AWS-based data infrastructure using Terraform and CloudFormation

  • Optimize Kubernetes (EKS) clusters for processing large-scale datasets and real-time analytics

  • Ensure high availability and cost-efficient scaling for Redshift, Snowflake, and data storage solutions

Performance, Monitoring & Incident Response:

  • Implement real-time monitoring, logging, and alerting using DataDog, AWS CloudWatch, and Prometheus

  • Define and track SLOs, SLIs, and error budgets to improve data reliability and uptime

  • Conduct Root Cause Analysis (RCA), security audits, and post-mortems for incidents

Security & Compliance:

  • Ensure GDPR, CCPA, and SOC 2 compliance for data storage, access controls, and retention policies

  • Implement AWS security best practices (IAM, KMS, Shield, WAF) to secure data access and encryption

  • Secure API gateways, authentication mechanisms, and data lake permissions to prevent unauthorized access

Collaboration & Leadership:

  • Work closely with data engineers, analytics teams, and DevOps engineers to enhance data platform reliability

  • Participate in incident response drills, disaster recovery planning, and security compliance reviews

  • Advocate for best practices in automation, cost optimization, and cloud-native data solutions



  • Hyderabad, Telangana, India JPMorganChase Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    JOB DESCRIPTIONThere's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Consumer & Community Banking, you will solve complex and broad...


  • Hyderabad, Telangana, India Chase- Candidate Experience page Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Consumer and Community Banking, you will solve complex and broad business problems...


  • Hyderabad, Telangana, India, Telangana Sonata Software Full time

    CategoryDetailsRoleSite Reliability Engineer (SRE) III – Data EngineeringLocationHyderabad- Employment TypeFull TimeExperience7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)Primary Skills (Must-Have)AWS, CI/CD, Jenkins, IAAC,...


  • Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    SRE (Site Reliability Engineer)Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services. Your work will involve both software engineering and systems operations as you strive to improve...


  • Hyderabad, Telangana, India SID Global Solutions Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Job Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...


  • Hyderabad, Telangana, India TurboHire Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Site Reliability Engineer (SRE)Location: Hyderabad (Hybrid)Experience: 3–5 yearsAbout the RoleWe are looking for an SRE Engineer to own reliability, deployment, and monitoringof TurboHire's cloud infrastructure. You will ensure our platform is scalable, secure,and highly available. The role balances hands-on coding, automation, and infraoperations, freeing...


  • Hyderabad, Telangana, India LivePerson Full time ₹ 8,00,000 - ₹ 15,00,000 per year

    LivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...


  • Hyderabad, Telangana, India EPAM Systems Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    We are seeking a skilledLead Site Reliability Engineerto drive the stability, scalability, and reliability of our systems while improving efficiency through automation and best practices.This role calls for deep expertise in DevOps methodologies, Infrastructure as Code (IaC), and collaboration across teams to ensure optimal system...


  • Hyderabad, Telangana, India Amgen Inc Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    We are looking for a Site Reliability Engineer/Cloud Engineer (SRE) to work on the performance optimization, standardization, and automation of Amgens critical infrastructure and systems. This role is crucial to ensuring the reliability, scalability, and cost-effectiveness of our production systems. The ideal candidate will work on operational excellence...