Site Reliability Engineer

5 days ago


Bengaluru Hyderabad, India Sonata Software Full time ₹ 12,00,000 - ₹ 36,00,000 per year

We are seeking a Site Reliability Engineer with strong expertise in AWS, CI/CD, IaC, and Kubernetes to ensure the reliability, scalability, and security of large-scale data infrastructure. The ideal candidate will blend DevOps best practices with data engineering operations, focusing on automation, observability, and cloud-native solutions.

Primary Skills (Must-Have)

  • AWS (core services: EC2, EKS, Lambda, Redshift, S3, IAM, VPC)
  • CI/CD (Jenkins, GitHub Actions, AWS CodePipeline)
  • Infrastructure as Code (Terraform, CloudFormation)
  • Kubernetes (EKS) and container orchestration

Secondary Skills (Good-to-Have)

  • AWS Systems Manager, Dataiku platform operations
  • Experience with platform patching, upgrades, and maintenance

Tools & Platforms

  • Data Warehousing & Processing: Snowflake, Redshift, Apache Airflow, dbt
  • CI/CD & Deployment: Jenkins, GitHub Actions, AWS CodePipeline, Terraform
  • Cloud & Event Processing: AWS Lambda, API Gateway, SNS/SQS, Kafka, Step Functions
  • Monitoring & Logging: DataDog, AWS CloudWatch, Prometheus, Splunk
  • Incident Management: PagerDuty, Opsgenie, AWS Health Dashboard
  • Collaboration & Code Review: GitHub, Jira, Confluence

Key Responsibilities

Data Pipeline Reliability & Observability

  • Maintain highly available, fault-tolerant infrastructure for ETL jobs and real-time data processing
  • Implement monitoring of Airflow DAGs, Snowflake queries, and AWS data workflows
  • Automate health checks, error handling, and self-healing for data pipelines

Infrastructure & Cloud Automation

  • Deploy and manage AWS-based infrastructure with Terraform & CloudFormation
  • Optimize Kubernetes (EKS) clusters for scale and cost efficiency
  • Support scaling and reliability for Redshift, Snowflake, and storage solutions

Performance, Monitoring & Incident Response

  • Build real-time monitoring, logging, and alerting with DataDog, CloudWatch, and Prometheus
  • Define & track SLOs/SLIs to improve data platform uptime
  • Perform RCA, post-mortems, and security audits after incidents

Security & Compliance

  • Ensure compliance with GDPR, CCPA, SOC 2 across data pipelines
  • Apply AWS security best practices (IAM, KMS, Shield, WAF)
  • Secure API Gateways, data access policies, and encryption standards

Collaboration & Leadership

  • Partner with data engineers, analytics, and DevOps teams to improve reliability
  • Participate in DR (Disaster Recovery) planning and security compliance reviews
  • Promote best practices in automation, observability, and cost optimization


  • Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Role DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....


  • Hyderabad, India Talent Worx Full time

    Site Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Hyderabad, India Sonata Software Full time

    Hello Connetions Greetings of the day!!! We have immediate openings for SRE Role - Site Reliability Engineer Experience - 7 to 12yrs Work Location -Hyderabad Notice Period -immediate Interested candidates can share your CVs to -


  • Bengaluru, Karnataka, India FIS Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the Role :Site Reliability Engineer (SRE)with deep expertise inMainframe technologies like COBOL, JCL, etc. to support and enhance ourCard Management & Payment processing functions. This role will be responsible for ensuring reliability, high availability, scalability, stability and performance of mission-critical mainframe software applications and...


  • Hyderabad, India Talentiser Full time

    Hiring hybrid Site Reliability Engineers for a fast-growing product company building scalable tech solutions and transforming how businesses run mission-critical operations. Our Saa S platform is designed for high performance, reliability, and automation at scale. Your Impact As a Site Reliability Engineer , you’ll play a key role in ensuring ...


  • Hyderabad, India Sonata Software Full time

    Category Details Role Site Reliability Engineer (SRE) III – Data Engineering Location Hyderabad- Employment Type Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...


  • Hyderabad, India Sonata Software Full time

    Category Details Role Site Reliability Engineer (SRE) III – Data Engineering Location Hyderabad- Employment Type Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...


  • Hyderabad, India Sonata Software Full time

    Role: Site Reliability Engineer Location: HyderabadNotice Period: Immediate to 20 DaysEmployment Type: Full TimeExperience7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)Primary Skills (Must-Have)AWS, CI/CD, Jenkins, IAAC,...


  • Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...


  • Hyderabad, India Pythian Full time

    Site Reliability Engineer HyderabadSite Reliability Engineering – Site Reliability Engineering /Full Time /HybridSite Reliability Engineer Hyderabad-based | Multiple timezones available | Hybrid | Work from Home and the OfficeWhy Pythian: At Pythian, we are experts in strategic database and analytics services, driving digital transformation and...