Site Reliability Engineer

3 days ago


Hyderabad, India Sonata Software Full time

Category Details Role Site Reliability Engineer (SRE) III – Data Engineering Location Hyderabad- Employment Type Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC, Terraform, Kubernetes Secondary Skills (Good-to-Have) AWS systems; Dataiku data, Platform updates and patching Tools & Platforms Data Warehousing & Processing: Snowflake, Redshift, Apache Airflow, dbt CI/CD & Deployment: Jenkins, GitHub Actions, AWS CodePipeline, Terraform Cloud & Event Processing: AWS Lambda, API Gateway, SNS/SQS, Kafka, Step Functions Monitoring & Logging: DataDog, AWS CloudWatch, Prometheus, Splunk Incident Management: PagerDuty, Opsgenie, AWS Health Dashboard Collaboration & Code Review: GitHub, Jira, Confluence Key Responsibilities Data Pipeline Reliability & Observability: - Maintain and optimize highly available, fault-tolerant infrastructure for data pipelines, ETL jobs, and real-time data processing - Implement end-to-end monitoring of Airflow DAGs, Snowflake queries, and AWS-based data workflows - Automate data pipeline health checks, error handling, and auto-remediation strategies Infrastructure & Cloud Automation: - Deploy and manage AWS-based data infrastructure using Terraform and CloudFormation - Optimize Kubernetes (EKS) clusters for processing large-scale datasets and real-time analytics - Ensure high availability and cost-efficient scaling for Redshift, Snowflake, and data storage solutions Performance, Monitoring & Incident Response: - Implement real-time monitoring, logging, and alerting using DataDog, AWS CloudWatch, and Prometheus - Define and track SLOs, SLIs, and error budgets to improve data reliability and uptime - Conduct Root Cause Analysis (RCA), security audits, and post-mortems for incidents Security & Compliance: - Ensure GDPR, CCPA, and SOC 2 compliance for data storage, access controls, and retention policies - Implement AWS security best practices (IAM, KMS, Shield, WAF) to secure data access and encryption - Secure API gateways, authentication mechanisms, and data lake permissions to prevent unauthorized access Collaboration & Leadership: - Work closely with data engineers, analytics teams, and DevOps engineers to enhance data platform reliability - Participate in incident response drills, disaster recovery planning, and security compliance reviews - Advocate for best practices in automation, cost optimization, and cloud-native data solutions



  • Hyderabad, India Talent Worx Full time

    Site Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Hyderabad, India Jigya Software Services Full time

    Job Title:Senior Site Reliability Engineer (SRE) - AWS/Kubernetes Location:Hyderabad - Onsite Job Type:Full-Time About the Role: We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance,...


  • Hyderabad, India Talentiser Full time

    Hiring hybrid Site Reliability Engineers for a fast-growing product company building scalable tech solutions and transforming how businesses run mission-critical operations. Our Saa S platform is designed for high performance, reliability, and automation at scale. Your Impact As a Site Reliability Engineer , you’ll play a key role in ensuring ...


  • Hyderabad, India Sonata Software Full time

    Hello Connetions Greetings of the day!!! We have immediate openings for SRE Role - Site Reliability Engineer Experience - 7 to 12yrs Work Location -Hyderabad Notice Period -immediate Interested candidates can share your CVs to -


  • Hyderabad, India Sonata Software Full time

    Category Details Role Site Reliability Engineer (SRE) III – Data Engineering Location Hyderabad- Employment Type Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...


  • Hyderabad, India Sonata Software Full time

    Role: Site Reliability Engineer Location: HyderabadNotice Period: Immediate to 20 DaysEmployment Type: Full TimeExperience7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)Primary Skills (Must-Have)AWS, CI/CD, Jenkins, IAAC,...


  • Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...


  • Hyderabad, India Pythian Full time

    Site Reliability Engineer HyderabadSite Reliability Engineering – Site Reliability Engineering /Full Time /HybridSite Reliability Engineer Hyderabad-based | Multiple timezones available | Hybrid | Work from Home and the OfficeWhy Pythian: At Pythian, we are experts in strategic database and analytics services, driving digital transformation and...


  • Hyderabad, India Pythian Full time

    Site Reliability Engineer HyderabadSite Reliability Engineering – Site Reliability Engineering /Full Time /HybridSite Reliability Engineer Hyderabad-based | Multiple timezones available | Hybrid | Work from Home and the OfficeWhy Pythian: At Pythian, we are experts in strategic database and analytics services, driving digital transformation and...


  • Hyderabad, India Talentiser Full time

    Hiring hybrid Site Reliability Engineers for a fast-growing product company building scalable tech solutions and transforming how businesses run mission-critical operations. Our SaaS platform is designed for high performance, reliability, and automation at scale.Your ImpactAs a Site Reliability Engineer , you’ll play a key role in ensuring reliability,...