
Site Reliability Engineer III
1 week ago
Role & responsibilities
Category
Details
Role
Site Reliability Engineer (SRE) III Data Engineering
Location
Hyderabad- Hybrid
Employment Type
Full Time
Experience
7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)
Primary Skills (Must-Have)
AWS, CI/CD, Jenkins, IAAC, Terraform, Kubernetes
Secondary Skills (Good-to-Have)
AWS systems; Dataiku data, Platform updates and patching
Tools & Platforms
Data Warehousing & Processing: Snowflake, Redshift, Apache Airflow, dbt
CI/CD & Deployment: Jenkins, GitHub Actions, AWS CodePipeline, Terraform
Cloud & Event Processing: AWS Lambda, API Gateway, SNS/SQS, Kafka, Step Functions
Monitoring & Logging: DataDog, AWS CloudWatch, Prometheus, Splunk
Incident Management: PagerDuty, Opsgenie, AWS Health Dashboard
Collaboration & Code Review: GitHub, Jira, Confluence
Key Responsibilities
Data Pipeline Reliability & Observability:
Maintain and optimize highly available, fault-tolerant infrastructure for data pipelines, ETL jobs, and real-time data processing
Implement end-to-end monitoring of Airflow DAGs, Snowflake queries, and AWS-based data workflows
Automate data pipeline health checks, error handling, and auto-remediation strategies
Infrastructure & Cloud Automation:
Deploy and manage AWS-based data infrastructure using Terraform and CloudFormation
Optimize Kubernetes (EKS) clusters for processing large-scale datasets and real-time analytics
Ensure high availability and cost-efficient scaling for Redshift, Snowflake, and data storage solutions
Performance, Monitoring & Incident Response:
Implement real-time monitoring, logging, and alerting using DataDog, AWS CloudWatch, and Prometheus
Define and track SLOs, SLIs, and error budgets to improve data reliability and uptime
Conduct Root Cause Analysis (RCA), security audits, and post-mortems for incidents
Security & Compliance:
Ensure GDPR, CCPA, and SOC 2 compliance for data storage, access controls, and retention policies
Implement AWS security best practices (IAM, KMS, Shield, WAF) to secure data access and encryption
Secure API gateways, authentication mechanisms, and data lake permissions to prevent unauthorized access
Collaboration & Leadership:
Work closely with data engineers, analytics teams, and DevOps engineers to enhance data platform reliability
Participate in incident response drills, disaster recovery planning, and security compliance reviews
Advocate for best practices in automation, cost optimization, and cloud-native data solutions
-
Site Reliability Engineer III
2 weeks ago
Hyderabad, Telangana, India JPMorganChase Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJOB DESCRIPTIONThere's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Consumer & Community Banking, you will solve complex and broad...
-
Site Reliability Engineer III
2 hours ago
Hyderabad, Telangana, India Chase- Candidate Experience page Full time ₹ 20,00,000 - ₹ 25,00,000 per yearThere's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Consumer and Community Banking, you will solve complex and broad business problems...
-
Site Reliability Engineer
3 days ago
Hyderabad, Telangana, India, Telangana Sonata Software Full timeCategoryDetailsRoleSite Reliability Engineer (SRE) III – Data EngineeringLocationHyderabad- Employment TypeFull TimeExperience7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)Primary Skills (Must-Have)AWS, CI/CD, Jenkins, IAAC,...
-
Site Reliability Engineer
2 days ago
Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per yearImagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...
-
SRE(Site Reliability Engineer)
4 hours ago
Hyderabad, Telangana, India Talent Worx Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSRE (Site Reliability Engineer)Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services. Your work will involve both software engineering and systems operations as you strive to improve...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India SID Global Solutions Full time ₹ 9,00,000 - ₹ 12,00,000 per yearJob Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India TurboHire Full time ₹ 15,00,000 - ₹ 28,00,000 per yearSite Reliability Engineer (SRE)Location: Hyderabad (Hybrid)Experience: 3–5 yearsAbout the RoleWe are looking for an SRE Engineer to own reliability, deployment, and monitoringof TurboHire's cloud infrastructure. You will ensure our platform is scalable, secure,and highly available. The role balances hands-on coding, automation, and infraoperations, freeing...
-
Site Reliability Engineer
3 hours ago
Hyderabad, Telangana, India LivePerson Full time ₹ 8,00,000 - ₹ 15,00,000 per yearLivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...
-
Lead Site Reliability Engineer
4 days ago
Hyderabad, Telangana, India EPAM Systems Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a skilledLead Site Reliability Engineerto drive the stability, scalability, and reliability of our systems while improving efficiency through automation and best practices.This role calls for deep expertise in DevOps methodologies, Infrastructure as Code (IaC), and collaboration across teams to ensure optimal system...
-
Principal Site Reliability Engineer
1 week ago
Hyderabad, Telangana, India Amgen Inc Full time ₹ 8,00,000 - ₹ 12,00,000 per yearWe are looking for a Site Reliability Engineer/Cloud Engineer (SRE) to work on the performance optimization, standardization, and automation of Amgens critical infrastructure and systems. This role is crucial to ensuring the reliability, scalability, and cost-effectiveness of our production systems. The ideal candidate will work on operational excellence...