
Site Reliability Engineer
5 days ago
We are seeking a Site Reliability Engineer with strong expertise in AWS, CI/CD, IaC, and Kubernetes to ensure the reliability, scalability, and security of large-scale data infrastructure. The ideal candidate will blend DevOps best practices with data engineering operations, focusing on automation, observability, and cloud-native solutions.
Primary Skills (Must-Have)
- AWS (core services: EC2, EKS, Lambda, Redshift, S3, IAM, VPC)
- CI/CD (Jenkins, GitHub Actions, AWS CodePipeline)
- Infrastructure as Code (Terraform, CloudFormation)
- Kubernetes (EKS) and container orchestration
Secondary Skills (Good-to-Have)
- AWS Systems Manager, Dataiku platform operations
- Experience with platform patching, upgrades, and maintenance
Tools & Platforms
- Data Warehousing & Processing: Snowflake, Redshift, Apache Airflow, dbt
- CI/CD & Deployment: Jenkins, GitHub Actions, AWS CodePipeline, Terraform
- Cloud & Event Processing: AWS Lambda, API Gateway, SNS/SQS, Kafka, Step Functions
- Monitoring & Logging: DataDog, AWS CloudWatch, Prometheus, Splunk
- Incident Management: PagerDuty, Opsgenie, AWS Health Dashboard
- Collaboration & Code Review: GitHub, Jira, Confluence
Key Responsibilities
Data Pipeline Reliability & Observability
- Maintain highly available, fault-tolerant infrastructure for ETL jobs and real-time data processing
- Implement monitoring of Airflow DAGs, Snowflake queries, and AWS data workflows
- Automate health checks, error handling, and self-healing for data pipelines
Infrastructure & Cloud Automation
- Deploy and manage AWS-based infrastructure with Terraform & CloudFormation
- Optimize Kubernetes (EKS) clusters for scale and cost efficiency
- Support scaling and reliability for Redshift, Snowflake, and storage solutions
Performance, Monitoring & Incident Response
- Build real-time monitoring, logging, and alerting with DataDog, CloudWatch, and Prometheus
- Define & track SLOs/SLIs to improve data platform uptime
- Perform RCA, post-mortems, and security audits after incidents
Security & Compliance
- Ensure compliance with GDPR, CCPA, SOC 2 across data pipelines
- Apply AWS security best practices (IAM, KMS, Shield, WAF)
- Secure API Gateways, data access policies, and encryption standards
Collaboration & Leadership
- Partner with data engineers, analytics, and DevOps teams to improve reliability
- Participate in DR (Disaster Recovery) planning and security compliance reviews
- Promote best practices in automation, observability, and cost optimization
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per yearRole DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....
-
Site Reliability Engineer
3 weeks ago
Hyderabad, India Talent Worx Full timeSite Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
5 days ago
Hyderabad, India Sonata Software Full timeHello Connetions Greetings of the day!!! We have immediate openings for SRE Role - Site Reliability Engineer Experience - 7 to 12yrs Work Location -Hyderabad Notice Period -immediate Interested candidates can share your CVs to -
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India FIS Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the Role :Site Reliability Engineer (SRE)with deep expertise inMainframe technologies like COBOL, JCL, etc. to support and enhance ourCard Management & Payment processing functions. This role will be responsible for ensuring reliability, high availability, scalability, stability and performance of mission-critical mainframe software applications and...
-
Site reliability engineer
2 weeks ago
Hyderabad, India Talentiser Full timeHiring hybrid Site Reliability Engineers for a fast-growing product company building scalable tech solutions and transforming how businesses run mission-critical operations. Our Saa S platform is designed for high performance, reliability, and automation at scale. Your Impact As a Site Reliability Engineer , you’ll play a key role in ensuring ...
-
Site Reliability Engineer
5 days ago
Hyderabad, India Sonata Software Full timeCategory Details Role Site Reliability Engineer (SRE) III – Data Engineering Location Hyderabad- Employment Type Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...
-
Site Reliability Engineer
5 days ago
Hyderabad, India Sonata Software Full timeCategory Details Role Site Reliability Engineer (SRE) III – Data Engineering Location Hyderabad- Employment Type Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...
-
Site Reliability Engineer
5 days ago
Hyderabad, India Sonata Software Full timeRole: Site Reliability Engineer Location: HyderabadNotice Period: Immediate to 20 DaysEmployment Type: Full TimeExperience7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)Primary Skills (Must-Have)AWS, CI/CD, Jenkins, IAAC,...
-
Site Reliability Engineer
5 days ago
Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per yearImagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...
-
Site Reliability Engineer
3 weeks ago
Hyderabad, India Pythian Full timeSite Reliability Engineer HyderabadSite Reliability Engineering – Site Reliability Engineering /Full Time /HybridSite Reliability Engineer Hyderabad-based | Multiple timezones available | Hybrid | Work from Home and the OfficeWhy Pythian: At Pythian, we are experts in strategic database and analytics services, driving digital transformation and...