Infrastructure Automation Site Reliability Engineer
3 weeks ago
Job Description About The Role The Infrastructure Automation Site Reliability Engineer (SRE) bridges the gap between development and operations by applying software engineering principles to infrastructure and operational challenges. Responsibilities include creating support documentation, developing key metrics for tracking and reporting, managing monitoring services, using automation tools, and coordinating cross-team communications related to releases and maintenance. Automation SREs support existing Infrastructure Developers by taking ownership of application support and process work required to manage these applications at scale in a 247 environment. This allows developers to focus on building new features and functionality. Key Functions Application / Tool Support - Support existing applications and services hosted by the Infrastructure Automation (InfAuto) team - Develop runbooks for application support and maintenance - Create detailed alerts for incident management and monitoring tools - Implement and manage an updated operations platform for the Technical Operations team Service Introduction & Communications - Develop communication plans for service and tool launches - Improve messaging around service interruptions and maintenance Infrastructure & Automation - Expand use of cloud development pipelines for new observability capabilities - Support cloud infrastructure integration - Use scripts to perform maintenance tasks Monitoring & Observability - Define KPIs and SLAs for managed services - Assist with dashboard development and management - Integrate cloud infrastructure with monitoring and reporting tools - Conduct capacity planning to support proactive scaling Operational Excellence - Design and execute high availability (HA) and disaster recovery (DR) infrastructure testing - Partner with operations teams to expedite issue analysis - Coordinate change management activities with application users Required Skills And Tools Experience Experience Range: 36 years using tools in the following categories: - Infrastructure as Code: Terraform, CloudFormation, or similar - Configuration Management: Ansible, Puppet, or Chef - Container Technologies: Docker, Podman, basic Kubernetes concepts - Observability Platforms: Grafana, Elastic (ELK), DataDog, Splunk - Issue / Project Tracking: JIRA, ServiceNow, Trello, or similar - CI/CD Pipelines: Jenkins, GitLab CI, GitHub Actions - Documentation Tools: SharePoint, Confluence (for user guides, runbooks, etc.) - Linux Operating Systems: Red Hat Enterprise Linux or similar (CentOS, Rocky, Fedora) - Database Operations: SQL, PostgreSQL - IDEs: Visual Studio Code (VS Code), JetBrains IntelliJ IDEA Desired Skills - 24 years in an L1 SRE or DevOps role - Experience as a Systems Engineer (infrastructure design and implementation) - Platform Engineer (internal tooling and platform development) - Cloud Engineer (multi-cloud experience and migration projects) - Application Support (production troubleshooting) - Release Engineer (software deployment and release management) - Incident Response (on-call experience and production issue resolution) Company Benefits & Perks - Competitive salary package. - Performance-based annual bonus (cash and stocks). - Hybrid working model (3 days office/week). - Group Medical & Life Insurance. - Modern offices with free amenities & fully stocked cafeterias. - Monthly food card & company-paid snacks. - Hardship/shift allowance with company-provided pickup & drop facility* - Attractive employee referral bonus. - Frequent company-sponsored team-building events and outings. - Depending upon the shifts. - The benefits package is subject to change at the management's discretion.
-
Hyderabad, Telangana, India Interactive Brokers External Full time ₹ 6,00,000 - ₹ 12,00,000 per yearAbout the Role:The Infrastructure Automation Site Reliability Engineer (SRE) bridges the gap between development and operations by applying software engineering principles to infrastructure and operational challenges. Responsibilities include creating support documentation, developing key metrics for tracking and reporting, managing monitoring services,...
-
Site Reliability Engineer
3 days ago
India InstaService Full timeAbout InstaService InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding nationwide — backed by strong traction, rapid adoption, and a mission to simplify how people get work done at home. We’re looking for a...
-
Site Reliability Engineer
2 days ago
India InstaService Full timeAbout InstaService InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding nationwide — backed by strong traction, rapid adoption, and a mission to simplify how people get work done at home. We’re looking for a...
-
Site Reliability Engineer
3 days ago
India InstaService Full timeAbout InstaServiceInstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding nationwide — backed by strong traction, rapid adoption, and a mission to simplify how people get work done at home.We’re looking for a...
-
Senior Infrastructure
6 days ago
Hyderabad, Chennai, Pune, India Brighttier Inc Full timeJob Description As a Senior Site Reliability Engineer you have a proven track record of supporting Operational environments whilst using DevOps tooling to automate management and reliability. You will work closely with our Operations teams, Architects, and other Site Reliability and DevOps engineers to help meet the day-to-day business needs of Lemongrass...
-
Site Reliability Engineer
5 days ago
Hyderabad, Telangana, India Technology Next Full time ₹ 20,00,000 - ₹ 30,00,000 per yearUrgently hiring for Site Reliability Engineer (SRE) / Chaos EngineerLocation: HyderabadJob Type: Full-time, PermanentJob Description:We are looking for an experienced Site Reliability Engineer (SRE) with strong Python automation skills (Boto3 required) and hands-on experience in chaos engineering to improve system reliability and resilience. The ideal...
-
Site Reliability Engineer
4 weeks ago
Hyderabad, India ValueMomentum Full timeJob Description About the Role We are seeking an experienced Site Reliability / Azure DevOps Engineer with Dynatrace Experience to join our engineering team and contribute to scalable CI/CD practices, infrastructure automation, and cloud operations. The ideal candidate will have deep expertise in Azure DevOps, Infrastructure as Code (IaC), Azure services,...
-
Site Reliability Engineer III
4 weeks ago
Hyderabad, India hackajob Full timeJob Description hackajob is collaborating with J.P. Morgan to connect them with exceptional tech professionals for this role. As a Site Reliability Engineer III at JPMorgan Chase within the Chief Technology Office, you will collaborate with engineering, support, and operations teams to maintain and improve the reliability of mission-critical applications....
-
Site Reliability Engineer
4 weeks ago
Hyderabad, India Sonata Software Full timeCategory Details Role Site Reliability Engineer (SRE) III – Data Engineering Location Hyderabad- Employment Type Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...
-
Site Reliability Engineer
4 weeks ago
Hyderabad, India Sonata Software Full timeCategory Details Role Site Reliability Engineer (SRE) III – Data Engineering Location Hyderabad- Employment Type Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...