Infrastructure Automation Site Reliability Engineer

24 hours ago


Hyderabad, Telangana, India Interactive Brokers External Full time ₹ 6,00,000 - ₹ 12,00,000 per year

About the Role:

The Infrastructure Automation Site Reliability Engineer (SRE) bridges the gap between development and operations by applying software engineering principles to infrastructure and operational challenges. Responsibilities include creating support documentation, developing key metrics for tracking and reporting, managing monitoring services, using automation tools, and coordinating cross-team communications related to releases and maintenance.

Automation SREs support existing Infrastructure Developers by taking ownership of application support and process work required to manage these applications at scale in a 24×7 environment. This allows developers to focus on building new features and functionality.

Key Functions:

Application / Tool Support

  • Support existing applications and services hosted by the Infrastructure Automation (InfAuto) team
  • Develop runbooks for application support and maintenance
  • Create detailed alerts for incident management and monitoring tools
  • Implement and manage an updated operations platform for the Technical Operations team

Service Introduction & Communications

  • Develop communication plans for service and tool launches
  • Improve messaging around service interruptions and maintenance

Infrastructure & Automation

  • Expand use of cloud development pipelines for new observability capabilities
  • Support cloud infrastructure integration
  • Use scripts to perform maintenance tasks

Monitoring & Observability

  • Define KPIs and SLAs for managed services
  • Assist with dashboard development and management
  • Integrate cloud infrastructure with monitoring and reporting tools
  • Conduct capacity planning to support proactive scaling

Operational Excellence

  • Design and execute high availability (HA) and disaster recovery (DR) infrastructure testing
  • Partner with operations teams to expedite issue analysis
  • Coordinate change management activities with application users

Required Skills and Tools Experience:

Experience Range: 3–6 years using tools in the following categories:

  • Infrastructure as Code: Terraform, CloudFormation, or similar
  • Configuration Management: Ansible, Puppet, or Chef
  • Container Technologies: Docker, Podman, basic Kubernetes concepts
  • Observability Platforms: Grafana, Elastic (ELK), DataDog, Splunk
  • Issue / Project Tracking: JIRA, ServiceNow, Trello, or similar
  • CI/CD Pipelines: Jenkins, GitLab CI, GitHub Actions
  • Documentation Tools: SharePoint, Confluence (for user guides, runbooks, etc.)
  • Linux Operating Systems: Red Hat Enterprise Linux or similar (CentOS, Rocky, Fedora)
  • Database Operations: SQL, PostgreSQL
  • IDEs: Visual Studio Code (VS Code), JetBrains IntelliJ IDEA

Desired Skills

  • 2–4 years in an L1 SRE or DevOps role
  • Experience as a Systems Engineer (infrastructure design and implementation)
  • Platform Engineer (internal tooling and platform development)
  • Cloud Engineer (multi-cloud experience and migration projects)
  • Application Support (production troubleshooting)
  • Release Engineer (software deployment and release management)
  • Incident Response (on-call experience and production issue resolution)
Company Benefits & Perks: 
  • Competitive salary package.
  • Performance-based annual bonus (cash and stocks).
  • Hybrid working model (3 days office/week).
  • Group Medical & Life Insurance.
  • Modern offices with free amenities & fully stocked cafeterias.
  • Monthly food card & company-paid snacks.
  • Hardship/shift allowance with company-provided pickup & drop facility*
  • Attractive employee referral bonus.
  • Frequent company-sponsored team-building events and outings.

* Depending upon the shifts.
**The benefits package is subject to change at the management's discretion.



  • Hyderabad, Telangana, India Jigya Software Services Full time ₹ 1,50,000 - ₹ 28,00,000 per year

    Job Title:Senior Site Reliability Engineer (SRE) - AWS/KubernetesLocation:Hyderabad - OnsiteJob Type:Full-TimeAbout the Role:We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance, and...


  • Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...


  • Hyderabad, Telangana, India TurboHire Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Site Reliability Engineer (SRE)Location: Hyderabad (Hybrid)Experience: 3–5 yearsAbout the RoleWe are looking for an SRE Engineer to own reliability, deployment, and monitoringof TurboHire's cloud infrastructure. You will ensure our platform is scalable, secure,and highly available. The role balances hands-on coding, automation, and infraoperations, freeing...


  • Hyderabad, Telangana, India Evalify-IQ Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Skills Required:AWS, Azure, Terraform, CloudFormation, Cloudformation, Pulumi, CICD, GitHub Actions,GitLab CI, Jenkins, ArgoCD, Prometheus, Splunk, Grafana, Cloudwatch, Datadog, SRE,Site Reliability, Python, Powershell, Shell, Go, Kubernetes, Docker, Performance Tuning,Performance Enhancements, Performance Enhancement, PerformanceExperience Range:2 - 5...


  • Hyderabad, Telangana, India Amgen Inc Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    We are looking for a Site Reliability Engineer/Cloud Engineer (SRE) to work on the performance optimization, standardization, and automation of Amgens critical infrastructure and systems. This role is crucial to ensuring the reliability, scalability, and cost-effectiveness of our production systems. The ideal candidate will work on operational excellence...


  • Hyderabad, Telangana, India INDIGLOBE IT SOLUTIONS PRIVATE LIMITED Full time

    Job Summary :We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our production systems across a multi-cloud environment (GCP & AWS). Youll be responsible for owning application support, maintaining our microservices...


  • Hyderabad, Telangana, India Self Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    We are seeking a highly experienced Site Reliability Engineer (SRE) with a strong background in Microsoft Dynamics 365 (D365) infrastructure, including Power Platform, Dataverse, and Finance & Operations (F&O). This role requires deep expertise in Azure DevOps (ADO), GitHub, and infrastructure automation. You will be responsible for deploying, configuring,...


  • Hyderabad, Telangana, India Chase- Candidate Experience page Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Chief Technology Office team, you will solve complex and broad business problems...


  • Hyderabad, Telangana, India Turbo Hire Full time ₹ 3,000 - ₹ 5,000 per year

    Full TimeJob Code: TTPLO-3796 | Hyderabad, Telangana, India1 positionExpires on 21/10/2025Compensation₹ 3 - 5 per yearRequired ExperienceSkillsnodejs,windows/apache/mysql...Site Reliability Engineer (SRE)Location: Hyderabad (Hybrid)Experience: 3–5 yearsAbout the RoleWe are looking for an SRE Engineer to own reliability, deployment, and monitoringof...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Site Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...