Infrastructure Automation Site Reliability Engineer

2 weeks ago


Mumbai, Maharashtra, India Interactive Brokers Full time ₹ 12,00,000 - ₹ 24,00,000 per year
Company Overview

Interactive Brokers Group, Inc. (Nasdaq: IBKR) is a global financial services company headquartered in Greenwich, CT, USA, with offices in over 15 countries. We have been at the forefront of financial innovation for over four decades, known for our cutting-edge technology and client commitment.

IBKR affiliates provide global electronic brokerage services around the clock on stocks, options, futures, currencies, bonds, and funds to clients in over 200 countries and territories. We serve individual investors and institutions, including financial advisors, hedge funds and introducing brokers. Our advanced technology, competitive pricing, and global market help our clients to make the most of their investments.

Barron's has recognized Interactive Brokers as the #1 online broker for six consecutive years. Join our dynamic, multi-national team and be a part of a company that simplifies and enhances financial opportunities using state-of-the-art technology.

About the Role:

The Infrastructure Automation Site Reliability Engineer (SRE) bridges the gap between development and operations by applying software engineering principles to infrastructure and operational challenges. Responsibilities include creating support documentation, developing key metrics for tracking and reporting, managing monitoring services, using automation tools, and coordinating cross-team communications related to releases and maintenance.

Automation SREs support existing Infrastructure Developers by taking ownership of application support and process work required to manage these applications at scale in a 24×7 environment. This allows developers to focus on building new features and functionality.

Key Functions:

Application / Tool Support

  • Support existing applications and services hosted by the Infrastructure Automation (InfAuto) team
  • Develop runbooks for application support and maintenance
  • Create detailed alerts for incident management and monitoring tools
  • Implement and manage an updated operations platform for the Technical Operations team

Service Introduction & Communications

  • Develop communication plans for service and tool launches
  • Improve messaging around service interruptions and maintenance

Infrastructure & Automation

  • Expand use of cloud development pipelines for new observability capabilities
  • Support cloud infrastructure integration
  • Use scripts to perform maintenance tasks

Monitoring & Observability

  • Define KPIs and SLAs for managed services
  • Assist with dashboard development and management
  • Integrate cloud infrastructure with monitoring and reporting tools
  • Conduct capacity planning to support proactive scaling

Operational Excellence

  • Design and execute high availability (HA) and disaster recovery (DR) infrastructure testing
  • Partner with operations teams to expedite issue analysis
  • Coordinate change management activities with application users

Required Skills and Tools Experience:

Experience Range: 3–6 years using tools in the following categories:

  • Infrastructure as Code: Terraform, CloudFormation, or similar
  • Configuration Management: Ansible, Puppet, or Chef
  • Container Technologies: Docker, Podman, basic Kubernetes concepts
  • Observability Platforms: Grafana, Elastic (ELK), DataDog, Splunk
  • Issue / Project Tracking: JIRA, ServiceNow, Trello, or similar
  • CI/CD Pipelines: Jenkins, GitLab CI, GitHub Actions
  • Documentation Tools: SharePoint, Confluence (for user guides, runbooks, etc.)
  • Linux Operating Systems: Red Hat Enterprise Linux or similar (CentOS, Rocky, Fedora)
  • Database Operations: SQL, PostgreSQL
  • IDEs: Visual Studio Code (VS Code), JetBrains IntelliJ IDEA

Desired Skills

  • 2–4 years in an L1 SRE or DevOps role
  • Experience as a Systems Engineer (infrastructure design and implementation)
  • Platform Engineer (internal tooling and platform development)
  • Cloud Engineer (multi-cloud experience and migration projects)
  • Application Support (production troubleshooting)
  • Release Engineer (software deployment and release management)
  • Incident Response (on-call experience and production issue resolution)
Company Benefits & Perks:
  • Competitive salary package.
  • Performance-based annual bonus (cash and stocks).
  • Hybrid working model (3 days office/week).
  • Group Medical & Life Insurance.
  • Modern offices with free amenities & fully stocked cafeterias.
  • Monthly food card & company-paid snacks.
  • Hardship/shift allowance with company-provided pickup & drop facility*
  • Attractive employee referral bonus.
  • Frequent company-sponsored team-building events and outings.

  • Depending upon the shifts.

**The benefits package is subject to change at the management's discretion.



  • Mumbai, Maharashtra, India Oracle Financial Services Software Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Senior Site Reliability Developer OCI is Oracle's next-generation cloud platform, built for the most demanding enterprise workloads. We deliver high-performance computing, storage, networking, and platform services at global scale. The AI Platform, Services & Solutions organization within OCI is building the foundation for enterprise AI—spanning GPU...


  • Mumbai, Maharashtra, India Avant-Garde Corporate Services Private Limited Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join the IT Transformation team.The role involves driving automation, reliability, and performance optimization across mission-critical applications and infrastructure within a financial market ecosystem.The successful candidate will manage end-to-end deployment automation, CI/CD...


  • Mumbai, Maharashtra, India Oracle Financial Services Software Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Site Reliability Developer 3 Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale...


  • Mumbai, Maharashtra, India Talent Leads HR Solutions Pvt Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Skill, Knowledge &Trainings : - Site Reliability Engineer will be responsible to develop and implement services that improve Software development Life Cycle. - Build automations which will help optimize software delivery. - Improve reliability, quality, and time-to-market of our suite of software solutions. - Will be responsible for availability,...


  • Navi Mumbai, Maharashtra, India Uplers Full time ₹ 8,00,000 - ₹ 25,00,000 per year

    Experience: 4+ yearsSalary: ConfidentialShift: (GMT+05:30) Asia/Kolkata (IST)Opportunity Type: Office (Mumbai)Placement Type: Full time Permanent Position(*Note: This is a requirement for one of Uplers' client--Gofynd)What do you need for this opportunity?Must have skills required: and AWS/Google Cloud and MongoDB/CI/CD/GrafanaJob descriptionFynd is Indias...


  • Mumbai, Maharashtra, India Search Synergy Pvt Ltd Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Note - Location - Dadar/Kurla (Mumbai)Skill, Knowledge &Trainings : - Own and manage the CI/CD pipelines for automated build, test, and deployment. - Design and implement robust deployment strategies for microservices and web applications. - Set up and maintain monitoring, alerting, and logging frameworks (e.g., Prometheus, Grafana, ELK) - Build...


  • Mumbai, Maharashtra, India RELX Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Would you like to be part of a team that delivers high-quality software to our customers?Are you a visible champion with a 'can do' attitude and enthusiasm that inspires others?About The BusinessLexisNexis Risk Solutions is the essential partner in the assessment of risk. Within our Business Services vertical, we offer a multitude of solutions focused on...


  • Mumbai, Maharashtra, India RELX Group Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Would you like to be part of a team that delivers high-quality software to our customers?Are you a visible champion with a 'can do' attitude and enthusiasm that inspires others?About the BusinessLexisNexis Risk Solutions is the essential partner in the assessment of risk. Within our Business Services vertical, we offer a multitude of solutions focused on...


  • Mumbai, Maharashtra, India Zycus Infotech Pvt Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job Description : Zycus is looking for a Site Reliability Engineer (SRE) with deep expertise in Kubernetes, automation, and Linux systems. The ideal candidate will have hands-on experience in deploying, administrating, and optimizing large-scale production systems, with a strong focus on microservices architecture, ensuring automation, performance,...


  • Mumbai, Maharashtra, India Pivotree Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    IntroductionOur goal at Pivotree is to help accelerate the future of frictionless commerce. We will help lead this change over the next decade because we believe a future where technology is embedded intimately into all aspects of our everyday lives can benefit everyone and will shape the interactions with the brands we love. We will help shape the future of...