Infrastructure Automation Site Reliability Engineer

2 weeks ago


India Interactive Brokers Full time

Company Overview Interactive Brokers Group Inc Nasdaq IBKR is a global financial services company headquartered in Greenwich CT USA with offices in over 15 countries We have been at the forefront of financial innovation for over four decades known for our cutting-edge technology and client commitment IBKR affiliates provide global electronic brokerage services around the clock on stocks options futures currencies bonds and funds to clients in over 200 countries and territories We serve individual investors and institutions including financial advisors hedge funds and introducing brokers Our advanced technology competitive pricing and global market help our clients to make the most of their investments Barron s has recognized Interactive Brokers as the 1 online broker for six consecutive years Join our dynamic multi-national team and be a part of a company that simplifies and enhances financial opportunities using state-of-the-art technology About the Role The Infrastructure Automation Site Reliability Engineer SRE bridges the gap between development and operations by applying software engineering principles to infrastructure and operational challenges Responsibilities include creating support documentation developing key metrics for tracking and reporting managing monitoring services using automation tools and coordinating cross-team communications related to releases and maintenance Automation SREs support existing Infrastructure Developers by taking ownership of application support and process work required to manage these applications at scale in a 24x7 environment This allows developers to focus on building new features and functionality Key Functions Application Tool Support Support existing applications and services hosted by the Infrastructure Automation InfAuto team Develop runbooks for application support and maintenance Create detailed alerts for incident management and monitoring tools Implement and manage an updated operations platform for the Technical Operations team Service Introduction Communications Develop communication plans for service and tool launches Improve messaging around service interruptions and maintenance Infrastructure Automation Expand use of cloud development pipelines for new observability capabilities Support cloud infrastructure integration Use scripts to perform maintenance tasks Monitoring Observability Define KPIs and SLAs for managed services Assist with dashboard development and management Integrate cloud infrastructure with monitoring and reporting tools Conduct capacity planning to support proactive scaling Operational Excellence Design and execute high availability HA and disaster recovery DR infrastructure testing Partner with operations teams to expedite issue analysis Coordinate change management activities with application users Required Skills and Tools Experience Experience Range 3-6 years using tools in the following categories Infrastructure as Code Terraform CloudFormation or similar Configuration Management Ansible Puppet or Chef Container Technologies Docker Podman basic Kubernetes concepts Observability Platforms Grafana Elastic ELK DataDog Splunk Issue Project Tracking JIRA ServiceNow Trello or similar CI CD Pipelines Jenkins GitLab CI GitHub Actions Documentation Tools SharePoint Confluence for user guides runbooks etc Linux Operating Systems Red Hat Enterprise Linux or similar CentOS Rocky Fedora Database Operations SQL PostgreSQL IDEs Visual Studio Code VS Code JetBrains IntelliJ IDEA Desired Skills 2-4 years in an L1 SRE or DevOps role Experience as a Systems Engineer infrastructure design and implementation Platform Engineer internal tooling and platform development Cloud Engineer multi-cloud experience and migration projects Application Support production troubleshooting Release Engineer software deployment and release management Incident Response on-call experience and production issue resolution Company Benefits Perks Competitive salary package Performance-based annual bonus cash and stocks Hybrid working model 3 days office week Group Medical Life Insurance Modern offices with free amenities fully stocked cafeterias Monthly food card company-paid snacks Hardship shift allowance with company-provided pickup drop facility Attractive employee referral bonus Frequent company-sponsored team-building events and outings Depending upon the shifts The benefits package is subject to change at the management s discretion



  • India Concord Full time

    SRE Sr. Engineers (Individual Contributors) Key Attributes : Strong SRE (Site Reliability Engineering) experience Dev Ops skills – CI/CD, monitoring, automation, infrastructure as code, etc. Excellent troubleshooting and debugging skills (infrastructure + application level) Perseverance – must push through complex/challenging issues without...


  • Chandigarh, India Wits Innovation Lab Full time

    Job Description Site Reliability Engineer (SRE) Senior Role Location : Mohali Experience : 4+ years We are looking for an experienced Site Reliability Engineer (SRE) to strengthen our cloud and infrastructure team. The role involves owning reliability, availability, and scalability of distributed platforms, while driving automation and observability best...


  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or Dev Ops Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of Gen AI powered developer platforms . We’re looking for an experienced Site Reliability Engineer to join...


  • Bengaluru, Karnataka, India, Karnataka Tata Consultancy Services Full time

    Role**: Manager, Site Reliability EngineeringRequired Technical Skill Set: Manager, Site Reliability EngineeringDesired Experience Range: 12 - 18 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual Interview Job Description:Describe what the person will do in the role - how he/she will impact...


  • India Zensar Technologies Full time

    Candidate having skilled and proactive Site Reliability Engineer (SRE) with 10 Years experience The SRE will be responsible for ensuring the reliability, scalability, and performance of our systems and infrastructure. This role blends software engineering with IT operations to build fault-tolerant, self-healing systems and drive continuous improvement across...


  • Delhi, India Weekday (YC W21) Full time

    Job Description This role is for one of our clients Company Name: Neemtree Industry: Technology, Information and Media Seniority level: Mid-Senior level Min Experience: 4 years Location: Gurugram, Delhi, NCR JobType: full-time We're looking for a Site Reliability & Automation Engineer who thrives at the intersection of infrastructure, automation, and...


  • India Akamai Full time

    Do you want to grow your career in Linux and Site Reliability Engineering? Would you like to contribute to the foundation of a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power the backbone of our cloud platform. This is a rare opportunity to help build a...


  • Bengaluru, Karnataka, India, Karnataka HDFC Limited Full time

    Hiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience - 8 - 14 Years Job PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance. Job Responsibilities: Help build a Site Reliability...


  • Bengaluru, Karnataka, India, Karnataka JRD Systems Full time

    Position: Site Reliability Engineer (SRE) Role Overview: We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in Windows infrastructure to manage and optimize our cloud and on-premises environments. The ideal candidate will partner with development teams to improve service reliability, implement automation, and ensure...


  • Hyderabad, India Lilly Full time

    Job Description About the Role We are seeking a highly experienced Site Reliability Engineer (SRE) who will play a key role in designing, building, and scaling reliable, automated, and self-healing infrastructure and applications. This role requires someone who is not only strong in system operations but also in engineering mindset, coding, and automation...