Lead - Site Reliability Engineering

3 weeks ago


Bengaluru, India Fidelity Investments Full time

The Purpose of This Role

The team comes from diverse technical backgrounds, and the responsibilities provide the opportunity for a variety of challenges. Ideal candidates will have a background in critical production support maintaining FBSI systems and systems engineering with a desire to learn the other or previous experience as an SRE. We are looking for a Systems Thinking, SRE who has helped teams scale through production insights, on-call support, operational automation, developer guidance, real-time metrics & DC-DR or vice-versa switch over.

The Value You Deliver

Passion for technology and the financial domain with demonstrated ability to learn quickly

Building quality solutions that align with the technology blueprint and best practices to solve business problems by driving design, development and ongoing support.

Work with our global team and provide technical direction in building solutions.

Actively participating in knowledge sharing sessions, code and design reviews etc.

The Skills that are Key to this role

Technical / Behavioral

You have extensive knowledge on Production on-call support for Cloud Infrastructure which is running in EKS platform

You have extensive experiences in Change, Incident, Problem Management & on-call support

You have extensive knowledge on observability tools (Preferable – DataDog), Grafana & Prometheus

You have experience in monitoring various aspects like Log, Metrics, APM, Event, Infrastructure & including of Dashboard creation

You have experience in multiple AWS services like EC2, EBS, S3, NLB, IAM, Lambda, Cloud-Watch, Cloud Trail & VPC. Rehydration or Patching knowledge's in cloud infrastructure

You have experience in Microservices Architecture like API Gateway or APIGEE

You have the ability to triage, execute root cause analysis and be decisive under pressure

You have strong communication skills with the ability to clearly and concisely put forth concepts and ideas.

You are capable to work with a variety of individuals and groups, both in-person and virtually, in a constructive and collaborative manner to build and maintain effective relationships

The Skills that are Good To Have for this role

You are able to do automation using Shell or Python

You have exposure to CFM/IaaC like Ansible & Terraform

You know on CI/CD tool like Jenkins

You know Kafka/MQ administration skill-set

You are familiar with Redis on AWS would be a plus

You have exposure to database administration (especially on NoSQL like Mongo or Maria-DB or CrDB)

You have exposure to document creation for knowledge & process will be an added advantage

You have the ability to drive the install calls like Monthly, rehydration/Upgradation & DC/DR switch


  • Site Reliability Lead

    4 weeks ago


    Bengaluru, India Domnic Lewis International Full time

    Purpose: As a Site Reliability Engineering Lead, you will bridge the gap between Development, Cloud Platform Engineering Teams and Product Owners of different Digital Offerings. Defining and implementing the SRE-concepts with our teams, and aligning the service quality with the business objectives and user expectations will be at the core of your - Define...

  • Site Reliability Lead

    3 months ago


    Bengaluru, India Domnic Lewis International Full time

    Purpose: As a Site Reliability Engineering Lead, you will bridge the gap between Development, Cloud Platform Engineering Teams and Product Owners of different Digital Offerings. Defining and implementing the SRE-concepts with our teams, and aligning the service quality with the business objectives and user expectations will be at the core of your - Define...


  • Bengaluru, India JPMorgan Chase Full time

    Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.As a Lead Site Reliability Engineer at JPMorgan Chase within the Finance Technology, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical...


  • Bengaluru, Karnataka, India Connectio IT Pvt Ltd Full time

    Overview : The Lead Site Reliability Engineer (SRE) - Observability KPI at Newrelic plays a crucial role in ensuring the reliability, availability, and performance of Newrelic's observability platform. This role is essential in maintaining and improving the observability key performance indicators (KPIs) to meet customer expectations and support Newrelic's...


  • Bengaluru, Karnataka, India Yo HR Consultancy Full time

    Role : Lead Site Reliability Engineer Location : Bangalore, Karnataka, IndiaExperience : 8-12 YearsMust Have : Site Reliability EngineeringSkills : Troubleshooting On call support Linux Monitoring tools AWS services Scripting(Python/Shell/Bash) CI/CD pipleine + & Responsibilities :Responsibilities : Collaborating with customer success managers and...


  • Bengaluru, India Tyson Foods India Full time

    Job Description – Lead Site Reliability Engineer (Cloud Engineering)The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...


  • Bengaluru, Karnataka, India Waytogo Consultants Full time

    Job Description :As an SRE Lead (Site Reliability Engineering Lead), you will play a crucial role in ensuring the reliability, scalability, and performance of our systems and services. He/ She will lead a team of SREs (Site Reliability Engineers) and collaborate closely with development teams to build and maintain highly available and resilient systems. Your...


  • Bengaluru, India Tyson Foods India Full time

    Job Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...


  • Bengaluru, India Delta Air Lines Full time

    Key Responsibilities: Execute on the Incident, Change Management, Problem Management processes Building and supporting a reliable application suite for the environment in order to meet the development and maintenance requirements of systems/platforms. Provide consultation and direct technical support in life cycle planning, problem management, integration,...


  • Bengaluru, Karnataka, India Delta Air Lines Full time

    Key Responsibilities:Execute on the Incident, Change Management, Problem Management processesBuilding and supporting a reliable application suite for the environment in order to meet thedevelopment and maintenance requirements of systems/platforms.Provide consultation and direct technical support in life cycle planning, problem management,integration, and...


  • Bengaluru, India Delta Air Lines Full time

    Key Responsibilities:Execute on the Incident, Change Management, Problem Management processesBuilding and supporting a reliable application suite for the environment in order to meet thedevelopment and maintenance requirements of systems/platforms.Provide consultation and direct technical support in life cycle planning, problem management,integration, and...


  • Bengaluru, India CloudBees Full time

    J ob Title - Manager, Site Reliability Engineer Location - Bangalore and Chennai Year of Experience - 10+ Years About CloudBees CloudBees is the leading software delivery platform that enables enterprises to deliver scalable, compliant, and secure software, empowering developers to do their best work. Seamlessly integrating into any hybrid and...


  • Bengaluru, India Cricbuzz.com Full time

    Site Reliability Engineer We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services. Experience - 3 - 5 years Responsibilities: ●...


  • Bengaluru, India Encora Inc. Full time

    Position: Site Reliability EngineerLocation: BangaloreExperience: 4+ YearsJob Mode: Full-timeWork Mode: RemoteResponsibilities and DutiesCollaborate with cross-functional teams to design, implement, and maintain reliable and scalable infrastructure solutions on the Azure cloud platform.Implement and maintain monitoring and logging solutions using Azure...


  • Bengaluru, India Protoporos Staffing Services Private Limited Full time

    Opportunity with a leading B2B SaaS product client specializing in cutting-edge data integration solutions Position Overview: We are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our team. As a Staff SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our data integration...


  • Bengaluru, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bengaluru, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • Bengaluru, Karnataka, India Ensono Full time

    About RoleEnsono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, Karnataka, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : Develop and provide operational support for fullstack software applications. Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation. Five years' experience as a site reliability engineer or similar role. Collaborate with development operations staff to create,...


  • Bengaluru, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff...