Site Reliability Engineering Manager

2 months ago


Mumbai, India Talent Socio Full time

Job Description :


- Lead and mentor a team of Site Reliability Engineers (SREs) responsible for ensuring the reliability, availability, and performance of critical systems.


- Establish and enforce engineering practices focused on automation, monitoring, and process improvement to enhance system reliability and operational efficiency.


- Conduct thorough and transparent blameless postmortems for incidents, ensuring clear Root Cause Analyses (RCAs) and actionable follow-ups.


- Drive the implementation of non-functional requirements including capacity planning, cost analysis, and instrumentation integration throughout the development lifecycle.


- Define and prioritize SRE initiatives, tasks, and projects, collaborating closely with stakeholders to align with business objectives.


- Implement a metrics-driven approach to monitor and improve service quality targets, leveraging data-driven insights to drive continuous improvement.


- Lead the migration of data and services from traditional to cloud-based environments, ensuring seamless integration and optimal :


- Bachelor's or Master's Degree in Computer Science, Information Systems, or a related field.


- 9-12 years of experience in Site Reliability Engineering or related roles, with proven leadership

experience managing teams.


- Expertise in automating infrastructure, testing, and deployments using tools such as Terraform, CloudFormation, Ansible, Jenkins, and other industry-standard tools.


- Hands-on experience with AutoSys Workload Automation, including configuration and management of Job Scheduler, Event Server, and Application Server.


In-depth knowledge of AWS cloud infrastructure, including VPC, EC2, EKS, ELB, RDS, Lambda, S3, SES, SNS, and Containers.


- Proficiency in scripting and programming languages such as Python, Bash, PowerShell, JavaScript, .NET, Java, and SQL.


- Experience designing and implementing technical roadmaps, project plans, and architectural designs in AWS environments.

(ref:hirist.tech)

  • Mumbai, India dentsu Full time

    The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...


  • Mumbai, India IDFC FIRST Bank Full time

    Role/ Job Title:  Senior Site Reliability Engineering Manager Function/ Department:  Information Technology Job Purpose: Site Reliability Engineering (SRE) department plays a pivotal role in providing seamless experience for our customers. With state-of-the-art technology and tools, we are transforming the overall application development and...


  • mumbai, India IDFC FIRST Bank Full time

    Role/ Job Title:  Senior Site Reliability Engineering Manager Function/ Department:  Information Technology Job Purpose: Site Reliability Engineering (SRE) department plays a pivotal role in providing seamless experience for our customers. With state-of-the-art technology and tools, we are transforming the overall application development and...


  • Mumbai, India IMC Full time

      As a Site Reliability Engineer at IMC, you'll be an integral member of a highly experienced team, responsible for maintaining a robust, best in class, low latency trading environment. The skills necessary to excel could range from system administration, network troubleshooting, database optimization, software development, release management and...


  • Mumbai, India CimpressVista Full time

    Senior Site Reliability Engineer You have successfully completed a degree in computer science or comparable training (e.g. as an ITspecialist) or have gained several years of relevant professional experience in the DevOpsenvironment.Experience working with:Agile methods and cloud technologies/architecture in AWS.Database administration to a small extent...


  • mumbai, India RELX India (Pvt) Ltd Risk div Company Full time

    About the role We are seeking a highly motivated and experienced Site Reliability Engineer (SRE) to manage and optimize our AWS cloud resources. The ideal candidate will have a strong background in AWS, Terraform, Kubernetes, and scripting, with proficiency in monitoring and CI/CD tools. Experience with Hashicorp Vault is a plus. Responsibilities: ...


  • Mumbai, India Jio Full time

    Site Reliability Engineer (SRE) with Automation Job OverviewAs a Site Reliability (SRE)/DevOps Automation Engineer, you will be responsible for the availability, automation, performance, efficiency, Scaling, monitoring and emergency response for any incidents/issues in Applications. You will use your deep understanding of platforms, architecture, people,...


  • mumbai, India Jio Full time

    Site Reliability Engineer (SRE) with Automation Job Overview As a Site Reliability (SRE)/DevOps Automation Engineer, you will be responsible for the availability, automation, performance, efficiency, Scaling, monitoring and emergency response for any incidents/issues in Applications. You will use your deep understanding of platforms, architecture,...

  • Site Engineer

    3 weeks ago


    Mumbai, India Aquamech Engineering Corporation Full time

    Job description :Having experience in handling sites for Water Treatment Plant & Pharmaceutical piping for Purified Water, Water for injection, and other Clean Utilities.Should be able to coordinate with client and project manager to manage the site independently.Successful installation, commissioning & handling of all systems.

  • Site Engineer

    3 weeks ago


    Mumbai, India Aquamech Engineering Corporation Full time

    Job description :Having experience in handling sites for Water Treatment Plant & Pharmaceutical piping for Purified Water, Water for injection, and other Clean Utilities.Should be able to coordinate with client and project manager to manage the site independently.Successful installation, commissioning & handling of all systems.


  • Mumbai, India Cyber Sphere LLC Full time

    Site Reliability Engineer (SRE) to join our team. Qualifications :- 4+ years of Software Engineering experience- BS Engineering/Computer Science or equivalent experience requiredResponsibilities :- Design, deploy, and maintain a highly available and scalable data infrastructure on Azure open ai , databases and event driven services- Monitor and optimize the...


  • mumbai, India Antal International Full time

    Job Description A major player in the tech industry, which specializes in retail technology, AI, ML, and big data, is seeking new talent. Established by alumni from a top engineering institute, this organization manages a vast network of brands and stores. Headquartered in Mumbai, it is recognized for its innovation and expertise across multiple tech...


  • Mumbai, India Cyber Sphere LLC Full time

    SALARY : 40LPA - 60LPAWe are seeking a talented and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our Azure AI Services platform. You will work closely with cross-functional teams to design, implement, and maintain robust infrastructure and...


  • Mumbai, India Ztek Consulting INC Full time

    Job Title: Senior Site Reliability Engineer(SRE) Duration: 612 months Location: HybridFort Worth TX Work Type: Rate: Pay rangeoffered to a successful candidate will be based on several factorsincluding the candidates education work experience work locationspecific job duties certifications etc. JobSummary: A Site Reliability Engineer is responsible...

  • Senior Site Engineer

    3 weeks ago


    Mumbai, India Aquamech Engineering Corporation Full time

    Tips: We are looking for a Senior Site Engineer to join our team and play a key role in the installation and commissioning of pharma water systems, including RO EDI, HSRO EDI, DM, and distribution systemsResponsibilitiesLead and supervise a team of engineers and technicians in the installation and commissioning of pharma water systems.Ensure that all systems...

  • Senior Site Engineer

    3 weeks ago


    Mumbai, India Aquamech Engineering Corporation Full time

    Tips: We are looking for a Senior Site Engineer to join our team and play a key role in the installation and commissioning of pharma water systems, including RO EDI, HSRO EDI, DM, and distribution systemsResponsibilitiesLead and supervise a team of engineers and technicians in the installation and commissioning of pharma water systems.Ensure that all systems...


  • Mumbai, India Aquamech Engineering Corporation Full time

    Tips: We are looking for a Senior Site Engineer to join our team and play a key role in the installation and commissioning of pharma water systems, including RO EDI, HSRO EDI, DM, and distribution systemsResponsibilitiesLead and supervise a team of engineers and technicians in the installation and commissioning of pharma water systems.Ensure that all systems...


  • Mumbai, India Session AI Full time

    Are you ready to make your mark with a true industry disruptor? ZineOne, a subsidiary of Session AI, the pioneer of in-session marketing, is looking to add talented team members to help us grow into the premier revenue tool for e-commerce. We work with some of the leading brands nationwide and we innovate how brands connect with and convert customers.Job...


  • mumbai, India Session AI Full time

    Are you ready to make your mark with a true industry disruptor? ZineOne, a subsidiary of Session AI, the pioneer of in-session marketing, is looking to add talented team members to help us grow into the premier revenue tool for e-commerce. We work with some of the leading brands nationwide and we innovate how brands connect with and convert customers. Job...


  • Navi Mumbai, India Capabiliq IT Services (OPC) Private Limited Full time

    Responsibilities :- Define processes for the DevOps program and align to best practice standards- Support of Product delivery teams integrating into existing pipelines and platforms.- Plan for and manage operational resilience for network and application while minimizing the effect on the business- Develop and extend DevOps tooling and automation efforts...