Site Reliability Engineering Lead

3 weeks ago


Bengaluru, India ZEISS Group Full time

CARL ZEISS

Carl Zeiss AG branded as ZEISS, is a German manufacturer of optical systems and optoelectronics, founded in Jena, Germany in 1846 by optician Carl Zeiss.

ZEISS is headquartered in Oberkochen, Germany and enjoys a global presence and rich heritage of being in business for more than 170 years.

ZEISS today operates in the following businesses:

• Semiconductor Manufacturing Technology

• Industrial Quality & Research

• Medical Technology

• Consumer Markets

We are located today internationally in almost 50 countries and have 25 research & development sites, 60 sales & services locations and 30 production sites.

In India, ZEISS is headquartered in Bangalore and has been present in India for 20+ years with an employee strength 1000+ has been one of the Top 10 markets of ZEISS. We have all the above Business Groups & 3 Global Centers in India. The global centers include:

- Global IT center

- Global R&D Center

- Global Production and Assembly Facility

Our R&D and IT teams have seen tremendous growth in the last couple of years with some exciting projects in hand which provide global exposure via global stakeholders while working with one of the best German companies in the world.

In India, other than Bangalore we have a production unit in Delhi and offices in Delhi, Mumbai, Kolkata, etc.

MANDATORY:

To know more about ZEISS and to understand the careers that ZEISS offers we urge you to please log onto our careers page to see the careers ZEISS offers and read our employees stories which will give you insights of the work, culture and careers offered

We would like to mention ZEISS does not offer you a job it offers you a career full of learning, global experience and exposure and challenging work and a chance to not be a part of the process but to manage and experience the entire process end to end.

You can also go through our page:


Site Reliability Engineering Lead


We are currently seeking a highly motivated and skilled Site Reliability Engineering Lead to join our dynamic team. If you are passionate about setting up SRE practices, we want to hear from you.

As a Site Reliability Engineering Lead, you will bridge the gap between Development, Cloud Platform Engineering Teams and Product Owners of different Digital Offerings. Defining and implementing the SRE-concepts with our teams, and aligning the service quality with the business objectives and user expectations will be at the core of your responsibilities:


What you will do:

Define and measure the reliability of the service using SLI, SLOs and consider the risk minimization of service degradation.


Enable the development team to bring new software or new features (Digital Offering) to production as quickly as possible, while also ensuring an agreed-upon acceptable level of IT operations performance and error risk in line with the service level agreements (SLAs) agreed.


  • Closely cooperate with different Product Owners, Site Reliability Engineers, and the Cloud Platform Teams to define processes to migrate between different Cloud Platforms while ensuring reliability for business offerings.


  • Work with multiple Site Reliability Engineers for operations and system administration tasks - analyzing logs, performance tuning, applying patches, testing production environments, identify opportunities and drive the design and implementation of end-to-end observability, alerting, self-healing and automation capabilities to improve service health, manageability, and reliability.


  • Closely work with with different stake holders (POs, SREs and Platform Team) to define Incident Management Process as required for responding to incidents, drive postmortems reviews for improving the service quality.


  • Closely work with Dev and SRE team to select appropriate metrics related to observability and reliability as well as defining SLIs and SLOs


  • Define and drive observability for self-developed software and the managed cloud components by collecting appropriate observability data for insights and alerting including setting up proper alerting for critical components.


  • Ensure availability and responsiveness of application by setting up and maintaining the required documentation method and tools. Building Playbooks for troubleshooting techniques to effectively identify and investigate issues that can be used by SREs.


  • Handle resolution of blockers, escalation to stakeholders, and provisioning of resources.


  • Own availability, performance, and supportability targets for the service.


  • Author functional and technical documentation and remain current on relevant technologies and procedures.



What you should have:


  • 8- 12 years of relevant industry experience.
  • Minimum of 3 years as a Site Reliability Engineering Lead.
  • Minimum of 5 years’ experience as a Site Reliability Engineer
  • Minimum of 8 years’ experience with cloud computing platforms like Azure and related services.
  • In depth knowledge of system architecture, networking, and microservice based distributed systems.
  • Expertise in designing and implementing reliable, scalable, and fault-tolerant systems using container Orchestration Technologies like Docker and Kubernetes.
  • Proficiency in setting up and managing monitoring, alerting, and logging systems for early detection and resolution of issues for container orchestrators like Kubernetes using Tools like Prometheus, Grafana, Open Telemetry Collector or similar tools.
  • Hands on experience in incident management, including incident response, troubleshooting, and post-mortem analysis.
  • Proficiency in coding/scripting languages commonly used in infrastructure automation and monitoring (such as Terraform).
  • Knowledge of best practices in disaster recovery planning and execution for cloud-based Systems.
  • Ability to lead and mentor a team of SREs, providing guidance, support, and coaching.
  • Capability to advocate for SRE best practices and principles within the organization and drive cultural changes as needed.
  • Willingness to stay updated with the latest trends, tools, and technologies in the field of site reliability engineering.
  • Strong communication skills to effectively collaborate with cross-functional teams, including Software Developers, Product Owners, and Cloud Platform Engineers.


At ZEISS we encourage creative thinking and innovation. We work in dynamic and interdisciplinary, cross-functional teams and offer individual development perspectives as well as flexibility in organizing your work. We care about our employees and take responsibility for improving society and preserving our environment. These core values have shaped our corporate culture at ZEISS for over 170 years. Take responsibility and shape the future of ZEISS



  • Bengaluru, India ZEISS Group Full time

    CARL ZEISSCarl Zeiss AG branded as ZEISS, is a German manufacturer of optical systems and optoelectronics, founded in Jena, Germany in 1846 by optician Carl Zeiss.ZEISS is headquartered in Oberkochen, Germany and enjoys a global presence and rich heritage of being in business for more than 170 years.ZEISS today operates in the following businesses:•...

  • Site Reliability Lead

    3 weeks ago


    Bengaluru, India Domnic Lewis International Full time

    Purpose: As a Site Reliability Engineering Lead, you will bridge the gap between Development, Cloud Platform Engineering Teams and Product Owners of different Digital Offerings. Defining and implementing the SRE-concepts with our teams, and aligning the service quality with the business objectives and user expectations will be at the core of your - Define...


  • Bengaluru, India ZEISS Group Full time

    CARL ZEISSCarl Zeiss AG branded as ZEISS, is a German manufacturer of optical systems and optoelectronics, founded in Jena, Germany in 1846 by optician Carl Zeiss.ZEISS is headquartered in Oberkochen, Germany and enjoys a global presence and rich heritage of being in business for more than 170 years.ZEISS today operates in the following businesses:•...


  • Bengaluru, India ZEISS Group Full time

    CARL ZEISSCarl Zeiss AG branded as ZEISS, is a German manufacturer of optical systems and optoelectronics, founded in Jena, Germany in 1846 by optician Carl Zeiss.ZEISS is headquartered in Oberkochen, Germany and enjoys a global presence and rich heritage of being in business for more than 170 years.ZEISS today operates in the following businesses:•...


  • Bengaluru, India ZEISS Group Full time

    CARL ZEISSCarl Zeiss AG branded as ZEISS, is a German manufacturer of optical systems and optoelectronics, founded in Jena, Germany in 1846 by optician Carl Zeiss.ZEISS is headquartered in Oberkochen, Germany and enjoys a global presence and rich heritage of being in business for more than 170 years.ZEISS today operates in the following businesses:•...


  • Bengaluru, India ZEISS Group Full time

    CARL ZEISS Carl Zeiss AG branded as ZEISS, is a German manufacturer of optical systems and optoelectronics, founded in Jena, Germany in 1846 by optician Carl Zeiss. ZEISS is headquartered in Oberkochen, Germany and enjoys a global presence and rich heritage of being in business for more than 170 years. ZEISS today operates in the following businesses: •...


  • Bengaluru, India HCLSoftware Full time

    Position – SRE Architect/ Lead Site Reliability EngineerLocation – Pune/Bangalore/Chennai/NoidaExp – 14+We are busy, growing quickly and have an incredible workforce who are committed to becoming the #1 Software company in the world. Come join HCL s fast-growing, $2B software business and make an impact from Day 1!This is an exciting time to be joining...


  • Bengaluru, India HCLSoftware Full time

    Position – SRE Architect/ Lead Site Reliability EngineerLocation – Pune/Bangalore/Chennai/NoidaExp – 14+We are busy, growing quickly and have an incredible workforce who are committed to becoming the #1 Software company in the world. Come join HCL s fast-growing, $2B software business and make an impact from Day 1!This is an exciting time to be joining...


  • Bengaluru, India The HRBPs Full time

    Lead Site Reliability Engineer - BangaloreExperience - 8 to 12 yearsResponsibilities :- Collaborating with customer success managers and solutions engineers to bring deep technical expertise in implementing intelligent automation solutions for customers.- Providing customers and solution engineers with ongoing technical support for complex issues and support...


  • Bengaluru, India JPMorgan Chase & Co. Full time

    Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability. As a Principal Site Reliability Engineer at JPMorgan Chase within the Asset and Wealth management, you work with your fellow stakeholders to define non-functional requirements...


  • Bengaluru, India HCLSoftware Full time

    Position – SRE Architect/ Lead Site Reliability Engineer Location – Pune/Bangalore/Chennai/Noida Exp – 14+ We are busy, growing quickly and have an incredible workforce who are committed to becoming the #1 Software company in the world. Come join HCL s fast-growing, $2B software business and make an impact from Day 1! This is an exciting time to be...


  • Bengaluru, India Ensono Full time

    About RoleEnsono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, India Maynor Consulting Full time

    CompanyOverview:MaynorConsulting is a leading Information Technology & Servicescompany dedicated to providing innovative solutions in the field.With a strong focus on reliability and efficiency we helpbusinesses optimize their operations and achieve their goals. Joinour dynamic team and contribute to transforming businesses throughcuttingedge...


  • Bengaluru, India Ensono Full time

    About RoleEnsono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff...


  • Bengaluru, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bengaluru, India Ensono Full time

    About RoleEnsono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, India Ensono Full time

    About RoleEnsono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, India ViewSonic Full time

    Job Requirements:1. Bachelor's degree in Computer Science, Engineering, or a related field.2. 1+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.3. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.4. Interest and understanding of Platform...


  • Bengaluru, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...