Current jobs related to Site Reliability Engineer/Architect - BangaloreAny Location - Grizmo Labs


  • bangalore, India Cricbuzz.com Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...


  • bangalore, India Cricbuzz.com Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 4 - 5 yearsResponsibilities:● Design,...


  • bangalore, India Cricbuzz.com Full time

    Site Reliability Engineer We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services. Experience - 4 - 5 years Responsibilities: ●...


  • bangalore, India Moveworks Full time

    Who We Are Moveworks is the universal AI copilot for search and automation across all your business applications. We give employees one place to go to find information and get support while reducing costs for your business. The Moveworks Copilot is powered by an industry-leading Reasoning Engine that uses a combination of public and proprietary language...


  • Bangalore, India Signify Netherlands B.V. Full time

    Signify, the new company name of Philips Lighting, is the global leader in lighting building on 125+ years of innovations. Our purpose is to unlock the extraordinary potential of light for brighter lives and a better world.We are proud to be ahead of the game in the Internet of Things and being carbon neutral. We learn through disruptive challenges and our...


  • bangalore, India tsworks Full time

    Who We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...


  • bangalore, India BETSOL Full time

    Description Senior Site Reliability Engineer Roles & Responsibilities: Create and Review Design architecture solutions to meet the technical and functional requirements.Work towards Setting up pipelines and Writing Terraform Scripts for provisioning Infrastructure and automating workflows.Identify, communicate and mitigate Risks,...


  • bangalore, India BETSOL Full time

    Description Senior Site Reliability Engineer Roles & Responsibilities: Create and Review Design architecture solutions to meet the technical and functional requirements. Work towards Setting up pipelines and Writing Terraform Scripts for provisioning Infrastructure and automating workflows. Identify, communicate and mitigate Risks, Assumptions,...


  • bangalore, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Tyson Foods India Full time

    Job Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...


  • bangalore, India Tyson Foods India Full time

    Job Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...


  • bangalore, India Tyson Foods India Full time

    Job Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...


  • bangalore, India Microsoft Full time

    Overview Looking to join an exciting industry and organization at the forefront of the next Tech industry transformation? Are you ready to join a team of the world’s best technical experts to enable the success of Microsoft solutions for our commercial & enterprise customers? We are seeking to build out the team of next generation Site Reliability...


  • bangalore, India Microsoft Full time

    Overview Looking to join an exciting industry and organization at the forefront of the next Tech industry transformation? Are you ready to join a team of the world’s best technical experts to enable the success of Microsoft solutions for our commercial & enterprise customers? We are seeking to build out the team of next generation Site Reliability...


  • bangalore, India Zensar Technologies Full time

    About the Role: Site Reliability Engineer Experience: 5-8Yrs Location: Bangalore Required Skills: Must have skills: - High level of experience using cloud log management and monitoring data platforms ( Dynatrace, Azure Monitor ) Hands on experience in Azure Bicep Experience working with Infrastructure as Code and Containerization tools ( Terraform , Docker,...


  • Bangalore, India Qure.ai Full time

    About the job Job Title: Site Reliability Engineer Department: Engineering Location: Bangalore Years of experience: 2-5 years Type: Full Time Employment About Qure.ai: Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create...


  • bangalore, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Zensar Technologies Full time

    About the Role: Site Reliability EngineerExperience: 5-8YrsLocation: BangaloreRequired Skills:Must have skills: -High level of experience using cloud log management and monitoring data platforms ( Dynatrace, Azure Monitor )Hands on experience in Azure BicepExperience working with Infrastructure as Code and Containerization tools ( Terraform , Docker,...


  • bangalore, India tsworks Full time

    Who We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...


  • bangalore, India tsworks Full time

    Who We Aretsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...

Site Reliability Engineer/Architect

4 months ago


BangaloreAny Location, India Grizmo Labs Full time

Responsibilities :

- Own the Infrastructure, and APM and work with Developers and Systems engineers to Build, Release, Monitor, and run the reliability of the service exceeding the agreed SLAs.

- Write software to automate API-driven tasks at scale and contribute to the product codebase in Java, JS, React, Node, Go, and Python.

- Write automation to reduce toil and eliminate manual, repeatable tasks.

- Work with Ansible, Puppet, Chef, Terraform, or another config management/orchestration suite, know where it's broken, work toward fixing them, and explore new alternatives.

- Define and accelerate the implementation of support processes, tools, and best practices Maintain services once they are live by measuring and monitoring availability, latency, and overall system reliability.

- Handle cross-team performance issues from identification of the cause, to determining the areas of improvement and driving those actions to closure.

- Performance and maturity baselining of Systems, tools maturity, coverage, metrics, technology, and engineering practices.

- Define, Measure, and Improve Reliability Metrics (SLO/SLI), Observability (Monitoring, Logging-Tracing solutions), Ops process (Incident, Problem Mgmt) and streamline - automate release management.

- Build dashboards to provide visibility into the performance of the applications.

- Create chaos in the production environment purposefully in a controlled manager to validate the reliability of systems.

- Mentor and coach other SREs in the organization.

- Provide written and verbal updates to executives and the stakeholders of the application in the organization.

- Understand the current process, and system setup and propose the improvements needed in the processes, and technology so that the application exceeds the desired Service Level Objective.

- Troubleshoot, debug and diagnose operational issues and drive them to closure.

- Understanding of software delivery life cycles, particularly Agile/Lean, and DevOps.

Requirements :

- A strong believer in automation to bring in sustained continuous improvement by automating Toil, and Runbooks, improving the ability of the applications to auto-heal leading to improved reliability.

- 15+ years of experience in the Development and Operations of applications/services in production that have uptime over 99.9%.

- 8+ years of experience as a SRE in handling web-scale applications.

- Strong hands-on coding experience in one or more programming languages such as Python, Golang, Java, Bash, etc.

- Good understanding of Observability (monitoring, logging, tracing, metrics) and chaos engineering concepts.

- Proficiency in using Observability tools (for example : New Relic, Datadog, etc) for monitoring, logging, and tracing.

- Expert level hands-on knowledge in public cloud platform AWS and/or Google Cloud Platform.

- A professional-level certificate in one of the public clouds is highly desirable.

- Must have hands-on experience in using configuration management systems such as Ansible or SaltStack and infrastructure automation tools like Terraform or CloudFormation.

- Should have used altering systems such as Pager Duty.

- Should have implemented solutions around Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for services.

- Measurement should have been within a system and across systems in distributed systems.

- Should have supported Production Incidents (PIs) on critical applications of a company.

- Proven experience in handling large-scale and growing infrastructure across Data Centers and heterogeneous Cloud platforms.

- Experience as a service owner in managing large - geographically diverse stakeholders.

- Ability to work with creative - fast-growing engineering teams and motivate them to deliver their best work.

- History of driving innovation.

(ref:hirist.tech)