Observability/Site Reliability Lead

2 months ago


Bangalore, India Connectio IT Pvt Ltd Full time

Job Description :


The Observability/SRE Lead will be a key member of the Engineering team.

They will have to focus on driving reliability, scalability, performance, and observability of production systems, and ensure a consistent end-user experience across all products.

They will also define the overall strategy for deployment and operations of systems and the related infrastructure, ensuring our systems stay up and running 24/7.

Key Responsibilities :

- Design and implement observability solutions using tools like PLG Stack, ELASTIC, APM tools, and New Relic.

- Define SRE (Site Reliability Engineering) and observability KPIs (Key Performance Indicators) and generate reports for monitoring system health and performance.

- Collaborate with cross-functional teams to establish best practices for observability and reliability in cloud-based environments.

- Develop automation scripts and programs to streamline observability processes and enhance efficiency.

- Manage the ITSM (IT Service Management) lifecycle, including incident, problem, and change management processes.

- Lead the implementation of observability and SRE practices in cloud-based environments for large enterprise systems.

- Provide expertise and guidance on observability tools, techniques, and best practices to internal teams.

- Continuously evaluate and optimize observability solutions to meet evolving business needs and technology trends

Qualifications :

- Tools - PLG Stack, cloud native monitor, ELASTIC, APM tools etc

- SRE and observability kpi and reports

- Good communication

- Automation skills - scripting, programming

- ITSM lifecycle management

- Experience of implementing Observability , SRE in cloud based environment for a large enterprise

- New Relic is mandatory(Min 2 to 3 Yrs of Relevant exp)

(ref:hirist.tech)

  • bangalore, India ALTERYX Full time

    We’re looking for problem solvers, innovators, and dreamers who are searching for anything but business as usual. Like us, you’re a high performer who’s an expert at your craft, constantly challenging the status quo. You value inclusivity and want to join a culture that empowers you to show up as your authentic self. You know that success hinges on...


  • Bangalore, India Connectio IT Pvt Ltd Full time

    Overview : The Lead Site Reliability Engineer (SRE) - Observability KPI at Newrelic plays a crucial role in ensuring the reliability, availability, and performance of Newrelic's observability platform. This role is essential in maintaining and improving the observability key performance indicators (KPIs) to meet customer expectations and support...


  • bangalore, India ViewSonic Full time

    Job Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.1+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...


  • bangalore, India Central Business Solutions Inc. Full time

    The Enterprise Computing (EC) Core Infrastructure Services organization is looking for a Site Reliability Engineering to manage the operations, reliability and services for Morgan Stanley's suite of Software Distribution product ecosystem products that are part of Artifact Curation and Distribution Control squad. This squad is responsible for providing...


  • bangalore, India Fidelity Investments Full time

    The Purpose of This Role Our Site Reliability Engineering group within Enterprise Infrastructure combines Operations Excellence with the Development Experience to deliver services at high scale, high availability with resilience by using automation and Infrastructure Code. We build reliability into our ecosystem by applying best practices in...

  • Engineering Director

    4 weeks ago


    Bangalore, India CareerNet Technologies Full time

    Job Description : Site Reliability Engineers (SREs) at Coupang is a mission-critical role that combines software and system engineering to build, run, and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer-facing services are healthy, monitored, automated,...

  • Engineering Director

    4 weeks ago


    bangalore, India CareerNet Technologies Full time

    Job Description : Site Reliability Engineers (SREs) at Coupang is a mission-critical role that combines software and system engineering to build, run, and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer-facing services are healthy, monitored, automated,...

  • Engineering Director

    1 month ago


    Bangalore, Karnataka, India CareerNet Technologies Full time

    Job Description :Site Reliability Engineers (SREs) at Coupang is a mission-critical role that combines software and system engineering to build, run, and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer-facing services are healthy, monitored, automated,...


  • bangalore, India JPMorgan Chase & Co. Full time

    Play a key role in ensuring system reliability at one of the world’s most iconic and largest financial institutions. As a Site Reliability Engineer II at JPMorgan Chase within the Risk technology, you will use technology to solve business problems and leverage software engineering best practices as we strive towards excellence. This role often works...


  • bangalore, India JPMorgan Chase & Co. Full time

    Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.  As a Lead Site Reliability Engineer at JPMorgan Chase within the Corporate and Investment Banking - Payments Technology team, you hold a leadership role in your team, demonstrate...


  • bangalore, India Squareroot Consulting Pvt Ltd. Full time

    Site Reliability EngineerLocation : Bangalore, IndiaDomain : CybersecurityBudget : 30 to 50 Lacks - We are looking for a hands-on devops engineer leading the design, implementation of devops/SRE practice for our infrastructure for data privacy.- The successful candidate will have experience implementing advanced DevOps & SRE techniques such as Auto...


  • Bangalore, India Squareroot Consulting Pvt Ltd. Full time

    Site Reliability EngineerLocation : Bangalore, IndiaDomain : CybersecurityBudget : 30 to 50 Lacks - We are looking for a hands-on devops engineer leading the design, implementation of devops/SRE practice for our infrastructure for data privacy.- The successful candidate will have experience implementing advanced DevOps & SRE techniques such as Auto...


  • Bangalore, India Squareroot Consulting Pvt Ltd. Full time

    Job Title : Senior Site Reliability Engineer (SRE)Location : Bangalore (Hybrid)Company Overview :We are Hiring for a dynamic and innovative FinTech company committed to delivering cutting-edge solutions to their clients. As part of our growth strategy, we are seeking a talented and experienced Hands-On Site Reliability Engineer (SRE) to join our...

  • Lead Engineer

    4 weeks ago


    Bangalore, India Seikor Full time

    Designation: Lead Engineer - Observability - Platform EngineeringSkill Set: Opentelemetry (No kubernets, working on distributed tracing and custom matrics), Java Scripts /type scripts knowledge, AWS RDS, ARORA, SQS, SNS, Kinesis, API Gateway, DynamoDB, AWS LAMBDAExperience: 3-10 YearsCTC: 18-25LpaLocation: BangaloreWork Mode: WFOCompany: Motherson Technology...


  • bangalore, India TalentOla Full time

    Monitoring and Automation: Proactively monitor software systems to prevent incidents and automate routine tasks. 2. Effective Monitoring: Build monitoring systems that alert based on symptoms rather than outages. 3. Application Performance Monitoring (APM): Implement and utilize APM tools such as New Relic or Dynatrace to monitor application...


  • bangalore, India Meesho Full time

    Site Reliability Engineer II Bangalore, Karnataka Tech Infrastructure /Full Time Employee /On-Site About the Team : When 5% of Indian households shop with us, its important to build resilient systems to manage millions of orders every day. Weve done this with zero downtime! Sounds impossible? Well, thats the kind of Engineering muscle that has helped...

  • Assistant Manager

    1 month ago


    bangalore, India Genpact Full time

    Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose – the relentless pursuit of a world that works better for people –...


  • bangalore, India BNC Global Services Pvt. Ltd. Full time

    Job Description : 1. Lead team of Observability Engineers to design and implement and maintain scalable, reliable, performant, and efficient systems.2. Drive continuous improvements in our operating processes.3. Nurture engineer growth through mentorship4. Create site reliability best-practices to ensure platform resiliency, performance and...


  • bangalore, India Infogain Full time

    You can send your applications onThis Job is available at multiply locations in India like Mumbai, Pune, Bangalore, Noida & Gurgaon.Title:"SRE developers responsible for Design and implementation details reviewed/approved by SRE / Reliability Engineer (Lead): A SRE/Reliability Engineer at a Lead level is responsible for maintaining the reliability,...


  • Bangalore, India Meesho Full time

    Site Reliability Engineer II Bangalore, Karnataka Tech Infrastructure /Full Time Employee /On-Site About the Team : When 5% of Indian households shop with us, its important to build resilient systems to manage millions of orders every day. Weve done this with zero downtime! Sounds impossible? Well, thats the kind of Engineering muscle that has helped...