Current jobs related to Expert in SRE and Observability - Chennai, Tamil Nadu - Talent500


  • Chennai, Tamil Nadu, India beBee Careers Full time

    Job DescriptionThis role requires a highly experienced Observability Engineer with a strong focus on observability and Site Reliability Engineering (SRE). The ideal candidate will have extensive hands-on experience deploying, managing, and optimizing enterprise-level observability platforms using the Grafana OSS stack Mimir, Loki, Tempo, Grafana Agent. They...


  • Chennai, Tamil Nadu, India beBee Careers Full time

    Job Title: SRE Technical SpecialistAbout the Role:The successful candidate will be responsible for developing and implementing site reliability engineering practices to improve the observability and reliability of security platforms. This includes designing and developing APIs, implementing architecture for integrating various Observability platforms,...

  • SRE Team Lead

    4 days ago


    Chennai, Tamil Nadu, India beBee Careers Full time

    Site Reliability Engineering Team LeadWe're seeking an exceptional Site Reliability Engineering (SRE) Team Lead to join our team. As a key member of our SRE team, you'll be responsible for designing, deploying, and managing enterprise-level observability platforms using the Grafana OSS stack. With your expertise in Azure and GCP cloud environments, you'll...

  • Cloud SRE

    4 weeks ago


    Chennai, Tamil Nadu, India Ford Motor Company Full time

    Job DescriptionJOB DESCRIPTIONEnterprise Technologyplays a critical part in shaping the future of mobility. If you're looking for the chance to leverage advanced technology to redefine the transportation landscape, enhance the customer experience and improve people's lives, this is the opportunity for you. Join us and challenge your IT expertise and...


  • Chennai, Tamil Nadu, India beBee Careers Full time

    DevOps Automation ExpertAbout the Role:We are seeking a seasoned DevOps Automation Expert to drive the development and implementation of automation solutions that enhance the performance and stability of our security platforms. This individual will collaborate with cross-functional teams to integrate various Observability platforms, develop and maintain...

  • SRE Lead

    4 days ago


    Chennai, Tamil Nadu, India beBee Careers Full time

    About the RoleThis is a challenging and rewarding role that requires a unique blend of technical and business skills. As an SRE, you will be responsible for building and maintaining systems that efficiently handle production traffic.You will have hands-on experience with Dynatrace - On Prem and SaaS, Observability skills, and familiarity with SLI/SLO/SLA...

  • SRE – SES

    2 weeks ago


    Chennai, Tamil Nadu, India N Consulting Ltd Full time

    Hi Jobseekers,We're HiringAre you passionate, driven, and ready for an exciting new challenge? We're looking for talented individuals to join our teamWe are happy to announce that , We are hiring for one of the reputed MNC company with direct payroll of "Natobotics Technologies private Ltd" for the role of -SRE – SES (Server Engg. Support) /  Client-...

  • DevOps Engineer

    3 weeks ago


    Chennai, Tamil Nadu, India LION AND ELEPHANTS Full time

    Job Description : This role requires a highly experienced DevOps Engineer with a strong focus on observability and Site Reliability Engineering (SRE). The ideal candidate will have extensive hands-on experience deploying, managing, and optimizing enterprise-level observability platforms using the Grafana OSS stack (Mimir, Loki, Tempo, Grafana Agent). They...


  • Chennai, Tamil Nadu, India beBee Careers Full time

    Job Description:This role involves ensuring the reliability and performance of cloud-based systems. To achieve this, the successful candidate will monitor and troubleshoot issues, utilizing various tools and technologies.Responsibilities:Monitor and report system performance using a range of tools.Analyze and resolve technical issues efficiently.Communicate...

  • DevOps Engineer

    4 weeks ago


    Chennai, Tamil Nadu, India LION AND ELEPHANTS Full time

    Job Description :This role requires a highly experienced DevOps Engineer with a strong focus on observability and Site Reliability Engineering (SRE). The ideal candidate will have extensive hands-on experience deploying, managing, and optimizing enterprise-level observability platforms using the Grafana OSS stack (Mimir, Loki, Tempo, Grafana Agent). They...

Expert in SRE and Observability

3 weeks ago


Chennai, Tamil Nadu, India Talent500 Full time

Talent500 Overview:

Talent500 is dedicated to connecting top talent with cutting-edge organizations. Our mission is to empower businesses by delivering exceptional professionals who drive innovation and growth.

Job Description:

The EPEO team at Talent500 is seeking a skilled Technical Specialist who can leverage their expertise in multiple technologies to enhance the reliability and observability of security platforms. Key responsibilities include designing and developing APIs, implementing architecture for observability platforms, evaluating new tools, and developing automation frameworks.

Main Responsibilities:

  • Designing and developing APIs using Java or Python, deploying them on GCP Services
  • Designing and implementing architecture for integrating Observability platforms like Dynatrace, GCP/Azure Monitoring, Azure Monitors
  • Evaluating new tools and technologies, performing Proof-of-Concepts
  • Developing and implementing an automation framework for SRE tasks
  • Designing and building observability dashboards using Dynatrace, Grafana, Looker
  • Developing and implementing automation solutions to improve efficiency, reduce manual intervention in infrastructure management and deployment process
  • Defining and implementing best practices for monitoring, alerting, and incident response to proactively identify potential issues.

Requirements:

  • Bachelor's or Master's degree in Computer Science/Engineering
  • 5+ years of experience as a DevOps/SRE Engineer, Solution Architect, or similar role with hands-on coding experience in Python/Java
  • Experience with SIAM/ITSM processes and best practices
  • Hands-on experience with Dynatrace, Grafana, SPLUNK tools and deep understanding of SRE concepts, including monitoring, alerting, automation, and incident management
  • Knowledge of cybersecurity, IAM toolsets, and Multi-cloud SaaS platform integrations
],