SRE - Hyderabad

2 weeks ago


Hyderabad, India Spectrum Consultants India Private Limited Full time
SRE Summary
Experience Required:
8 - 15 Years
Job Term:
Permanent
Location:
Bangalore; Hyderabad; Pune
Category:
Cloud Computing /Design/Support
World leader in visual and AI Computing.
Site Reliability Engineering is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. SRE ensures reliability and uptime as promised to the users and at the same time enabling developers to make changes to the existing system through careful preparation and planning while keeping an eye on capacity, latency and performance. SRE is also a mindset and a set of engineering approaches to running better production systems and optimizations. The person in this position will be responsible for Service Response and Workflows and will drive tools/service development to maintain and improve service SLOs.
What you’ll be doing:
Working on building tools to improve the SRE Observability.
Rapidly debug and triage incidents and user-reported issues.
Make valuable contribution to the overall health, performance, and reliability of Cloud Data Science platform and Infrastructure Services.
Taking ownership of automating, scripting, and tooling of new/existing scripts to help the team achieve 100% automation of daily tasks.
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity management and launch reviews.
Clear SRE Observability understanding and experience in building new tools and automation using Python/GO.
Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
Practice balanced incident response and blameless postmortems.
Be part of an on call rotation to support production systems.
What we need to see:
MS or BS in Computer Science/Engineering or a related field or equivalent experience.
5+ years Site reliability engineering experience working on large scale distributed micro services in a production environment with a real passion for automation and tooling.
SRE approach and who can understand Error budgeting, SLO’s, SLA’s.
Clear understanding on Incident management, change management and problem management process. Ability to detect all service-impacting issues, accurate triage, partner communication, impact containment, service restoration, and post-incident follow-up.
Proven strengths in problem-solving and root causing issues, while continuously seeking ways to drive optimization, efficiency and the bottom line.
Strong experience on streaming data infra services involving web services, Kafka, Spark etc.
Expert knowledge with building and operating large scale observability platforms for monitoring and logging (ELK, Prometheus etc)
Excellent interpersonal skills including the ability to identify and communicate data driven insights
Ways to stand out from the crowd:
Experience with operating large scale distributed systems with strong SLAs.
Excellent scripting: Python, GO.
Strong experience on operating data platforms.
.
  • SRE - Hyderabad

    2 months ago


    hyderabad, India Virtusa Full time

    SRE - CREQ185851 Description SRE Engineer Require experience on CI /CD., AWS/Azure, Shell scripting, Cloud Computing Provide end-to-end data administration and optimization, and monitoring tools (Elasticsearch , Data Dog, Grafana and Promotheus), Motivated with a passion for excellence, quality and attention to detail, and be able to manage the entire...

  • SRE - Hyderabad

    3 weeks ago


    hyderabad, India Spectrum Consultants India Private Limited Full time

    SRE Summary Experience Required: 8 - 15 YearsJob Term: PermanentLocation: Bangalore; Hyderabad; PuneCategory: Cloud Computing /Design/SupportWorld leader in visual and AI Computing.Site Reliability Engineering is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination...


  • Hyderabad, India Virtusa Full time

    SRE - AIOP and Dynatrace - CREQ180980 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • Hyderabad, India Virtusa Full time

    SRE with AIOP and Dynatrace - CREQ181003 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • hyderabad, India Virtusa Full time

    SRE - AIOP and Dynatrace - CREQ180980 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • hyderabad, India Virtusa Full time

    SRE with AIOP and Dynatrace - CREQ181003 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • Hyderabad, India Virtusa Full time

    SRE with AIOP and Dynatrace - CREQ181003 Description Knowledge & Experience: Minimum of 6 years of relevant work experience in critical production environments Experience in enabling observability within applications to extract appropriate telemetry into suitable back ends like Dynatrace Hands-on experience of curating Service Level Objectives, defining...


  • Hyderabad, India Virtusa Full time

    SRE - AIOP and Dynatrace - CREQ180980 Description Knowledge & Experience: Minimum of 6 years of relevant work experience in critical production environments Experience in enabling observability within applications to extract appropriate telemetry into suitable back ends like Dynatrace Hands-on experience of curating Service Level Objectives, defining Error...

  • Sr SRE

    5 days ago


    hyderabad, India Innova Solutions India Full time

    JOB DESCRIPTION Innova Solutions is immediately hiring for a Senior SRE Position type: Full Time Location: Hyderabad As a Senior SRE Validity you will: Details of the role: Sr. Site Reliability Engineer monitors all aspects of the Hyundai Connectivity Services connected car platforms. Identifies anomalies, addressed failures,...

  • Sr SRE

    4 days ago


    Hyderabad, India Innova Solutions India Full time

    JOB DESCRIPTION Innova Solutions is immediately hiring for a Senior SRE Position type: Full Time Location: Hyderabad As a Senior SRE Validity you will: Details of the role: Sr. Site Reliability Engineer monitors all aspects of the Hyundai Connectivity Services connected car platforms. Identifies anomalies, addressed failures, communicates and...

  • SRE - Cloud Engineer

    4 weeks ago


    Hyderabad, India Solugenix Full time

    OverviewSolugenix is an information technology services firm that has a rich history of providing comprehensive technology services and solutions for more than five decades.As a pioneer in IT services, we’ve partnered with some of the biggest global corporations across many industries. Our history was built on a foundation of partnerships with global...

  • IT&D Manager

    3 days ago


    Hyderabad, India Reckitt Full time

    IT&D Manager - GCP SRE City: Hyderabad We are ReckittHome to the world's best loved and trusted hygiene, health, and nutrition brands. Our purpose defines why we exist: to protect, heal and nurture in the relentless pursuit of a cleaner, healthier world. We are a global team united by this purpose.Join us in our fight to make access to the highest quality...

  • IT&D Manager

    19 hours ago


    hyderabad, India Reckitt Full time

    IT&D Manager - GCP SRE City: Hyderabad We are Reckitt Home to the world's best loved and trusted hygiene, health, and nutrition brands. Our purpose defines why we exist: to protect, heal and nurture in the relentless pursuit of a cleaner, healthier world. We are a global team united by this purpose.Join us in our fight to make access to the highest...

  • SRE - Cloud Engineer

    4 weeks ago


    Hyderabad, India Solugenix Full time

    Overview Solugenix is an information technology services firm that has a rich history of providing comprehensive technology services and solutions for more than five decades. As a pioneer in IT services, we’ve partnered with some of the biggest global corporations across many industries. Our history was built on a foundation of partnerships with...

  • SRE - Cloud Engineer

    4 weeks ago


    hyderabad, India Solugenix Full time

    Overview Solugenix is an information technology services firm that has a rich history of providing comprehensive technology services and solutions for more than five decades. As a pioneer in IT services, we’ve partnered with some of the biggest global corporations across many industries. Our history was built on a foundation of partnerships...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability. Should have proficiency in industry standard monitoring tools...


  • hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability.Should have proficiency in industry standard monitoring...

  • Sysops Engineer

    2 weeks ago


    hyderabad, India Intense Technologies Limited Full time

    Experience: 35 yearsWorkLocation: HyderabadRequired Skills: DevOps SRE AWS AzureKubernetesMandatoryskills Knowledgeon DevOps Cloud technologies and SREskillsKnowledge with AWS AzureCI/CD tools Kubernetes OpenShift Jenkins Docker andGitKnowledge on Linux Shellscripting and AnsibleSQLprogramming PLSQL programming andoptimizationsKnowledgeable aboutDatabase...

  • Sysops Engineer

    1 month ago


    Hyderabad, India Intense Technologies Limited Full time

    Experience:35 yearsWorkLocation:HyderabadRequired Skills:DevOps SRE AWS AzureKubernetesMandatoryskillsKnowledgeon DevOps Cloud technologies and SREskillsKnowledge with AWS AzureCI/CD tools Kubernetes OpenShift Jenkins Docker andGitKnowledge on Linux Shellscripting and AnsibleSQLprogramming PLSQL programming andoptimizationsKnowledgeable aboutDatabase Network...

  • Data Architect

    4 weeks ago


    Hyderabad, Telangana, India Avadhesh India Adivsory Services LLP Full time

    **Data Architect**: **Few important notes**: **Description**: **Grafana Architect** **Relocation Eligible: Role is based in Hyderabad** **Job Description for the post: - ** Hiring for one of our customer. They are a global leader in food and beverage, & are undergoing a digital transformation. We seek a strategic and **visionary Observability Architect...