SRE with AIOP and Dynatrace

4 weeks ago


hyderabad, India Virtusa Full time
SRE with AIOP and Dynatrace - CREQ181003 Description Knowledge & Experience:
Minimum of 6 years of relevant work experience in critical production environments
Experience in enabling observability within applications to extract appropriate telemetry into suitable back ends like Dynatrace
Hands-on experience of curating Service Level Objectives, defining Error Budgets and refining the change management lifecycle to accommodate the same
Knowledge and experience with CI CD pipelines and deployment patterns like Canary
Analytics of application telemetry and AIOps enablement using Dynatrace Davis or an alternative product in combination with any other tools for orchestration
Has experience defining an SRE capability charter and roadmap for all dependent teams
Has experience successfully running and providing leadership to DevOps or SRE teams (preferred)
Working knowledge of SQL and troubleshooting by writing queries is key
Knowledge of containerized solutions and orchestration tools like Kubernetes
Core Capabilities:
Understand and demonstrate application of SRE principles, particularly toil reduction, blameless post mortems, monitoring distributed systems and release engineering
Indepth knowledge of any observability product like Dynatrace, Splunk or ELK stack covering synthetic monitoring, RUM and APM
Ability to instrument microservices applications via OpenTelemetry to extract traces is beneficial
Experience administering applications and infrastructure services in hyperscaler environments such as AWS, Azure or GCP is key
Hands-on experience in writing Python scripts and Ansible templates for application deployment automation or other automations is important
Ability to diagnose and debug systems at the code level (Java preferred) is beneficial
Qualification:
ITIL4 certification is mandatory. Achieving Practitioner or Intermediate level certifications are preferred
SRE Foundation certification via PeopleSoft or DevOps Institute is beneficial
AWS Solutions Architect Associate qualification or alternative from another Cloud Service Provider is preferred
Role & Responsibilities:
Formulate the detailed SRE rollout plan and execute a transformation roadmap
Continuously seek to uplift the maturity of SRE implementation and improve SLO, MTTR, MTTD as well as any other relevant KPIs identified
Engage in on call and critical operations support activities while leading blameless post mortems
Direct liaison with customers remotely and face to face for stakeholder management
Formulate a plan to eliminate toil by lowering incident volume, eliminating noise from alerts, automating manual processes, and converting workarounds into system features
Work with Development, QA and other squads to design, build and rollout reliability features into the applications being delivered
Lead a team of SREs deployed on the ground while being engaged hands on Primary Location Hyderabad, Andhra Pradesh, India Job Type Experienced Primary Skills AWS - SRE, CI/CD, SRE Years of Experience 10 Qualification

Knowledge & Experience:
Minimum of 6 years of relevant work experience in critical production environments
Experience in enabling observability within applications to extract appropriate telemetry into suitable back ends like Dynatrace
Hands-on experience of curating Service Level Objectives, defining Error Budgets and refining the change management lifecycle to accommodate the same
Knowledge and experience with CI CD pipelines and deployment patterns like Canary
Analytics of application telemetry and AIOps enablement using Dynatrace Davis or an alternative product in combination with any other tools for orchestration
Has experience defining an SRE capability charter and roadmap for all dependent teams
Has experience successfully running and providing leadership to DevOps or SRE teams (preferred)
Working knowledge of SQL and troubleshooting by writing queries is key
Knowledge of containerized solutions and orchestration tools like Kubernetes
Core Capabilities:
Understand and demonstrate application of SRE principles, particularly toil reduction, blameless post mortems, monitoring distributed systems and release engineering
Indepth knowledge of any observability product like Dynatrace, Splunk or ELK stack covering synthetic monitoring, RUM and APM
Ability to instrument microservices applications via OpenTelemetry to extract traces is beneficial
Experience administering applications and infrastructure services in hyperscaler environments such as AWS, Azure or GCP is key
Hands-on experience in writing Python scripts and Ansible templates for application deployment automation or other automations is important
Ability to diagnose and debug systems at the code level (Java preferred) is beneficial
Qualification:
ITIL4 certification is mandatory. Achieving Practitioner or Intermediate level certifications are preferred
SRE Foundation certification via PeopleSoft or DevOps Institute is beneficial
AWS Solutions Architect Associate qualification or alternative from another Cloud Service Provider is preferred
Role & Responsibilities:
Formulate the detailed SRE rollout plan and execute a transformation roadmap
Continuously seek to uplift the maturity of SRE implementation and improve SLO, MTTR, MTTD as well as any other relevant KPIs identified
Engage in on call and critical operations support activities while leading blameless post mortems
Direct liaison with customers remotely and face to face for stakeholder management
Formulate a plan to eliminate toil by lowering incident volume, eliminating noise from alerts, automating manual processes, and converting workarounds into system features
Work with Development, QA and other squads to design, build and rollout reliability features into the applications being delivered
Lead a team of SREs deployed on the ground while being engaged hands on

Travel No

  • hyderabad, India Virtusa Full time

    SRE - AIOP and Dynatrace - CREQ180980 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • Hyderabad, India Virtusa Full time

    SRE with AIOP and Dynatrace - CREQ181003 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • Hyderabad, India Virtusa Full time

    SRE - AIOP and Dynatrace - CREQ180980 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • Hyderabad, India Virtusa Full time

    SRE with AIOP and Dynatrace - CREQ181003 Description Knowledge & Experience: Minimum of 6 years of relevant work experience in critical production environments Experience in enabling observability within applications to extract appropriate telemetry into suitable back ends like Dynatrace Hands-on experience of curating Service Level Objectives, defining...


  • Hyderabad, India Virtusa Full time

    SRE - AIOP and Dynatrace - CREQ180980 Description Knowledge & Experience: Minimum of 6 years of relevant work experience in critical production environments Experience in enabling observability within applications to extract appropriate telemetry into suitable back ends like Dynatrace Hands-on experience of curating Service Level Objectives, defining Error...


  • Hyderabad, India Wipro Full time

    Requirement- SREExperience: 6+YearsLocation : Pan IndiaKey responsibilities Review Monitoring & alerts to provide recommendations for enhancement towards 360° coverage Create dashboards, setup synthetic and real user monitoring, visualize large data sets with interactive custom dashboards, setup alerts, reports, self-remediation actions, leverage AIOps...


  • Hyderabad, India Wipro Full time

    Requirement- SREExperience: 6+YearsLocation : Pan IndiaKey responsibilities Review Monitoring & alerts to provide recommendations for enhancement towards 360° coverage Create dashboards, setup synthetic and real user monitoring, visualize large data sets with interactive custom dashboards, setup alerts, reports, self-remediation actions, leverage AIOps...


  • Hyderabad, India Wipro Full time

    Requirement- SRE Experience: 6+Years Location : Pan India Key responsibilities Review Monitoring & alerts to provide recommendations for enhancement towards 360° coverage Create dashboards, setup synthetic and real user monitoring, visualize large data sets with interactive custom dashboards, setup alerts, reports, self-remediation actions, leverage AIOps...

  • Cloud Architect

    3 days ago


    Hyderabad, India Grid Dynamics Full time

    Senior Cloud Consultant/Architect - (SRE / Observability / AIOPs)Job DescriptionProfessionals who specialize in cloud operations with focus on observability (logging, tracing, alerting) with a vision for AIOPs and a strong understanding in practice of site reliability. A consultant with a mix of knowledge and skills in software development and cloud...


  • Hyderabad, India Grid Dynamics Full time

    Senior Cloud Consultant/Architect - (SRE / Observability / AIOPs)Job DescriptionProfessionals who specialize in cloud operations with focus on observability (logging, tracing, alerting) with a vision for AIOPs and a strong understanding in practice of site reliability. A consultant with a mix of knowledge and skills in software development and cloud...

  • Cloud Architect

    4 weeks ago


    hyderabad, India Grid Dynamics Full time

    Senior Cloud Consultant/Architect - (SRE / Observability / AIOPs)Job DescriptionProfessionals who specialize in cloud operations with focus on observability (logging, tracing, alerting) with a vision for AIOPs and a strong understanding in practice of site reliability. A consultant with a mix of knowledge and skills in software development and cloud...

  • Cloud Architect

    4 weeks ago


    Hyderabad, India Grid Dynamics Full time

    Senior Cloud Consultant/Architect - (SRE / Observability / AIOPs)Job DescriptionProfessionals who specialize in cloud operations with focus on observability (logging, tracing, alerting) with a vision for AIOPs and a strong understanding in practice of site reliability. A consultant with a mix of knowledge and skills in software development and cloud...

  • Cloud Architect

    4 weeks ago


    hyderabad, India Grid Dynamics Full time

    Senior Cloud Consultant/Architect - (SRE / Observability / AIOPs) Job Description Professionals who specialize in cloud operations with focus on observability (logging, tracing, alerting) with a vision for AIOPs and a strong understanding in practice of site reliability. A consultant with a mix of knowledge and skills in software development and cloud...

  • Cloud Architect

    4 weeks ago


    Hyderabad, India Grid Dynamics Full time

    Senior Cloud Consultant/Architect - (SRE / Observability / AIOPs)Job DescriptionProfessionals who specialize in cloud operations with focus on observability (logging, tracing, alerting) with a vision for AIOPs and a strong understanding in practice of site reliability. A consultant with a mix of knowledge and skills in software development and cloud...


  • Hyderabad/ Secunderabad, India timesjobs Full time

    Role & ResponsibilitiesWe are seeking a hands-on Manager who has experience leading large Big Data environments spread across thousands of nodes and petabytes of data.We look forward to a dynamic manager with a background & experience that looks like this:Grown into leadership roles after proving technical skills in individual contributor roles but still...

  • Sre Ops

    4 weeks ago


    Hyderabad, India Apps Associates Full time

    Site Reliability Engineer 12 to 16 years - Strong scripting skills in Bash and Python to automate routine tasks and improve operational workflows. - Strict to follow Change management and SLA’s - Hands-on experience in provisioning infrastructure using CloudFormation and Terraform to deploy and manage cloud-based services. - Strong proficiency in AWS...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability. Should have proficiency in industry standard monitoring tools...


  • hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability.Should have proficiency in industry standard monitoring...


  • Hyderabad, India PepsiCo Full time

    Overview: **Main purpose of the Role**: PepsiCo's approach to IT operations extends beyond the typical ITIL framework. We enhance user experience and maintain high stability, reliability, and availability through an SRE framework and an AIOps-based IT operations model. This role, part of the SRE-led/AI Ops services model, involves skillfully managing the...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 DescriptionPosition : SREPrimary skills: devops CI/CD pipelineLocation: HyderabadShould have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability.Should have proficiency in industry standard monitoring tools (Dynatrace,...