Incident & Availability Manager

5 days ago


Thiruvananthapuram, Kerala, India UST Full time ₹ 6,00,000 - ₹ 18,00,000 per year

5 - 7 Years

1 Opening

Trivandrum

Role description

1. Role Purpose

The Incident & Availability Manager is responsible for managing the complete lifecycle of incidents, including coordination of Major Incidents (MIM), to restore normal service as quickly as possible and minimize business impact. The role also governs service availability and reliability, ensuring that agreed SLAs, OLAs, and uptime targets are consistently met.

2. Key Responsibilities

Incident Management

  • Manage end-to-end handling of high-priority (P1/P2) incidents across infrastructure, applications, and business services.
  • Oversee triage, impact assessment, and stakeholder communication throughout the incident lifecycle.
  • Ensure incidents are logged, prioritized, and resolved per ITIL standards.
  • Lead technical bridge calls with resolver groups and vendors for quick restoration.
  • Conduct post-incident reviews and track corrective/preventive actions.
  • Analyze incident trends and recommend improvement measures.
  • Provide timely updates to users, management, and stakeholders.

Major Incident Management (MIM)

  • Lead all Major Incidents (P1) to ensure fast recovery and effective communication.
  • Act as the single point of accountability during critical outages.
  • Manage Major Incident bridges, coordinate technical teams, and update leadership in real time.
  • Prepare and share MIM communications — initial notifications, progress updates, and closure summaries.
  • Produce post-MIM reports including business impact, RCA summary, and recovery actions.
  • Ensure RCA and preventive actions are completed in coordination with Problem Management.

Availability Management

  • Monitor and report on availability of critical IT systems and services.
  • Define, measure, and track SLAs, OLAs, and uptime metrics.
  • Identify and address recurring availability issues with Problem and Capacity teams.
  • Support proactive monitoring, redundancy, and resilience improvements.
  • Participate in DR testing, failover validation, and service continuity initiatives.

Governance & Reporting

  • Maintain dashboards and reports for incident and availability KPIs.
  • Present weekly/monthly operations reviews to leadership and stakeholders.
  • Work with Change and Problem Management to reduce incidents and operational risks.
  • Contribute to ITSM process improvement and service maturity initiatives.

Stakeholder Communication

  • Act as the main point of contact for stakeholders during major incidents.
  • Provide timely and clear updates to leadership, clients, and users.
  • Deliver executive summaries and post-incident reports.
  • Manage escalation paths and vendor coordination effectively.
3. Required Skills & Experience

Technical & Process Skills

  • Strong experience in Incident and Major Incident Management in a 24x7 enterprise environment.
  • Hands-on experience with ITSM tools (ManageEngine, ServiceNow, Jira Service Management).
  • Sound understanding of ITIL processes (Incident, Problem, Change, Availability, Capacity).
  • Familiarity with key infrastructure areas (Cloud, Network, Server, End User).
  • Proven ability to coordinate multiple technical teams during high-severity incidents.
  • Knowledge of monitoring tools (SolarWinds, Dynatrace, CloudWatch, Splunk, etc.).

Soft Skills

  • Excellent communication and stakeholder management skills.
  • Calm, decisive, and effective under pressure.
  • Strong analytical and problem-solving abilities.
  • Proven leadership and team coordination skills.
  • Highly organized and process-driven.
4. Qualifications
  • Bachelor's Degree in Information Technology or equivalent.
  • 8–12 years of IT Operations experience, including 3+ years in Major Incident or Availability Management.
  • Certifications:

  • ITIL v4 Intermediate or Expert (mandatory)

  • Major Incident / Problem Management certification (preferred)
  • AWS or Azure Foundations certification (desirable)
5. Tools & Platforms
  • ITSM: ManageEngine, ServiceNow, Jira Service Management
  • Monitoring: SolarWinds, Dynatrace, CloudWatch, PRTG, Splunk
  • Collaboration: Microsoft Teams, Outlook, SharePoint

Support Coverage: 24x7 (On-call rotation for Major Incident & Problem Management support)

Skills

Servicenow,Incident Management,Manage Engine,Jira service Management

About UST

UST is a global digital transformation solutions provider. For more than 20 years, UST has worked side by side with the world's best companies to make a real impact through transformation. Powered by technology, inspired by people and led by purpose, UST partners with their clients from design to operation. With deep domain expertise and a future-proof philosophy, UST embeds innovation and agility into their clients' organizations. With over 30,000 employees in 30 countries, UST builds for boundless impact—touching billions of lives in the process.



  • Thiruvananthapuram, Kerala, India UST Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    5 - 7 Years1 OpeningTrivandrumRole descriptionJob Title: ITSM Manager (Incident, Problem, Availability & Major Incident Management) Support Coverage: 24x7 (On-call rotation for Major Incident & Problem Management support)1. Role PurposeThe ITSM Manager (Incident, Problem, Availability & Major Incident Management) is responsible for overseeing the complete...


  • Thiruvananthapuram, Kerala, India Equifax Full time

    Equifax is where you can power your possible. We seek individuals to achieve their potential, develop new skills, and collaborate with bright minds. The Technology Operations Resilience Center – Incident Management Supervisor will lead a team providing 24x7 support for Event and Incident Management of all Equifax applications and infrastructure. The...


  • Thiruvananthapuram, Kerala, India Cognizioni Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Job Title: SAP EHS Incident Management ConsultantJob Summary:We are looking for an experiencedSAP EHS (Environment, Health, and Safety) Consultant with strong expertise inIncident Management. The ideal candidate should have in-depth knowledge ofconfiguration, customization, andreporting within the EHS IncidentManagement module and should be able to...

  • Deputy Manager

    2 weeks ago


    Thiruvananthapuram, Kerala, India Adani Ports and SEZ Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    ResponsibilitiesRoles & ResponsibilitiesDesign, Plan and implement IT infrastructure and Data centers, including the operational areas like Jetty, Yard, Facilities etc. This includes preparation of BOQ, AMCs and vendor management. Managing the subordinates and their performance evaluation based on key result areas and key performance indicators, Provide for...

  • Environment Manager

    2 weeks ago


    Thiruvananthapuram, Kerala, India UST Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Job Title: Environment Manager - Non-Production EnvironmentsExperience Range: 10-15 yearsHiring Location: Hyderabad / Bangalore / Chennai / Kochi / Trivandrum / Pune / NoidaMust Have Skills7+ relevant experience in Environment / Release / Platform Management with large-scale coverage.Strong coordination experience across DBA, Infra, Network, Security, and...


  • Thiruvananthapuram, Kerala, India Muthoot FinCorp (MFL) Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Chief Manager - Cyber Security RiskROLE SUMMARYChief Manager - Cyber Security Risk is responsible for the implementation and governance of Cyber Security Risk and Compliance frameworks.The role takes the lead for the implementation of information security policies, standards, procedures, and best practices to ensure the confidentiality, integrity, and...


  • Thiruvananthapuram, Kerala, India UST Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    2 - 3 Years1 OpeningTrivandrumRole descriptionResponsible for 24x7 monitoring and first-level incident response across the IT environment, including cloud, server, network, and security infrastructure.Hands-on expertise in operating NOC, SOC, and ROC functions to ensure proactive detection, triage, and escalation of incidents impacting service availability,...


  • Thiruvananthapuram, Kerala, India Wincogz Business Solutions Private Limited Full time ₹ 2,00,000 - ₹ 3,00,000 per year

    A Database Support Engineer (24x7) maintains, troubleshoots, and optimizes database systems to ensure uninterrupted service around the clock, including incident resolution, system monitoring, backup, security, and performance tuning.Key ResponsibilitiesProvide 24x7 operational support for critical database infrastructure, ensuring uptime and rapid incident...

  • Change Manager

    5 days ago


    Thiruvananthapuram, Kerala, India UST Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    5 - 7 Years1 OpeningTrivandrumRole description1. Role PurposeThe Change Manager is responsible for ensuring that all IT changes across Aptia's technology landscape are implemented in a controlled, secure, and compliant manner, minimizing risk and disruption to business operations.This role governs the end-to-end lifecycle of change management, ensuring all...


  • Thiruvananthapuram, Kerala, India UST Full time

    9 - 12 Years4 OpeningsTrivandrumRole descriptionThe Enterprise Release Manager (Production Releases) plays a critical role in ensuring smooth, controlled, and reliable delivery of application and infrastructure changes into production environments. This position oversees end-to-end release management processes, ensuring quality, governance, and coordination...