Incident Manager

1 week ago


India Talentoj Full time

As Incident Manager IV, you will be the link between our Support, Engineering and Infrastructure teams. You will enable a better experience for our customers by organizing and driving the investigation of production issues in our application, which is a SaaS product consisting of Spring based microservices, ML models and data pipelines hosted within the AWS infrastructure, and report on these to Engineering, Support and other stakeholders. In doing so, you will also have a positive impact on the quality of the product. We are looking for somebody who is passionate about product quality, has extreme customer empathy, and is constantly looking to improve the quality of our services.

This is an engineering position, not a management position.

Role Value:

Your work will directly contribute to greater customer satisfaction by providing information about product issues in a timely manner. You will also help our Sales teams by answering technical questions about our infrastructure in customer RFP's.

Key Responsibilities

  • Investigate production issues raised by customers, Support and Engineering
  • Work as a liaison between Support and Engineering to facilitate issue resolution, root cause analysis (RCA), and drive the implementation of learnings
  • Create and track progress of problem tickets in Jira
  • Create incident analysis reports with the support of Engineering teams
  • Perform log file analysis with Datadog
  • Debugging of basic REST API calls for investigations
  • Execute SQL database queries to provide more information for investigations
  • Create and update knowledge base articles in Confluence
  • Participate in security audits (PCI DSS, ISO 27001, SOC2) and preparing supporting evidence

Skills & Qualifications

Must-Have Skills:

  • Working experience of at least 8 years in IT (SRE, sysadmin, developer, QA, technical support, or similar)
  • University degree in a relevant field
  • Strong analytical, problem-solving and collaboration skills
  • Basic understanding of systems architecture of cloud hosted applications
  • Data analysis skills - creating and interpreting dashboards to distinguish between real issues and false positives
  • Project management and documentation skills such as Jira and Confluence
  • Excellent written and verbal communication skills in English
  • Knowledge of cloud, preferably AWS, infrastructure components
  • Experience with REST APIs and tools e.g. Postman
  • Experience with application logging/monitoring tools e.g. Kibana, Datadog;
  • Experience with SQL, Linux & Network environments
  • Willingness to learn new technical skills

Nice-to-Have Skills:

  • Understanding of basic ML concepts and LLM's
  • experience with Git or similar version control system
  • experience with agile software development process
  • Jenkins or similar CI pipeline
  • Bash scripting for Linux
  • basic skills in software development e.g. Java, Python, JavaScript, Go;
  • experience with Docker & Microservices
  • network and application security
  • working within a PCI DSS environment

  • Incident Manager

    2 weeks ago


    India Akamai Full time

    Do you like working on high impact incidents and problem solvingWould you like the opportunity to solve critical technical challengesAct as a trusted AdvisorThe Incident Coordination team is part of the Infrastructure Engineering Operations group We re a team whose goals are ensuring that incidents are quickly mitigated incident status is well...


  • India Akamai Full time

    Are you excited about an opportunity to manage some of the biggest major incidents on the internetDo you enjoy working in a fast-paced cross-functional global environmentJoin our world class Incident Response And Prevention Team IRAPT Join Akamai s Incident Response And Prevention Team IRAPT part of the Support Services group You will manage major...

  • Incident Commander

    4 days ago


    India Smarsh Full time

    **Who are we?** Smarsh empowers its customers to manage risk and unleash intelligence in their digital communications. Our growing community of over 6500 organizations in regulated industries counts on Smarsh every day to help them spot compliance, legal or reputational risks in 80+ communication channels before those risks become regulatory fines or...

  • Incident Manager

    4 weeks ago


    India Talentoj Full time

    As Incident Manager IV, you will be the link between our Support, Engineering and Infrastructure teams. You will enable a better experience for our customers by organizing and driving the investigation of production issues in our application, which is a SaaS product consisting of Spring based microservices, ML models and data pipelines hosted within the AWS...


  • India beBeeCybersecurity Full time ₹ 1,80,00,000 - ₹ 2,50,00,000

    Incident Response Specialist JobThis is a highly critical role that involves leading and coordinating the response to information security incidents. The ideal candidate will have a strong understanding of various attack vectors, threat intelligence, and incident response methodologies.The selected individual will drive the full incident lifecycle from...


  • India Zensar Technologies Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Candidate with 6-8 years of experience of Incident, Major Incident & Service request management processes of ITSMThe role requires proactively run incident processes and analyze incident metrics.Identifying changes in the support processes and change the incident management process respectivelyConduct regular review of incident management process and drive...


  • India beBeeIncident Full time US$ 7,50,000 - US$ 15,00,000

    Job Summary:The Incident Coordination team is a part of the Infrastructure Engineering Operations group, responsible for quickly mitigating incidents and ensuring necessary steps are taken to reduce their recurrence. We aim to provide timely incident updates and foster a collaborative environment where team members can work together efficiently.About Us:As...


  • India Optel Group Full time

    OPTEL Responsible Agile Innovative OPTEL is a global company that develops transformative software middleware and hardware solutions to secure and ensure supply chain compliance in major industry sectors such as pharmaceuticals and food with the goal of reducing the effects of climate change and enabling sustainable living If you are driven by the...


  • India AiiR Response Full time

    Company Description AiiR is the first AI-driven breach response and extortion management platform that automates negotiations, investigations, and recovery, reducing incident costs and response times. At the core of AiiR is CEIRA, an AI-powered virtual breach response analyst that streamlines ransom negotiations, tracks cryptocurrency payments, conducts...


  • India beBeeCybersecurity Full time ₹ 80,00,000 - ₹ 1,20,00,000

    Cybersecurity Threat HunterJob Summary:The ideal candidate will lead and coordinate the response to information security incidents, safeguarding our organization by driving the full incident lifecycle from detection and analysis through containment, eradication, and recovery.This individual will collaborate closely with various internal teams and external...