Lead Engineer- Enterprise Monitoring and Observability

6 days ago


hyderabad, India Micron Full time

Our vision is to transform how the world uses information to enrich life for all.

Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of information into intelligence, inspiring the world to learn, communicate and advance faster than ever.

JR57951 Lead Engineer- Enterprise Monitoring and Observability

As a Monitoring and Observability Subject Matter Expert (SME), your expertise should cover the following areas:
 

Responsibilities:

1. Technical Proficiency:
- Proficient in using monitoring and observability tools such as OpsRamp (currently in use) and Cisco Thousand Eyes (NetScout is a plus).
- Demonstrate the ability to monitor, analyze, and optimize the performance of large-scale, distributed systems and applications.
- Diagnose and resolve performance issues, ensuring high availability and reliability of services.
- Integrate data across domains (applications, infrastructure, network) for comprehensive insights.
- Collaborate with cross-functional teams to design and implement AIOps solutions.
- Leverage AIOps tools to intelligently filter and prioritize alerts, reducing Mean Time to Recover (MTTR).
- Swiftly identify patterns and correlations to pinpoint root causes of incidents, minimizing downtime.
- Strong expertise in monitoring on-premise data center components and cloud infrastructure.
- Solid understanding of AWS, Azure, GCP, OpenShift, and Nutanix infrastructure monitoring.
- Proven expertise in monitoring and troubleshooting network infrastructure, ensuring optimal performance and reliability.
- Proficient at creating comprehensive service maps that visualize application dependencies across the enterprise.

2. Data Analysis and Reporting:
- Excel in data collection, analysis, and visualization to identify trends, patterns, and anomalies.
- Create and maintain dashboards, including Synthetic Transactions Monitoring, alerts, and actionable insights.
- Integrate with other solutions, such as Syslog, other monitoring tools, and ITSM ServiceNow.
- Primitive knowledge on Grafana, Prometheus or any other reporting tools. 

3. Automation and Scripting:
- Proficient in Python, Bash, or PowerShell for automating monitoring tasks and integrating observability tools with CI/CD pipelines.
- Understand Kubernetes and containerization concepts.
- Develop and implement automated solutions to enhance proactive monitoring and alerting.

4. Adherence to ITIL Standards:
- Follow ITIL standards for incident, change, and problem management to ensure efficient network operations.

5. Required Skills:
- Bachelor’s degree in computer science, Information Technology, or related field.
- 8+ years of experience in monitoring and observability tools implementation and support.
- Proven hands-on experience with any one or more of these tools: OpsRamp OR BMC TrueSight, DataDog, or Dynatrace.
- Strong expertise on understanding of network protocols (TCP/IP, SNMP, ICMP) and their troubleshooting skills.
- Willingness to work in rotational shifts and provide on-call support.
- Required to be on-site for 4 days (Work from Office).
 



  • Hyderabad, India Micron Full time

    Our vision is to transform how the world uses information to enrich life for all.Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of information into intelligence, inspiring the world to learn, communicate and advance faster than ever.JR57951 Lead Engineer- Enterprise Monitoring and...


  • Hyderabad, India Micron Full time

    Our vision is to transform how the world uses information to enrich life for all. Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of information into intelligence, inspiring the world to learn, communicate and advance faster than ever. JR57951 Lead Engineer- Enterprise Monitoring and...


  • Hyderabad, India TalentOla Full time

    Job Locations:PUNE SEZ QUBIXRequired Experience:7 - 10 YearsSkills :ITIL, AWS CloudRoles and Responsibilities:Design and build an observability platform for all enterprise IT teams to consume.Develop and improve instrumentation for monitoring and logging the health and availability of services.Proactively monitor systems, networks, and applications to...


  • hyderabad, India TalentOla Full time

    Job Locations: PUNE SEZ QUBIX Required Experience: 7 - 10 Years Skills : ITIL, AWS Cloud Roles and Responsibilities: Design and build an observability platform for all enterprise IT teams to consume. Develop and improve instrumentation for monitoring and logging the health and availability of services. Proactively...


  • hyderabad, India techcarrot FZ LLC Full time

    Job Description ·       Minimum 7 years of experience building and maintaining enterprise observability solutions.Minimum 4+ years of hands-on technical working experience in testing, full-stack monitoring, observability, or site reliabilityKnowledgeable in requirement gathering and rollout enterprise observability solutions. Experienced in...


  • hyderabad, India techcarrot FZ LLC Full time

    ·       Minimum 7 years of experience building and maintaining enterprise observability solutions.Minimum 4+ years of hands-on technical working experience in testing, full-stack monitoring, observability, or site reliabilityKnowledgeable in requirement gathering and rollout enterprise observability solutions. Experienced in identifying requirements,...


  • Hyderabad, India techcarrot FZ LLC Full time

    Job Description·       Minimum 7 years of experience building and maintaining enterprise observability solutions.Minimum 4+ years of hands-on technical working experience in testing, full-stack monitoring, observability, or site reliabilityKnowledgeable in requirement gathering and rollout enterprise observability solutions. Experienced in identifying...


  • Hyderabad, India techcarrot FZ LLC Full time

    ·       Minimum 7 years of experience building and maintaining enterprise observability solutions.Minimum 4+ years of hands-on technical working experience in testing, full-stack monitoring, observability, or site reliabilityKnowledgeable in requirement gathering and rollout enterprise observability solutions. Experienced in identifying requirements,...


  • Hyderabad, Telangana, India Splunk Inc Full time

    Join us as we pursue our exciting new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about solving problems using data and seek to deliver the best experience for customers. At Splunk, we're committed to our work, our customers, having fun, and most importantly to each other's...


  • Hyderabad, India JPMorgan Chase & Co. Full time

    You belong to the top echelon of talent in your field. At one of the world's most iconic financial institutions, where infrastructure is of paramount importance, you can play a pivotal role.As an Infrastructure Engineer III at JPMorgan Chase within the Cybersecurity & Tech Controls team, you utilize strong knowledge of software, applications, and technical...


  • Hyderabad, Telangana, India LivePerson, Inc Full time

    Overview:The Observability Platform team is building a state of the art system for logging, motoring, and tracing across cloud and on-prem data centers. We're looking for an experienced Senior DevOps engineer to lead our Logging and Monitoring, ensuring robust, scalable solutions within our Google Cloud Platform. In this role, you will be helping to bring...


  • Hyderabad, India Splunk Inc Full time

    Join us as we pursue our exciting new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about solving problems using data and seek to deliver the best experience for customers. At Splunk, we're committed to our work, our customers, having fun, and most importantly to each other's...


  • Hyderabad, India Splunk Inc Full time

    Join us as we pursue our exciting new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about solving problems using data and seek to deliver the best experience for customers. At Splunk, we're committed to our work, our customers, having fun, and most importantly to each other's...


  • hyderabad, India Splunk Inc Full time

    Join us as we pursue our exciting new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about solving problems using data and seek to deliver the best experience for customers. At Splunk, we're committed to our work, our customers, having fun, and most importantly to each other's...


  • Hyderabad, Telangana, India BNC Global Services Pvt. Ltd. Full time

    Job Description : Lead team of Observability Engineers to design and implement and maintain scalable, reliable, performant, and efficient systems.Drive continuous improvements in our operating processes.Nurture engineer growth through mentorshipCreate site reliability best-practices to ensure platform resiliency, performance and cost-optimization.Partner...


  • Hyderabad, India LivePerson, Inc Full time

    Overview:The Observability Platform team is building a state of the art system for logging, motoring, and tracing across cloud and on-prem data centers. We’re looking for an experienced Senior DevOps engineer to lead our Logging and Monitoring, ensuring robust, scalable solutions within our Google Cloud Platform. In this role, you will be helping to bring...


  • hyderabad, India LivePerson, Inc Full time

    Overview: The Observability Platform team is building a state of the art system for logging, motoring, and tracing across cloud and on-prem data centers. We’re looking for an experienced Senior DevOps engineer to lead our Logging and Monitoring, ensuring robust, scalable solutions within our Google Cloud Platform. In this role, you will be...


  • Hyderabad, India LivePerson, Inc Full time

    Overview: The Observability Platform team is building a state of the art system for logging, motoring, and tracing across cloud and on-prem data centers. We’re looking for an experienced Senior DevOps engineer to lead our Logging and Monitoring, ensuring robust, scalable solutions within our Google Cloud Platform. In this role, you will be helping to...


  • Hyderabad, India Splunk Inc Full time

    Join us as we pursue our exciting new vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people passionate about our product and seeking to deliver the best experience for our customers. At Splunk, we’re committed to our work, customers, having fun and most importantly to each other’s success. Learn...


  • Hyderabad, India Splunk Inc Full time

    Join us as we pursue our exciting new vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people passionate about our product and seeking to deliver the best experience for our customers. At Splunk, we’re committed to our work, customers, having fun and most importantly to each other’s success. Learn...