
Site Reliability Engineer
1 week ago
Job Title : Datadog developer.
Location : PAN INDIA.
Experience : 6 To 10 Years.
Job Type : Contract to hire.
Notice Period : Immediate joiners.
Mandatory Skills : Datadog, Kubernetes, Docker.
Job description.
Must have skill :
- Datadog.
- Kubernetes.
- Docker.
You will be responsible for designing, implementing, and managing Datadog solution, ensuring seamless integration with Kubernetes, cloud providers, and CI/CD tools to achieve traceability and observability.
Key Responsibilities :.
- Creating Customer centric Use cases and providing consulting for custom metrics, alerts, log monitoring, analysis and visualizations in Datadog.
- Set up and configure Datadog for monitoring infrastructure, applications, and logs via automation.
- Develop a consolidated view for transaction health and include synthetic health checks for APIs.
- Analyze and correlate data across various services for troubleshooting and optimization.
- Implement AIOps for event management by centralizing and correlating events, enabling anomaly detection, and predictive incident notifications.
- Enhance proactive issue resolution and minimize downtime using advanced AI/ML tools, including LLMs for event clustering.
Technical Skills :
- Minimum 4 years of hands-on experience with Datadog, including integration with Kubernetes, cloud providers, and CI/CD tools, with an overall experience of 6+ years
- Proficiency in Datadogs Service Map, Service Catalog, and other observability tools.
- Strong understanding of cloud infrastructure (AWS, Azure, GCP) and container orchestration (Kubernetes, Docker).
- Experience with CI/CD pipelines and tools such as Jenkins, Git.
- Knowledge of application development frameworks and languages (e., Java, .NET, Node.js).
- Familiarity with scripting and automation (e., Python, Bash).
- Proven track record of setting up and managing data source integrations.
Preferred Qualifications :
- Datadog Foundation certification.
- Experience with other monitoring tools.
- Site Reliability Engineering (SRE) experience, focusing on high availability, performance, and scalability of systems.
(ref:hirist.tech)
-
Specialist - Site Reliability Engineer
2 days ago
Pune, Maharashtra, India Accelya Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...
-
Specialist - Site Reliability Engineer
2 days ago
Pune, Maharashtra, India Accelya Group Full time ₹ 15,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...
-
Site Reliability Engineer
2 weeks ago
Pune, India ENGEL Full timeCompany Description ENGEL is a global leader in the production of injection moulding machines and their automation. The company produces systems that manufacture plastic parts used in various industries such as automotive, packaging, and consumer goods. With nine production plants worldwide and subsidiaries and representatives in over 85 countries, ENGEL...
-
Site Reliability Engineer
2 weeks ago
Pune, Maharashtra, India ENGEL Full time ₹ 6,00,000 - ₹ 18,00,000 per yearCompany DescriptionENGEL is a global leader in the production of injection moulding machines and their automation. The company produces systems that manufacture plastic parts used in various industries such as automotive, packaging, and consumer goods. With nine production plants worldwide and subsidiaries and representatives in over 85 countries, ENGEL...
-
SRE (Site Reliability Engineer)
2 weeks ago
Pune, India Apex One Full timeJob Overview We are looking for a detail-oriented and experienced Site Reliability Engineer to join our team. The Site Reliability Engineer will be responsible for creating and implementing scalable software solutions in order to meet system and application performance goals. You will also be responsible for troubleshooting system errors and resolving any...
-
Site Reliability Engineer
2 days ago
Pune, Maharashtra, India Idox Full time ₹ 9,00,000 - ₹ 12,00,000 per yearSite Reliability Engineer (AWS)Pune, IndiaAbout the roleWe are seeking a driven and detail-oriented Site Reliability Engineer (SRE) with a strong passion for building resilient, scalable cloud infrastructure. This role offers an exciting opportunity for professionals with 2 to 4 years of experience in DevOps, Cloud, or Infrastructure to deepen their...
-
Site Reliability Engineer
4 weeks ago
Chennai, Bengaluru, Pune, India Infosys Limited Full timeJob Description- As a Senior Site Reliability Engineer, you will play a critical role in supporting application developers by providing expert guidance on Application and infrastructure best practices from reliability perspective.- Your role covers the entire life cycle of a product/application. Your primary focus will be Automation, Observability,...
-
Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Reveille Technologies Full timeJob Summary :We are seeking a skilled and proactive Site Reliability Engineer (SRE) with a strong DevOps mindset and hands-on experience in application troubleshooting. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our applications and infrastructure. This role requires a blend of software engineering,...
-
Specialist - Site Reliability Engineer
2 days ago
Pune/Pimpri-Chinchwad Area, India Accelya Full time ₹ 15,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...
-
Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Allianz Full timeSite Reliability Engineer (SRE) - One Identity Access ManagementThe primary objective of the Site Reliability Engineer (SRE) specializing in One Identity Access Management is to ensure the seamless operation, reliability, and scalability of IAM systems within the organization.This role is critical in maintaining system integrity, optimizing performance, and...