Datadog Implementation Engineer
7 days ago
About the Role :
We are seeking a highly skilled Datadog Implementation Engineer to join our team and lead the design, implementation, and maintenance of Datadog monitoring and observability solutions.
The ideal candidate will have extensive hands-on experience with the Datadog platform, including APM, infrastructure monitoring, and cloud observability, enabling us to ensure application performance, reliability, and security across diverse environments.
Key Responsibilities :
- Design, implement, configure, and maintain Datadog monitoring solutions across infrastructure, applications, cloud services, and security domains.
- Build and optimize application performance monitoring (APM) using Datadog modules such as Spans and Traces to detect and diagnose issues proactively.
- Develop comprehensive dashboards and alerts tailored to business and technical requirements to provide actionable insights.
- Manage and optimize Datadog billing and resource usage for cost-effective monitoring.
- Integrate Datadog with incident management and collaboration tools such as PagerDuty, ServiceNow, Slack, and Jira to streamline alerting and resolution workflows.
- Collaborate with DevOps, SRE, and engineering teams to implement Datadog agents and custom integrations for cloud platforms including AWS, Azure, and Google Cloud Platform (GCP).
- Tune Linux systems, network configurations, and application performance to enhance monitoring accuracy and response times.
- Extend Datadog functionality through custom plugins, scripts, and configurations as required.
- Analyze system and application logs to detect anomalies and ensure system health and security monitoring.
- Provide expert-level guidance on application platforms, architecture, and monitoring best practices, covering networking, databases, runtime environments, and user interfaces.
- Develop and maintain technical documentation related to Datadog implementations and monitoring standards.
- Communicate effectively with stakeholders, troubleshoot complex issues, and provide resolution recommendations.
- Stay current with the latest Datadog features, cloud technologies, and monitoring industry trends.
- Automate monitoring deployment and configuration tasks using Ansible or similar configuration management tools.
- Leverage scripting skills in Python or to enhance monitoring workflows and automation.
Required Skills and Qualifications
years of experience designing, implementing, and managing Datadog monitoring solutions.
- Strong hands-on experience with Datadog modules: Infrastructure Monitoring, APM, RUM, Logs, Synthetics, Cloud Monitoring, Database, Network, and Security Monitoring.
- Deep understanding of distributed tracing concepts including spans and traces.
- Expertise in creating interactive, insightful dashboards and configuring alerting systems.
- Experience integrating Datadog with ITSM and incident management tools such as PagerDuty, ServiceNow, Slack, and Jira.
- Proficient with cloud platforms AWS, Azure, and GCP, including deployment and monitoring strategies.
- Strong knowledge of Linux operating systems, networking, and system performance tuning.
- Familiarity with scripting languages like Python and to create custom monitoring solutions and automation.
- Working knowledge of Ansible or similar automation/configuration management tools.
- Solid understanding of application architecture, including databases, middleware, front-end/back-end layers, and networking.
- Excellent communication, teamwork, and problem-solving skills.
- Ability to work independently and collaboratively in a fast-paced, agile environment
-
Enterprise Sales Engineer
3 weeks ago
India Datadog Full timeAs an Enterprise Sales Engineer you will provide technical expertise through sales presentations product demonstrations and supporting technical evaluations POVs Sales Engineers help qualify and close opportunities with customers and partners and have a voice with the product team to help prioritize features based on input from customers competitors and...
-
Datadog Developer
4 weeks ago
Gurugram, India Minutes to Seconds Full timeJob Description Job Title: Datadog Developer Experience: 5+ Years - 8 Years Location: Delhi/NCR, Gurgaon, Noida, Bangalore, Pune (Onsite with client) Job Type: Full-Time / Permanent Interview Mode - Virtual/Video Call Budget - up to - 18 LPA About the Role: We are seeking a skilled and experienced Datadog Developer to join our engineering team. The ideal...
-
Founding Automation Engineer
1 week ago
Anywhere in India/Multiple Locations Talent Socio Full time ₹ 15,00,000 - ₹ 25,00,000 per yearAbout the Role : We are looking for a skilled DevOps Engineer to join our technology team and streamline the development, deployment, and maintenance of our software systems. You will work closely with developers, QA, and operations teams to implement CI/CD pipelines, automate infrastructure, monitor applications, and ensure high availability and...
-
Lead DevOps Engineer
2 weeks ago
Anywhere in India/Multiple Locations Scaling Theory Technologies Pvt Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescription : As the Lead DevOps Engineer, you'll own our cloud infrastructure and CI/CD ecosystem end-to-end. You'll design scalable, secure, and automated cloud systems, mentor engineers, and collaborate closely with backend, data, and product teams to deliver resilient, high-performance environments. Responsibilities : - Design, implement, and...
-
Test Automation Engineer
7 days ago
Anywhere in India/Multiple Locations Omni Reach Full time ₹ 6,00,000 - ₹ 18,00,000 per yearRole Description : We are looking for 8 years experience Lead QA Automation Engineers (Engineering background mandatory, Excellent communication, Ability to interact with Customer. Devops automation mindset). This is a full-time remote role for a Lead QA Test Automation Engineer SDET. The individual will be responsible for developing and executing...
-
Senior Software Engineer
1 week ago
Anywhere in India/Multiple Locations Obrimo Technologies (Formerly known as Salecino) Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescription : Responsibilities : - Design, build, and maintain highly scalable, distributed backend services and cloud-based platforms. - Architect and implement microservices and API-driven solutions to support internal and external product needs. - Develop and deploy cloud-native applications on AWS, leveraging services such as Lambda, API Gateway,...
-
Anywhere in India/Multiple Locations Omni Reach Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescription : Role Description : - We are looking for 8 years experience Lead QA Automation Engineers (Engineering background mandatory, Excellent communication, Ability to interact with Customer Devops automation mindset). - This is a full-time remote role for a Lead QA Test Automation Engineer SDET. - The individual will be responsible for...
-
AWS Engineer
1 week ago
Anywhere in India/Multiple Locations TALENT VELOCITY Full time ₹ 6,00,000 - ₹ 18,00,000 per yearDescription : Role : AWS Engineer (Chaos Engineering & Python) Experience : 4 Years, Location : Pan India, Mode : Remote / Hybrid Shift : 12 : 00 PM 9 : 00 PM IST, Type : Contractual (6 Months Extendable)About the Role We are seeking a highly skilled AWS Engineer with hands-on experience in Python development and Chaos Engineering to design,...
-
Site Reliability Engineer/Architect
1 week ago
Anywhere in India/Multiple Locations Cling Multi Solutions Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Description : Role : Site Reliability Engineer (SRE) Location : Bangalore / Chennai / Pune (Hybrid) Experience : 5 years Role Overview : We are looking for a skilled SRE to ensure the reliability, scalability, and performance of our cloud-native applications. The ideal candidate has hands-on experience in cloud environments, container...
-
Multiple Locations, India MULTISTREAM TECHNOLOGIES PRIVATE LIMITED Full time ₹ 12,00,000 - ₹ 24,00,000 per yearKey Responsibilities : - Design, build, and maintain multi-region infrastructure using Terraform and Atlantis. - Continuously optimize system performance, scalability, and cost efficiency. - Implement infrastructure automation and self-healing capabilities. - Develop and maintain Datadog dashboards, SLOs, SLIs, and alerting mechanisms. -...