Site Reliability
2 weeks ago
Job Title: Observability & Automation Specialist
Location: Pune /Mumbai/ Noida/Udaipur
Job Type: Full-Time/Hybrid
Experience: 6 -15yrs
Job Summary:
We are seeking a skilled Observability & Automation Specialist with hands-on experience in building observability practices and implementing end-to-end automation, including AI and GenAI capabilities. The ideal candidate will be responsible for configuring and optimizing observability platforms to deliver actionable insights into system performance and reliability across various industry use cases.
Key Responsibilities:
- Design, implement, and maintain heterogeneous observability solutions using infrastructure, logs, synthetic monitoring, automation, AI, and GenAI.
- Create and manage dashboards, monitors, alerts, service maps, and user interfaces.
- Collaborate with DevOps, Development, and Security teams to define and maintain SLIs, SLOs, and SLAs.
- Develop integrations between observability platforms and other systems (e.g., hybrid cloud, on-prem data centers, end-user assets, Kubernetes, Terraform, CI/CD tools).
- Optimize alerting mechanisms to reduce false positives and improve incident response.
- Provide support during incidents, including root cause analysis and post-mortem reviews.
- Conduct training sessions for internal teams on effective platform usage.
Required Skills and Qualifications:
- 6+ years of experience in development, automation, system monitoring, and DevOps.
- 3+ years of hands-on experience with advanced automation and observability platforms such as ;Dynatrace, Datadog, AppDynamics, New Relic, Zabbix, ELK(Elasticsearch, Logstash, and Kibana.), AI/GenAI, and Machine Learning.
- Strong understanding of infrastructure components including cloud platforms (AWS, Azure, GCP), containers (Docker, Kubernetes), networking, and operating systems.
- Proficiency in scripting languages such as Python, Bash, or Shell.
- Experience with CI/CD pipelines and automation tools (e.g., Jenkins, GitHub Actions, Terraform, Packer).
- Familiarity with log collection, parsing, and automation using observability platforms.
- Strong analytical and problem-solving skills with a product-oriented mindset.
Preferred Qualifications:
- Certifications in observability platforms (e.g., Datadog Certified Monitoring Professional, Dynatrace, AppDynamics, ELK).
- Experience with additional monitoring tools (e.g., Prometheus, Grafana, New Relic, Nagios, ManageEngine).
- Familiarity with ITIL processes and incident management tools (e.g., PagerDuty, ServiceNow,
Why Join BXI Technologies?
- Lead innovation in AI, Cloud, and Cybersecurity with top-tier partners.
- Be part of a forward-thinking team driving digital transformation.
- Access to cutting-edge technologies and continuous learning opportunities.
- Competitive compensation and performance-based incentives.
- Flexible and dynamic work environment based in India.
About BXI Tech
BXI Tech is a purpose-driven technology company, backed by private equity and focused on delivering innovation inengineering, AI, cybersecurity, and cloud solutions. We combine deep tech expertise with a commitment to creating value for both businesses and communities.
Our ecosystem includesBXI Ventures, which invests acrosstechnology, healthcare, real estate, and hospitality, andBXI Foundation, which leads impactful initiatives ineducation, healthcare, and care homes. Together, we aim to drivesustainable growth and meaningful social impact.
-
Site Reliability Engineer
4 weeks ago
Noida, India CorroHealth Full timeWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...
-
Site Reliability Engineer
3 weeks ago
Pune, India Talent Worx Full timeSite Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
4 weeks ago
Noida, India CorroHealth Full timeWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...
-
Site Reliability Engineer
3 weeks ago
Noida, India CorroHealth Full timeWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...
-
Site Reliability Engineer
14 hours ago
Noida, Uttar Pradesh, India CorroHealth Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...
-
Site Reliability Engineer
3 weeks ago
Pune, India Talent Worx Full timeSite Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
6 days ago
Pune, Maharashtra, India Talent Worx Full time ₹ 15,00,000 - ₹ 25,00,000 per yearSite Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
3 days ago
Noida, Uttar Pradesh, India Times Internet Full time ₹ 1,04,000 - ₹ 1,30,878 per yearRole:Site Reliability EngineerExperience:8-14 yearsLocation:Sector 16, NoidaNotice Period:Immediate / Serving onlyAbout Times InternetAt Times Internet, we create premium digital products that simplify and enhance the lives ofmillions. As India's largest digital products company, we have a significant presence across awide range of categories, including...
-
Site Reliability Engineer
2 weeks ago
Noida, Uttar Pradesh, India Cloud Angles Digital Transformation Full time ₹ 15,00,000 - ₹ 25,00,000 per yearAbout the Role:We are seeking a skilled and proactive Site Reliability Engineer I & II (SRE II) to join our growing infrastructure team. As an SRE II, you will play a critical role in ensuring the reliability, scalability, and performance of our systems. Youll work independently and collaboratively to design, implement, and maintain robust infrastructure...
-
Site Reliability Engineer
2 weeks ago
Pune, Maharashtra, India ENGEL Full time ₹ 6,00,000 - ₹ 18,00,000 per yearCompany DescriptionENGEL is a global leader in the production of injection moulding machines and their automation. The company produces systems that manufacture plastic parts used in various industries such as automotive, packaging, and consumer goods. With nine production plants worldwide and subsidiaries and representatives in over 85 countries, ENGEL...