Site Reliability
3 days ago
Job Title: Observability & Automation Specialist
Location: Pune /Mumbai/ Noida/Udaipur
Job Type: Full-Time/Hybrid
Experience: 6 -15yrs
Job Summary:
We are seeking a skilled Observability & Automation Specialist with hands-on experience in building observability practices and implementing end-to-end automation, including AI and GenAI capabilities. The ideal candidate will be responsible for configuring and optimizing observability platforms to deliver actionable insights into system performance and reliability across various industry use cases.
Key Responsibilities:
- Design, implement, and maintain heterogeneous observability solutions using infrastructure, logs, synthetic monitoring, automation, AI, and GenAI.
- Create and manage dashboards, monitors, alerts, service maps, and user interfaces.
- Collaborate with DevOps, Development, and Security teams to define and maintain SLIs, SLOs, and SLAs.
- Develop integrations between observability platforms and other systems (e.g., hybrid cloud, on-prem data centers, end-user assets, Kubernetes, Terraform, CI/CD tools).
- Optimize alerting mechanisms to reduce false positives and improve incident response.
- Provide support during incidents, including root cause analysis and post-mortem reviews.
- Conduct training sessions for internal teams on effective platform usage.
Required Skills and Qualifications:
- 6+ years of experience in development, automation, system monitoring, and DevOps.
- 3+ years of hands-on experience with advanced automation and observability platforms such as ;Dynatrace, Datadog, AppDynamics, New Relic, Zabbix, ELK(Elasticsearch, Logstash, and Kibana.), AI/GenAI, and Machine Learning.
- Strong understanding of infrastructure components including cloud platforms (AWS, Azure, GCP), containers (Docker, Kubernetes), networking, and operating systems.
- Proficiency in scripting languages such as Python, Bash, or Shell.
- Experience with CI/CD pipelines and automation tools (e.g., Jenkins, GitHub Actions, Terraform, Packer).
- Familiarity with log collection, parsing, and automation using observability platforms.
- Strong analytical and problem-solving skills with a product-oriented mindset.
Preferred Qualifications:
- Certifications in observability platforms (e.g., Datadog Certified Monitoring Professional, Dynatrace, AppDynamics, ELK).
- Experience with additional monitoring tools (e.g., Prometheus, Grafana, New Relic, Nagios, ManageEngine).
- Familiarity with ITIL processes and incident management tools (e.g., PagerDuty, ServiceNow,
Why Join BXI Technologies?
- Lead innovation in AI, Cloud, and Cybersecurity with top-tier partners.
- Be part of a forward-thinking team driving digital transformation.
- Access to cutting-edge technologies and continuous learning opportunities.
- Competitive compensation and performance-based incentives.
- Flexible and dynamic work environment based in India.
About BXI Tech
BXI Tech is a purpose-driven technology company, backed by private equity and focused on delivering innovation inengineering, AI, cybersecurity, and cloud solutions. We combine deep tech expertise with a commitment to creating value for both businesses and communities.
Our ecosystem includesBXI Ventures, which invests acrosstechnology, healthcare, real estate, and hospitality, andBXI Foundation, which leads impactful initiatives ineducation, healthcare, and care homes. Together, we aim to drivesustainable growth and meaningful social impact.
-
Site Reliability Engineer
2 weeks ago
pune, India Talent Worx Full timeSite Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
7 days ago
Pune, India Talent Worx Full timeSite Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Relanto Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Site Reliability EngineerSummaryWe are looking for a Site Reliability Engineer to join our Digital & Transformation department. The ideal candidate will have 4 years of experience in this field and will be responsible for ensuring the reliability, availability, and performance of our systems and applications.Roles And Responsibilities4 years of...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Mumbai, Pune, India Xoriant Corporation Full timeJob Description Job description Site Reliability Engineer Pune, Mumbai, Bangalore, Gurgaon , Chennai Full Time Hybrid (3 dyas a week) As a Site Reliability Engineer (SRE), you will play a crucial role in maintaining and improving the reliability and performance of our systems and applications. You will leverage Datadogs monitoring and observability platform...
-
Site Reliability Engineer
4 days ago
Noida, Uttar Pradesh, India Cloud Angles Digital Transformation Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob SummarySite Reliability Engineers (SRE's) cover the intersection of Software Engineer and Systems Administrator. In other words, they can both create code and manage the infrastructure on which the code runs. This is a very wide skillset, but the end goal of an SRE is always the same: to ensure that all SLAs are met, but not exceeded, so as to balance...
-
Site Reliability Engineer
2 weeks ago
noida, India ALIQAN Technologies Full timeGreetings from ALIQAN TechnologiesWe are hiring Site Reliability & DevOps Engineer for one of our client MNCs.Job Title:Devops EngineerExp: 4-6 YrsLocation:Remote Key ResponsibilitiesInfrastructure & Platform Engineering Design, implement, and maintain scalable cloud infrastructure using Infrastructure as Code (IaC) principles Architect and manage...
-
Site Reliability Engineer
7 days ago
Pune, Maharashtra, India Fiserv Full time ₹ 8,00,000 - ₹ 24,00,000 per yearSite Reliability EngineerExp. Range-8 to14 YearsWhat does a successful Site Reliability Engineer (SRE) Expert do at Fiserv?The Site reliability engineer blends the principles of software engineering with the discipline of operations to create high-performing and reliable software systems. They are tasked with designing and implementing tools, processes, and...
-
Site Reliability Engineer
2 weeks ago
Noida, India ALIQAN Technologies Full timeGreetings from ALIQAN Technologies! We are hiring Site Reliability & DevOps Engineer for one of our client MNCs. Job Title:Devops Engineer Exp: 4-6 Yrs Location:Remote Key Responsibilities Infrastructure & Platform Engineering Design, implement, and maintain scalable cloud infrastructure using Infrastructure as Code (IaC) principles Architect and manage...
-
Specialist - Site Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Accelya Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...
-
Specialist - Site Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Accelya Group Full time ₹ 15,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...