Senior Site Reliability Engineer- ELK Expert

1 day ago


New Delhi, India iVedha Inc. Full time

Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+ years of experience, including 4+ years specializing in the ELK stack (Elasticsearch, Logstash, Kibana), to join our Platform Engineering Practice. In this role, you’ll design, manage, and scale ELK clusters ingesting 2–3+ TB/day, enhance reliability across distributed systems, and drive automation within Azure cloud environments. This is a high-impact engineering opportunity focused on performance, observability, and operational excellence at scale.Why Join Us- Career Growth: Work alongside industry experts on cutting-edge cloud technologies - Competitive Compensation and Benefits: We recognize and reward top talent - Exciting, Impactful Work: Design and build scalable, resilient cloud environments - Strategic Platform Role: Contribute to the foundation of next-gen observability and reliability infrastructureWhat You Will Do- Design and Optimize Cloud Infrastructure: Architect scalable, fault-tolerant systems on Microsoft Azure - Automate Everything: Use Terraform, Ansible, and GitHub Actions to streamline deployment and configuration - Ensure Reliability and Performance: Proactively monitor, troubleshoot, and resolve production issues using Prometheus, Grafana, and Azure Monitor - Enhance Security and Compliance: Implement security best practices across DevOps workflows - Collaborate and Innovate: Work closely with engineering, security, and operations teams to drive automation and efficiency - Manage and scale large ELK clusters handling 2–3+ TB/day log volumes, ensuring high availability and performance - Optimize ELK architecture: Implement efficient index lifecycle policies, shard strategies, and hot-warm-cold tiered storage - Build and tune log pipelines: Scale Logstash and Beats pipelines across distributed environments - Support Kibana observability layers: Create dashboards, visualizations, and custom alerting frameworks (e.g., Watcher, ElastAlert)What You Bring- 7+ years of experience in Site Reliability Engineering, DevOps, or Cloud Engineering - 4+ years of dedicated, hands-on experience with ELK (Elasticsearch, Logstash, Kibana) - Strong experience managing large-scale ELK clusters in production with heavy ingestion (multi-TB/day) - Deep knowledge of index tuning, shard allocation, ILM policies, and scaling ELK components - Expertise in GitHub Actions, Terraform, Ansible, and Infrastructure as Code (IaC) - Proficiency in Python, Go, or Bash for automation and scripting - Deep understanding of Kubernetes, Docker, and cloud-native architectures - Experience with observability tools such as Prometheus, Grafana, Azure Monitor - Ability to work in a fast-paced, collaborative environment and solve complex operational issuesEducation- Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related fieldCertifications (Nice to Have)- Microsoft Azure certifications: AZ-104, AZ-400



  • Delhi, India iVedha Inc. Full time

    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...


  • Delhi, NCR, New Delhi, Pune, India Ithena Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Senior Site Reliability Engineer (SRE) Backend SystemsLocation: Remote (India) Pune/Delhi/Delhi NCRMumbaiExperience: 8+ years Were looking for a Senior SRE to join our backend team and help scale our real-time, event-driven platform. This role goes beyond traditional DevOps we're seeking engineers who can write high-quality code, debug complex distributed...


  • New Delhi, India AutoRABIT Full time

    AutoRABIT ProfileAutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce. Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, and effective. AutoRABIT’s highly scalable framework covers the entire DevSecOps cycle, which makes it the favourite platform for companies,...


  • Delhi, India Sapaad Full time

    WHO WE ARESapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F&B businesses across 40+ countries —with many more coming onboard each day.Driven by a...


  • Delhi, India Sapaad Full time

    WHO WE ARESapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F&B businesses across 40+ countries—with many more coming onboard each day.Driven by a...


  • New Delhi, India Tata Consultancy Services Full time

    Dear Candidates,Greetings from TCS!!!TCS is looking for Senior Site Reliability Engineer – AWSExperience: 8-12 yearsLocation: ChennaiMust have skills:- Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS - Develop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, Harness - Own and implement...


  • New Delhi, India iVedha Inc. Full time

    Role: Senior ELK Full Stack Developer (Python & Node.JS)Location: India (Remote)Type: Full-time or ConsultantPractice: iVedha Platform Engineering PracticeJob Description:iVedha is seeking a Senior ELK Full Stack Developer to join our Platform Engineering Practice. This strategic role will drive the development of observability solutions using the ELK stack,...


  • Bengaluru, Delhi, Mumbai, NCR, India Avom Consultants Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    Experience in Site Reliability Engineering, DevOps,managing teams, including mentoring and developing engineers.Prometheus, Grafana, ELK Stack, Splunk, Datadog, New Relic, AWS, GCP, Azure,Docker, Kubernetes,Python, Go, Bash, or simila.


  • New Delhi, India ANSR Full time

    ANSR is hiring for one of its clients.About T-Mobile:T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional...


  • New Delhi, India ANSR Full time

    ANSR is hiring for one of its clients. About T-Mobile: T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and...