Senior Site Reliability Engineer- ELK Expert

22 hours ago


Delhi, India iVedha Inc. Full time
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice
Location: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.
Role Summary:
Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?
We're looking for an SRE with 7+ years of experience, including 4+ years specializing in the ELK stack (Elasticsearch, Logstash, Kibana), to join our Platform Engineering Practice. In this role, you’ll design, manage, and scale ELK clusters ingesting 2–3+ TB/day, enhance reliability across distributed systems, and drive automation within Azure cloud environments. This is a high-impact engineering opportunity focused on performance, observability, and operational excellence at scale.
Why Join Us
- Career Growth: Work alongside industry experts on cutting-edge cloud technologies
- Competitive Compensation and Benefits: We recognize and reward top talent
- Exciting, Impactful Work: Design and build scalable, resilient cloud environments
- Strategic Platform Role: Contribute to the foundation of next-gen observability and reliability infrastructure
What You Will Do
- Design and Optimize Cloud Infrastructure: Architect scalable, fault-tolerant systems on Microsoft Azure
- Automate Everything: Use Terraform, Ansible, and GitHub Actions to streamline deployment and configuration
- Ensure Reliability and Performance: Proactively monitor, troubleshoot, and resolve production issues using Prometheus, Grafana, and Azure Monitor
- Enhance Security and Compliance: Implement security best practices across DevOps workflows
- Collaborate and Innovate: Work closely with engineering, security, and operations teams to drive automation and efficiency
- Manage and scale large ELK clusters handling 2–3+ TB/day log volumes, ensuring high availability and performance
- Optimize ELK architecture: Implement efficient index lifecycle policies, shard strategies, and hot-warm-cold tiered storage
- Build and tune log pipelines: Scale Logstash and Beats pipelines across distributed environments
- Support Kibana observability layers: Create dashboards, visualizations, and custom alerting frameworks (e.g., Watcher, ElastAlert)
What You Bring
- 7+ years of experience in Site Reliability Engineering, DevOps, or Cloud Engineering
- 4+ years of dedicated, hands-on experience with ELK (Elasticsearch, Logstash, Kibana)
- Strong experience managing large-scale ELK clusters in production with heavy ingestion (multi-TB/day)
- Deep knowledge of index tuning, shard allocation, ILM policies, and scaling ELK components
- Expertise in GitHub Actions, Terraform, Ansible, and Infrastructure as Code (IaC)
- Proficiency in Python, Go, or Bash for automation and scripting
- Deep understanding of Kubernetes, Docker, and cloud-native architectures
- Experience with observability tools such as Prometheus, Grafana, Azure Monitor
- Ability to work in a fast-paced, collaborative environment and solve complex operational issues
Education
- Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field
Certifications (Nice to Have)
- Microsoft Azure certifications: AZ-104, AZ-400

  • Delhi, India iVedha Inc. Full time

    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) -Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with7+...


  • Delhi, India iVedha Inc. Full time

    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...


  • Delhi, Delhi, India beBeeELKexpert Full time US$ 1,50,000 - US$ 2,10,000

    Senior Site Reliability Engineer ELK ExpertWe are seeking an exceptional Senior Site Reliability Engineer with in-depth expertise in the ELK stack to join our team.This role requires a highly skilled professional who can design, manage, and scale large-scale observability infrastructure, enhancing reliability across distributed systems. The ideal candidate...


  • Delhi, India Employ Full time

    Role -Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation –Fully RemoteType -6 months ContractWork Ex -5+ YrsWe’re working with aAI product companythat’s building the next generation ofGenAI powered developer platforms .We’re looking for anexperienced Site Reliability Engineerto join theirPlatform Engineering...


  • Delhi, India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Fully RemoteType - 6 months ContractWork Ex - 5+ YrsWe’re working with a AI product company that’s building the next generation of GenAI powered developer platforms.We’re looking for an experienced Site Reliability Engineer to join their Platform...


  • Delhi, Delhi, India beBeeSenior Full time US$ 1,50,000 - US$ 2,10,000

    We're looking for an experienced Senior Cloud Reliability Engineer to join our team. As a key member of the engineering team, you will be responsible for designing, managing, and scaling large-scale observability infrastructure using the ELK stack (Elasticsearch, Logstash, Kibana).Key Responsibilities:Cloud Infrastructure Design: Architect scalable,...


  • Delhi, NCR, New Delhi, Pune, India Ithena Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Senior Site Reliability Engineer (SRE) Backend SystemsLocation: Remote (India) Pune/Delhi/Delhi NCRMumbaiExperience: 8+ years Were looking for a Senior SRE to join our backend team and help scale our real-time, event-driven platform. This role goes beyond traditional DevOps we're seeking engineers who can write high-quality code, debug complex distributed...

  • Site Engineer

    5 days ago


    Delhi, India Engineer Department Full time

    Company Description Engineer Department is a company We are dedicated to providing efficient and effective engineering solutions for public infrastructure and services. Our team is committed to ensuring the highest standards in project management and execution, serving the community with integrity and professionalism. Role Description This is a full-time...


  • Delhi, India Elgebra Full time

    Hiring: Site Reliability Engineer – 7+ Years Location: Bangalore / Chennai Payroll: Elgebra Client: Qincline Joining: Immediate to 15 Days Role Overview: We are looking for an experienced Site Reliability Engineer (SRE) with over 6 years of expertise to join our team. The ideal candidate will have strong technical skills, a problem-solving mindset, and...


  • Delhi, Delhi, India beBeeSite Full time ₹ 18,00,000 - ₹ 25,50,000

    Job SummaryWe are seeking an experienced Site Reliability Engineer to drive the reliability and performance of our systems.About the Role:The ideal candidate will have a strong understanding of distributed systems, cloud platforms (AWS, Azure or GCP), and microservices architecture.They will be responsible for ensuring scalability and availability of...