Senior Site Reliability Engineer- ELK Expert

2 days ago


Delhi, India iVedha Inc. Full time

Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) -Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with7+ years of experience , including4+ years specializing in the ELK stack (Elasticsearch, Logstash, Kibana) , to join ourPlatform Engineering Practice . In this role, you’ll design, manage, and scale ELK clusters ingesting2–3+ TB/day , enhance reliability across distributed systems, and drive automation within Azure cloud environments. This is a high-impact engineering opportunity focused on performance, observability, and operational excellence at scale.Why Join UsCareer Growth:Work alongside industry experts on cutting-edge cloud technologiesCompetitive Compensation and Benefits:We recognize and reward top talentExciting, Impactful Work:Design and build scalable, resilient cloud environmentsStrategic Platform Role:Contribute to the foundation of next-gen observability and reliability infrastructureWhat You Will DoDesign and Optimize Cloud Infrastructure:Architect scalable, fault-tolerant systems on Microsoft AzureAutomate Everything:Use Terraform, Ansible, and GitHub Actions to streamline deployment and configurationEnsure Reliability and Performance:Proactively monitor, troubleshoot, and resolve production issues using Prometheus, Grafana, and Azure MonitorEnhance Security and Compliance:Implement security best practices across DevOps workflowsCollaborate and Innovate:Work closely with engineering, security, and operations teams to drive automation and efficiencyManage and scale large ELK clustershandling2–3+ TB/daylog volumes, ensuring high availability and performanceOptimize ELK architecture:Implement efficient index lifecycle policies, shard strategies, and hot-warm-cold tiered storageBuild and tune log pipelines:Scale Logstash and Beats pipelines across distributed environmentsSupport Kibana observability layers:Create dashboards, visualizations, and custom alerting frameworks (e.g., Watcher, ElastAlert)What You Bring7+ years of experiencein Site Reliability Engineering, DevOps, or Cloud Engineering4+ years of dedicated, hands-on experience with ELK (Elasticsearch, Logstash, Kibana)Strong experience managinglarge-scale ELK clusters in productionwith heavy ingestion (multi-TB/day)Deep knowledge ofindex tuning, shard allocation, ILM policies , and scaling ELK componentsExpertise in GitHub Actions, Terraform, Ansible, and Infrastructure as Code (IaC)Proficiency inPython, Go, or Bashfor automation and scriptingDeep understanding ofKubernetes, Docker , and cloud-native architecturesExperience withobservability toolssuch as Prometheus, Grafana, Azure MonitorAbility to work in a fast-paced, collaborative environment and solve complex operational issuesEducationBachelor’s or Master’s degree in Computer Science, Information Technology, or a related fieldCertifications (Nice to Have)Microsoft Azure certifications:AZ-104 ,AZ-400



  • Delhi, Delhi, India beBeeELKexpert Full time US$ 1,50,000 - US$ 2,10,000

    Senior Site Reliability Engineer ELK ExpertWe are seeking an exceptional Senior Site Reliability Engineer with in-depth expertise in the ELK stack to join our team.This role requires a highly skilled professional who can design, manage, and scale large-scale observability infrastructure, enhancing reliability across distributed systems. The ideal candidate...


  • Delhi, Delhi, India beBeeSystem Full time ₹ 1,80,00,000 - ₹ 2,52,00,000

    Job Description:We are seeking a skilled and experienced Senior Site Reliability Engineer to join our organization. The ideal candidate will have a strong background in engineering and experience working with cross-functional teams.Key Responsibilities:Bridge the gap between development and operations teams by developing scripts, implementing tools, and...


  • Delhi, India Employ Full time

    Role -Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation –Fully RemoteType -6 months ContractWork Ex -5+ YrsWe’re working with aAI product companythat’s building the next generation ofGenAI powered developer platforms .We’re looking for anexperienced Site Reliability Engineerto join theirPlatform Engineering...


  • Delhi, Delhi, India beBeeSenior Full time US$ 1,50,000 - US$ 2,10,000

    We're looking for an experienced Senior Cloud Reliability Engineer to join our team. As a key member of the engineering team, you will be responsible for designing, managing, and scaling large-scale observability infrastructure using the ELK stack (Elasticsearch, Logstash, Kibana).Key Responsibilities:Cloud Infrastructure Design: Architect scalable,...


  • Delhi, NCR, New Delhi, Pune, India Ithena Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Senior Site Reliability Engineer (SRE) Backend SystemsLocation: Remote (India) Pune/Delhi/Delhi NCRMumbaiExperience: 8+ years Were looking for a Senior SRE to join our backend team and help scale our real-time, event-driven platform. This role goes beyond traditional DevOps we're seeking engineers who can write high-quality code, debug complex distributed...


  • Bengaluru, Delhi, Mumbai, NCR, India Avom Consultants Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    Experience in Site Reliability Engineering, DevOps,managing teams, including mentoring and developing engineers.Prometheus, Grafana, ELK Stack, Splunk, Datadog, New Relic, AWS, GCP, Azure,Docker, Kubernetes,Python, Go, Bash, or simila.

  • Site Engineer

    1 week ago


    Delhi, Delhi, India Engineer Department Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Company DescriptionEngineer Department is a company We are dedicated to providing efficient and effective engineering solutions for public infrastructure and services. Our team is committed to ensuring the highest standards in project management and execution, serving the community with integrity and professionalism.Role DescriptionThis is a full-time...


  • Delhi, India Xebia Full time

    We are looking for ahighly skilled AWS Engineer with strong Python development and Chaos Engineering expertiseto design, build, and validate resilient, scalable, and automated cloud-native environments. The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency of...


  • Delhi, Delhi, India beBeeSite Full time ₹ 18,00,000 - ₹ 25,50,000

    Job SummaryWe are seeking an experienced Site Reliability Engineer to drive the reliability and performance of our systems.About the Role:The ideal candidate will have a strong understanding of distributed systems, cloud platforms (AWS, Azure or GCP), and microservices architecture.They will be responsible for ensuring scalability and availability of...


  • Delhi, India Concord Full time

    SRE Sr. Engineers (Individual Contributors)Key Attributes :Strong SRE (Site Reliability Engineering)experienceDevOps skills– CI/CD, monitoring, automation, infrastructure as code, etc.Excellenttroubleshooting and debuggingskills (infrastructure + application level)Perseverance– must push through complex/challenging issues without giving upAble to"figure...