
Senior Site Reliability Engineer- ELK Expert
2 days ago
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) -Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with7+ years of experience , including4+ years specializing in the ELK stack (Elasticsearch, Logstash, Kibana) , to join ourPlatform Engineering Practice . In this role, you’ll design, manage, and scale ELK clusters ingesting2–3+ TB/day , enhance reliability across distributed systems, and drive automation within Azure cloud environments. This is a high-impact engineering opportunity focused on performance, observability, and operational excellence at scale.Why Join UsCareer Growth:Work alongside industry experts on cutting-edge cloud technologiesCompetitive Compensation and Benefits:We recognize and reward top talentExciting, Impactful Work:Design and build scalable, resilient cloud environmentsStrategic Platform Role:Contribute to the foundation of next-gen observability and reliability infrastructureWhat You Will DoDesign and Optimize Cloud Infrastructure:Architect scalable, fault-tolerant systems on Microsoft AzureAutomate Everything:Use Terraform, Ansible, and GitHub Actions to streamline deployment and configurationEnsure Reliability and Performance:Proactively monitor, troubleshoot, and resolve production issues using Prometheus, Grafana, and Azure MonitorEnhance Security and Compliance:Implement security best practices across DevOps workflowsCollaborate and Innovate:Work closely with engineering, security, and operations teams to drive automation and efficiencyManage and scale large ELK clustershandling2–3+ TB/daylog volumes, ensuring high availability and performanceOptimize ELK architecture:Implement efficient index lifecycle policies, shard strategies, and hot-warm-cold tiered storageBuild and tune log pipelines:Scale Logstash and Beats pipelines across distributed environmentsSupport Kibana observability layers:Create dashboards, visualizations, and custom alerting frameworks (e.g., Watcher, ElastAlert)What You Bring7+ years of experiencein Site Reliability Engineering, DevOps, or Cloud Engineering4+ years of dedicated, hands-on experience with ELK (Elasticsearch, Logstash, Kibana)Strong experience managinglarge-scale ELK clusters in productionwith heavy ingestion (multi-TB/day)Deep knowledge ofindex tuning, shard allocation, ILM policies , and scaling ELK componentsExpertise in GitHub Actions, Terraform, Ansible, and Infrastructure as Code (IaC)Proficiency inPython, Go, or Bashfor automation and scriptingDeep understanding ofKubernetes, Docker , and cloud-native architecturesExperience withobservability toolssuch as Prometheus, Grafana, Azure MonitorAbility to work in a fast-paced, collaborative environment and solve complex operational issuesEducationBachelor’s or Master’s degree in Computer Science, Information Technology, or a related fieldCertifications (Nice to Have)Microsoft Azure certifications:AZ-104 ,AZ-400
-
Delhi, Delhi, India beBeeELKexpert Full time US$ 1,50,000 - US$ 2,10,000Senior Site Reliability Engineer ELK ExpertWe are seeking an exceptional Senior Site Reliability Engineer with in-depth expertise in the ELK stack to join our team.This role requires a highly skilled professional who can design, manage, and scale large-scale observability infrastructure, enhancing reliability across distributed systems. The ideal candidate...
-
Senior Site Reliability Expert
2 weeks ago
Delhi, Delhi, India beBeeSystem Full time ₹ 1,80,00,000 - ₹ 2,52,00,000Job Description:We are seeking a skilled and experienced Senior Site Reliability Engineer to join our organization. The ideal candidate will have a strong background in engineering and experience working with cross-functional teams.Key Responsibilities:Bridge the gap between development and operations teams by developing scripts, implementing tools, and...
-
Site Reliability Engineer
2 days ago
Delhi, India Employ Full timeRole -Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation –Fully RemoteType -6 months ContractWork Ex -5+ YrsWe’re working with aAI product companythat’s building the next generation ofGenAI powered developer platforms .We’re looking for anexperienced Site Reliability Engineerto join theirPlatform Engineering...
-
Senior Cloud Reliability Engineer
1 week ago
Delhi, Delhi, India beBeeSenior Full time US$ 1,50,000 - US$ 2,10,000We're looking for an experienced Senior Cloud Reliability Engineer to join our team. As a key member of the engineering team, you will be responsible for designing, managing, and scaling large-scale observability infrastructure using the ELK stack (Elasticsearch, Logstash, Kibana).Key Responsibilities:Cloud Infrastructure Design: Architect scalable,...
-
Senior Site Reliability Engineer
1 week ago
Delhi, NCR, New Delhi, Pune, India Ithena Full time ₹ 15,00,000 - ₹ 20,00,000 per yearSenior Site Reliability Engineer (SRE) Backend SystemsLocation: Remote (India) Pune/Delhi/Delhi NCRMumbaiExperience: 8+ years Were looking for a Senior SRE to join our backend team and help scale our real-time, event-driven platform. This role goes beyond traditional DevOps we're seeking engineers who can write high-quality code, debug complex distributed...
-
Site Reliability Engineer Lead
2 weeks ago
Bengaluru, Delhi, Mumbai, NCR, India Avom Consultants Full time ₹ 8,00,000 - ₹ 12,00,000 per yearExperience in Site Reliability Engineering, DevOps,managing teams, including mentoring and developing engineers.Prometheus, Grafana, ELK Stack, Splunk, Datadog, New Relic, AWS, GCP, Azure,Docker, Kubernetes,Python, Go, Bash, or simila.
-
Site Engineer
1 week ago
Delhi, Delhi, India Engineer Department Full time ₹ 15,00,000 - ₹ 28,00,000 per yearCompany DescriptionEngineer Department is a company We are dedicated to providing efficient and effective engineering solutions for public infrastructure and services. Our team is committed to ensuring the highest standards in project management and execution, serving the community with integrity and professionalism.Role DescriptionThis is a full-time...
-
Site Reliability Engineer
2 days ago
Delhi, India Xebia Full timeWe are looking for ahighly skilled AWS Engineer with strong Python development and Chaos Engineering expertiseto design, build, and validate resilient, scalable, and automated cloud-native environments. The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency of...
-
Reliability Systems Expert
5 days ago
Delhi, Delhi, India beBeeSite Full time ₹ 18,00,000 - ₹ 25,50,000Job SummaryWe are seeking an experienced Site Reliability Engineer to drive the reliability and performance of our systems.About the Role:The ideal candidate will have a strong understanding of distributed systems, cloud platforms (AWS, Azure or GCP), and microservices architecture.They will be responsible for ensuring scalability and availability of...
-
Site Reliability Engineer
2 days ago
Delhi, India Concord Full timeSRE Sr. Engineers (Individual Contributors)Key Attributes :Strong SRE (Site Reliability Engineering)experienceDevOps skills– CI/CD, monitoring, automation, infrastructure as code, etc.Excellenttroubleshooting and debuggingskills (infrastructure + application level)Perseverance– must push through complex/challenging issues without giving upAble to"figure...