
Senior Site Reliability Engineer- ELK Expert
6 hours ago
Location: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.
Role Summary:
Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?
We're looking for an SRE with 7+ years of experience , including 4+ years specializing in the ELK stack (Elasticsearch, Logstash, Kibana) , to join our Platform Engineering Practice . In this role, you’ll design, manage, and scale ELK clusters ingesting 2–3+ TB/day , enhance reliability across distributed systems, and drive automation within Azure cloud environments. This is a high-impact engineering opportunity focused on performance, observability, and operational excellence at scale.
- Career Growth: Work alongside industry experts on cutting-edge cloud technologies
- Competitive Compensation and Benefits: We recognize and reward top talent
- Exciting, Impactful Work: Design and build scalable, resilient cloud environments
- Strategic Platform Role: Contribute to the foundation of next-gen observability and reliability infrastructure
- Design and Optimize Cloud Infrastructure: Architect scalable, fault-tolerant systems on Microsoft Azure
- Automate Everything: Use Terraform, Ansible, and GitHub Actions to streamline deployment and configuration
- Ensure Reliability and Performance: Proactively monitor, troubleshoot, and resolve production issues using Prometheus, Grafana, and Azure Monitor
- Enhance Security and Compliance: Implement security best practices across DevOps workflows
- Collaborate and Innovate: Work closely with engineering, security, and operations teams to drive automation and efficiency
- Manage and scale large ELK clusters handling 2–3+ TB/day log volumes, ensuring high availability and performance
- Optimize ELK architecture: Implement efficient index lifecycle policies, shard strategies, and hot-warm-cold tiered storage
- Build and tune log pipelines: Scale Logstash and Beats pipelines across distributed environments
- Support Kibana observability layers: Create dashboards, visualizations, and custom alerting frameworks (e.g., Watcher, ElastAlert)
- 7+ years of experience in Site Reliability Engineering, DevOps, or Cloud Engineering
- 4+ years of dedicated, hands-on experience with ELK (Elasticsearch, Logstash, Kibana)
- Strong experience managing large-scale ELK clusters in production with heavy ingestion (multi-TB/day)
- Deep knowledge of index tuning, shard allocation, ILM policies , and scaling ELK components
- Expertise in GitHub Actions, Terraform, Ansible, and Infrastructure as Code (IaC)
- Proficiency in Python, Go, or Bash for automation and scripting
- Deep understanding of Kubernetes, Docker , and cloud-native architectures
- Experience with observability tools such as Prometheus, Grafana, Azure Monitor
- Ability to work in a fast-paced, collaborative environment and solve complex operational issues
- Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field
- Microsoft Azure certifications: AZ-104 , AZ-400
-
Senior Site Reliability Engineer- ELK Expert
2 weeks ago
India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...
-
Senior Site Reliability Engineer- ELK Expert
4 days ago
India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice Location: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone. Role Summary: Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure? We're looking for an SRE...
-
Senior Cloud Reliability Engineer
2 weeks ago
India beBeeExpertise Full time ₹ 24,56,888 - ₹ 32,13,625Job Title: Senior Cloud Reliability EngineerAbout the RoleWe are seeking a highly skilled and experienced Senior Cloud Reliability Engineer to join our team. As a key member of our Platform Engineering Practice, you will design, manage, and scale large-scale observability infrastructure.Your primary focus will be on ensuring the high availability and...
-
India beBeeEngineering Full time ₹ 1,04,000 - ₹ 1,30,878**Job Description:**Senior Site Reliability Engineer- ELK ExpertDesign and manage large-scale observability infrastructure, ensuring high availability and performance. Collaborate with engineering teams to drive automation and efficiency.Key Responsibilities:Manage and scale large ELK clusters handling 2–3+ TB/day log volumesOptimize ELK architecture,...
-
Highly Skilled ELK Expert
2 weeks ago
India beBeesite Full time ₹ 16,50,000 - ₹ 20,29,000Job Overview:We are seeking a highly skilled Site Reliability Engineer with expertise in ELK to join our team.
-
Site Reliability Engineer
2 weeks ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Bangalore/ RemoteType - ContractWork Ex - 4-6 yrsWe're working with a AI product company that's building the next generation of GenAI powered developer platforms.We're looking for an experienced Site Reliability Engineer to join their Platform Engineering...
-
Site Reliability Engineer
2 weeks ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering roles Location – Bangalore/ Remote Type - Contract Work Ex - 4-6 yrs We're working with a AI product company that's building the next generation of GenAI powered developer platforms . We're looking for an experienced Site Reliability Engineer to join their Platform...
-
Site Reliability Engineer
6 hours ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of GenAI powered developer platforms . We’re looking for an experienced Site Reliability...
-
Site Reliability Engineer
2 weeks ago
India CES Full timeWe're looking for a highly skilled Site Reliability Engineer to help us build, manage, and scale modern infrastructure systems for high-availability applications. If you're passionate about automation, cloud platforms, and solving tough operational challenges, we would love to hear from you.Key Skills and Competencies- 3+ years of extensive experience with...
-
Site Reliability Engineer
2 weeks ago
India CES Full timeWe're looking for a highly skilled Site Reliability Engineer to help us build, manage, and scale modern infrastructure systems for high-availability applications. If you're passionate about automation, cloud platforms, and solving tough operational challenges, we would love to hear from you. Key Skills and Competencies 3+ years of extensive experience...