Senior Site Reliability Engineer- ELK Expert
4 weeks ago
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice Location: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone. Role Summary: Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure? We're looking for an SRE with 7+ years of experience , including 4+ years specializing in the ELK stack (Elasticsearch, Logstash, Kibana) , to join our Platform Engineering Practice . In this role, you’ll design, manage, and scale ELK clusters ingesting 2–3+ TB/day , enhance reliability across distributed systems, and drive automation within Azure cloud environments. This is a high-impact engineering opportunity focused on performance, observability, and operational excellence at scale.Why Join Us Career Growth: Work alongside industry experts on cutting-edge cloud technologies Competitive Compensation and Benefits: We recognize and reward top talent Exciting, Impactful Work: Design and build scalable, resilient cloud environments Strategic Platform Role: Contribute to the foundation of next-gen observability and reliability infrastructure What You Will Do Design and Optimize Cloud Infrastructure: Architect scalable, fault-tolerant systems on Microsoft Azure Automate Everything: Use Terraform, Ansible, and GitHub Actions to streamline deployment and configuration Ensure Reliability and Performance: Proactively monitor, troubleshoot, and resolve production issues using Prometheus, Grafana, and Azure Monitor Enhance Security and Compliance: Implement security best practices across DevOps workflows Collaborate and Innovate: Work closely with engineering, security, and operations teams to drive automation and efficiency Manage and scale large ELK clusters handling 2–3+ TB/day log volumes, ensuring high availability and performance Optimize ELK architecture: Implement efficient index lifecycle policies, shard strategies, and hot-warm-cold tiered storage Build and tune log pipelines: Scale Logstash and Beats pipelines across distributed environments Support Kibana observability layers: Create dashboards, visualizations, and custom alerting frameworks (e.g., Watcher, ElastAlert) What You Bring 7+ years of experience in Site Reliability Engineering, DevOps, or Cloud Engineering 4+ years of dedicated, hands-on experience with ELK (Elasticsearch, Logstash, Kibana) Strong experience managing large-scale ELK clusters in production with heavy ingestion (multi-TB/day) Deep knowledge of index tuning, shard allocation, ILM policies , and scaling ELK components Expertise in GitHub Actions, Terraform, Ansible, and Infrastructure as Code (IaC) Proficiency in Python, Go, or Bash for automation and scripting Deep understanding of Kubernetes, Docker , and cloud-native architectures Experience with observability tools such as Prometheus, Grafana, Azure Monitor Ability to work in a fast-paced, collaborative environment and solve complex operational issues Education Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field Certifications (Nice to Have) Microsoft Azure certifications: AZ-104 , AZ-400
-
Senior Site Reliability Engineer- ELK Expert
4 weeks ago
India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...
-
Senior Site Reliability Engineer
4 weeks ago
India iVoyant Full timeOne of our clients is looking for an experienced Senior Site Reliability Engineer (SRE) - Mission-Critical SaaS Cloud Products to join their team.Key Responsibilities:Reliability and Performance Management:Design, implement, and maintain highly available, scalable, and resilient cloud-native architectures for mission-critical SaaS products. Develop and...
-
Site Reliability Engineer
2 weeks ago
India InOrg Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout VivaOps :VivaOps is a leading DevSecOps platform company specializing in GitLab - The comprehensive DevOps platform, to transform and secure software development processes. We help organizations to streamline their DevSecOps journey by offering a complete range of GitLab services, from advisory, to implementation and managed services, to accelerate...
-
ELK Developer
1 week ago
Bengaluru, India Dicetek LLC Full timeJob Description The ELK Developer is responsible to enhance the monitoring of the Business-Critical applications of the enterprise and align with the standards and policies defined by the Enterprise Tools and CSI department. An ELK Developer is a specialist who uses data to monitor and improve the performance, reliability, and security of infrastructure and...
-
Senior Site Reliability Engineer
3 weeks ago
Hyderabad, India AutoRABIT Full timeJob Description AutoRABIT Profile AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce. Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, and effective. AutoRABIT's highly scalable framework covers the entire DevSecOps cycle, which makes it the favourite platform...
-
Site Reliability Engineer
3 weeks ago
India CareerUS Solutions Full timeJob Description Position Overview: The Site Reliability Engineer (SRE) is responsible for ensuring the stability, scalability, performance, and reliability of production systems and services. This role bridges software development and operations, using automation, monitoring, and performance optimization to build resilient systems that can scale efficiently...
-
India Weekday AI Full time ₹ 15,00,000 - ₹ 25,00,000 per yearThis role is for one of Weekday's clientsMin Experience: 4 yearsJobType: full-timeWe are looking for an experienced and motivated Site Reliability Engineer (SRE) – Platform Engineering to join our growing technology team. In this role, you will be responsible for designing, building, and maintaining scalable, resilient, and secure infrastructure platforms...
-
Senior Site Reliability Engineer
2 weeks ago
India Akamai Full time ₹ 12,00,000 - ₹ 24,00,000 per yearDescriptionDo you like collaborating across teams to solve complex problems?Do you enjoy solving large scale systems problems?Join our Site Reliability Engineering teamThe Senior Site Performance and Reliability Engineer ensures optimal performance and uptime of Akamai's portal services and infrastructure. Responsibilities include analyzing system...
-
Senior Site Reliability Engineer
1 week ago
India Jobgether Full time ₹ 12,00,000 - ₹ 24,00,000 per yearThis position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer in India.We are seeking an experienced Senior Site Reliability Engineer to ensure the reliability, scalability, and performance of critical security infrastructure. In this role, you will lead initiatives for operational...
-
Site Reliability Engineer
6 days ago
India techolution Full timeWe are seeking a highly skilled Site Reliability Engineer - AWS to enhance the reliability, scalability, and security of our cloud infrastructure. The ideal candidate will be responsible for designing, implementing, and maintaining high-availability systems, automating processes, and ensuring seamless operations on AWS. This role requires expertise in...