
Cloud-Oriented Observability Specialist
1 day ago
Senior Site Reliability Engineer - ELK Expert
We are seeking a seasoned Senior Site Reliability Engineer with expertise in the ELK stack to join our Platform Engineering Practice. As a key member of our team, you will design, manage, and scale large-scale observability infrastructure, enhancing reliability across distributed systems and driving automation within Azure cloud environments.
In this high-impact engineering role, you will be responsible for designing and optimizing cloud infrastructure, automating everything using Terraform, Ansible, and GitHub Actions, ensuring reliability and performance by proactively monitoring, troubleshooting, and resolving production issues using Prometheus, Grafana, and Azure Monitor, and collaborating closely with engineering, security, and operations teams to drive automation and efficiency.
Key Responsibilities
- Design and Optimize Cloud Infrastructure: Architect scalable, fault-tolerant systems on Microsoft Azure.
- Automate Everything: Use Terraform, Ansible, and GitHub Actions to streamline deployment and configuration.
- Ensure Reliability and Performance: Proactively monitor, troubleshoot, and resolve production issues using Prometheus, Grafana, and Azure Monitor.
- Enhance Security and Compliance: Implement security best practices across DevOps workflows.
- Collaborate and Innovate: Work closely with engineering, security, and operations teams to drive automation and efficiency.
- Manage and scale large ELK clusters handling 2–3+ TB/day log volumes, ensuring high availability and performance.
- Optimize ELK architecture: Implement efficient index lifecycle policies, shard strategies, and hot-warm-cold tiered storage.
- Build and tune log pipelines: Scale Logstash and Beats pipelines across distributed environments.
- Support Kibana observability layers: Create dashboards, visualizations, and custom alerting frameworks (e.g., Watcher, ElastAlert).
Requirements
- 7+ years of experience in Site Reliability Engineering, DevOps, or Cloud Engineering.
- 4+ years of dedicated, hands-on experience with ELK (Elasticsearch, Logstash, Kibana).
- Strong experience managing large-scale ELK clusters in production with heavy ingestion (multi-TB/day).
- Deep knowledge of index tuning, shard allocation, ILM policies, and scaling ELK components.
- Expertise in GitHub Actions, Terraform, Ansible, and Infrastructure as Code (IaC).
- Proficiency in Python, Go, or Bash for automation and scripting.
- Deep understanding of Kubernetes, Docker, and cloud-native architectures.
- Experience with observability tools such as Prometheus, Grafana, Azure Monitor.
Benefits
- Career Growth: Work alongside industry experts on cutting-edge cloud technologies.
- Competitive Compensation and Benefits: We recognize and reward top talent.
- Exciting, Impactful Work: Design and build scalable, resilient cloud environments.
- Strategic Platform Role: Contribute to the foundation of next-gen observability and reliability infrastructure.
-
Senior Cloud Security
4 days ago
Bengaluru, Karnataka, India beBeeSecurity Full time ₹ 9,00,000 - ₹ 12,00,000Cloud Site Reliability Engineer (SRE) wanted. We are seeking a skilled Cloud SRE with expertise in Cloud Security and Observability to design, build, and scale resilient cloud platforms.ResponsibilitiesCloud Platform Architect: Design and optimize Terraform modules for multi-environment deployments.DevSecOps Lead: Drive DevSecOps practices and strengthen...
-
SRE - Cloud Security & Observability
23 hours ago
Bengaluru, Karnataka, India Xebia Full timeSRE – Cloud Security & Observability # ; Location: Bangalore (Hybrid – 3 days office per week)We are looking for a Cloud Site Reliability Engineer (SRE) with strong expertise in Cloud Security and Observability to design, build, and scale resilient cloud platforms. #Architect and optimize Terraform modules for multi-environment deployments....
-
Cloud Security and Observability Expert
11 hours ago
Bengaluru, Karnataka, India beBeeSre Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Cloud Security Engineer (SRE) Job DescriptionAs a Cloud Site Reliability Engineer (SRE), you will be responsible for designing, building, and scaling resilient cloud platforms with strong expertise in Cloud Security and Observability. This role is ideal for someone who thrives in a fast-paced environment and has a passion for automating deployments, CI/CD...
-
Bengaluru, Karnataka, India Pegasystems Full timeMeet Our Team:Cloud Observability Engineering collaborates with all the engineering teams at Pega and advocate for Observability solutions, establish standards and processes. Cloud Observability Engineering team is responsible for designing, developing and maintaining Observability solutions for Pega Cloud.Picture Yourself at Pega:You will be part of a...
-
Bengaluru, Karnataka, India Pegasystems Full timeMeet Our Team: Cloud Observability Engineering collaborates with all the engineering teams at Pega and advocate for Observability solutions, establish standards and processes. Cloud Observability Engineering team is responsible for designing, developing and maintaining Observability solutions for Pega Cloud. Picture Yourself at Pega: You will be part of a...
-
Cloud Engineer III-Observability
3 weeks ago
Bengaluru, Karnataka, India Smarsh Full timeJob DescriptionWho are weSmarsh empowers its customers to manage risk and unleash intelligence in their digital communications. Our growing community of over 6500 organizations in regulated industries counts on Smarsh every day to help them spot compliance, legal or reputational risks in 80+ communication channels before those risks become regulatory fines...
-
Observability Expert
2 hours ago
Bengaluru, Karnataka, India beBeeobservability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Observability ExpertAs an Observability Expert, you will play a pivotal role in crafting end-to-end visibility solutions for our applications.Create observability strategies that cater to diverse stakeholder needs.Collaborate with cross-functional teams to integrate observability tools and platforms.Design and maintain monitoring and logging infrastructure...
-
Observability Solutions Architect
11 hours ago
Bengaluru, Karnataka, India beBeeCloud Full time ₹ 1,80,00,000 - ₹ 2,50,00,000Our mission is to deliver exceptional cloud-based solutions.Job DescriptionWe are seeking a talented engineer to join our team and contribute to the development of cutting-edge observability solutions. As a Cloud Development Engineer, you will design and develop observability solutions for Pega Cloud-hosted applications, ensuring seamless monitoring and...
-
Cloud Support Specialist
3 days ago
Bengaluru, Karnataka, India beBeeOperations Full time ₹ 6,00,000 - ₹ 8,00,000Business Operations Specialist RoleOverviewWe are seeking a detail-oriented Business Operations Specialist to support application workloads across multi-cloud environments. This role involves monitoring systems, handling incidents, managing tickets, and ensuring smooth communication with internal teams and external vendors.Key Responsibilities:Monitoring and...
-
Bengaluru, Karnataka, India beBeeObservability Full time ₹ 1,20,00,000 - ₹ 2,40,00,000We are seeking a highly skilled Monitoring and Observability Expert with expertise in monitoring and observability frameworks. The ideal candidate will have experience in setting up performance monitoring solutions, integrating new monitoring tools, executing migrations, and delivering large-scale projects.Key Responsibilities:Design and implement effective...