Senior Site Reliability Engineer- ELK Expert
3 weeks ago
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+ years of experience, including 4+ years specializing in the ELK stack (Elasticsearch, Logstash, Kibana), to join our Platform Engineering Practice. In this role, you’ll design, manage, and scale ELK clusters ingesting 2–3+ TB/day, enhance reliability across distributed systems, and drive automation within Azure cloud environments. This is a high-impact engineering opportunity focused on performance, observability, and operational excellence at scale.Why Join UsCareer Growth: Work alongside industry experts on cutting-edge cloud technologiesCompetitive Compensation and Benefits: We recognize and reward top talentExciting, Impactful Work: Design and build scalable, resilient cloud environmentsStrategic Platform Role: Contribute to the foundation of next-gen observability and reliability infrastructureWhat You Will DoDesign and Optimize Cloud Infrastructure: Architect scalable, fault-tolerant systems on Microsoft AzureAutomate Everything: Use Terraform, Ansible, and GitHub Actions to streamline deployment and configurationEnsure Reliability and Performance: Proactively monitor, troubleshoot, and resolve production issues using Prometheus, Grafana, and Azure MonitorEnhance Security and Compliance: Implement security best practices across DevOps workflowsCollaborate and Innovate: Work closely with engineering, security, and operations teams to drive automation and efficiencyManage and scale large ELK clusters handling 2–3+ TB/day log volumes, ensuring high availability and performanceOptimize ELK architecture: Implement efficient index lifecycle policies, shard strategies, and hot-warm-cold tiered storageBuild and tune log pipelines: Scale Logstash and Beats pipelines across distributed environmentsSupport Kibana observability layers: Create dashboards, visualizations, and custom alerting frameworks (e.g., Watcher, ElastAlert)What You Bring7+ years of experience in Site Reliability Engineering, DevOps, or Cloud Engineering4+ years of dedicated, hands-on experience with ELK (Elasticsearch, Logstash, Kibana)Strong experience managing large-scale ELK clusters in production with heavy ingestion (multi-TB/day)Deep knowledge of index tuning, shard allocation, ILM policies, and scaling ELK componentsExpertise in GitHub Actions, Terraform, Ansible, and Infrastructure as Code (IaC)Proficiency in Python, Go, or Bash for automation and scriptingDeep understanding of Kubernetes, Docker, and cloud-native architecturesExperience with observability tools such as Prometheus, Grafana, Azure MonitorAbility to work in a fast-paced, collaborative environment and solve complex operational issuesEducationBachelor’s or Master’s degree in Computer Science, Information Technology, or a related fieldCertifications (Nice to Have)Microsoft Azure certifications: AZ-104, AZ-400
-
Senior Site Reliability Engineer
3 weeks ago
india, IN Sapaad Full timeWHO WE ARESapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F&B businesses across 40+ countries—with many more coming onboard each day.Driven by a...
-
Senior Site Reliability Engineer
2 weeks ago
Bangalore Urban, Karnataka, India, IN GigSky Full timeWe're Hiring: Site Reliability Engineer (5–10 Years Experience) Location: Bangalore, India | Gigsky India Private LimitedAre you passionate about building resilient, scalable, and secure infrastructure? Gigsky is looking for a seasoned Site Reliability Engineer to join our Bangalore team and help drive operational excellence across our global platform....
-
Site Reliability Engineer
2 weeks ago
, India, IN Sonata Software Full timeWe're Hiring: Senior Site Reliability Engineer Location: Onsite (Office: Hyderabad – Mandatory from Day 1) Employment Type: Full-time Notice Period: Immediate to 15 Days Only Experience: 8+ Years About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact...
-
Software Engineer, Site Reliability Engineering
2 weeks ago
india, IN Ecoh Full timeMinimum qualifications:Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.1 year of experience with software development in one or more programming languages during coursework/projects, research, internships, or practical experience in school, work, or Open Source projects.Strong problem-solving and analytical...
-
Site Reliability Engineer
3 weeks ago
Bangalore Urban, Karnataka, India, IN Trantor Full timeJob Title - Site Reliability EngineerRole- Contract (9 Months- Extendable)Exp- 5+ yearsLoc- Bangalore ( Hybrid)Notice- Immediate joiner onlyDuties:Responsible for maintaining and scaling production services and servers across multiple datacenters for complex and data-intensive cloud services Improve scalability, service reliability,capacity, and performance...
-
Site Reliability Engineer
2 weeks ago
india, IN TalentBridge Full timeLead SRE and DevOps initiatives, supporting development teams with CI/CD, automation, and infrastructure design across Azure environments.Maintain Infrastructure as Code (IaC) standards; automate key rotation, backups, and configuration drift detection using pipelines.Design and execute SRE projects including API versioning, security tool integrations...
-
Senior Devops Engineer
3 weeks ago
india, IN CareerXperts Consulting Full timeWe are seeking a highly skilled Senior DevOps Engineer to drive the design, implementation, and optimization of our infrastructure and deployment pipelines. The ideal candidate will bring expertise in automation, cloud technologies, and CI/CD practices, ensuring high availability, scalability, and security of mission-critical systems. You will collaborate...
-
Senior DevOps Enginner
3 weeks ago
india, IN Glowingbud Full timeGlowingbud is a rapidly growing eSIM services platform that simplifies connectivity with powerful APIs, robust B2B and B2C interfaces, and seamless integrations with Telna. Our platform enables global eSIM lifecycle management, user onboarding, secure payment systems, and scalable deployments. Recently acquired by Telna, we are expanding our product...
-
Senior Site Reliability Engineer
3 weeks ago
Bangalore Urban, Karnataka, India, IN RecRoots Full timeThe core premise for the SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned, addressing availability, scalability, latency, and efficiency challenges within the vast infrastructure here.Responsibilities: Design, develop, and implement software that improves the stability, scalability,...
-
Expert Senior Full Stack Dev, MERN
3 weeks ago
india, IN ProcureNetworks Full timeAre you an expert senior full stack MERN engineer?If you’re an expert senior full stack MERN engineer, this job could become a long-term engagement.NOTE: If you have LESS THAN 7 YEARS OF EXPERIENCE as a MERN Stack engineer, do not waste your time applying. Under no circumstances will you be selected for an interview.We need an expert senior full stack MERN...