Site Reliability Engineer
3 weeks ago
Job Description As an SRE-2 at MoEngage, you'll be a critical member of our SRE team, responsible for the health and performance of key services and contributing directly to the evolution of our infrastructure at a scale that few engineers get to experience. This is your chance to deepen your technical expertise, take on more ownership, and mentor emerging talent while working on a platform that operates at the cutting edge. What You'll Do to Keep Our Engines Roaring - Be a Reliability Champion: Take ownership of the reliability, performance, and efficiency of critical services. - Automate, Automate, Automate: Design, develop, and implement robust automation solutions to eliminate toil, streamline operations, and improve system resilience. - Battle Incidents (and Win): Lead troubleshooting efforts for complex production incidents, perform in-depth root cause analysis, and implement sustainable preventative measures. - Sculpt Our Infrastructure: Actively contribute to the design, implementation, and optimization of our cloud infrastructure on AWS and GCP, leveraging your expertise in technologies like Kubernetes. - Enhance Observability: Implement and refine advanced monitoring, alerting, and logging solutions to gain deep insights into system behavior and predict potential issues. - Collaborate for Success: Partner closely with development teams to influence architectural decisions, ensuring reliability, scalability, and security are built in from the start. - Strengthen Our Security Posture: Implement and advocate for advanced security practices within our infrastructure and operational workflows. - Drive Efficiency: Analyze and optimize cloud infrastructure spend, identifying and implementing cost-saving opportunities. - Guide the Next Wave: Mentor and guide SRE-1 engineers, contributing to the growth and knowledge sharing within the team. - Be Ready for Action: Participate in our on-call rotation, acting as a key point of escalation and resolution for critical issues. What Makes You the Ideal Candidate - 3-5 years of hands-on experience in Site Reliability Engineering, DevOps, or a similar role with a strong focus on production systems. - Demonstrated expertise in Python or Goyou have a proven track record of automating complex tasks. - Strong command of AWS and/or GCP cloud platforms. - In-depth experience with containerization and orchestration using Kubernetes (K8s, ArgoCD, Helm/Kustomize). - Experience with infrastructure as code tools like Terraform or Ansible is highly valued. - Solid understanding and experience with monitoring and observability stacks (VictoriaMetrics, Prometheus, Grafana, ELK stack, etc.). - Deep knowledge of Linux/Unix systems internals and advanced networking concepts. - Proven ability to diagnose and resolve complex issues in large-scale distributed systems. - A strong understanding of Cloud Security and Information Security principles and best practices. - Experience with cloud cost analysis and optimization techniques. - Familiarity with CI/CD pipelines and GitOps methodologies. - Experience with messaging queues and distributed systems (Celery, Kafka) is a plus. - Excellent communication, collaboration, and problem-solving skills. - A desire to mentor and lead by example.
-
Site Reliability Engineering
1 week ago
Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per yearCompany DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India Whatjobs IN C2 Full timeSite Reliability Engineer (SRE) Level 3 Overview: A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Site Reliability Engineering
1 week ago
Bengaluru, Karnataka, India Viraaj HR Solutions Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)About The OpportunityA fast-growing organization in the Enterprise Cloud Infrastructure & SaaS sector delivering highly available, mission-critical services to enterprise customers. We are hiring an on-site Site Reliability Engineer in India to own reliability, automation, and operational excellence across cloud-native...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India ViewSonic Full timeJob Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India ViewSonic Full timeJob Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India ViewSonic Full timeJob Requirements: 1. Bachelor's degree in Computer Science, Engineering, or a related field. 2. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. 3. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. 4. Interest and understanding of...
-
Site Reliability Engineer
1 day ago
Bengaluru, India eBay Full timeThis job is with eBay, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.At eBay, we're more than a global ecommerce leader - we're changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India super Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSite Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Site reliability engineer
4 weeks ago
Bengaluru, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by Open Stack and Kubernetes. In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high system...
-
site reliability engineer
2 weeks ago
bengaluru, India Randstad Full timeRole: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...