Site Reliability Engineer
4 days ago
- As a Site Reliability Engineer, you will play a key role in ensuring our systems remain reliable, available, and performant for both our customers and internal teams. Your expertise will directly impact our users' experience and the success of our business.
- In this role, you'll collaborate closely with our product development and platform engineering teams to build scalable systems and create robust automation that supports our company's goals. Your day-to-day work will make a meaningful difference in how efficiently and effectively our technology operates.
- We're looking for someone who has hands-on experience with technologies like AWS, CDN, Terraform, Packer, and Splunk. Keen troubleshooting abilities will be essential as you identify and solve complex issues in the critical applications our customers rely on daily.
- The ideal candidate thrives on learning new technologies and approaches challenges with enthusiasm. You'll be joining a collaborative environment where your problem-solving skills will shine as you work across multiple teams. If you're self-motivated, passionate about quality, and ready to make an impact, we want to hear from you
- Collaborate with development teams to implement and deploy new features that meet high standards for reliability, security, and performance.
- Partner with cross-functional teams to establish and enhance enterprise standards and best practices.
- Develop and maintain effective monitoring tools, alerts, and dashboards that provide clear visibility into system health and performance.
- Analyze metrics and logs to proactively detect anomalies, optimize performance, plan capacity, and isolate issues before customer impact occurs.
- Identify innovative solutions to complex problems and implement corrective actions decisively.
- Mentor junior team members while documenting and sharing solutions to build team knowledge.
- Minimum 5 years' experience in DevOps engineering roles such as SRE, DevOps, CloudOps.
- Advanced proficiency with Terraform for infrastructure as code implementation (required)
- Extensive experience with AWS technologies and services, including EC2, S3, RDS, and IAM (required).
- Comprehensive understanding of HTTP protocols, web server technologies, and troubleshooting.
- Strong experience with load balancing solutions such as AWS ELB, NGINX, or HAProxy.
- Practical knowledge of caching technologies and CDN implementations.
- Working experience with Redis for in-memory data storage and caching.
- Demonstrated ability implementing and optimizing CDN solutions for global content delivery (Preferred).
- Expertise in monitoring and troubleshooting web application performance and availability.
- Practical experience with observability solutions such as Splunk, Datadog, or similar.
- Proficiency in one or more languages such as Java, Go, Python, or Linux Shell.
- Proven experience operating effectively in an agile software development environment.
- Strong understanding of AWS pricing/cost models across compute, storage, and database offerings.
- Experience implementing and maintaining CI/CD pipelines.
- Ability to multitask and adapt to changing priorities in a fast-paced, 24x7 environment.
- Collaborative approach to working with cross-functional teams of both technical and business professionals.
- Excellent communication, problem-solving, and customer service skills.
- Bachelor's degree in computer science, science, engineering or equivalent technical certifications preferred.
-
Site Reliability Engineer
2 weeks ago
Remote (India) Luma Financial Technologies Full time ₹ 30,000 - ₹ 60,000 per yearAbout the roleAt Luma, our Site Reliability Engineer (SRE) team keeps our platform reliable, secure, and lightning fast. They own everything from AWS infrastructure and Kubernetes clusters to CI/CD pipelines, monitoring, and alerting. If you're passionate about tackling big challenges, automating at scale, and making systems more resilient, we'd love to have...
-
Site Reliability Engineer
7 days ago
India Grootan Technologies Full timeAbout the Role We are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...
-
Site Reliability Engineer
17 hours ago
India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
2 weeks ago
India InOrg Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout VivaOps :VivaOps is a leading DevSecOps platform company specializing in GitLab - The comprehensive DevOps platform, to transform and secure software development processes. We help organizations to streamline their DevSecOps journey by offering a complete range of GitLab services, from advisory, to implementation and managed services, to accelerate...
-
Site Reliability Engineer
3 weeks ago
India Akamai Technologies Full timeJob Description Job Description Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed content delivery challenges Join our highly skilled Compute Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, India Relanto Full timeJob Description Job Title: Site Reliability Engineer Summary We are looking for a Site Reliability Engineer to join our Digital & Transformation department. The ideal candidate will have 2-3 years of experience in this field and will be responsible for ensuring the reliability, availability, and performance of our systems and applications. Roles And...
-
Site Reliability Engineer
1 week ago
India Akamai Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescription Do you like collaborating across teams to solve complex problems? Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating...
-
Site Reliability Engineer
2 weeks ago
Gurgaon Office, India Fidelity International Full time ₹ 8,00,000 - ₹ 12,00,000 per yearTechnical Specialist - Site Reliability Engineer About the OpportunityJob Type: PermanentApplication Deadline: 22 October Job Description Title Technical Specialist - Site Reliability Engineer Department ISS Production Services Location Gurgaon & Bangalore Level Application Support - 4 We're proud to have been helping our clients build...
-
Site Reliability Engineer
4 weeks ago
India CareerUS Solutions Full timeJob Description Position Overview: The Site Reliability Engineer (SRE) is responsible for ensuring the stability, scalability, performance, and reliability of production systems and services. This role bridges software development and operations, using automation, monitoring, and performance optimization to build resilient systems that can scale efficiently...
-
Site Reliability Engineer
2 weeks ago
India CitNOW Group Full timeAbout us Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably. CitNOW’s app-based platform provides a secure, brand-compliant solution – for dealers to build trust, transparency and long-lasting relationships. CitNOW Group was formed...