Site Reliability Engineer
6 days ago
- As a Site Reliability Engineer, you will play a key role in ensuring our systems remain reliable, available, and performant for both our customers and internal teams. Your expertise will directly impact our users' experience and the success of our business.
- In this role, you'll collaborate closely with our product development and platform engineering teams to build scalable systems and create robust automation that supports our company's goals. Your day-to-day work will make a meaningful difference in how efficiently and effectively our technology operates.
- We're looking for someone who has hands-on experience with technologies like AWS, CDN, Terraform, Packer, and Splunk. Keen troubleshooting abilities will be essential as you identify and solve complex issues in the critical applications our customers rely on daily.
- The ideal candidate thrives on learning new technologies and approaches challenges with enthusiasm. You'll be joining a collaborative environment where your problem-solving skills will shine as you work across multiple teams. If you're self-motivated, passionate about quality, and ready to make an impact, we want to hear from you
- Collaborate with development teams to implement and deploy new features that meet high standards for reliability, security, and performance.
- Partner with cross-functional teams to establish and enhance enterprise standards and best practices.
- Develop and maintain effective monitoring tools, alerts, and dashboards that provide clear visibility into system health and performance.
- Analyze metrics and logs to proactively detect anomalies, optimize performance, plan capacity, and isolate issues before customer impact occurs.
- Identify innovative solutions to complex problems and implement corrective actions decisively.
- Mentor junior team members while documenting and sharing solutions to build team knowledge.
- Minimum 5 years' experience in DevOps engineering roles such as SRE, DevOps, CloudOps.
- Advanced proficiency with Terraform for infrastructure as code implementation (required)
- Extensive experience with AWS technologies and services, including EC2, S3, RDS, and IAM (required).
- Comprehensive understanding of HTTP protocols, web server technologies, and troubleshooting.
- Strong experience with load balancing solutions such as AWS ELB, NGINX, or HAProxy.
- Practical knowledge of caching technologies and CDN implementations.
- Working experience with Redis for in-memory data storage and caching.
- Demonstrated ability implementing and optimizing CDN solutions for global content delivery (Preferred).
- Expertise in monitoring and troubleshooting web application performance and availability.
- Practical experience with observability solutions such as Splunk, Datadog, or similar.
- Proficiency in one or more languages such as Java, Go, Python, or Linux Shell.
- Proven experience operating effectively in an agile software development environment.
- Strong understanding of AWS pricing/cost models across compute, storage, and database offerings.
- Experience implementing and maintaining CI/CD pipelines.
- Ability to multitask and adapt to changing priorities in a fast-paced, 24x7 environment.
- Collaborative approach to working with cross-functional teams of both technical and business professionals.
- Excellent communication, problem-solving, and customer service skills.
- Bachelor's degree in computer science, science, engineering or equivalent technical certifications preferred.
-
Site Reliability Engineer
3 weeks ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
3 weeks ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
3 weeks ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
4 weeks ago
India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
2 weeks ago
India Insight Global Full timeCompany: Insight Global Duration: Approved for 1 year 📍 Location: Remote (India) 💼 Type: Contract with Insight Global Client 💰 Compensation: 14 LPA – 20 LPA 🕒 Working Hours: Normal IST hours 🚀 Start Date: Immediate (No notice period) About the Role Join our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building...
-
Site Reliability Engineer
2 weeks ago
India Insight Global Full timeCompany: Insight GlobalDuration: Approved for 1 year📍 Location: Remote (India)💼 Type: Contract with Insight Global Client💰 Compensation: 14 LPA – 20 LPA🕒 Working Hours: Normal IST hours🚀 Start Date: Immediate (No notice period)About the RoleJoin our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building and...
-
Site Reliability Engineer
4 days ago
India pythian Full timeRemote Site Reliability Engineering - Site Reliability Engineering Full Time Remote Site Reliability Engineer India Multiple Timezones Remote Work from Home Why Pythian At Pythian we are experts in strategic database and analytics services driving digital transformation and operational excellence Pythian a multinational company was founded in 1997 and...
-
Site Reliability Engineer
1 week ago
India - Remote StarTree Full time US$ 12,12,000 - US$ 60,48,000 per yearAt StarTree we're a group of passionate individuals that desire to improve the lives of many by developing tools and technologies that support availability and speed in the world of real-time analytics. Our aim is to make it simple for every company to delight their users - external and internal - and create new revenue streams from their data, by building...
-
Site Reliability Engineer
2 weeks ago
india Hydrolix Full timeAbout the jobAt Hydrolix, we are revolutionizing the world of data management and analytics with our innovative cloud data platform, purpose-built for petabyte-scale datasets. Our mission is to help organizations drastically reduce data costs while increasing their data retention.We are looking for a Site Reliability Engineer (SRE) with 8 to 10+ years...
-
Site Reliability Engineer
2 weeks ago
india, IN Insight Global Full timeCompany: Insight GlobalDuration: Approved for 1 year Location: Remote (India) Type: Contract with Insight Global Client Compensation: 14 LPA – 20 LPA Working Hours: Normal IST hours Start Date: Immediate (No notice period)About the RoleJoin our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building and maintaining highly...