Senior Site Reliability Engineer-SRE
2 days ago
About Rocketlane
Rocketlane is a fast-growing, innovative SaaS company making waves in customer onboarding and professional services automation.
Our mission? To empower B2B companies with a smooth, consistent, and efficient way to onboard customers and manage client projects—reducing chaos and boosting customer satisfaction across industries.
We're a close-knit team of close 200 passionate professionals, all focused on building a product that teams love to use. Our journey has been fueled by $45M in funding from top investors, including 8VC, Matrix Partners, and Nexus Venture Partners.
What will you do?
We're looking for a self-motivated, enthusiastic, and hands-on engineer to set up solid DevOps and SRE foundations. If you thrive in a small, high-energy team and want to play a key role in shaping infrastructure and reliability at scale, this is the place for you.
We're looking for a hands-on engineer with 3–6 years of experience who has a solid grasp of cloud infrastructure, a strong foundation in Infrastructure as Code (IaC), and a keen eye for choosing the right tools for the job. You'll help design, build, and scale resilient infrastructure for a fast-growing, product-driven team.
- Design, build, and manage cloud infrastructure using Infrastructure as Code (IaC) tools like Terraform, Ansible, Chef, or CloudFormation.
- Champion observability by defining SLIs, SLOs, and building robust monitoring, logging, and alerting systems using tools like Prometheus, Grafana, and custom telemetry.
- Ensure availability, scalability, and resilience of our SaaS platform and platform services in production.
- Proven ability to improve system observability through the design and instrumentation of system-level metrics, enhancing visibility into system health, performance, and bottlenecks.
- Dive deep into complex system architectures to solve critical performance and reliability challenges.
- Work with developers and product teams to embed NFR (Non-functional Requirements) into every product and feature release.
- Conduct root cause analysis and system-level debugging (primarily on Linux).
- Build and maintain CI/CD pipelines, automating deployments and infrastructure operations across environments.
- Scale infrastructure to meet growth needs while optimizing cost and performance.
- Take ownership of incident response, on-call rotations, and blameless postmortems.
- Collaborate cross-functionally to drive technical and architectural decision
- Highly self-driven, accountable, and eager to own initiatives end-to-end. Comfortable working in startups or small teams, where flexibility, speed, and autonomy are key. Strong communication and cross-team collaboration skills.
You should apply if
- Proficient in at least one programming language — Python, Java, or similar.
- Demonstrated experience with performance optimization, latency reduction, and scaling services.
- Strong analytical skills for incident debugging, log analysis, and system troubleshooting.
- Understanding of service-level metrics (SLIs, SLOs, error budgets) and how to operationalize them.
- Experience building large-scale, distributed, resilient systems.
- Strong understanding of core infrastructure components such as load balancers, firewalls, and databases — including their internal workings and operational fundamentals.
- Solid understanding of infrastructure cost management — proactively identifies cost drivers, implements optimization strategies, and contributes to cost reduction initiatives without compromising reliability or performance.
- Familiarity with on-call responsibilities, incident management, and root cause analysis.
- Strong experience with Infrastructure as Code (IaC): Terraform, Ansible, Chef, or CloudFormation and other orchestration tools
- Ability to deep-dive into third-party or internal library codebases to understand internal behavior, debug complex issues, and contribute insights or fixes when needed.
- Solid understanding of cloud platforms — preferably AWS, but Azure or GCP is also acceptable.
Why join us?
At Rocketlane, we're all about building a great product and a great place to work. Here's why you'll actually look forward to Mondays:
- Impact and ownership: You won't just be another cog in the machine; here, you're more like a turbocharged engine part. Bring your ideas, make them happen.
- Work with the best: We're a team of passionate, quirky, and ridiculously talented people. Come for the work, stay for the memes.
- Celebrate wins: Whether we're hitting major milestones or celebrating new funding, we like to mix it up. From rap videos to team outings, we believe in celebrating big.
- Learn and grow: We're all about learning—and we're not just talking about the latest SaaS trends. You'll grow your career, pick up new skills, and maybe even learn to love Excel (or at least tolerate it).
- Flexibility and balance: While we love collaborating in the office five days a week, we know everyone has their own rhythm. That's why we offer flexibility around hours—so you can bring your best energy, whether you're an early bird or a night owl. Pajamas optional (at least outside the office).
-
Cloud Site Reliability Engineer
6 days ago
Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 1,04,000 - ₹ 1,30,878 per yearBe at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...
-
Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India Zyoin Group Full timePosition: Site Reliability Engineer (SRE)Experience: 4 – 10 YearsLocation: Chennai (Hybrid – 2 days in office)Role Overview:We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services.Key Responsibilities- Design,...
-
Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India Zyoin Group Full timePosition: Site Reliability Engineer (SRE) Experience: 4 – 10 Years Location: Chennai (Hybrid – 2 days in office) Role Overview: We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services. Key Responsibilities ...
-
Site Reliability Engineering
5 days ago
Chennai, Tamil Nadu, India ti Steps Full time US$ 60,000 - US$ 1,20,000 per yearSite Reliability Engineering (SRE) InternJob Description:Support the SRE team in ensuring the reliability, scalability, and performance of production systems. Learn incident response, monitoring, and automation techniques.Key Responsibilities:Monitor system health and respond to alerts.Help improve system reliability through automation.Participate in root...
-
Site Reliability Engineer
3 days ago
Chennai, Tamil Nadu, India Zyoin Group Full timeWork Mode: Hybrid (2 days Office)We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building and operating highly reliable and scalable products....
-
Site Reliability Engineer
5 days ago
Chennai, Tamil Nadu, India Zyoin Group Full timeJob DescriptionExp : 4- 10 Years Location : Chennai Work Mode: Hybrid (2 days Office)We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building and...
-
Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India Zyoin Group Full timeJob Description Exp : 4- 10 Years Location : Chennai Work Mode: Hybrid (2 days Office) We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building...
-
Site Reliability Engineer
3 days ago
Chennai, Tamil Nadu, India Zyoin Group Full timeJob DescriptionExp : 4- 10 Years Location : Chennai Work Mode: Hybrid (2 days Office)We are looking for a Site Reliability Engineer (SREs) who will lead the Site Reliability Engineering(SRE) side of each of our products. This position is responsible for making technical decisions, collaborating with development teams and platform engineers, and building and...
-
Site Reliability Engineer
5 days ago
Chennai, Tamil Nadu, India Compunnel Full time ₹ 18,00,000 per yearJob Title: Site Reliability Engineer (SRE)Work Location: Chennai (Work from Office)Compensation: 30 LPAInterview Process: Final Round Face-to-Face Discussion follwed by Virtual round of interviewRequirements5-8 years of experience as an SRE/DevOps Engineer/Backend Engineer with SRE focus.Strong Python scripting and automation skills.Proven API integration...
-
Site Reliability Engineering Lead
1 week ago
Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Overview:We are seeking an experienced Site Reliability Engineering Lead to oversee the reliability, scalability, and performance of our systems.As a Site Reliability Engineering Lead, you will establish and implement SRE practices, lead a team of engineers, and drive automation, monitoring, and incident response strategies.This position combines...