
Site Reliability Engineer
12 hours ago
You'll be our: Site Reliability Engineer
You'll be based at: Pune Zonal Office
You'll be aligned with: Cloud and Data Platform Lead / Cloud Architect
You'll be a member of: Cloud and Data Platform Team
Ather's fleet of smart scooters is growing rapidly, and so is the volume of data they generate. Our Vehicle Data Platform (VDP) is the core of this ecosystem, and its stability and scalability are critical to our success. We are looking for a foundational Site Reliability Engineer to join our VDP team, taking full ownership of our data infrastructure and building a robust reliability practice to support our rapid growth.
What You'll do at ather:
- Run and own the production environment by managing alerts, leading incident response, conducting root cause analysis (RCA), and implementing permanent fixes.
- Take full ownership of our ClickHouse database clusters as we move from a managed service, managing their performance, reliability, and scaling internally.
- Build and maintain our core infrastructure using Infrastructure-as-Code principles (Terraform).
- Perform critical, periodic maintenance and upgrades for our infrastructure, with a strong focus on Kubernetes, Cloud SQL, and data workloads like Kafka.
- Partner with the Data Engineering team to support the underlying infrastructure for our new Databricks platform, ensuring robust and efficient data ingestion pipelines.
- Enhance observability by building and refining our monitoring, logging, and tracing systems to proactively identify performance bottlenecks.
- Lead capacity planning and forecasting for all cloud workloads, ensuring our platform can scale effectively for the next 6-12 months.
- Drive cloud cost optimization by monitoring spending, identifying and implementing savings opportunities, and ensuring resource governance.
Here's What we Are Looking for:
- Our ideal candidate is a strong software engineer at heart with deep expertise in cloud-native infrastructure.
- The main focus areas for this role are:
- Significant Coding Experience: You must have a strong software engineering background with significant coding experience in a language like Python, Go, or Java, focusing on writing clean, scalable, and automated solutions.
- Deep Cloud Proficiency: You need deep, hands-on experience with at least one major cloud provider (GCP, AWS, or Azure). A strong background in GCP is highly preferred.
- Production Kubernetes Expertise: You must have proven, hands-on experience designing, running, and troubleshooting applications on Kubernetes in a production environment.
- Other key qualifications include:
- Hands-on experience with infrastructure automation tools like Terraform or Ansible.
- Strong expertise in building and managing CI/CD pipelines.
- Experience administering, monitoring, and scaling ClickHouse clusters is highly desirable.
- Familiarity with data platforms like Databricks and their infrastructure requirements.
- Experience with messaging queues like Kafka.
- Strong Linux administration, system internals, and network troubleshooting skills.
You Bring to Ather:
- A Bachelor's or Master's degree in Computer Science or a related engineering field.
- 3 to 6 years of relevant experience as a Site Reliability Engineer, DevOps Engineer, or Software Engineer with a focus on infrastructure.
-
Specialist - Site Reliability Engineer
4 days ago
Pune, Maharashtra, India Accelya Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...
-
Specialist - Site Reliability Engineer
4 days ago
Pune, Maharashtra, India Accelya Group Full time ₹ 15,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...
-
SRE (Site Reliability Engineer)
19 hours ago
Pune, Maharashtra, India Apex One Full time ₹ 6,00,000 - ₹ 18,00,000 per yearJob Overview We are looking for a detail-oriented and experienced Site Reliability Engineer to join our team. The Site Reliability Engineer will be responsible for creating and implementing scalable software solutions in order to meet system and application performance goals. You will also be responsible for troubleshooting system errors and resolving any...
-
Site Reliability Engineer
4 days ago
Pune, Maharashtra, India Idox Full time ₹ 9,00,000 - ₹ 12,00,000 per yearSite Reliability Engineer (AWS)Pune, IndiaAbout the roleWe are seeking a driven and detail-oriented Site Reliability Engineer (SRE) with a strong passion for building resilient, scalable cloud infrastructure. This role offers an exciting opportunity for professionals with 2 to 4 years of experience in DevOps, Cloud, or Infrastructure to deepen their...
-
Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Reveille Technologies Full timeJob Summary :We are seeking a skilled and proactive Site Reliability Engineer (SRE) with a strong DevOps mindset and hands-on experience in application troubleshooting. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our applications and infrastructure. This role requires a blend of software engineering,...
-
Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Allianz Full timeSite Reliability Engineer (SRE) - One Identity Access ManagementThe primary objective of the Site Reliability Engineer (SRE) specializing in One Identity Access Management is to ensure the seamless operation, reliability, and scalability of IAM systems within the organization.This role is critical in maintaining system integrity, optimizing performance, and...
-
Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Uplers Full timeJob DescriptionMust have skills required :Azure DevOps, SRE concepts, TerraData, CDC, CDC tool, NEWRELGood to have skills :Aws cloudwatchReflections Info Systems (One of Uplers Clients) is Looking for:Site Reliability Engineer who is passionate about their work, eager to learn and grow, and who is committed to delivering exceptional results. If you are a...
-
Site Reliability Engineer
3 days ago
Pune, Maharashtra, India Creospan Inc. Full time ₹ 15,00,000 - ₹ 28,00,000 per yearCreospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. "Tomorrow's ideas, built today" In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...
-
Site Reliability Engineering
5 days ago
Pune, Maharashtra, India Deutsche Bank Full time ₹ 10,00,000 - ₹ 25,00,000 per yearSite Reliability Engineering (SRE) Lead, VPJob ID: R0402474Full/Part-Time: Full-timeRegular/Temporary: RegularListed: Location: PunePosition OverviewJob Title: Site Reliability Engineering (SRE) LeadCorporate Title: Vice PresidentLocation: Pune, IndiaRole DescriptionWe are seeking an experienced and highly capable Site Reliability Engineering (SRE) Lead to...
-
Site Reliability Engineer
3 weeks ago
Pune, Maharashtra, India LanceSoft, Inc Full timeRole and Responsibilities : Reporting to Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions, Payments, and Capital Markets business. In this role, the candidate will have the opportunity to make a lasting impact on the company's transformation journey, drive customer-centric...