Site Reliability Engineer
19 hours ago
we are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future direction. If you’re looking for a challenging and rewarding role where your decisions will make a profound impact on the product’s success - this team is for you. We are seeking an experienced Senior Site Reliability Engineer to design, scale, and optimize the reliability and performance of our production systems. The Sr. SRE will play a key role in managing observability, implementing incident response procedures, managing SLAs, and facilitating automated deployments. Key responsibilities: ● Build, design, and manage a Site Reliability program from the ground up. ● Own all aspects of incident response including on-call rotation, system alerting, escalation, remediations, and post-incident reviews. ● Work with engineering and infrastructure teams to facilitate deployments. ● Design, implement, and maintain scalable systems to meet uptime SLAs (Service Level Agreements). ● Develop, implement, and own platform orchestration (AirFlow, Prefect, etc.) ● Take ownership of platform observability. ● Lead the development of new tools to foster automation, reliability, and scalability. ● Actively participate in code reviews, providing constructive feedback to enhance code quality. Minimum qualifications: ● 5 to 8+ years in SRE, DevOps, or Systems Engineering roles in high volume, 24x7 B2B production environments. ● Proficiency in AWS cloud services. ● Strong experience with Linux systems, containerization (Docker, Kubernetes), and networking fundamentals. ● Deep knowledge of observability, logging, tracing, and metrics systems. ● Proficiency in Python or Bash for automation and tooling.● CI/CD experience with GitHub Actions, ArgoCD, or similar. ● Strong problem-solving abilities and attention to detail ● Excellent communication and collaboration skills. Preferred qualifications: ● Experience working at startups and/or fintech companies ● Familiarity with TypeScript/Node.js ● Familiarity with Kafka, Redis, EKS, and RDS
-
Site Reliability Engineer
1 week ago
bangalore, India super Full timeSite Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Site Reliability Engineer
7 days ago
bangalore, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
2 weeks ago
Bangalore, India CodeKarma Full timeSite Reliability Engineer (Multi-Cloud Deployments) Location: Bangalore / Remote Experience: 4–10 years Type: Full-time (6-month probation) About CodeKarma CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow. Our platform runs both as SaaS and as sub-
-
Site Reliability Engineer
2 weeks ago
Bangalore, India Flipkart Full timeHiring Site Reliability Engineers Exp : 2.5 +years (Excluding internship) Location : Bangalore Apply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...
-
Site Reliability Engineer
1 week ago
bangalore, India Andor Tech Full timeHiring!!🏢 About AndorTechAndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability...
-
Site Reliability Engineer
1 week ago
Bangalore, India Andor Tech Full timeHiring!! About AndorTech AndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation . With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability Centers...
-
Site Reliability Engineer
1 week ago
bangalore, India Karix Full timeRole: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...
-
Site Reliability Engineer
7 days ago
bangalore, India Karix Full timeRole: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...
-
Site Reliability Engineer
6 days ago
bangalore, India Karix Full timeRole: Site Reliability Engineer Location: Bangalore (WFO) About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...
-
Site Reliability Engineer
4 days ago
bangalore, India Glocomms Full timeWe are currently looking for an SRE Lead - to join our customer - an IT consultancy with urgent projects on board.This will be a 6 month contract initially with an option to extend further.Must have 10+ years exp.Responsibilities:- Assess application architecture and implement patterns for reliability and performance.- Automate workflows and reduce manual...