Site Reliability Engineer

4 weeks ago


New Delhi, India CodeKarma Full time

Site Reliability Engineer (Multi-Cloud Deployments)Location: Bangalore / RemoteExperience: 4–10 yearsType: Full-time (6-month probation)About CodeKarmaCodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow.Our platform runs both as SaaS and as sub-account / on-prem deployments within our customers’ cloud environments.We’re looking for engineers who can take ownership of these deployments end-to-end — from setup to monitoring, upgrades, and ongoing reliability.What You’ll DoYou’ll be responsible for managing CodeKarma’s distributed deployments across client environments — ensuring reliability, security, and performance at scale.- Deploy and manage CodeKarma clusters across AWS, GCP, and Azure customer sub-accounts. - Monitor, upgrade, and maintain Kubernetes clusters and related infrastructure. - Implement observability, alerting, and disaster recovery for each deployment. - Handle CI/CD automation for platform releases, patches, and version upgrades. - Work closely with client engineering teams to adapt deployments to their environments, policies, and security constraints. - Diagnose and resolve environment-specific issues across networking, storage, and configuration layers. - Build and maintain infrastructure playbooks, Helm charts, and Terraform modules for standardized deployment.What We’re Looking For- Strong experience managing Kubernetes clusters (EKS, GKE, AKS, or on-prem equivalents). - Deep understanding of Kubernetes internals, Helm, ingress controllers, networking, and storage classes. - Hands-on experience with CI/CD tools (GitHub Actions, ArgoCD, or similar). - Familiarity with monitoring and alerting stacks (Prometheus, Grafana, Loki, ELK, etc.). - Working knowledge of cloud infrastructure across AWS / GCP / Azure. - Ability to work directly with client engineering and DevOps teams, understanding their constraints and helping them integrate CodeKarma. - Strong debugging and communication skills — you’ll often be the bridge between CodeKarma and client infrastructure.Why Join Us- Manage real, large-scale production environments across multiple enterprises. - Work directly with founders and senior engineers to shape how CodeKarma scales across clients. - High ownership, fast-moving environment, and exposure to deep-tech systems.How to ApplyPlease share:- A short summary of your Kubernetes experience (cluster management, scaling, debugging, etc.). - Any automation or deployment tooling you’ve built or maintained. - Links to your GitHub / GitLab / blog posts (if available).



  • New Delhi, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production SupportLocation: BengaluruJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...


  • New Delhi, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • New Delhi, India WhiteLotus Talent Partners Full time

    We are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes . In this role, you will focus onmonitoring ,basic troubleshooting , andincident response , helping to maintain high system availability,...


  • New Delhi, India Tata Consultancy Services Full time

    Role: Site Reliability Engineer Experience: 4 to 7 Years Locations: Chennai/Pune/Kolkata


  • New Delhi, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...


  • New Delhi, India Grootan Technologies Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • New Delhi, India SID Global Solutions Full time

    Job Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...


  • New Delhi, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary: We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • New Delhi, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation)Job Summary:We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • New Delhi, India Tata Consultancy Services Full time

    Role**: Manager, Site Reliability EngineeringRequired Technical Skill Set: Manager, Site Reliability EngineeringDesired Experience Range: 12 - 18 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual InterviewJob Description:Describe what the person will do in the role - how he/she will impact...