Site Reliability Engineer

3 weeks ago


India COZZERA INTERNATIONAL LLP Full time

Job Title: SRE / DevOps Engineer Location: Remote (India) Experience: 5+ Years Role Overview: We are looking for an experienced Site Reliability / DevOps Engineer to design, build, and operate highly reliable, scalable, and secure cloud infrastructure on Google Cloud Platform (GCP) . You will work closely with development, platform, and security teams to ensure high availability, performance, and continuous delivery of cloud-native microservices. This role demands deep expertise in Kubernetes (GKE) , Terraform , CI/CD automation , and observability , along with strong troubleshooting and communication skills. Key Responsibilities: Design, implement, and manage highly available GCP infrastructure using Terraform (IaC). Build and operate Kubernetes (GKE) clusters, including deployments, ingress, autoscaling, and Helm-based releases. Develop and maintain CI/CD pipelines using GitHub Actions and Google Cloud Build . Implement SRE best practices : SLIs, SLOs, SLAs, error budgets, and incident response. Containerize and deploy microservices using Docker and Kubernetes. Implement monitoring, logging, and observability using Cloud Monitoring, Cloud Logging, Prometheus, and Grafana. Troubleshoot production issues, perform root cause analysis, and drive permanent fixes. Manage networking and security including VPCs, load balancers, DNS, SSL/TLS, firewalls, IAM, and VPNs. Collaborate with engineering teams to improve system reliability, performance, and scalability. Automate operational tasks using Python, Bash, or Go . Participate in on-call rotations and incident management processes. Required Skills & Qualifications: 5+ years of experience as an SRE / DevOps / Cloud Engineer . Strong hands-on experience with Google Cloud Platform (GCP) : Compute Engine, GKE, Cloud Functions Cloud Storage, VPC, IAM Cloud Logging & Cloud Monitoring Expert-level Kubernetes experience (preferably GKE ): Deployments, Services, Ingress Autoscaling (HPA) Helm charts Strong experience with Terraform for Infrastructure as Code. Proven experience building CI/CD pipelines using GitHub Actions and Cloud Build . Strong understanding of Docker, containers, microservices , and service mesh concepts . Experience with observability tools : Stackdriver (Cloud Ops), Prometheus, Grafana Solid understanding of networking & cloud security : Load balancers, DNS, SSL VPNs, firewalls, IAM best practices Hands-on scripting experience in Python, Bash, or Go . Excellent problem-solving, debugging, and communication skills . Nice to Have: Experience with service mesh (Istio, Linkerd). Experience with SRE metrics and reliability engineering practices. Knowledge of cost optimization (FinOps) on GCP. Experience working in remote, globally distributed teams .



  • India Insight Global Full time

    Site Reliability Engineer Location : Mumbai, India - working onsite 1x a week Salary : 22-25 LPA Target Start Date : January 2026 Join our dynamic and highly collaborative agile team, where you'll play a pivotal role in ensuring the reliability, scalability, and efficiency of our premier InsurTech solution. Our platform enables clients to obtain quotes and...


  • India InOrg Full time

    About VivaOps :VivaOps is a leading DevSecOps platform company specializing in GitLab - The comprehensive DevOps platform, to transform and secure software development processes. We help organizations to streamline their DevSecOps journey by offering a complete range of GitLab services, from advisory, to implementation and managed services, to accelerate...


  • Bengaluru, India Collabera Full time

    Job Description Job Description As a Principal/Chief Site Reliability Engineer, you will play a critical role in designing, developing, and maintaining scalable and highly reliable systems. You'll work closely with development teams to improve system reliability, monitor critical applications, and design fail-proof infrastructure. Responsibilities - Design...


  • India CodeVyasa Full time

    Job Description We are looking for a skilled SRE Engineer ll Bangalore ll 4-7 yrs. of exp to design, develop, and maintain scalable backend applications. The ideal candidate should have strong experience in Laravel framework , RESTful APIs , and database-driven applications , with a focus on clean code and performance. About Us CodeVyasa is a mid-sized...


  • Hyderabad, India GSPANN Technologies, Inc Full time

    Job Description Dynatrace, Splunk, Datadog, Grafana, New Relic, Dashboards, Azure, Python, Kubernetes, Docker, GitLab, Jenkins, Ansible, Terraform, DevOps, Troubleshooting, SLO/SLAs Monitoring, Incident Response, Root Cause Analysis (RCA), E2E Implementation Description GSPANN is hiring a Site Reliability Engineer (SRE) for its Pune or Hyderabad location....


  • Chennai, India TECEZE Full time

    Job Description Job Title: Site Reliability Engineer (SRE) Core IT Infrastructure Location: Chennai/ pune/ bangalore Company: Teceze About Teceze Teceze is a global IT services and consulting organization delivering innovative, scalable, and secure technology solutions. We specialize in infrastructure services, cloud transformation, DevOps, and managed...


  • Noida, India NTT DATA North America Full time

    Job Description Req ID: 350360 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Site Reliability Engineer to join our team in Noida, Uttar Pradesh (IN-UP), India (IN). Role Overview...


  • India COZZERA INTERNATIONAL LLP Full time

    Job Title: SRE / DevOps EngineerLocation: Remote (India)Experience: 5+ YearsRole Overview:We are looking for an experienced Site Reliability / DevOps Engineer to design, build, and operate highly reliable, scalable, and secure cloud infrastructure on Google Cloud Platform (GCP). You will work closely with development, platform, and security teams to ensure...


  • India COZZERA INTERNATIONAL LLP Full time

    Job Title: SRE / DevOps Engineer Location: Remote (India) Experience: 5+ Years Role Overview: We are looking for an experienced Site Reliability / DevOps Engineer to design, build, and operate highly reliable, scalable, and secure cloud infrastructure on Google Cloud Platform (GCP). You will work closely with development, platform, and security teams to...


  • India COZZERA INTERNATIONAL LLP Full time

    Job Description Job Title: SRE / DevOps Engineer Location: Remote (India) Experience: 5+ Years Role Overview: We are looking for an experienced Site Reliability / DevOps Engineer to design, build, and operate highly reliable, scalable, and secure cloud infrastructure on Google Cloud Platform (GCP). You will work closely with development, platform, and...