Site Reliability Engineer

3 weeks ago


Bangalore Karnataka, India Betsol Full time

Company Description BETSOL is a cloud-first digital transformation and data management company offering products and IT services to enterprises in over 40 countries BETSOL team holds several engineering patents and is recognized with industry awards and BETSOL maintains a net promoter score that is 2x the industry average BETSOL s open source backup and recovery product line Zmanda Zmanda com delivers up to 50 savings in total cost of ownership TCO and best-in-class performance BETSOL Global IT Services BETSOL com builds and supports end-to-end enterprise solutions reducing time-to-market for its customers BETSOL offices are set against the vibrant backdrops of Broomfield Colorado and Bangalore India We take pride in being an employee-centric organization offering comprehensive health insurance competitive salaries 401K volunteer programs and scholarship opportunities Office amenities include a fitness center cafe and recreational facilities Own the reliability availability performance and scalability of customer and employee facing platforms Partner with application infrastructure security and NOC teams to engineer resilient services and automate operations across Azure and on-prem environments Drive incident response and post-incident reviews implement observability and continuously improve service health through automation and best practices Responsibilities Build and operate production platforms across Azure e g AKS App Services Functions Windows Linux and networking layers in partnership with Platform Server Network teams Engineer end-to-end observability metrics logs and traces via Azure Monitor Application Insights Log Analytics Prometheus Grafana and centralized logging Automate provisioning and configuration using Infrastructure as Code Terraform Bicep and configuration management Ansible PowerShell DSC Design and maintain CI CD pipelines Azure DevOps GitHub Actions with automated testing canary blue-green deployments and change control alignment Establish runbooks SOPs and self-healing automations to reduce MTTR and ticket volume from the NOC and Service Desk Harden platform security identity secrets certificates network segmentation leveraging Azure Key Vault managed identities and policy guardrails Perform capacity planning performance tuning and cost optimization FinOps for compute storage and networking Partner with Data ETL teams to ensure reliability of batch and streaming jobs scheduling and dependencies Create and maintain documentation architecture runbooks dashboards and support audits and compliance requirements Qualifications Bachelor s degree in Computer Science Engineering or equivalent experience 2-5 years in SRE DevOps Platform Engineering with hands-on production ownership Proficiency with Azure services AKS App Services Functions Azure Monitor Log Analytics Application Insights Strong Kubernetes Docker skills Helm ingress service mesh e g Istio Linkerd experience is a plus IaC Terraform or Bicep and scripting PowerShell and or Python Git-based workflows CI CD Azure DevOps or GitHub Actions artifact management and release strategies canary blue-green Observability tooling Prometheus Grafana ELK OpenSearch Azure Monitor and alert design to minimize noise Experience with ITIL processes incident change problem and tools ServiceNow Jira Knowledge of networking DNS TLS certificates load balancers and security fundamentals Excellent troubleshooting communication and cross-functional collaboration skills Certifications such as Microsoft Azure Administrator DevOps CKA CKAD or ITIL Foundation are a plus Additional Information All your information will be kept confidential according to EEO guidelines



  • Bangalore, Karnataka, India NatWest Group Full time

    Join us as a Site Reliability Engineer In this key role you ll support the improvement of non-functional and operational characteristics such as availability performance efficiency change management monitoring security incident response and capacity planning of our products and services You ll enjoy significant stakeholder interaction working in...


  • Bangalore, Karnataka, India Akamai Full time

    Job Category Site Reliability Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed content delivery challenges Join our critical Platform and Reliability Engineering Team The Platform Reliability Engineering team defines measures and optimizes key performance indicators for Akamai s global network This...


  • Bangalore, Karnataka, India Deutsche Bank Full time

    Job Title Site Reliability EngineerLocation Bangalore IndiaCorporate Title AssociateRole Description You will work closely with application teams to ensure stable well monitored applications that are resilient to faults You will agree and review Service Level Objectives SLOs to achieve high availability for applications based on their criticality ...


  • bangalore, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production Support Location: Bengaluru Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management . The ideal candidate will be responsible for maintaining the reliability, performance, and...


  • Bangalore, Karnataka, India NatWest Group Full time

    Join us as a Site Reliability Engineer Youll manage the provision of stable resilient reliable applications with the end goal of minimising disruption to Customer Colleague Journeys CCJ Well look to you to identify and automate manual tasks and implement observability solutions ensuring a thorough understanding of CCJ across applications This...


  • Bangalore, India ViewSonic Full time

    Job Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, Dev Ops Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, Cloud Watch, Lambda, and RDS. Interest and understanding of Platform...


  • Bangalore, India CodeKarma Full time

    Site Reliability Engineer (Multi-Cloud Deployments) Location: Bangalore / Remote Experience: 4–10 years Type: Full-time (6-month probation) About CodeKarma CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow. Our platform runs both as SaaS and as sub-


  • bangalore, India JRD Systems Full time

    Position: Site Reliability Engineer (SRE) Role Overview: We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in Windows infrastructure to manage and optimize our cloud and on-premises environments. The ideal candidate will partner with development teams to improve service reliability, implement automation, and ensure...


  • Bangalore, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron – Bangalore Job Role: - SRE (Senior Site Reliability Engineer) Job Location: - Bangalore Notice Period: Within 30days About Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to...


  • Bangalore, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron – Bangalore Job Role: - SRE (Senior Site Reliability Engineer) Job Location: - Bangalore Notice Period: Within 30days About Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to...