Site Reliability Engineer

12 hours ago


Chennai, Tamil Nadu, India GAVS Technologies Pvt. Ltd. (GAVS) Full time ₹ 8,00,000 - ₹ 12,00,000 per year

Site Reliability Engineer:

Site Reliability Engineer (SRE) - Azure Tech Stack

Experience: 3-5 years

Location: [Chennai Work from office ]

About the Role:

We are seeking a highly motivated and experienced Site Reliability Engineer (SRE) to join our growing team. As an SRE, you will be instrumental in ensuring the reliability, scalability, and performance of our critical applications and infrastructure built on Microsoft Azure. You will leverage your expertise in Azure services, automation, and incident management to drive operational excellence and continuous improvement.

Key Responsibilities:

System Reliability & amp; Performance:

Design, implement, and maintain highly available, scalable, and resilient systems on Azure.

Proactively monitor system health, performance, and availability using Azure Monitor, Application Insights, Log Analytics, and other monitoring tools (e.g., Grafana, Prometheus, Splunk).

Define, track, and report on Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to ensure adherence to service availability and performance targets.

Conduct root cause analysis (RCA) for incidents and implement preventive measures to avoid recurrence.

Participate in on-call rotation to provide 24/7 support for production systems, diagnosing and resolving critical issues promptly.

Automation & Infrastructure as Code (IaC):

Develop and maintain automation scripts and tools using PowerShell, Python, Bash, or Go to automate repetitive tasks, deployments, and infrastructure provisioning.

Implement and manage infrastructure using IaC principles with tools like Terraform or Azure Bicep.

Contribute to the design and implementation of robust CI/CD pipelines using Azure DevOps, GitHub Actions, or similar tools to ensure efficient and reliable application deployments.

Azure Ecosystem Management:

Hands-on experience deploying, configuring, and managing a wide range of Azure services, including:

Compute: Azure Virtual Machines, Azure Kubernetes Service (AKS), Azure Functions, Azure App Service

Networking: Azure Virtual Networks, Load Balancers, Azure Front Door, DNS

Storage: Azure Storage Accounts (Blob, File, Queue, Table), Azure SQL Database, Azure Cosmos DB

Monitoring & Logging: Azure Monitor, Application Insights, Log Analytics, Kusto Query Language (KQL)

Security: Azure Active Directory (AAD), Azure Security Center, Azure Policy, Key Vault, Network Security Groups (NSGs)

Optimize Azure resource utilization for cost efficiency and performance.

Collaboration & Best Practices:

Collaborate closely with development teams (DevOps culture) to integrate reliability practices into the software development lifecycle

Promote and implement SRE best practices, including error budgets, blameless post-mortems, and continuous improvement.

Contribute to documentation of system architecture, operational procedures, and troubleshooting guides.

Stay up-to-date with emerging Azure technologies and SRE trends, proposing and adopting relevant innovations.

Required Skills; Qualifications:

Bachelors degree in Computer Science, Information Technology, or a related field, or equivalent practical experience.

3-5 years of hands-on experience in a Site Reliability Engineering, DevOps, or similar role with a strong focus on Microsoft Azure.

Proficiency in at least one scripting or programming language (e.g., Python, PowerShell, Go, Bash).

Solid understanding of Infrastructure as Code (IaC) principles and experience with tools like Terraform or Azure Bicep.

Demonstrated experience with CI/CD pipelines (Azure DevOps preferred).

Strong experience with Azure monitoring and logging solutions (Azure Monitor, Application Insights, Log Analytics, KQL).

Experience with containerization and orchestration technologies, particularly Azure Kubernetes Service (AKS).

Good understanding of networking concepts (TCP/IP, DNS, Load Balancing).

Familiarity with database systems (SQL and NoSQL).

Strong problem-solving, analytical, and troubleshooting skills.

Excellent communication and collaboration skills, with the ability to work effectively in a team environment.

Ability to work independently and manage multiple priorities in a fast-paced environment.

Preferred Skills & Certifications:

Microsoft Certified: Azure Administrator Associate (AZ-104)

Microsoft Certified: Azure DevOps Engineer Expert (AZ-400)

Certified Kubernetes Administrator (CKA)

Experience with other monitoring tools like Grafana, Prometheus, Splunk, Datadog.

Familiarity with security best practices in cloud environments.

Experience with Git and version control systems.



  • Chennai, Tamil Nadu, India Cstream, Inc. Full time ₹ 2,50,000 - ₹ 5,00,000 per year

    Company Description, headquartered in Irvine, California, is a technology-driven company that provides innovative solutions for technology governance, risk management, and compliance. Utilising automation and AI, Cstream's platform helps organisations streamline compliance frameworks like SOC 2, ISO 27001, HIPAA, and PCI-DSS through guided workflows,...


  • Chennai, Tamil Nadu, India GSR Business Services Full time ₹ 6,00,000 - ₹ 12,00,000 per year

    Dear Aspirants,Urgent HiringSite reliability Engineer3-5 YearsChennaiRole Summary:Supports the reliability and performance of systems and infrastructure. Assists in monitoring, troubleshooting, and automating tasks to maintain high-availability environments.Key Responsibilities:Assist in managing VMware and Linux servers.Monitor system health and respond to...


  • Chennai, Tamil Nadu, India Neurealm Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Chennai, Tamil Nadu, IndiaPracticeCloud Platform-As-A-ServiceJob posted onNov 10, 2025Employee TypeFull Time EmployeeExperience range (Years)3 years - 8 yearsClientProjectsSite Reliability Engineer (SRE) - Azure Tech StackExperience: 3-5 yearsLocation: [Chennai Work from office ]About the Role:We are seeking a highly motivated and experienced Site...


  • Chennai, Tamil Nadu, India Datum Technologies Group Full time ₹ 18,00,000 - ₹ 22,00,000 per year

    Job Details:Job Title: Lead Site Reliability Engineer (SRE)Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || GurugramInterview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled and experienced Lead Site Reliability Engineer (SRE) to drive reliability,...


  • Chennai, Tamil Nadu, India Datum Technologies Group Full time ₹ 3,00,000 - ₹ 4,50,000 per year

    Job Details:Job Title: Sr. Site Reliability Engineer (SRE)Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || GurugramInterview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to enhance reliability, scalability, and...


  • Chennai, Tamil Nadu, India Flex Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Experience:3.5 to 7 yearsLocation:ChennaiWork mode:Hybrid.Role Overview:As a Site Reliability Engineer (SRE) on the Factory Applications team, you will help maintain and scale Brix" - a cloud-native, containerized, microservices-based platform used to build global shop floor systems. Your focus will be on automation, reliability, and performance.Key...


  • Chennai, Tamil Nadu, India Talent Worx Full time ₹ 15,00,000 - ₹ 35,00,000 per year

    EXP required - 5 to 8 years.Role and ResponsibilitiesReporting to Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions, Payments and Capital Markets business.  In this role, the candidate will have the opportunity to make a lasting impact on the company's transformation journey,...


  • Chennai, Tamil Nadu, India NatWest Group Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Join us as a Site Reliability EngineerYou'll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ)We'll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of CCJ across applicationsThis is a...


  • Chennai, Tamil Nadu, India, Tamil Nadu Tata Consultancy Services Full time

    Role: Site Reliability EngineerLocation: Chennai/Bangalore/HyderabadExp- 5-11 years1.Exposure to any APM tool like Dynatrace, Appdynamics, Splunk, etc2.DBA or Infra admin 3.Gremlin or Chaos Monkey or Simian Army or Litmus expertise4.Exposure to ITSM tools like Service Now, etc5.Understanding of Automation and Chaos Engineering6.Exposure to Devops tools and...


  • Chennai, Tamil Nadu, India HTC Global Services Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    HTC – A brief profileEstablished in 1990, HTC Inc., a company with headquarters in Troy, Michigan, is a leading global Information Technology solution and BPO provider. HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data warehousing, embedded systems, ECM, SCM, CRM, and ERP solutions. HTC Inc....