Current jobs related to Site Reliability Engineer - Ahmedabad, Gujarat - VOLANSYS (An ACL Digital Company)
-
Ahmedabad, Gujarat, India beBeeEngineering Full time ₹ 1,50,00,000 - ₹ 2,50,00,000About the Role:We are seeking an experienced and dynamic Site Reliability Engineering leader to oversee the reliability, scalability, and performance of our critical systems.As a Site Reliability Engineering leader, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving automation, monitoring,...
-
Site Reliability Engineer
4 weeks ago
Ahmedabad, Gujarat, India Core Minds Tech SOlutions Full timeJob Description :- Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions- Operate, monitor, and triage all aspects of our production and non-production environments- Collaborate with other engineers on code, infrastructure, design reviews, and process enhancements.- Evaluate and...
-
Site Reliability Engineer
4 weeks ago
Ahmedabad, Gujarat, India ACL Digital Full timeJob Description :- Continuous monitoring of system performance and identify potential issues before they impact users.- Experience working with Industry leading monitoring tools.- Respond to incidents related to monitoring systems, troubleshooting Level 1 issues and resolving issues promptly.- Analyze monitoring data to identify trends, anomalies, to...
-
Site Reliability Engineer
2 weeks ago
Ahmedabad, Gujarat, India ACL Digital Full timeJob Description : - Continuous monitoring of system performance and identify potential issues before they impact users. - Experience working with Industry leading monitoring tools. - Respond to incidents related to monitoring systems, troubleshooting Level 1 issues and resolving issues promptly. - Analyze monitoring data to identify trends, anomalies, to...
-
Site Reliability Manager
3 days ago
Ahmedabad, Gujarat, India beBeeReliability Full time ₹ 9,00,000 - ₹ 12,00,000Job DescriptionWe are seeking a skilled IT Operations professional to join our team. In this role, you will be responsible for delivering high-quality IT services to ensure the smooth operation of our site.Deliver site IT services in line with quality, reliability, and cost expectations.Lead Margin Improvement Projects delivery and monitor site incidents and...
-
Senior Site Reliability Engineer
5 days ago
Ahmedabad, Gujarat, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Title: Senior Site Reliability EngineerAbout the JobWe are seeking a seasoned Senior Site Reliability Engineer to join our team as a technical leader, coach, and hands-on problem solver.Key Responsibilities:Investigate and resolve high-impact production issues across infrastructure and applications.Educate and guide development teams on performance,...
-
Site Reliability Engineer
4 days ago
Ahmedabad, Gujarat, India Azilen Technologies Full timeJob PurposeTo ensure the reliability, performance, and resilience of our systems by managing Windows and Linux servers, SQL Server, .NET applications, and Azure services, while bridging development and operations teams to foster a culture of reliability.Who you are:● Lead incident management processes, carry out on-call duties, and effectively use incident...
-
Site Reliability Engineer
2 days ago
Ahmedabad, Gujarat, India Azilen Technologies Full timeJob Purpose To ensure the reliability, performance, and resilience of our systems by managing Windows and Linux servers, SQL Server, .NET applications, and Azure services, while bridging development and operations teams to foster a culture of reliability. Who you are: ● Lead incident management processes, carry out on-call duties, and effectively use...
-
Site Engineer
3 weeks ago
Ahmedabad, Gujarat, India Devashish Infrastructure Pvt Ltd Full timeAbout the RoleDevashish Infrastructure Pvt. Ltd. is seeking a skilled and dedicated Project Site Engineer to oversee the on-site execution of Pre-Engineered Building (PEB) projects. The ideal candidate will have hands-on experience in managing site-level activities, coordinating with teams and vendors, and ensuring timely and quality execution as per...
-
Site Reliability Engineer
2 days ago
Ahmedabad, Gujarat, India Uplers Full timeUplers is hiring for one of the clients. It is a remote opportunity. Role Details:Position: SRE (Oracle Cloud Infrastructure)Type: 10-month contract (possible extension)Mode: Remote | Mon–Fri | 10:30 AM – 7:30 PM ISTPolicy: Use of personal device requiredExperience: 7–10 yrs (min. 7–8 yrs in OCI)Skills: OCI, Terraform, GitLabRounds: 2About the...

Site Reliability Engineer
4 weeks ago
Experience: 5+ Years
Work Mode: Work from office only
Job Description:
1. AWS Cloud Infrastructure
Design, deploy, and manage scalable, secure, and highly available systems on AWS.
Optimize cloud costs, enforce tagging, and implement security best practices (IAM, VPC, GuardDuty, etc.).
Automate infrastructure provisioning using Terraform or AWS CDK.
Ensure backup, disaster recovery, and high availability (HA) strategies are in place.
2. Kubernetes (EKS preferred)
Manage and scale Kubernetes clusters (preferably Amazon EKS).
Implement CI/CD pipelines with GitOps (e.g., ArgoCD or Flux) or traditional tools (e.g., Jenkins, GitLab).
Enforce RBAC policies, namespaces isolation, and pod security policies.
Monitor cluster health, optimize pod scheduling, autoscaling, and resource limits/requests.
3. Monitoring and Observability (Datadog)
Build and maintain Datadog dashboards for real-time visibility across systems and services.
Set up alerting policies, SLOs, SLIs, and incident response workflows.
Integrate Datadog with AWS, Kubernetes, and applications for full-stack observability.
Conduct post-incident reviews using Datadog analytics to reduce MTTR.
4. Automation and DevOps
Automate manual processes (e.g., server setup, patching, scaling) using Python, Bash, or Ansible.
Maintain and improve CI/CD pipelines (Jenkins) for faster and more reliable deployments.
Drive Infrastructure-as-Code (IaC) practices using Terraform to manage cloud resources.
Promote GitOps and version-controlled deployments.
5. Linux Systems Administration
Administer Linux servers (Ubuntu, RHEL, Amazon Linux) for stability and performance.
Harden OS security, configure SELinux, firewalls, and ensure timely patching.
Troubleshoot system-level issues: disk, memory, network, and processes.
Optimize system performance using tools like top, htop, iotop, netstat, etc.