Site Reliability Engineer
4 days ago
Key Responsibilities :
- Incident Management : Provide L2 support for critical production incidents, performing root cause analysis, and implementing effective solutions to minimize downtime.
- Automation and Infrastructure as Code (IaC) : Develop and maintain automation scripts using Python, Bash, and
Go to streamline operational tasks. Implement and manage IaC using Terraform and Ansible to automate infrastructure provisioning and configuration.
- UNIX Systems Administration : Manage and troubleshoot critical applications running in a UNIX environment, ensuring system stability and performance.
- Database Management : Administer and optimize production databases (Postgres, MySQL, Oracle) in both cloud and on-premise environments.
Perform database backups, restores, and performance tuning.
- Cloud Infrastructure Management : Design, deploy, and manage infrastructure on AWS and/or Azure cloud platforms.
Implement best practices for security, scalability, and cost optimization.
- Containerization and Orchestration : Deploy, manage, and troubleshoot Kubernetes clusters.
Ensure high availability and scalability of containerized applications.
- Monitoring and Logging : Implement and maintain monitoring and logging solutions using the ELK stack (Elasticsearch, Logstash, Kibana) to proactively identify and resolve issues.
- Performance Tuning and Optimization : Analyze system performance metrics, identify bottlenecks, and implement solutions to optimize performance.
- Collaboration and Communication : Collaborate with cross-functional teams to resolve issues and implement improvements.
Communicate effectively with stakeholders and provide clear and concise documentation.
- On-Call Support : Participate in an on-call rotation to provide 24/7 support for critical systems.
- Documentation : create and maintain detailed documentation of systems, procedures, and troubleshooting steps.
Required Skills and Experience :
- Experience : 5-8 years of experience in an L2 Site Reliability Engineer, DevOps Engineer, or similar role.
- Scripting : Proficiency in scripting languages such as Python, Bash, and
Go.
- Infrastructure as Code : Hands-on experience with Terraform and Ansible for infrastructure automation.
- UNIX Systems : Strong experience supporting critical applications in a UNIX environment.
- Database Management : Expertise in managing production databases (Postgres, MySQL, Oracle) in cloud and on-premise environments.
- Cloud Platforms : Extensive experience with AWS and/or Azure cloud environments.
- Containerization : Solid understanding of Kubernetes and containerization technologies.
- Monitoring and Logging : Experience with the ELK stack for monitoring and logging.
- Education : Bachelor's or Master's degree in Computer Science or a related field with 5+ years of relevant experience.
- Problem-Solving : Excellent problem-solving and troubleshooting skills.
- Communication : Strong communication and collaboration skills.
Preferred Qualifications :
- Relevant certifications (e.g., AWS Certified DevOps Engineer, Kubernetes Administrator, Oracle Database Administrator).
- Experience with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
- Knowledge of networking concepts and protocols (TCP/IP, DNS, HTTP).
- Experience with configuration management tools (e.g., Chef, Puppet).
- Experience with other monitoring tools (Prometheus, Grafana).
)
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per yearRole DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India FIS Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the Role :Site Reliability Engineer (SRE)with deep expertise inMainframe technologies like COBOL, JCL, etc. to support and enhance ourCard Management & Payment processing functions. This role will be responsible for ensuring reliability, high availability, scalability, stability and performance of mission-critical mainframe software applications and...
-
Site Reliability Engineer
14 hours ago
Bengaluru, Karnataka, India FOSS United Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAll JobsSite Reliability Engineer at ZEISS IndiaSite Reliability EngineerApplyPosted on September 11, 2025ZEISS IndiaKadubeesanahalli, BengaluruFull TImeJob DescriptionZEISS in IndiaZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
site reliability engineer
4 weeks ago
Bengaluru, Karnataka, India Randstad Full timeRole: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India ViewSonic Full timeJob Requirements:1. Bachelor's degree in Computer Science, Engineering, or a related field.2. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.3. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.4. Interest and understanding of Platform...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India ViewSonic Full timeJob Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. Interest and understanding of Platform...
-
Site Reliability Engineer
11 hours ago
Bengaluru, Karnataka, India ViewSonic Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India HDFC Limited Full time ₹ 15,00,000 - ₹ 25,00,000 per yearHiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience YearsJob PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance.Job Responsibilities:Help build a Site Reliability Engineering...