
Senior Site Reliability Engineer
6 days ago
Role & responsibilities
- Deploy, manage, and optimize storage solutions using ZFS and iSCSI across global data centers.
- Implement and maintain automation and monitoring tools such as Puppet, Grafana, Zabbix, and Jenkins to enhance system performance and reliability.
- Utilize storcli for managing server storage configurations.
- Linux Systems Expertise:
- Manage and maintain Ubuntu-based systems, ensuring security and compliance.
- Conduct performance tuning and capacity planning for Linux servers.
- Develop and implement self-healing systems and automated recovery processes on Linux platforms.
- Reliability Engineering:
- Develop and implement strategies for improving system availability and performance.
- Conduct root-cause analysis and incident response for storage-related issues.
- Collaborate with SDEs to support software development infrastructure and deploy new product features.
Preferred candidate profile
- Proven experience in site reliability engineering, with a focus on storage solutions and Linux systems.
- Strong knowledge of ZFS, iSCSI, and Ubuntu.
- Expertise in automation and configuration management tools (e.g., Bash, Ansible, Puppet).
- Familiarity with Hashicorp tools, SSH, and LDAP.
- Experience with storcli for storage configuration.
- Experience with monitoring tools such as Grafana, Zabbix, InfluxDB.
- Ability to conduct root-cause analysis and implement effective solutions.
- High level of ownership for assigned team problem space, including driving predictable delivery, continuous iteration and improvement, consistent and effective communication team, gracefully coordinating with upstream and downstream stakeholders, and project status.
- Project management skills, including experience with task estimation, scheduling, Gantt charts, unblocking dependencies, Agile methodologies (such as sprint planning or Scrum), being detail-oriented, and keeping projects on track. Ability to define broad, complex problems and break into discrete, specific tasks that can be delegated.
- Documentation skills including writing standard operating procedures, design docs, policy documents, runbooks.
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Akamai Full timeJob Category Site Reliability Would you like to lead modernization initiatives while building a public cloud platform from scratch Would you like to own critical services in a new public cloud platform Join our IaaS Site Reliability Engineering SRE team We design develop and operate infrastructure and services that power the backbone of our...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Job Role: - SRE (Senior Site Reliability Engineer)We began life in 2001 as a small, self-funded team of technology specialists. Innovative tech solutions for business We're now a leading global digital consulting firm, providing innovative technology solutions for...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 9,00,000 - ₹ 12,00,000 per yearWe are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...
-
Site Reliability Engineer
1 week ago
Bengaluru, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
1 week ago
Bengaluru, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
1 week ago
Bengaluru, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years.Synechron – BangaloreJob Role: - SRE (Senior Site Reliability Engineer)Job Location: - BangaloreNotice Period: Within 30daysAbout SynechronWe began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to...
-
Site Reliability Engineer
2 days ago
Bengaluru, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years.Synechron – BangaloreJob Role: - SRE (Senior Site Reliability Engineer)Job Location: - BangaloreNotice Period: Within 30daysAbout SynechronWe began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India LanceSoft, Inc. Full time ₹ 6,00,000 - ₹ 8,00,000 per yearRole DescriptionThis is a full-time on-site role for a Senior Site Reliability Engineer based in Bangalore/Chennai/Pune. The Senior Site Reliability Engineer will be responsible for maintaining and enhancing the reliability and performance of the company's IT infrastructure & Development. Daily tasks include troubleshooting system issues, ensuring system...