
Senior Site Reliability Engineer
15 hours ago
Role & responsibilities
- Deploy, manage, and optimize storage solutions using ZFS and iSCSI across global data centers.
- Implement and maintain automation and monitoring tools such as Puppet, Grafana, Zabbix, and Jenkins to enhance system performance and reliability.
- Utilize storcli for managing server storage configurations.
- Linux Systems Expertise:
- Manage and maintain Ubuntu-based systems, ensuring security and compliance.
- Conduct performance tuning and capacity planning for Linux servers.
- Develop and implement self-healing systems and automated recovery processes on Linux platforms.
- Reliability Engineering:
- Develop and implement strategies for improving system availability and performance.
- Conduct root-cause analysis and incident response for storage-related issues.
- Collaborate with SDEs to support software development infrastructure and deploy new product features.
Preferred candidate profile
- Proven experience in site reliability engineering, with a focus on storage solutions and Linux systems.
- Strong knowledge of ZFS, iSCSI, and Ubuntu.
- Expertise in automation and configuration management tools (e.g., Bash, Ansible, Puppet).
- Familiarity with Hashicorp tools, SSH, and LDAP.
- Experience with storcli for storage configuration.
- Experience with monitoring tools such as Grafana, Zabbix, InfluxDB.
- Ability to conduct root-cause analysis and implement effective solutions.
- High level of ownership for assigned team problem space, including driving predictable delivery, continuous iteration and improvement, consistent and effective communication team, gracefully coordinating with upstream and downstream stakeholders, and project status.
- Project management skills, including experience with task estimation, scheduling, Gantt charts, unblocking dependencies, Agile methodologies (such as sprint planning or Scrum), being detail-oriented, and keeping projects on track. Ability to define broad, complex problems and break into discrete, specific tasks that can be delegated.
- Documentation skills including writing standard operating procedures, design docs, policy documents, runbooks.
-
Senior Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India Akamai Full timeJob Category Site Reliability Would you like to lead modernization initiatives while building a public cloud platform from scratch Would you like to own critical services in a new public cloud platform Join our IaaS Site Reliability Engineering SRE team We design develop and operate infrastructure and services that power the backbone of our...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 9,00,000 - ₹ 12,00,000 per yearWe are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...
-
Senior Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Saviynt Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the job Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per yearRole DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....
-
Senior Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India Saviynt Full time ₹ 15,00,000 - ₹ 25,00,000 per yearAbout the jobSaviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely...
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India HireAlpha Full time ₹ 8,00,000 - ₹ 24,00,000 per yearWe're Hiring | Senior Site Reliability Engineer (SRE)Bangalore | HybridPermanent RoleAre you ready to help shape the future of cloud contact centers? we're building scalable, reliable, and cutting-edge infrastructure for world-class customer experiences — and we're looking for aSenior SREto join our teamWhat you'll do:Lead efforts in building a seamless ...
-
Site Reliability Engineer
9 hours ago
Bengaluru, Karnataka, India Luxoft Full time ₹ 20,00,000 - ₹ 25,00,000 per yearProject descriptionLuxoft partner with next-generation digital bank, built from the ground up to deliver seamless, secure, and scalable financial services. Our platform is cloud-native, API-first, and focused on reliability, speed, and security. We are growing fast and looking for top-tier Site Reliability / Ops Engineers to join our core team and help run...
-
Senior Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India Aerospike Full time ₹ 15,00,000 - ₹ 20,00,000 per yearAbout Aerospike Aerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases. Global leaders, including Adobe, Airtel, Barclays,...