SRE - Infrastructure Support Engineer
6 days ago
SRE - Infrastructure Support Engineer – JD
We are hiring a "SRE [Site Reliability Engineer] Infrastructure Support" engineer with deep expertise in Linux,
Kubernetes, and hardware infrastructure management for our "Enterprise-grade high-performance
supercomputing" platform. We are helping enterprises and service providers build their AI inference platforms
for end users, powered by our state-of-the-art RDU (Reconfigurable Dataflow Unit) hardware architecture. This
is a high-impact, high-visibility role. The ideal candidate will play a pivotal role in supporting and maintaining
our enterprise infrastructure stack, ensuring high availability and optimal performance across mission-critical AI
& ML environments. This role involves close collaboration with global SRE and Platform teams to manage and
troubleshoot enterprise systems and clusters.
Location: Remote and open to traveling to KSA or Turkey for 1 year.
Exp: 10+ years
Key Responsibilities:
- Linux Administration: Manage, configure, and optimize Linux servers (RHEL, Ubuntu, or similar),
including patching, security hardening, and performance tuning.
- Kubernetes Administration: Deploy, manage, and troubleshoot Kubernetes clusters, ensuring
reliability and scalability.
- Hardware Infrastructure Management: Oversee physical data center infrastructure, including servers,
storage, and networking hardware.
- Security & Compliance: Apply security patches and upgrades for Linux-based Kubernetes
environments and ensure compliance with organizational policies.
- Collaboration & Support: Work closely with SRE and Platform teams worldwide to support enterprise
systems and clusters.
- Ticket-Based Case Management: Handle tickets efficiently using tools such as Salesforce or
ServiceNow.
Required Qualifications:
- Strong hands-on experience with Linux system administration (RHEL, Ubuntu, or similar).
RHCSA/RHCE certification is a plus.
Solid understanding of Kubernetes administration; CKA/CKS certification is a plus.
Hands-on experience with bare-metal and hardware infrastructure (servers, storage, networking).
Good understanding of networking concepts (TCP/IP, DNS, Load Balancers, Firewalls); knowledge of
Juniper OS is a plus.
Strong troubleshooting skills across hardware, OS, and Kubernetes environments.
Knowledge of automation tools such as Ansible, Python, Bash, or similar is a plus.
Familiarity with monitoring and observability tools (Prometheus, Grafana, ELK) is a plus.
Soft Skills:
Strong communication, problem-solving, and collaboration abilities.
Ability to work effectively in fast-paced, dynamic environments and adapt to evolving AI & ML
technologies.
- Proactive mindset with a focus on automation, scalability, and operational excellence.
Why Join Us:
Work on cutting-edge AI & ML infrastructure supporting mission-critical applications.
Collaborate with global teams and gain exposure to advanced cloud-native and enterprise
technologies.
- Opportunity to grow your expertise in Linux, Kubernetes, and data center operations
Job Type: Full-time
Pay: ₹200, ₹1,540,374.87 per year
Benefits:
- Provident Fund
Work Location: In person
-
Infrastructure Sre
3 days ago
Pune, India Barclays Full timeJob title :Infrastructure SRE Location: Pune About Barclays Barclays is a British universal bank. We are diversified by business, by different types of customers and clients, and by geography. Our businesses include consumer banking and payments operations around the world, as well as a top-tier, full service, global corporate and investment bank, all of...
-
Infrastructure Sre
1 week ago
Pune, India Barclays Full timeJob title :Infrastructure SRE Location: Pune About Barclays Barclays is a British universal bank. We are diversified by business, by different types of customers and clients, and by geography. Our businesses include consumer banking and payments operations around the world, as well as a top-tier, full service, global corporate and investment bank, all of...
-
Sre Support
21 hours ago
Pune, Maharashtra, India Virtusa Full timeProduction Engineering/ SRE Support Engineer Experience: 2-4 years Location: Pune / Hyderabad Employment **Responsibilities**: Must have a basic understanding of Pega app support, database management & networking concepts Hands-on experience using Linux/ Unix commands. Monitor system performance and availability using tools like Prometheus, Grafana,...
-
Sre & Devops Engineer
3 weeks ago
Pune, Maharashtra, India METRO Global Solutions Center Full timeCompany Description Metro Global Solution Center MGSC is internal solution partner for METRO a EUR31 Billion international wholesaler with operations in more than 30 countries The store network comprises a total of 623 stores in 21 countries of which 522 offer out-of-store delivery OOS and 94 dedicated depots In 12 countries METRO runs only the delivery...
-
Sre
3 days ago
Pune, Maharashtra, India Hitachi Solutions Full timeCompany Description About Hitachi Solutions India Pvt Ltd Hitachi Solutions Ltd headquartered in Tokyo Japan is a core member of Information Telecommunication Systems Company of Hitachi Group and a recognized leader in delivering proven business and IT strategies and solutions to companies across many industries The company provides value-driven services...
-
Infrastructure Engineer
2 weeks ago
Pune, Maharashtra, India UBS Full timeBusiness Divisions Group Functions Your role Are you an analytic thinker Do you enjoy Site Reliability Engineering initiatives and proactive problem management across on-premises Cloud Database ensuring high availability stability of Database infrastructure services Do you want to play a key role in transforming our firm into an agile organization At UBS we...
-
SRE support
6 days ago
Pune, Maharashtra, India Virtusa Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our growing engineering team. The SRE will be responsible for ensuring the availability, performance, scalability, and reliability of our production systems. You will work at the intersection of software development and operations, driving best practices in observability,...
-
SRE Engineer
2 weeks ago
Pune, Maharashtra, India Techno Facts Solutions Full time ₹ 8,00,000 - ₹ 24,00,000 per yearRole OverviewWe are seeking an experienced Site Reliability Engineer (SRE) with a strong background in automation, monitoring, and performance optimization. The ideal candidate will be proficient in scripting (Python, Bash, ), observability tools, and incident response, ensuring reliability and scalability of enterprise applications.Key...
-
Sre
2 weeks ago
Pune, Maharashtra, India Hitachi Solutions Full time**Company Description** About Hitachi Solutions India Pvt Ltd**: Hitachi Solutions, Ltd., headquartered in Tokyo, Japan, is a core member of Information & Telecommunication Systems Company of Hitachi Group and a recognized leader in delivering proven business and IT strategies and solutions to companies across many industries. The company provides...
-
SRE Migration Engineer
2 days ago
Pune, Maharashtra, India Procallisto solution Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe are seeking an experienced DevOps Engineer with proven expertise in GitHub to GitLab migration, strong hands-on skills in Python programming, AWS, and Site Reliability Engineering (SRE) practices. The ideal candidate will play a key role in modernizing our CI/CD pipelines, improving cloud infrastructure, and ensuring high system reliability and...