
SRE - Infrastructure Support Engineer
2 days ago
SRE - Infrastructure Support Engineer – JD
We are hiring a "SRE [Site Reliability Engineer] Infrastructure Support" engineer with deep expertise in Linux,
Kubernetes, and hardware infrastructure management for our "Enterprise-grade high-performance
supercomputing" platform. We are helping enterprises and service providers build their AI inference platforms
for end users, powered by our state-of-the-art RDU (Reconfigurable Dataflow Unit) hardware architecture. This
is a high-impact, high-visibility role. The ideal candidate will play a pivotal role in supporting and maintaining
our enterprise infrastructure stack, ensuring high availability and optimal performance across mission-critical AI
& ML environments. This role involves close collaboration with global SRE and Platform teams to manage and
troubleshoot enterprise systems and clusters.
Location: Remote and open to traveling to KSA or Turkey for 1 year.
Exp: 10+ years
Key Responsibilities:
- Linux Administration: Manage, configure, and optimize Linux servers (RHEL, Ubuntu, or similar),
including patching, security hardening, and performance tuning.
- Kubernetes Administration: Deploy, manage, and troubleshoot Kubernetes clusters, ensuring
reliability and scalability.
- Hardware Infrastructure Management: Oversee physical data center infrastructure, including servers,
storage, and networking hardware.
- Security & Compliance: Apply security patches and upgrades for Linux-based Kubernetes
environments and ensure compliance with organizational policies.
- Collaboration & Support: Work closely with SRE and Platform teams worldwide to support enterprise
systems and clusters.
- Ticket-Based Case Management: Handle tickets efficiently using tools such as Salesforce or
ServiceNow.
Required Qualifications:
- Strong hands-on experience with Linux system administration (RHEL, Ubuntu, or similar).
RHCSA/RHCE certification is a plus.
Solid understanding of Kubernetes administration; CKA/CKS certification is a plus.
Hands-on experience with bare-metal and hardware infrastructure (servers, storage, networking).
Good understanding of networking concepts (TCP/IP, DNS, Load Balancers, Firewalls); knowledge of
Juniper OS is a plus.
Strong troubleshooting skills across hardware, OS, and Kubernetes environments.
Knowledge of automation tools such as Ansible, Python, Bash, or similar is a plus.
Familiarity with monitoring and observability tools (Prometheus, Grafana, ELK) is a plus.
Soft Skills:
Strong communication, problem-solving, and collaboration abilities.
Ability to work effectively in fast-paced, dynamic environments and adapt to evolving AI & ML
technologies.
- Proactive mindset with a focus on automation, scalability, and operational excellence.
Why Join Us:
Work on cutting-edge AI & ML infrastructure supporting mission-critical applications.
Collaborate with global teams and gain exposure to advanced cloud-native and enterprise
technologies.
- Opportunity to grow your expertise in Linux, Kubernetes, and data center operations
Job Type: Full-time
Pay: ₹200, ₹1,540,374.87 per year
Benefits:
- Provident Fund
Work Location: In person
-
Infrastructure Sre
2 weeks ago
Pune, India Barclays Full timeJob title :Infrastructure SRE Location: Pune About Barclays Barclays is a British universal bank. We are diversified by business, by different types of customers and clients, and by geography. Our businesses include consumer banking and payments operations around the world, as well as a top-tier, full service, global corporate and investment bank, all of...
-
senior SRE engineer
2 days ago
Pune, Maharashtra, India Biyani Technologies Full time ₹ 6,61,000 - ₹ 22,00,801 per yearRole: Senior SRE Engineer (AWS/GCP)We are looking for a Senior Site Reliability Engineer (SRE) with expertise in AWS/GCP, Kubernetes, CI/CD, and automation. The role involves designing, building, and scaling cloud infrastructure, leading DevOps best practices, and ensuring system reliability in a modern cloud-based environment.Responsibilities Lead the...
-
SRE Engineer
1 week ago
Pune, Maharashtra, India Techno Facts Solutions Full time ₹ 5,00,000 - ₹ 15,00,000 per yearRole OverviewWe are seeking an experienced Site Reliability Engineer (SRE) with a strong background in automation, monitoring, and performance optimization. The ideal candidate will be proficient in scripting (Python, Bash, ), observability tools, and incident response, ensuring reliability and scalability of enterprise applications.Key...
-
SRE Engineer
7 days ago
Pune, India Techno Facts Solutions Full timeRole Overview We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in automation, monitoring, and performance optimization. The ideal candidate will be proficient in scripting (Python, Bash, ), observability tools, and incident response, ensuring reliability and scalability of enterprise applications. Key Responsibilities...
-
SRE support
2 days ago
Pune, Maharashtra, India Virtusa Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our growing engineering team. The SRE will be responsible for ensuring the availability, performance, scalability, and reliability of our production systems. You will work at the intersection of software development and operations, driving best practices in observability,...
-
Sre & Devops Engineer
1 week ago
Pune, Maharashtra, India METRO Global Solutions Center Full timeCompany Description Metro Global Solution Center MGSC is internal solution partner for METRO a EUR31 Billion international wholesaler with operations in more than 30 countries The store network comprises a total of 623 stores in 21 countries of which 522 offer out-of-store delivery OOS and 94 dedicated depots In 12 countries METRO runs only the...
-
Sre
3 weeks ago
Pune, Maharashtra, India Hitachi Solutions Full timeCompany Description About Hitachi Solutions India Pvt Ltd Hitachi Solutions Ltd headquartered in Tokyo Japan is a core member of Information Telecommunication Systems Company of Hitachi Group and a recognized leader in delivering proven business and IT strategies and solutions to companies across many industries The company provides value-driven...
-
Sre Support
4 days ago
Pune, Maharashtra, India Virtusa Full timeHands-on experience using Linux/ Unix commands. Monitor system performance and availability using tools like Prometheus, Grafana, AppDynamics, or similar. Actively participate in on-call rotation, incident response, and root cause analysis to maintain system health. Able to manage & maintain infrastructure in cloud or hybrid environments i.e., GCP &...
-
SRE Engineer
7 days ago
Pune, India GSPANN Full timeAbout GSPANN :Founded in 2004, GSPANN is a fast-growing IT services and consulting company based in Milpitas, California, USA.We provide end-to-end content, e-commerce, information analytics, quality assurance, and digital transformation solutions to our global clients across retail, finance, healthcare, manufacturing, and high-technology domains. We support...
-
SRE & DevOps Engineer
2 weeks ago
Pune, Maharashtra, India METROMAKRO Full time ₹ 1,04,000 - ₹ 1,30,878 per yearCompany Description Metro Global Solution Center (MGSC) is internal solution partner for METRO, a €31 Billion international wholesaler with operations in more than 30 countries. The store network comprises a total of 623 stores in 21 countries, of which 522 offer out-of-store delivery (OOS), and 94 dedicated depots. In 12 countries, METRO runs only the...