
SRE - Infrastructure Support Engineer
5 days ago
SRE - Infrastructure Support Engineer – JD
We are hiring a "SRE [Site Reliability Engineer] Infrastructure Support" engineer with deep expertise in Linux,
Kubernetes, and hardware infrastructure management for our "Enterprise-grade high-performance
supercomputing" platform. We are helping enterprises and service providers build their AI inference platforms
for end users, powered by our state-of-the-art RDU (Reconfigurable Dataflow Unit) hardware architecture. This
is a high-impact, high-visibility role. The ideal candidate will play a pivotal role in supporting and maintaining
our enterprise infrastructure stack, ensuring high availability and optimal performance across mission-critical AI
& ML environments. This role involves close collaboration with global SRE and Platform teams to manage and
troubleshoot enterprise systems and clusters.
Location: Remote and open to traveling to KSA or Turkey for 1 year.
Exp: 10+ years
Key Responsibilities:
- Linux Administration: Manage, configure, and optimize Linux servers (RHEL, Ubuntu, or similar),
including patching, security hardening, and performance tuning.
- Kubernetes Administration: Deploy, manage, and troubleshoot Kubernetes clusters, ensuring
reliability and scalability.
- Hardware Infrastructure Management: Oversee physical data center infrastructure, including servers,
storage, and networking hardware.
- Security & Compliance: Apply security patches and upgrades for Linux-based Kubernetes
environments and ensure compliance with organizational policies.
- Collaboration & Support: Work closely with SRE and Platform teams worldwide to support enterprise
systems and clusters.
- Ticket-Based Case Management: Handle tickets efficiently using tools such as Salesforce or
ServiceNow.
Required Qualifications:
- Strong hands-on experience with Linux system administration (RHEL, Ubuntu, or similar).
RHCSA/RHCE certification is a plus.
Solid understanding of Kubernetes administration; CKA/CKS certification is a plus.
Hands-on experience with bare-metal and hardware infrastructure (servers, storage, networking).
Good understanding of networking concepts (TCP/IP, DNS, Load Balancers, Firewalls); knowledge of
Juniper OS is a plus.
Strong troubleshooting skills across hardware, OS, and Kubernetes environments.
Knowledge of automation tools such as Ansible, Python, Bash, or similar is a plus.
Familiarity with monitoring and observability tools (Prometheus, Grafana, ELK) is a plus.
Soft Skills:
Strong communication, problem-solving, and collaboration abilities.
Ability to work effectively in fast-paced, dynamic environments and adapt to evolving AI & ML
technologies.
- Proactive mindset with a focus on automation, scalability, and operational excellence.
Why Join Us:
Work on cutting-edge AI & ML infrastructure supporting mission-critical applications.
Collaborate with global teams and gain exposure to advanced cloud-native and enterprise
technologies.
- Opportunity to grow your expertise in Linux, Kubernetes, and data center operations
Job Type: Full-time
Pay: ₹200, ₹1,540,374.87 per year
Benefits:
- Provident Fund
Work Location: In person
-
Infrastructure Sre
7 days ago
Pune, India Barclays Full timeJob title :Infrastructure SRE Location: Pune About Barclays Barclays is a British universal bank. We are diversified by business, by different types of customers and clients, and by geography. Our businesses include consumer banking and payments operations around the world, as well as a top-tier, full service, global corporate and investment bank, all of...
-
Sre Support
6 days ago
Pune, Maharashtra, India Virtusa Full timeProduction Engineering/ SRE Support Engineer Experience: 2-4 years Location: Pune / Hyderabad Employment **Responsibilities**: Must have a basic understanding of Pega app support, database management & networking concepts Hands-on experience using Linux/ Unix commands. Monitor system performance and availability using tools like Prometheus, Grafana,...
-
Sre- Ai Infrastructure
2 weeks ago
Pune, India Ellicium Full timeWhat are we about? At Ellicium be ready for contagious excitement, worthy challenges and enriching learning experience every day. We trust in the process of failing fast and learning fast. You will find ‘Ellicians’ putting their heart to get the perfect dish during our monthly potlucks or arguing over best Nolan movie during lunch breaks. It is all about...
-
SRE
2 weeks ago
Pune, India Virtusa Full timeSRE - CREQ Description Minimum 5 years of work experience as an SRE (not Traditional Production Support) covering integration platforms on cloud-based deployments. Coding experience in any programming language, particularly for integration tier and middleware. Working in a 24x7 operations support model for mission critical applications and infrastructure...
-
SRE
3 weeks ago
Pune, India Virtusa Full timeSRE - CREQ Description Minimum 5 years of work experience as an SRE (not Traditional Production Support) covering integration platforms on cloud-based deployments. Coding experience in any programming language, particularly for integration tier and middleware. Working in a 24x7 operations support model for mission critical applications and infrastructure...
-
senior SRE engineer
3 days ago
Pune, Maharashtra, India Biyani Technologies Full time ₹ 6,61,000 - ₹ 22,00,801 per yearRole: Senior SRE Engineer (AWS/GCP)We are looking for a Senior Site Reliability Engineer (SRE) with expertise in AWS/GCP, Kubernetes, CI/CD, and automation. The role involves designing, building, and scaling cloud infrastructure, leading DevOps best practices, and ensuring system reliability in a modern cloud-based environment.Responsibilities Lead the...
-
SRE support
5 days ago
Pune, Maharashtra, India Virtusa Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our growing engineering team. The SRE will be responsible for ensuring the availability, performance, scalability, and reliability of our production systems. You will work at the intersection of software development and operations, driving best practices in observability,...
-
SRE Engineer
2 weeks ago
Pune, Maharashtra, India Techno Facts Solutions Full time ₹ 8,00,000 - ₹ 24,00,000 per yearRole OverviewWe are seeking an experienced Site Reliability Engineer (SRE) with a strong background in automation, monitoring, and performance optimization. The ideal candidate will be proficient in scripting (Python, Bash, ), observability tools, and incident response, ensuring reliability and scalability of enterprise applications.Key...
-
SRE & DevOps Engineer
2 weeks ago
Pune, India METRO Global Solution Center IN Full timeJob DescriptionWe are looking for… An experienced SRE & DevOps Engineer with deep expertise in cloud infrastructure, automation, and observability A hands-on engineer who ensures reliability, performance, and scalability of systems A proactive problem solver with a strong focus on operational excellence and continuous improvement A collaborator who...
-
SRE & DevOps Engineer
2 weeks ago
Pune, India METRO Global Solution Center IN Full timeJob DescriptionWe are looking for… An experienced SRE & DevOps Engineer with deep expertise in cloud infrastructure, automation, and observability A hands-on engineer who ensures reliability, performance, and scalability of systems A proactive problem solver with a strong focus on operational excellence and continuous improvement A collaborator who...