Platform Reliability Engineer

1 week ago


Bengaluru, Karnataka, India Millennium Full time
Platform Reliability Engineer

Millennium's Infrastructure organization is dedicated to designing, engineering, supporting, and managing a robust server estate, systems virtualization, and core enterprise services. We are seeking a Platform Reliability Engineer to join a highly specialized team of exceptionally talented yet refreshingly humble individuals from diverse disciplines. We believe that delivering exceptional services requires the ability to make meaningful changes across the entire stack. Our mission is to solve real business challenges, reduce operational complexities, and foster a collaborative, team-driven environment that promotes mutual growth and success.

As a Platform Reliability Engineer, you will play a key role in managing and optimizing the operational aspects of the server and network infrastructure for a large financial buy-side organization. Your primary focus will be on reducing operational overhead, optimizing systems, managing configurations, and ensuring the reliability and performance of critical infrastructure.

Key Responsibilities:  

  • Ensure the production reliability of the firm's Linux-based research and trading platform as part of a globally distributed engineering team.

  • Provide rapid emergency response to production infrastructure issues.

  • Proactively understand internal clients' needs and effectively communicate them to leadership at both regional and global levels.

  • Identify risks, develop contingency plans, and implement solutions to mitigate them.

  • Develop and enhance the observability platform to monitor the performance and health of critical computing environments.

  • Participate in occasional (monthly) on-call rotations and support on-call staff during their shifts.

  • Contribute to organizational knowledge through documentation, education, and writing maintainable code.

Qualifications/Skills: We are looking for individuals with experience in at least some of the following areas:

  • 2+ years of experience in SRE, DevOps, or other infrastructure engineering roles, preferably within the financial industry.

  • Strong understanding of Linux system internals, including kernel operations, memory management, and performance optimization.

  • In-depth knowledge of storage technologies, particularly those used in high-performance computing (GPFS experience is a plus).

  • Broad understanding of IT infrastructure components, such as networking, DNS, NTP/PTP, and NIS.

  • Proficiency in system automation, monitoring, and self-healing (experience with Salt is a plus).

  • Experience with container orchestration and virtualization technologies (e.g., Kubernetes, Nomad, VMware).

  • Familiarity with on-premises and cloud-based HPC infrastructure (operational knowledge of Slurm and GPU is a plus).

  • Understanding of AI technologies and their applications in infrastructure automation and management. Experience with or a strong interest in implementing AI/ML solutions for infrastructure optimization, anomaly detection, or predictive analytics.

  • A passion for technology and automation, with a deep sense of curiosity and ownership.

  • A hands-on approach to problem-solving and a demonstrable enthusiasm for technology.

  • Excellent verbal and written communication skills.



  • Bengaluru, Karnataka, India Weekday Full time ₹ 35,00,000 - ₹ 45,00,000

    This role is for one of our clientsIndustry: Technology, Information and MediaSeniority level: Mid-senior levelMin Experience: 10 yearsLocation: BengaluruJobType: full-time ₹35,00,000 - ₹45,00,000 a year We are looking for a Principal DevOps & Platform Reliability Engineer to own the design, automation, and reliability of our cloud platforms. In this...


  • Bengaluru, Karnataka, India Weekday Full time

    This role is for one of our clients Industry: Technology, Information and Media Seniority level: Mid-senior level Min Experience: 10 years Location: Bengaluru JobType: full-time ₹35,00,000 - ₹45,00,000 a year We are looking for a Principal DevOps & Platform Reliability Engineer to own the design, automation, and reliability of our cloud platforms. In...


  • Bengaluru, Karnataka, India Weekday Full time

    This role is for one of our clientsIndustry: Technology, Information and MediaSeniority level: Mid-senior levelMin Experience: 10 yearsLocation: BengaluruJobType: full-timeWe are looking for a Principal DevOps & Platform Reliability Engineer to own the design, automation, and reliability of our cloud platforms. In this role, you will build highly available,...


  • Bengaluru, Karnataka, India airisDATA Full time ₹ 50,00,000 - ₹ 2,50,00,000 per year

    Job Title: Site Reliability Engineer/Plotfrom Engineer/Observability EngineeringExperience: 5+ yearsLocations: Remote________________________________________Job Description:We are looking for a Data Engineer with strong experience in Python automation, observability (Prometheus), MongoDB, ETL workflows, and cloud data platforms. The ideal candidate should...


  • Bengaluru, Karnataka, India Innovative Information Technologies, Inc. Full time US$ 60,000 - US$ 1,20,000 per year

    Job Title:Site Reliability Engineer/ platform engineerWork mode :Notice: Immediate to 15days________________________________________Job Description:We are looking for a Data Engineer with strong experience in Python automation, observability (Prometheus),MongoDB,ETLworkflows, and cloud data platforms. The ideal candidate should have hands-on skills in...


  • Bengaluru, Karnataka, India Innovative Information Technologies, Inc. Full time

    Job Title: Site Reliability Engineer/ platform engineerWork mode : RemoteNotice: Immediate to 15days________________________________________Job Description:We are looking for a Data Engineer with strong experience in Python automation, observability (Prometheus),MongoDB,ETLworkflows, and cloud data platforms. The ideal candidate should have hands-on skills...


  • Bengaluru, Karnataka, India airisDATA Full time ₹ 7,00,000 - ₹ 14,00,000 per year

    Job Title: Site Reliability Engineer/Plotfrom Engineer/Observability EngineeringExperience: 5+ yearsClient: AT&TPayroll: Innovative Information technologies Pvt Ltd (For 6 months)- RemotelyLocations: "Initially, the candidate will work with Innovative payroll Remotely. Based on performance, they will be converted to AT&T payroll (After 4 to 6 months), at...


  • Bengaluru, Karnataka, India MathWorks Full time

    SummaryMathWorks has a hybrid work model that enables staff members to split their time between office and home. The hybrid model provides the advantage of having both in-person time with colleagues and flexible at-home life optimizations. Learn More: As a Senior Platform Site Reliability Engineer (SRE) for the IT Observability and Automation Team, you will...


  • Bengaluru, Karnataka, India GlobalLogic Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    DescriptionSame As aboveRequirementsSite Reliability Engineer (SRE) – Platform Reliability & Operational ExcellenceOverviewAt Client, we're scaling a mission‑critical safety and automation platform as we evolve from a monolith into distributed, event‑driven and microservice-based systems. Reliability, latency, and operational efficiency are...


  • Bengaluru, Karnataka, India Apna Full time

    About Blue MachinesBlue Machines powers large-scale, real-time Voice AI and Agentic Workflows across BFSI,Healthcare, HRTech, and Global Enterprises.Role: SRE & MLOps Engineer (3–6 Years Experience)Location: Bangalore (Hybrid)What You Will Own1. Platform Uptime & Reliability- Maintain 99.9%+ uptime.- Monitor and optimize latency for voice agents.2....