Sre- Ai Infrastructure

6 days ago


Pune, India Ellicium Full time

**What are we about?**:

- At Ellicium be ready for contagious excitement, worthy challenges and enriching learning experience every day. We trust in the process of failing fast and learning fast.

You will find ‘Ellicians’ putting their heart to get the perfect dish during our monthly potlucks or arguing over best Nolan movie during lunch breaks. It is all about having fun
We are passionate people with immense love for what we do and we are proud of what we have created.

**Our Key Values**:
**AMBITION**:

Businesses need to make better and faster decisions by analyzing data to stay competitive and future-ready

**TEAMWORK**:

Businesses need to make better and faster decisions by analyzing data to stay competitive and future-ready

**GROWTH**:

Businesses need to make better and faster decisions by analyzing data to stay competitive and future-ready

**COMMUNITY**:

Businesses need to make better and faster decisions by analyzing data to stay competitive and future-ready

**Perks and Benefits**:

- Businesses need to make better and faster Businesses need to make better and faster decisions by analyzing data
to stay competitive and future-ready**Targeted Bonus Program**:
**Health Care**:
**Competitive Salary**:
The Service Operations team at “Product Platform” Systems is responsible for building and operating the platform and infrastructure that enables us to deliver our groundbreaking capabilities to enterprise customers.
As a site reliability engineer on this team, you will lead key system engineering and automation functions, enhancing our capabilities to provide a reliable and scalable service for customers, in a hybrid deployment pattern.

**How you will make an impact**:

- Assume broad responsibilities for successful delivery of our “Product Platform” services in a hybrid model including but not limited to, deployment, configuration, integrations, and ongoing operations.
- Take ownership for ongoing updates, upgrades and patches on customer environments.
- Augment ongoing efforts to design and develop automation for deployments, updates and upgrades of the entire “Product Platform” software stack.
- Build the systems and tools for centralized command and control of distributed environments.
- Partner and collaborate with product and engineering teams to improve the security posture and operational readiness of our systems with the flexibility to integrate into unique customer environments.
- Participate in on-call rotation responsibilities.

**Basic qualifications**:

- Bachelors and/or Masters in CS /EE or related field.
- 5+ years of hands-on experience as an SRE with focus on systems and infrastructure for cloud/SaaS production requirements.
- Extensive experience building, configuring, securing and administering Linux systems large-scale production environments.
- Strong scripting /programming skills (Python preferable) with experience with automated deployment systems, e.g. Ansible, Terraform, etc.
- Systematic problem-solving approach to troubleshooting, and the desire to solve the root cause of common problems in 24×7 environments.

**Preferred Qualifications**:

- Deep understanding of DNS, DHCP, LDAP, NFS, Kerberos, PAM, PXE, SNMP, SSH, HTTP/S, NTP, troubleshooting network performance issues.
- Knowledge of software development processes and methods, CI/CD pipelines and experience with common version control software.
- Knowledge of virtualization, multiple hypervisor technologies, Kubernetes cluster administration and management.
- Experience with monitoring and logging systems and the ability to identify new technologies as appropriate.
- Configuration and maintenance of web servers, load balancers, databases, storage systems and messaging systems.
- A passion to design for high availability and scale, with the discipline and desire for extensive automation.
- Strong communication skills with the ability and willingness to work with diverse teams, and customers, across multiple time zones.


  • Infrastructure Sre

    2 weeks ago


    Pune, India Barclays Full time

    Job title :Infrastructure SRE Location: Pune About Barclays Barclays is a British universal bank. We are diversified by business, by different types of customers and clients, and by geography. Our businesses include consumer banking and payments operations around the world, as well as a top-tier, full service, global corporate and investment bank, all of...


  • Kothrud, Pune, Maharashtra, India CRUTZ LEELA ENTERPRISES Full time ₹ 15,40,374 per year

    SRE - Infrastructure Support Engineer – JDWe are hiring a "SRE [Site Reliability Engineer] Infrastructure Support" engineer with deep expertise in Linux,Kubernetes, and hardware infrastructure management for our "Enterprise-grade high-performancesupercomputing" platform. We are helping enterprises and service providers build their AI inference platformsfor...

  • Sre

    3 weeks ago


    Pune, Maharashtra, India Hitachi Solutions Full time

    Company Description About Hitachi Solutions India Pvt Ltd Hitachi Solutions Ltd headquartered in Tokyo Japan is a core member of Information Telecommunication Systems Company of Hitachi Group and a recognized leader in delivering proven business and IT strategies and solutions to companies across many industries The company provides value-driven...


  • Pune, Maharashtra, India Procallisto solution Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    We are seeking an experienced DevOps Engineer with proven expertise in GitHub to GitLab migration, strong hands-on skills in Python programming, AWS, and Site Reliability Engineering (SRE) practices. The ideal candidate will play a key role in modernizing our CI/CD pipelines, improving cloud infrastructure, and ensuring high system reliability and...

  • senior SRE engineer

    2 days ago


    Pune, Maharashtra, India Biyani Technologies Full time ₹ 6,61,000 - ₹ 22,00,801 per year

    Role: Senior SRE Engineer (AWS/GCP)We are looking for a Senior Site Reliability Engineer (SRE) with expertise in AWS/GCP, Kubernetes, CI/CD, and automation. The role involves designing, building, and scaling cloud infrastructure, leading DevOps best practices, and ensuring system reliability in a modern cloud-based environment.Responsibilities Lead the...

  • SRE Engineer

    7 days ago


    Pune, India Techno Facts Solutions Full time

    Role Overview We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in automation, monitoring, and performance optimization. The ideal candidate will be proficient in scripting (Python, Bash, ), observability tools, and incident response, ensuring reliability and scalability of enterprise applications. Key Responsibilities...

  • SRE Engineer

    1 week ago


    Pune, Maharashtra, India Techno Facts Solutions Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    Role OverviewWe are seeking an experienced Site Reliability Engineer (SRE) with a strong background in automation, monitoring, and performance optimization. The ideal candidate will be proficient in scripting (Python, Bash, ), observability tools, and incident response, ensuring reliability and scalability of enterprise applications.Key...

  • Cloud Sre

    3 days ago


    Pune, India Deutsche Bank Full time

    **Job Title**:Cloud SRE (AVP/Associate) **Location**:Pune **Role Description**: **What we’ll offer you** Please be aware there are regional differences to DB benefits, and you will need to check the correct package per advert. As part of our flexible scheme, here are just some of the benefits that you’ll enjoy - Best in class leave policy - Gender...


  • Pune, India Emergys Full time

    Experience: 6+ years NP-0 to 30 days Please find JD: We are hiring a "SRE (Site Reliability Engineer) AI ML Support" engineer for our "Enterprise-grade highperformance supercomputing" platform. We are helping enterprises and service providers build their AI inference platforms for end users, powered by our state-of-the-art RDU (Reconfigurable Dataflow Unit)...


  • Pune, Maharashtra, India procallisto solutions pvt Full time ₹ 20,40,000 per year

    We are seeking an experienced DevOps Engineer with proven expertise in GitHub to GitLab migration, strong hands-on skills in Python programming, AWS, and Site Reliability Engineering (SRE) practices. The ideal candidate will play a key role in modernizing our CI/CD pipelines, improving cloud infrastructure, and ensuring high system reliability and...