Sre- Cloud Native

4 days ago


Pune, India Ellicium Full time

**What are we about?**:

- At Ellicium be ready for contagious excitement, worthy challenges and enriching learning experience every day. We trust in the process of failing fast and learning fast.

You will find ‘Ellicians’ putting their heart to get the perfect dish during our monthly potlucks or arguing over best Nolan movie during lunch breaks. It is all about having fun
We are passionate people with immense love for what we do and we are proud of what we have created.

**Our Key Values**:
**AMBITION**:

Businesses need to make better and faster decisions by analyzing data to stay competitive and future-ready

**TEAMWORK**:

Businesses need to make better and faster decisions by analyzing data to stay competitive and future-ready

**GROWTH**:

Businesses need to make better and faster decisions by analyzing data to stay competitive and future-ready

**COMMUNITY**:

Businesses need to make better and faster decisions by analyzing data to stay competitive and future-ready

**Perks and Benefits**:

- Businesses need to make better and faster Businesses need to make better and faster decisions by analyzing data
to stay competitive and future-ready**Targeted Bonus Program**:
**Health Care**:
**Competitive Salary**:
The Service Operations team at “Product Platform” Systems is responsible for building and operating the platform and infrastructure that enables us to deliver our groundbreaking capabilities to enterprise customers.
As a site reliability engineer on this team, you will work closely alongside the platform engineering team to deploy and manage our Kubernetes based platform at a global scale. You will lead multiple initiatives to enhance our capabilities and provide a reliable, scalable service for customers, in a hybrid deployment pattern.

**How you will make an impact**:

- Assume broad responsibilities for successful delivery of our “Product Platform” services in a hybrid model including but not limited to, deployment, configuration, integrations, and ongoing operations
- Deploy, administer, manage multiple Kubernetes clusters, both on-prem and in private cloud environments
- Develop and continuously improve platform capabilities for observability, monitoring, notifications, logging, tracing and continuous delivery with reduced toil
- Develop standard solutions that enable consistency in service delivery and proactively engage with multiple cross-functional teams to solve problems that impact service levels.
- Determine and set SLOs for the service and build the process and tools to measure and implement the SLOs, prevent recurring problems and undesirable service conditions.
- Participate in on-call rotation responsibilities

**Basic qualifications**:

- Bachelors and/or Masters in CS /EE or related field
- 5+ years of hands-on experience as an SRE with focus on cloud native technologies
- Hands-on experience deploying, managing and troubleshooting Kubernetes clusters and components.
- Strong experience configuring and administering Linux systems in cloud/Saas production environments.
- Systematic problem-solving approach to troubleshooting, and the desire to solve the root cause of common problems in 24×7 environments
Preferred Qualifications
- Software programming experience in one or more languages including Go/ Python
- Experience delivering infrastructure as code - Ansible, Terraform, Git, Jenkins, Helm, ArgoCD.
- Good understanding of DNS, DHCP, LDAP, NFS, Kerberos, PAM, PXE, SNMP, SSH, HTTP/S, NTP, troubleshooting network performance issues
- Experience with monitoring and logging systems such as Prometheus, Grafana, Nagios, ELK etc. and the ability to identify new technologies as appropriate Experience tuning and optimizing storage solutions including Object Storage and NFS.
- Knowledge of virtualization, multiple hypervisor technologies as well as cloud computing technologies like AWS, Azure, GCP.
- Configuration and maintenance of web servers, load balancers, databases, storage systems and messaging systems
- Good understanding of test driven development, continuous integration and delivery
- A passion to design for high availability and scale, with the discipline and desire for extensive automation.
- Strong communication skills with the ability and willingness to work with diverse teams, and customers, across multiple time zones.


  • senior SRE engineer

    6 days ago


    Pune, Maharashtra, India Biyani Technologies Full time ₹ 6,61,000 - ₹ 22,00,801 per year

    Role: Senior SRE Engineer (AWS/GCP)We are looking for a Senior Site Reliability Engineer (SRE) with expertise in AWS/GCP, Kubernetes, CI/CD, and automation. The role involves designing, building, and scaling cloud infrastructure, leading DevOps best practices, and ensuring system reliability in a modern cloud-based environment.Responsibilities Lead the...

  • SRE

    3 weeks ago


    Pune, India Virtusa Full time

    SRE - CREQ Description Minimum 5 years of work experience as an SRE (not Traditional Production Support) covering integration platforms on cloud-based deployments. Coding experience in any programming language, particularly for integration tier and middleware. Working in a 24x7 operations support model for mission critical applications and infrastructure...

  • SRE

    3 weeks ago


    Pune, India Virtusa Full time

    SRE - CREQ Description Minimum 5 years of work experience as an SRE (not Traditional Production Support) covering integration platforms on cloud-based deployments. Coding experience in any programming language, particularly for integration tier and middleware. Working in a 24x7 operations support model for mission critical applications and infrastructure...


  • Bengaluru, Gurugram, Pune, India Epam Systems Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    We are seeking a highly skilled Senior DevOps & Cloud Engineer to join our Site Reliability Engineering (SRE) & Automation team. This role is ideal for someone with deep expertise in AWS infrastructure, DevOps automation, and Infrastructure as Code (IaC). You will be responsible for designing, implementing, and maintaining scalable and secure cloud...

  • SRE & DevOps Engineer

    3 weeks ago


    Pune, India METRO Global Solution Center IN Full time

    Job DescriptionWe are looking for… An experienced SRE & DevOps Engineer with deep expertise in cloud infrastructure, automation, and observability A hands-on engineer who ensures reliability, performance, and scalability of systems A proactive problem solver with a strong focus on operational excellence and continuous improvement A collaborator who...

  • SRE & DevOps Engineer

    3 weeks ago


    Pune, India METRO Global Solution Center IN Full time

    Job DescriptionWe are looking for… An experienced SRE & DevOps Engineer with deep expertise in cloud infrastructure, automation, and observability A hands-on engineer who ensures reliability, performance, and scalability of systems A proactive problem solver with a strong focus on operational excellence and continuous improvement A collaborator who...

  • SRE support

    7 days ago


    Pune, Maharashtra, India Virtusa Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our growing engineering team. The SRE will be responsible for ensuring the availability, performance, scalability, and reliability of our production systems. You will work at the intersection of software development and operations, driving best practices in observability,...

  • SRE & DevOps Engineer

    3 weeks ago


    Pune, India METRO LOGISTICS Full time

    Company Description Metro Global Solution Center (MGSC) is internal solution partner for METRO, a €31 Billion international wholesaler with operations in more than 30 countries. The store network comprises a total of 623 stores in 21 countries, of which 522 offer out-of-store delivery (OOS), and 94 dedicated depots. In 12 countries, METRO runs only...

  • SRE & DevOps Engineer

    3 weeks ago


    Pune, India METRO LOGISTICS Full time

    Company Description Metro Global Solution Center (MGSC) is internal solution partner for METRO, a €31 Billion international wholesaler with operations in more than 30 countries. The store network comprises a total of 623 stores in 21 countries, of which 522 offer out-of-store delivery (OOS), and 94 dedicated depots. In 12 countries, METRO runs only the...

  • Software Engineer

    3 weeks ago


    Pune, India Maersk Full time

    About the Role We are looking for a highly skilled Software Engineer with strong AI/ML expertise and a foundational understanding of SRE principles to help transform reliability engineering through intelligent, automation-driven solutions. This role is not just about applying AI; it’s about applying engineering mindset and AI capabilities to...