SRE Principle

1 week ago


Pune, Maharashtra, India Systems Plus Full time ₹ 15,00,000 - ₹ 25,00,000 per year

SystemsPlus is hiring for a Principal SRE, Exp : 10 to 15 yr. Location : Pune Hybrid. Client's Direct-to-Consumer Engineering team is responsible for creating, maintaining and providing customer service for its branded eCommerce websites.

We seek talented individuals that fit into our team-oriented atmosphere and are proud to have an environment that offers the comfort of a true work/life balance. The Principal Site Reliability Engineer will play a lead role in the production environment by monitoring availability and taking a holistic view of system health. They will build software and systems to manage platform infrastructure and applications; improve reliability, quality, and time-to-market of our suite of software solutions; and measure and optimize system performance - all with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve.

Responsibilities

  • Ensure availability, latency, performance, and efficiency of our global ecomm sites
  • Experience driving change management and incident management
  • Promote best practices and innovative observability to guide product delivery teams in achieving operational excellence for new product deliveries.
  • Drive operational excellence and evangelize best practices in observability.
  • Develop unified observability dashboards and implement E2E observability requirements.
  • Design innovative observability solutions for internal and external stakeholders.
  • Contribute to observability instrumentation standards and create repeatable patterns for engineering teams.
  • Define and implement E2E observability requirements and lead teams to support E2E best practices.
  • Collaborate with cross-functional teams to achieve objectives and drive high reliability into systems.
  • Build proprietary tools to mitigate weaknesses in incident management or software delivery.
  • Implement SRE best practices to increase system reliability and performance.
  • Automate processes for improved collaborative response and prepare teams for incidents.
  • Maintain error budgets, meet SLOs, and support uptime and availability of critical platform components.
  • Automate technology stacks to improve operating costs while responding to traffic spikes.
  • Location: Pune – Client Office, Mandatory in person – Tu, We, Thu in a week
  • Work timings: First 3 months in EST to onboarding ramp up, move into IST work timings for 8 hours with a possible 1 hour overlap in the evening with US team in EST (10am to 7pm) Required Skills and Experience:
  • Bachelor's Degree in Computer Science, Information Science, Engineering, or a related field.
  • 10+ years of experience in code management, deployment processes, procedures, and tools in a DevOps or SRE role.
  • Experience with monitoring tools (preferred: Dynatrace, Splunk, Datadog, Grafana, and

New Relic).

  • Proficiency in state-of-the-art observability trends, tools, products, and technologies.
  • Ability to identify organization-wide gaps in the SRE practice and implement solutions that contribute to organizational transformation.
  • Experience driving cross-organization adoption of new technologies or initiatives.
  • Ability to influence senior management in selecting the right strategy, processes, and structures to transform the organization into a modern SRE team.
  • Proactive in identifying performance bottlenecks, anomalous system behavior, and addressing root causes of service issues.
  • Passionate about technology with a strong sense of curiosity and a desire to improve processes, automate everything, and continuously learn.
  • Successful experience supporting a cloud production environment (strong preference for Azure).
  • Competency in one or more programming languages for automation (Python strongly preferred).
  • Knowledge of cloud deployment tools and methodologies (ideally Ansible, but Terraform, Azure DevOps, etc. are also considered).
  • Deep understanding of Kubernetes and Docker architecture and associated tools.
  • Experience with at least one configuration management solution (e.g., Chef, Ansible, AWS CodeDeploy).
  • Proficiency with repository and pipeline-related tools (e.g., GitLab, Jenkins, Bamboo, Travis, CircleCI).
  • Experience with implementing and using various application and infrastructure monitoring tools.
  • Strong troubleshooting skills.
  • Ability to take ownership and deliver solutions autonomously.

Interested candidate drop CV on ***********@systems-


  • SRE support

    2 weeks ago


    Pune, Maharashtra, India Virtusa Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our growing engineering team. The SRE will be responsible for ensuring the availability, performance, scalability, and reliability of our production systems. You will work at the intersection of software development and operations, driving best practices in observability,...

  • Sre

    4 weeks ago


    Pune, Maharashtra, India Hitachi Solutions Full time

    Company Description About Hitachi Solutions India Pvt Ltd Hitachi Solutions Ltd headquartered in Tokyo Japan is a core member of Information Telecommunication Systems Company of Hitachi Group and a recognized leader in delivering proven business and IT strategies and solutions to companies across many industries The company provides value-driven...

  • SRE Manager

    6 days ago


    Pune, Maharashtra, India Virtusa Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Need a strong profile having good exp in stakeholder & SRE team management.Experience working on Production engineering/ production support projects is a must which includes handling teams working in 24/7 model.Good understanding of Incident, change, service req management is a daily routine so candidate should know how to manage the workload, rotate FTEs as...


  • Pune, Maharashtra, India Procallisto solution Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    We are seeking an experienced DevOps Engineer with proven expertise in GitHub to GitLab migration, strong hands-on skills in Python programming, AWS, and Site Reliability Engineering (SRE) practices. The ideal candidate will play a key role in modernizing our CI/CD pipelines, improving cloud infrastructure, and ensuring high system reliability and...

  • Sre & Devops Engineer

    3 weeks ago


    Pune, Maharashtra, India METRO Global Solutions Center Full time

    Company Description Metro Global Solution Center MGSC is internal solution partner for METRO a EUR31 Billion international wholesaler with operations in more than 30 countries The store network comprises a total of 623 stores in 21 countries of which 522 offer out-of-store delivery OOS and 94 dedicated depots In 12 countries METRO runs only the...


  • Pune, Maharashtra, India procallisto solutions pvt Full time ₹ 20,40,000 per year

    We are seeking an experienced DevOps Engineer with proven expertise in GitHub to GitLab migration, strong hands-on skills in Python programming, AWS, and Site Reliability Engineering (SRE) practices. The ideal candidate will play a key role in modernizing our CI/CD pipelines, improving cloud infrastructure, and ensuring high system reliability and...


  • Pune, Maharashtra, India METRO Global Solution Center IN Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Metro Global Solution Center (MGSC) is internal solution partner for METRO, a €31 Billion international wholesaler with operations in more than 30 countries. The store network comprises a total of 623 stores in 21 countries, of which 522 offer out-of-store delivery (OOS), and 94 dedicated depots. In 12 countries, METRO runs only the delivery business by...


  • Pune, Maharashtra, India METROMAKRO Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Company Description Metro Global Solution Center (MGSC) is internal solution partner for METRO, a €31 Billion international wholesaler with operations in more than 30 countries. The store network comprises a total of 623 stores in 21 countries, of which 522 offer out-of-store delivery (OOS), and 94 dedicated depots. In 12 countries, METRO runs only the...

  • SRE Engineer

    4 weeks ago


    Pune, Maharashtra, India InfoVision Inc. Full time

    Job DescriptionCritical Skills To Possess- 5+ years of Site Reliability Engineering, DevOps, or Infrastructure Engineering experience- SRE Principles: Deep understanding of SLOs, SLIs, error budgets, and reliability engineering practices- Incident Management: Proven experience with incident response, on-call rotations, and post-mortem processes- Automation:...

  • SRE Engineer

    5 days ago


    Pune, Maharashtra, India InfoVision Inc. Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Critical Skills To Possess5+ years of Site Reliability Engineering, DevOps, or Infrastructure Engineering experienceSRE Principles: Deep understanding of SLOs, SLIs, error budgets, and reliability engineering practicesIncident Management: Proven experience with incident response, on-call rotations, and post-mortem processesAutomation: Strong scripting...