Senior Site Reliability Engineer

2 weeks ago


Bengaluru, Karnataka, India Josys Full time US$ 1,50,000 - US$ 2,00,000 per year

Senior Site Reliability Engineer (SRE)
About JOSYS
Josys, a dynamic B2B SaaS platform startup, has embarked on a mission to revolutionize IT operations globally, following an exceptional launch in Japan and securing $125 million in Series A and B funding. Our platform enables businesses to conquer the complexities of work-from-anywhere setups, rapid digital transformation, and the proliferation of SaaS applications by simplifying, optimizing, and securing their IT operations.

With a presence in 9 countries, including Japan, India, and the USA, our cutting-edge product technology center is located in Bengaluru, India. As we continue our rapid expansion, we aim to double our full-time employee headcount in 2024, enhancing our capacity to innovate and deliver.

Josys was spun off from RAKSUL, a celebrated Japanese unicorn and Forbes Startup of the Year 2018, which is renowned for driving transformation through three pioneering B2B e-commerce platforms.

About The Role
We are seeking a
Senior Site Reliability Engineer (Senior SRE)
to drive the scalability, reliability, and efficiency of our critical systems and infrastructure. As a senior member of the team, you will lead SRE initiatives, mentor engineers, and architect solutions that enhance system resilience and operational excellence. You will work closely with engineering, DevOps, and security teams to implement automation, observability, and reliability best practices across the organization.

Reliability Engineering

  • Define and drive SRE best practices, including SLOs, SLAs and resilience engineering.
  • Lead incident management and post-mortem analysis, ensuring continuous improvement in system reliability.
  • Establish disaster recovery (DR) and high-availability (HA) strategies to meet business continuity goals.
  • Develop and optimize incident response playbooks, reducing mean time to resolution (MTTR).

Observability, Monitoring, and Performance Engineering

  • Define and implement advanced monitoring, logging, and tracing strategies using tools like Prometheus, Grafana, Datadog, or New Relic.
  • Perform capacity planning, load testing, and performance tuning to optimize system health.
  • Introduce AIOps and machine learning-based anomaly detection for proactive issue resolution.

Security, Compliance, and DevSecOps

  • Implement security best practices, including infrastructure hardening, zero-trust principles, and identity management.
  • Ensure compliance with SOC2 and ISO 27001.

Mentorship & Cross-Functional Collaboration

  • Mentor and coach junior SREs, fostering a culture of reliability.
  • Work closely with development teams to ensure reliability is built into the software lifecycle.
  • Advocate for chaos engineering, game days, and resilience testing to enhance system robustness.

Qualifications

  • Bachelor's or Master's degree in Computer Science or a related field.
  • 6+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
  • Strong experience with cloud platforms (AWS, GCP, or Azure) and cloud-native technologies.
  • Expertise in Kubernetes and container orchestration.
  • Expertise with log management tools like ELK or Graylog.
  • Strong coding/scripting skills in Python, Go, or Bash for automation.
  • Deep understanding of networking, DNS, CDN, load balancing, and security.
  • Proven experience with observability tools (Prometheus, Grafana, ELK, OpenTelemetry).
  • Hands-on experience in performance tuning, high availability, and DR strategies.
  • Strong knowledge of incident management frameworks and reliability metrics (SLOs, SLIs, SLAs).
  • Experience leading cross-functional reliability initiatives.

Preferred Qualifications

  • Experience in a fast-paced, agile development environment.
  • SRE Certifications from Datadog/Google.
  • Experience with Chaos Engineering.
  • Exposure to AIOps and ML-based observability.
  • Experience in leading SRE transformations at scale.
  • Experience in multi-timezone support.
  • Experience working for B2B products from scratch would be a big plus.


  • Bengaluru, Karnataka, India Akamai Full time

    Job Category Site Reliability Would you like to lead modernization initiatives while building a public cloud platform from scratch Would you like to own critical services in a new public cloud platform Join our IaaS Site Reliability Engineering SRE team We design develop and operate infrastructure and services that power the backbone of our...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Job Role: - SRE (Senior Site Reliability Engineer)We began life in 2001 as a small, self-funded team of technology specialists. Innovative tech solutions for business We're now a leading global digital consulting firm, providing innovative technology solutions for...


  • Bengaluru, Karnataka, India LanceSoft, Inc. Full time ₹ 6,00,000 - ₹ 8,00,000 per year

    Role DescriptionThis is a full-time on-site role for a Senior Site Reliability Engineer based in Bangalore/Chennai/Pune. The Senior Site Reliability Engineer will be responsible for maintaining and enhancing the reliability and performance of the company's IT infrastructure & Development. Daily tasks include troubleshooting system issues, ensuring system...


  • Bengaluru, Karnataka, India CloudHire Full time

    Job SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • Bengaluru, Karnataka, India beBeeSiteReliability Full time ₹ 20,00,000 - ₹ 30,00,000

    As a senior site reliability engineer, you will play a critical role in ensuring the stability and scalability of financial platforms.Key Responsibilities:Ensure defined SLAs, SLOs, and SLIs are met for performance, reliability, and uptime.Build automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and...


  • Bengaluru, Karnataka, India Procore Full time ₹ 5,00,000 - ₹ 8,00,000 per year

    Job DescriptionWe're looking for a Senior Site Reliability Engineer to join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...


  • Bengaluru, Karnataka, India Procore Technologies Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Job Description We're looking for a Senior Site Reliability Engineer to join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...


  • Bengaluru, Karnataka, India HireAlpha Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    We're Hiring | Senior Site Reliability Engineer (SRE)Bangalore | HybridPermanent RoleAre you ready to help shape the future of cloud contact centers? we're building scalable, reliable, and cutting-edge infrastructure for world-class customer experiences — and we're looking for aSenior SREto join our teamWhat you'll do:Lead efforts in building a seamless ...