Site Reliability Engineer/Architect
2 weeks ago
Job Summary
We are seeking an experienced Site Reliability Engineer (SRE) Architect with over 10 years of IT experience, specializing in designing and implementing highly scalable, reliable, and automated systems.
The ideal candidate will have strong expertise in cloud-native architectures, automation, monitoring, and SRE practices.
This role requires excellent leadership, technical depth, and the ability to guide large-scale enterprise reliability initiatives.
Key Responsibilities
- Design and implement scalable, reliable, and automated infrastructure solutions.
- Lead SRE initiatives across multiple teams, ensuring adherence to SRE principles (SLIs, SLOs, SLAs).
- Drive incident management, root cause analysis, and postmortem processes.
- Define and implement observability standards (monitoring, logging, alerting).
- Collaborate with development and operations teams to improve system reliability and performance.
- Automate infrastructure provisioning and deployments using IaC (Terraform, Ansible, etc.).
- Build and optimize CI/CD pipelines for zero-downtime deployments.
- Ensure high availability, fault tolerance, and disaster recovery strategies.
- Establish performance benchmarks, load testing, and capacity planning.
- Provide leadership and mentorship to SRE and DevOps teams.
Required Skills & Qualifications
- 10+ years of IT experience with at least 5 years in SRE/DevOps roles.
- Expertise in cloud platforms: AWS, Azure, or GCP.
- Strong knowledge of Kubernetes, Docker, and microservices architecture.
- Hands-on experience with Infrastructure as Code (Terraform, Ansible, CloudFormation).
- Proficiency in programming/scripting languages such as Python, Go, or Bash.
- Experience with monitoring tools (Prometheus, Grafana, ELK, Datadog, Dynatrace).
- Strong background in CI/CD pipeline design and automation (Jenkins, GitHub Actions, GitLab CI).
- In-depth knowledge of networking, load balancers, DNS, and security best practices.
- Excellent problem-solving and incident management skills.
- Strong leadership and stakeholder management abilities.
Preferred Qualifications
- Certified Kubernetes Administrator (CKA) or AWS/Azure/GCP Cloud Architect certification.
- Experience in large-scale distributed systems design.
- Background in performance engineering and chaos engineering.
- Knowledge of ITIL practices for incident, problem, and change management.
)
-
Senior Site Reliability Engineer
5 days ago
Greater Kolkata Area, India Meazure Learning Full time ₹ 12,00,000 - ₹ 24,00,000 per yearAt Meazure Learning, we believe in transforming learning and assessment experiences to unlock human potential. As a global leader in online testing and exam services, we support credentialing, licensure, workforce education, and higher education through purpose-built solutions that are secure, accessible, and deeply human-centered. With a global footprint...
-
Principal Site Reliability Engineer
2 days ago
Greater Kolkata Area, India Atlassian Full time ₹ 1,20,000 - ₹ 2,60,000 per yearOverviewWe are looking for a reliability expert who is passionate about scaling Cloud services to join our growing Site Reliability Engineering (SRE) teams. You are someone who is aware of current industry trends (particularly those related to reliability) and who values working with a diverse set of partners, who can articulate the business impact of a...
-
Senior Site Reliability Engineer
2 weeks ago
Greater Kolkata Area, India N-iX Full time ₹ 12,00,000 - ₹ 24,00,000 per yearAbout The CompanyOur Client is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real time. Singularity XDR ingests data and leverages our patented AI models to deliver autonomous protection. With the Client, organizations gain full transparency into everything happening across the...
-
Site Reliability
1 week ago
Greater Kolkata Area, India N-iX Full time ₹ 1,00,00,000 - ₹ 2,00,00,000 per yearDescriptionN-iX is a global software development company founded in 2002, connecting over 2,400+ tech professionals across 40+ countries.We deliver innovative technology solutions in cloud computing, data analytics, AI, embedded software,IoT, and more to global industry leaders and Fortune 500 companies.Join us to create technology that drives real change...
-
Site Reliability Engineer
1 day ago
Greater Bengaluru Area, India Relevance Lab Full timeAbout the roleWe are seeking an experienced Site Reliability Engineer to join our team at, a leader in blockchain technology and solutions. The ideal candidate will have a strong background in infrastructure management and a deep understanding of blockchain ecosystems. You will be responsible for designing, implementing, and maintaining the foundational...
-
Site Reliability Engineer
3 days ago
Greater Hyderabad Area, India Awign Expert Full timePosition: SRE Observability EngineerExp: 5+ YearsLocation: HyderabadMandatory Skills: Observability, Grafana and Writing queries using Prometheus and Loki. Job Description:We are seeking a highly experienced and driven Senior Observability Engineer to lead the design, development, and maintenance of observability solutions across our infrastructure,...
-
Site Reliability Engineer
23 hours ago
Greater Bengaluru Area, India Relevance Lab Full timeAbout the role We are seeking an experienced Site Reliability Engineer to join our team at, a leader in blockchain technology and solutions. The ideal candidate will have a strong background in infrastructure management and a deep understanding of blockchain ecosystems. You will be responsible for designing, implementing, and maintaining the foundational...
-
Site Reliability Engineer
21 hours ago
Greater Bengaluru Area, India Relevance Lab Full timeAbout the role We are seeking an experienced Site Reliability Engineer to join our team at, a leader in blockchain technology and solutions. The ideal candidate will have a strong background in infrastructure management and a deep understanding of blockchain ecosystems. You will be responsible for designing, implementing, and maintaining the foundational...
-
Sr Site Reliability Engineer
1 day ago
Greater Bengaluru Area, India Shell Recharge Solutions Full timeSenior Site Reliability Engineer Shell Recharge Solutions is a leader in delivering the new electric mobility future through innovative software, infrastructure, and professional services that empower utilities, cities, fleets, transit agencies, and automakers to deploy EV charging infrastructure at scale. Our technology is connecting EV infrastructure...
-
Sr Site Reliability Engineer
21 hours ago
Greater Bengaluru Area, India Shell Recharge Solutions Full timeSenior Site Reliability Engineer Shell Recharge Solutions is a leader in delivering the new electric mobility future through innovative software, infrastructure, and professional services that empower utilities, cities, fleets, transit agencies, and automakers to deploy EV charging infrastructure at scale. Our technology is connecting EV infrastructure...