Site Reliability Engineer

1 week ago

India Prophecy Full time ₹ 12,00,000 - ₹ 24,00,000 per year

About Prophecy

Prophecy is a rapidly growing startup enabling all the data users to visually build data pipelines with modern software practices including code on Github using its Low-Code Data Engineering Platform.

Prophecy is trusted by top Fortune 500 firms to replace their legacy ETL tools as they re-platform to the Cloud or Apache Spark. We're very well funded, backed by top VCs, and on the path to establishing ourselves as the leader on the cloud.

Prophecy is a Core Technology and Deep-IP company with engineering centered in India. Prophecy engineers often say that they have never worked in a more productive, higher-horsepower organization in their careers. The engineers love their work, are being challenged, and are doing the best work of their careers. To learn more, visit us on LinkedIn

Position Summary

As a Site Reliability Engineer (SRE), you will ensure the reliability, scalability, and performance of Prophecy's platform across multi-cloud and SaaS environments. You will provide technical expertise in Kubernetes, networking, identity, observability, and automation, working to resolve challenges that impact the availability and resilience of our platform. Customers and internal teams will look to you for solutions ranging from infrastructure troubleshooting to complex architectural designs spanning Kubernetes, cloud-native services, and enterprise security. You will partner closely with product engineering and support teams to deliver a highly reliable experience to our enterprise customers.

The Impact You Will Have

Operate and optimize Kubernetes platforms (EKS, AKS, GKE) with Helm, namespaces, pods, autoscaling, node pools.
Manage ingress & networking: NGINX, ALB/AGIC, DNS, TLS/certificates, proxies, VNET/VPC routing, PrivateLink/peering.
Implement identity & secrets management: SSO (OIDC/SAML), SCIM, service principals/managed identities, vaults, key rotation.
Maintain platform service health across UI, APIs, orchestrators, workflow services using readiness/liveness probes and capacity planning.
Enable storage & I/O: object stores (S3, ADLS, GCS), DBFS mounts, IAM roles, access connectors, throughput/quota optimization.
Execute release & upgrades: version rollouts, canary/blue-green strategies, rollback automation, image registries, SBOM/vulnerability scanning.
Deliver observability: build dashboards, log pipelines, SLO/SLA monitoring with Prometheus, Grafana, CloudWatch, Log Analytics, ELK.
Strengthen resilience & DR: multi-AZ architectures, backup/restore, chaos testing, RTO/RPO validation, recovery runbooks.
Drive release automation: GitOps (ArgoCD/Flux), pre-flight checks, automated smoke tests, post-upgrade validation suites.
Ensure cloud-specific reliability: IAM, private connectivity, security groups, application gateways across AWS, Azure, GCP.
Enforce security & compliance: CIS hardening, benchmarks, network segmentation, vulnerability management, auditability.
Support high-governance SaaS deployments: dedicated SaaS controls, change control, strict egress policies, artifact provenance, customer-owned KMS.

What We Look For

4+ years in SRE, platform engineering, or enterprise production support.
Strong hands-on experience with Kubernetes and multi-cloud (AWS, Azure, GCP).
Expertise in networking, identity, secrets, and platform automation.
Proven track record in observability, reliability engineering, and incident management.
Familiarity with GitOps/CI/CD pipelines and modern automation practices.
Strong problem-solving, ownership, and ability to work in a fast-moving startup culture.
Technical degree or the equivalent experience.

What You'll Have At Prophecy

Great company culture.
Competitive compensation.
Fair and Open Equity awards for everyone.
Flexible hybrid/remote work environment
Private medical insurance.
Learning and career development opportunities
End-to-end project ownership and high-growth career path

Our Commitment to Diversity and Inclusion

At Prophecy, we hire for merit and foster an inclusive culture where people from diverse backgrounds can excel and do their best work. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Prophecy are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and any other protected characteristics under applicable laws.

Site Reliability Engineer

1 week ago

India Grootan Technologies Full time

About the Role We are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...
Site Reliability Engineer

3 days ago

India Datum Technologies Group Full time

Job Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
Site Reliability Engineer

3 weeks ago

India Akamai Technologies Full time

Job Description Job Description Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed content delivery challenges Join our highly skilled Compute Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We...
Site Reliability Engineer

2 weeks ago

India Akamai Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Description Do you like collaborating across teams to solve complex problems? Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating...
Site Reliability Engineer

1 day ago

Chennai, India Datum Technologies Group Full time

Job Description Job Title: Site Reliability Engineer (SRE) Azure & AI Experience: 7+ years Work Mode: Hybrid Work Location: Chennai/Mumbai/Gurgaon Job Summary: We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in...
Site Reliability Engineer

4 weeks ago

India CareerUS Solutions Full time

Job Description Position Overview: The Site Reliability Engineer (SRE) is responsible for ensuring the stability, scalability, performance, and reliability of production systems and services. This role bridges software development and operations, using automation, monitoring, and performance optimization to build resilient systems that can scale efficiently...
Site Reliability Engineer

3 weeks ago

India CitNOW Group Full time

About us Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably. CitNOW’s app-based platform provides a secure, brand-compliant solution – for dealers to build trust, transparency and long-lasting relationships. CitNOW Group was formed...
Site Reliability Engineer

1 week ago

Hyderabad, India UBS Full time

Job Description Job Reference # 322870BR Job Type Full Time Your role Are you an analytic thinker Do you enjoy Site Reliability Engineering initiatives and proactive problem management across on-premises & Cloud Database ensuring high availability & stability of Database infrastructure services Do you want to play a key role in transforming our firm into an...
Site Reliability Engineer

2 weeks ago

Bengaluru, Karnataka, , India Qure ai Technologies Full time ₹ 12,00,000 - ₹ 24,00,000 per year

About Qure.AI:Qure.AI is an equal opportunity employer. is a leading Healthcare Artificial Intelligence (AI) company disrupting the 'status quo' by enhancing diagnostic imaging and improving health outcomes with the assistance of machine -supported tools. taps deep learning technology to provide an automated interpretation of radiology exams like X -rays,...
Site Reliability Engineer

2 weeks ago

India Zensar Technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Candidate having skilled and proactive Site Reliability Engineer (SRE) with 10 Years experienceThe SRE will be responsible for ensuring the reliability, scalability, and performance of our systems and infrastructure.This role blends software engineering with IT operations to build fault-tolerant, self-healing systems and drive continuous improvement across...

Americas

Europe

Asia / Oceania

Africa

Site Reliability Engineer