Lead Site Reliability Engineer
2 weeks ago
Job Details:
Job Title: Lead Site Reliability Engineer (SRE)
Duration: Contract to Hire (On the Payroll of Datum Technology Group)
Location: Chennai || Mumbai || Gurugram
Interview Process: Virtual (2 Rounds) + 1 Technical screening.
Job Description:
- We are seeking a highly skilled and experienced Lead Site Reliability Engineer (SRE) to drive reliability, scalability, and performance across our cloud infrastructure, with a strong emphasis on cloud security, compliance, networking, and operating systems expertise.
- This role blends reliability engineering with security best practices to ensure our cloud infrastructure is not only scalable and resilient but also secure and compliant.
Responsibilities:
- Develop and maintain Infrastructure as Code (IaC) using Terraform, including advanced module design and best practices for highly complex environments.
- Design and optimize CI/CD pipelines with a focus on automation, scalability, and deployment efficiency. Ability to discuss and implement pipeline optimizations from prior experience.
- Collaborate with development teams to integrate security and observability tools into CI/CD pipelines, automating security checks.
- Troubleshoot and debug networking issues, including deep understanding of networking layers, components, and configurations across cloud and hybrid environments.
- Administer and optimize Linux-based operating systems, including troubleshooting, performance tuning, and implementing best practices for security and reliability.
- Address vulnerabilities in code libraries and infrastructure (e.g., OS packages) through patching and remediation.
- Partner with application teams to resolve specific security findings and improve overall system resilience.
Requirements:
- 9+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Engineering.
- Some experience into leading or managing a team of engineers.
- Deep knowledge of networking fundamentals, Linux operating systems, and CI/CD optimization strategies.
- Very strong expertise in writing complex Terraform code, including advanced module design and best practices for large-scale, highly complex environments.
- Proficiency in scripting or programming languages (e.g., Python, Bash, Go).
- Hands-on experience with Azure cloud platform
Bonus/Preferred Skills:
- Experience with Docker and Kubernetes for containerization and orchestration.
-
Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ yearsLocation: Chennai / MumbaiWork Mode: HybridKey Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu Tata Consultancy Services Full timeRole: Site Reliability EngineerLocation: Chennai/Bangalore/HyderabadExp- 5-11 years1.Exposure to any APM tool like Dynatrace, Appdynamics, Splunk, etc2.DBA or Infra admin 3.Gremlin or Chaos Monkey or Simian Army or Litmus expertise4.Exposure to ITSM tools like Service Now, etc5.Understanding of Automation and Chaos Engineering6.Exposure to Devops tools and...
-
Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu Grootan Technologies Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...
-
AWS Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu HTC Global Services Full timeHTC – A brief profileEstablished in 1990, HTC Inc., a company with headquarters in Troy, Michigan, is a leading global Information Technology solution and BPO provider. HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data warehousing, embedded systems, ECM, SCM, CRM, and ERP solutions. HTC Inc....
-
Lead Site Reliability Engineer
16 hours ago
Chennai, Tamil Nadu, India Datum Technologies Group Full timeJob Details:Job Title: Lead Site Reliability Engineer (SRE)Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || GurugramInterview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled and experienced Lead Site Reliability Engineer (SRE) to drive reliability,...
-
Site Reliability Engineer
2 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu Proglite Full timeWe have the following requirements for the Site Reliability Engineer roleSkill Set:AWS: EC2, Networking, Storage, autoscaling, CloudWatch, SSM, management (patching/upgrades/security) of OS(windows/Linux) in EC2GCP: GKE/Compute, Networking, storage, Cloud Monitoring, management (patching/upgrades/security) of OS(windows/Linux) in computeSRE Practices:...
-
Sr. Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu Datum Technologies Group Full timeJob Details:Job Title: Sr. Site Reliability Engineer (SRE) Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || Gurugram Interview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to enhance reliability, scalability, and...
-
Senior Site Reliability Engineer
2 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu Poshmark Full timeWe’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...
-
Staff Site Reliability Engineer
2 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu Poshmark Full timeWe’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...
-
Site Reliability Engineer
23 hours ago
Chennai, Tamil Nadu, India TECEZE Full timeJob Title:Site Reliability Engineer (SRE) – Core IT InfrastructureLocation:Chennai/ pune/ bangaloreCompany:TecezeAbout TecezeTecezeis a global IT services and consulting organization delivering innovative, scalable, and secure technology solutions. We specialize in infrastructure services, cloud transformation, DevOps, and managed services, helping...