Site Reliability Engineer

14 hours ago


Chennai Tamil Nadu India, Tamil Nadu TECEZE Full time

Job Title:


Site Reliability Engineer (SRE) – Core IT Infrastructure


Location:


Chennai/ pune/ bangalore


Company:


Teceze



About Teceze


Teceze is a global IT services and consulting organization delivering innovative, scalable, and secure technology solutions. We specialize in infrastructure services, cloud transformation, DevOps, and managed services, helping enterprises achieve operational excellence and digital resilience.



Job Summary


Teceze is looking for a highly skilled Site Reliability Engineer (SRE) to join our Core IT Infrastructure team. The ideal candidate will focus on designing, building, and maintaining reliable, scalable, and highly available infrastructure platforms. This role blends software engineering, systems engineering, and operational excellence to ensure stability, performance, and automation across enterprise environments.



Key Responsibilities


Infrastructure Reliability & Operations

• Design, implement, and maintain highly available and fault-tolerant infrastructure

• Ensure reliability, performance, scalability, and security of core IT systems

• Monitor system health, capacity, and performance using proactive observability practices

• Lead incident response, root cause analysis (RCA), and post-incident reviews


Automation & SRE Development

• Develop and maintain automation tools, scripts, and frameworks to reduce manual operations

• Apply Infrastructure as Code (IaC) principles using tools such as Terraform, Ansible, or CloudFormation

• Build self-healing systems and automate repetitive operational tasks

• Improve deployment pipelines and operational workflows through engineering solutions


DevOps & Platform Engineering

• Collaborate with DevOps, development, and security teams to support CI/CD pipelines

• Enable seamless application deployments with minimal downtime

• Support containerized and orchestration platforms (Docker, Kubernetes, OpenShift)

• Implement best practices for configuration management and environment consistency


Monitoring, Observability & Performance

• Design and maintain monitoring, logging, and alerting systems

• Define and track SLIs, SLOs, and SLAs

• Optimize system performance, capacity planning, and cost efficiency

• Enhance observability using tools such as Prometheus, Grafana, ELK, Datadog, or similar


Security & Compliance

• Implement infrastructure security best practices

• Collaborate with security teams on vulnerability management and compliance requirements

• Ensure secure access, identity management, and audit readiness



Required Skills & Qualifications


Technical Skills

• Strong experience in Linux/Unix system administration

• Proficiency in programming/scripting (Python, Go, Bash, Shell, or similar)

• Experience with cloud platforms (AWS, Azure, or GCP)

• Hands-on experience with containerization and orchestration

• Knowledge of networking concepts (DNS, TCP/IP, load balancing, firewalls)

• Experience with monitoring, logging, and alerting tools



  • Chennai, Tamil Nadu, India, Tamil Nadu Tata Consultancy Services Full time

    Role: Site Reliability EngineerLocation: Chennai/Bangalore/HyderabadExp- 5-11 years1.Exposure to any APM tool like Dynatrace, Appdynamics, Splunk, etc2.DBA or Infra admin 3.Gremlin or Chaos Monkey or Simian Army or Litmus expertise4.Exposure to ITSM tools like Service Now, etc5.Understanding of Automation and Chaos Engineering6.Exposure to Devops tools and...


  • Chennai, Tamil Nadu, India, Tamil Nadu Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ yearsLocation: Chennai / MumbaiWork Mode: HybridKey Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • Chennai, Tamil Nadu, India, Tamil Nadu Grootan Technologies Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • Chennai, Tamil Nadu, India, Tamil Nadu Proglite Full time

    We have the following requirements for the Site Reliability Engineer roleSkill Set:AWS: EC2, Networking, Storage, autoscaling, CloudWatch, SSM, management (patching/upgrades/security) of OS(windows/Linux) in EC2GCP: GKE/Compute, Networking, storage, Cloud Monitoring, management (patching/upgrades/security) of OS(windows/Linux) in computeSRE Practices:...


  • Chennai, Tamil Nadu, India, Tamil Nadu HTC Global Services Full time

    HTC – A brief profileEstablished in 1990, HTC Inc., a company with headquarters in Troy, Michigan, is a leading global Information Technology solution and BPO provider. HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data warehousing, embedded systems, ECM, SCM, CRM, and ERP solutions. HTC Inc....


  • Chennai, Tamil Nadu, India, Tamil Nadu Datum Technologies Group Full time

    Job Details:Job Title: Lead Site Reliability Engineer (SRE) Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || Gurugram Interview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled and experienced Lead Site Reliability Engineer (SRE) to drive reliability,...


  • Chennai, Tamil Nadu, India, Tamil Nadu Datum Technologies Group Full time

    Job Details:Job Title: Sr. Site Reliability Engineer (SRE) Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || Gurugram Interview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to enhance reliability, scalability, and...


  • Chennai, Tamil Nadu, India, Tamil Nadu Poshmark Full time

    We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...


  • Chennai, Tamil Nadu, India, Tamil Nadu Poshmark Full time

    We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...


  • Chennai, Tamil Nadu, India Grootan Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the RoleWe are seeking a skilledSite Reliability Engineer (SRE)with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and monitoring...