Senior Site Reliability Engineer

3 days ago


Chennai Tamil Nadu, India Growfin Full time

About Job At Growfin ai we are seeking a highly motivated and detail-oriented Site Reliability Engineer to join our team As a Site Reliability Engineer you will play a critical role in transforming our infrastructure from manually managed EC2 instances to a modern automated containerized platform This is an exciting opportunity to work with cutting-edge technologies and contribute to the growth and reliability of our financial platform As a Site Reliability Engineer at Growfin ai you will have the chance to collaborate with cross-functional teams to understand application requirements and support deployment strategies that enable rapid development cycles You will also have the opportunity to learn from senior engineers and contribute to team knowledge sharing through documentation and runbooks Skills Qualification Hands-on experience with AWS cloud services including EC2 VPC IAM S3 and basic networking concepts Basic Linux system administration skills including scripting networking and troubleshooting in development or staging environments Familiarity with Infrastructure as Code concepts with eagerness to learn Terraform for automating infrastructure provisioning and management Understanding of containerization technologies particularly Docker with strong interest in learning Kubernetes and container orchestration Basic proficiency in scripting languages such as Python Bash or similar for automation and tool development Experience with CI CD pipeline concepts using tools like GitHub Actions GitLab CI Jenkins or similar platforms Strong problem-solving abilities and a growth mindset with demonstrated eagerness to learn new technologies and tackle infrastructure challenges Familiarity with monitoring and observability concepts with willingness to develop expertise in modern observability platforms Datadog experience is a plus Understanding of JVM-based applications and interest in learning performance monitoring Excellent communication skills with the ability to work effectively with cross-functional teams and document technical processes clearly Some exposure to configuration management tools Ansible Chef Puppet or willingness to learn for managing server infrastructure Basic understanding of networking concepts including DNS load balancing and security fundamentals Familiarity with database concepts and backup strategies for systems like MySQL PostgreSQL or similar technologies Passion for automation continuous improvement and building reliable systems that support business growth Enthusiasm for learning and knowledge sharing with ability to contribute to team collaboration and development initiatives Responsibilities Contribute to the transformation of our infrastructure from manually managed EC2 instances to a modern automated containerized platform under senior engineer guidance Learn and implement Infrastructure as Code solutions using Terraform to replace manual server management and enable version-controlled infrastructure Containerize existing applications using Docker and gain hands-on experience with container orchestration using Kubernetes Build and maintain CI CD pipelines and GitOps workflows to automate deployment processes Implement monitoring and alerting solutions across infrastructure components and learn comprehensive observability practices Collaborate with development teams to understand application requirements and support deployment strategies that enable rapid development cycles Assist in establishing backup disaster recovery and security protocols to ensure high availability and data protection for our financial platform Monitor AWS resource utilization and help implement cost optimization strategies Create documentation and runbooks for new infrastructure processes to support team knowledge sharing Learn DevOps best practices cloud technologies and modern infrastructure patterns through hands-on experience and team collaboration Support the adoption of container security best practices and vulnerability scanning processes Participate in incident response efforts and learn from post-mortem analysis to improve system reliability Stay current with emerging DevOps technologies and contribute ideas for infrastructure improvements Support architecture discussions for new applications and learn about infrastructure strategy and technology planning



  • tamil nadu, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together. What we are looking for Role: Site Reliability Engineering (SRE) Experience Range: 5 – 15 Years Location: Chennai/Pune candidates should come to office for Walk in...


  • tamil nadu, India Grootan Technologies Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • tamil nadu, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ yearsLocation: Chennai / MumbaiWork Mode: HybridKey Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • tamil nadu, India Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • Chennai, Tamil Nadu, India, Tamil Nadu Tata Consultancy Services Full time

    Role: Site Reliability EngineerLocation: Chennai/Bangalore/HyderabadExp- 5-11 years1.Exposure to any APM tool like Dynatrace, Appdynamics, Splunk, etc2.DBA or Infra admin 3.Gremlin or Chaos Monkey or Simian Army or Litmus expertise4.Exposure to ITSM tools like Service Now, etc5.Understanding of Automation and Chaos Engineering6.Exposure to Devops tools and...


  • Chennai, Tamil Nadu, India, Tamil Nadu Datum Technologies Group Full time

    Job Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ yearsLocation: Chennai / MumbaiWork Mode: HybridKey Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...


  • Chennai, Tamil Nadu, India, Tamil Nadu Grootan Technologies Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • Chennai, Tamil Nadu, India, Tamil Nadu Poshmark Full time

    We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...


  • Chennai, Tamil Nadu, India Growfin Full time

    About JobAt , we are seeking a highly motivated and detail-oriented Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in transforming our infrastructure from manually managed EC2 instances to a modern, automated, containerized platform. This is an exciting opportunity to work with cutting-edge...


  • Chennai, Tamil Nadu, India Pfizer Full time

    ROLE SUMMARY At Pfizer we make medicines and vaccines that change patients lives with a global reach of over 780 million patients Pfizer Digital is the organization charged with winning the digital race in the pharmaceutical industry We apply our expertise in technology innovation and our business to support Pfizer in this mission Our team the Global Supply...