Senior Site Reliability Engineer

1 day ago


Chennai, Tamil Nadu, India Growfin Full time
About Job

At , we are seeking a highly motivated and detail-oriented Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in transforming our infrastructure from manually managed EC2 instances to a modern, automated, containerized platform. This is an exciting opportunity to work with cutting-edge technologies and contribute to the growth and reliability of our financial platform.

As a Site Reliability Engineer at , you will have the chance to collaborate with cross-functional teams to understand application requirements and support deployment strategies that enable rapid development cycles. You will also have the opportunity to learn from senior engineers and contribute to team knowledge sharing through documentation and runbooks.

Skills & Qualification
  • Hands-on experience with AWS cloud services, including EC2, VPC, IAM, S3, and basic networking concepts.

  • Basic Linux system administration skills, including scripting, networking, and troubleshooting in development or staging environments.

  • Familiarity with Infrastructure as Code concepts, with eagerness to learn Terraform for automating infrastructure provisioning and management.

  • Understanding of containerization technologies, particularly Docker, with strong interest in learning Kubernetes and container orchestration.

  • Basic proficiency in scripting languages such as Python, Bash, or similar for automation and tool development.

  • Experience with CI/CD pipeline concepts using tools like GitHub Actions, GitLab CI, Jenkins, or similar platforms.

  • Strong problem-solving abilities and a growth mindset, with demonstrated eagerness to learn new technologies and tackle infrastructure challenges.

  • Familiarity with monitoring and observability concepts, with willingness to develop expertise in modern observability platforms (Datadog experience is a plus).

  • Understanding of JVM-based applications and interest in learning performance monitoring.

  • Excellent communication skills with the ability to work effectively with cross-functional teams and document technical processes clearly.

  • Some exposure to configuration management tools (Ansible, Chef, Puppet) or willingness to learn for managing server infrastructure.

  • Basic understanding of networking concepts, including DNS, load balancing, and security fundamentals.

  • Familiarity with database concepts and backup strategies for systems like MySQL, PostgreSQL, or similar technologies.

  • Passion for automation, continuous improvement, and building reliable systems that support business growth.

  • Enthusiasm for learning and knowledge sharing, with ability to contribute to team collaboration and development initiatives.

Responsibilities
  • Contribute to the transformation of our infrastructure from manually managed EC2 instances to a modern, automated, containerized platform under senior engineer guidance.

  • Learn and implement Infrastructure as Code solutions using Terraform to replace manual server management and enable version-controlled infrastructure.

  • Containerize existing applications using Docker and gain hands-on experience with container orchestration using Kubernetes.

  • Build and maintain CI/CD pipelines and GitOps workflows to automate deployment processes.

  • Implement monitoring and alerting solutions across infrastructure components and learn comprehensive observability practices.

  • Collaborate with development teams to understand application requirements and support deployment strategies that enable rapid development cycles.

  • Assist in establishing backup, disaster recovery, and security protocols to ensure high availability and data protection for our financial platform.

  • Monitor AWS resource utilization and help implement cost optimization strategies.

  • Create documentation and runbooks for new infrastructure processes to support team knowledge sharing.

  • Learn DevOps best practices, cloud technologies, and modern infrastructure patterns through hands-on experience and team collaboration.

  • Support the adoption of container security best practices and vulnerability scanning processes.

  • Participate in incident response efforts and learn from post-mortem analysis to improve system reliability.

  • Stay current with emerging DevOps technologies and contribute ideas for infrastructure improvements.

  • Support architecture discussions for new applications and learn about infrastructure strategy and technology planning.



  • Chennai, Tamil Nadu, India warrior tech solutions Full time

    Hi,Greetings from EWarriors Tech Solutions. We are hiring for the following position:Role: Cloud Site Reliability EngineerLocation: Chennai, Bangalore and Pune (Onsite)Experience: 14+ YearsEmployment: ContractNotice: Immediate Joiners / Less than 15 Days· 14+ years of experience with strong SRE principles and hands on experience in Azure cloud· Deep...


  • Chennai, Tamil Nadu, India Cstream, Inc. Full time ₹ 2,50,000 - ₹ 5,00,000 per year

    Company Description, headquartered in Irvine, California, is a technology-driven company that provides innovative solutions for technology governance, risk management, and compliance. Utilising automation and AI, Cstream's platform helps organisations streamline compliance frameworks like SOC 2, ISO 27001, HIPAA, and PCI-DSS through guided workflows,...


  • Chennai, Tamil Nadu, India Grootan Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the RoleWe are seeking a skilledSite Reliability Engineer (SRE)with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and monitoring...


  • Chennai, Tamil Nadu, India Cortex Consultants Full time

    Job Title: Site Reliability Engineer (SRE) Experience: 6 to 9 years Location: chennai Job Overview: We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our growing team. As an SRE, you will be responsible for maintaining the reliability, availability, and performance of our systems. We're looking for someone with solid experience...


  • Chennai, Tamil Nadu, India GSR Business Services Full time

    Dear Aspirants,Urgent HiringSite reliability Engineer3-5 YearsChennaiRole Summary:Supports the reliability and performance of systems and infrastructure. Assists in monitoring, troubleshooting, and automating tasks to maintain high-availability environments.Key Responsibilities:Assist in managing VMware and Linux servers.Monitor system health and respond to...


  • Chennai, Tamil Nadu, India HICS Technologies Pte Ltd Full time ₹ 8,00,000 - ₹ 16,00,000 per year

    Job Title: Site Reliability Engineer (SRE) – Capital Markets / TradingLocation: [Chennai / Onsite / Full Time]Experience: 7 to 15 yearsDomain: IT Operations / Capital Markets / TradingAbout the Role:We are seeking a seasoned Site Reliability Engineer (SRE) to join our dynamic IT team supporting trading and capital markets applications. The ideal...


  • Chennai, Tamil Nadu, India Chasra Solutions Full time

    Position id: 34689 Site Reliability Engineering- Chennai (Onsite)Position Description:Employees in this SRE job function are responsible for ensuring availability, reliability and performance of cloud and network systems and services by AUTOMATING routine manual tasks. (Handson Software Engineer only)Key Responsibilities:Collaborate with Infrastructure teams...


  • Chennai, Tamil Nadu, India Flex Full time

    Flex is the diversified manufacturing partner of choice that helps market-leading brands design, build and deliver innovative products that improve the world.A career at Flex offers the opportunity to make a difference and invest in your growth in a respectful, inclusive, and collaborative environment. If you are excited about a role but don't meet every...


  • Chennai, Tamil Nadu, India Ford Motor Company Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Job DescriptionJob Description:Ford is seeking an experienced Site Reliability Engineer (SRE) to join our team and lead the development, enhancement, and extension of our global monitoring and observability platform.Enterprise Technology plays a critical part in shaping the future of mobility. If you're looking for the chance to leverage advanced technology...


  • Chennai, Tamil Nadu, India Proglite Full time

    We have the following requirements for the Site Reliability Engineer roleSkill Set:AWS: EC2, Networking, Storage, autoscaling, CloudWatch, SSM, management (patching/upgrades/security) of OS(windows/Linux) in EC2GCP: GKE/Compute, Networking, storage, Cloud Monitoring, management (patching/upgrades/security) of OS(windows/Linux) in computeSRE Practices:...