Senior Site Reliability Engineer

4 weeks ago


india CirrusLabs Full time
We are

CirrusLabs

. Our vision is to become the world's most sought-after niche digital transformation company that helps customers realize value through innovation. Our mission is to co-create success with our customers, partners and community. Our goal is to enable employees to dream, grow and make things happen. We are committed to excellence. We are a dependable partner organization that delivers on commitments. We strive to maintain integrity with our employees and customers. Every action we take is driven by value. The core of who we are is through our well-knit teams and employees. You are the core of a values driven organization.

You have an entrepreneurial spirit. You enjoy working as a part of well-knit teams. You value the team over the individual. You welcome diversity at work and within the greater community. You aren't afraid to take risks. You appreciate a growth path with your leadership team that journeys how you can grow inside and outside of the organization. You thrive upon continuing education programs that your company sponsors to strengthen your skills and for you to become a thought leader ahead of the industry curve.

You are excited about creating change because your skills can help the greater good of every customer, industry and community. We are hiring a talented

Senior Site Reliability Engineer (SRE)

to join our team. If you're excited to be part of a winning team, CirrusLabs (

http://www.cirruslabs.io

) is a great place to grow your career.

Experience - 5 - 8 yearsLocation - Bengaluru

Senior Site Reliability Engineer (SRE) - Mission-Critical SaaS Cloud Products

Key Responsibilities

2 Reliability and Performance Management

- Design, implement, and maintain highly available, scalable, and resilient cloud-native architectures for mission-critical SaaS products.- Develop and implement SLOs, SLIs, and SLAs to measure and improve service reliability.- Continuously optimize system performance and resource utilization across multiple cloud platforms.- Finetune/Optimize Application performance by analyzing the code, traces and database queries.

3 Incident Management and Troubleshooting

- Lead incident response efforts, effectively troubleshooting complex issues to minimize downtime and impact.- Reduce Mean Time to Recover (MTTR) through proactive monitoring, automated alerting, and efficient problem-solving techniques.- Conduct thorough Root Cause Analysis (RCA) for all major incidents and implement preventive measures.

4 Observability and Monitoring

- Design and implement end-to-end observability solutions across our distributed systems.- Develop and maintain comprehensive monitoring strategies using tools like ELK Stack, Prometheus, Grafana.- Create and optimize product status dashboards to provide real-time visibility into system health and performance.

5 Automation and Infrastructure as Code (IaC)

- Implement Infrastructure as Code practices using tools like Terraform.- Develop and maintain automated deployment pipelines and CI/CD workflows.- Create self-healing systems and automate routine operational tasks to reduce manual intervention.

6 Cloud-Agnostic Architecture

- Design and implement cloud-agnostic solutions that can operate efficiently across multiple cloud providers.- Develop expertise in event-driven architectures and related technologies (e.g., Apache Kafka/Eventhub, Redis, Mongo Atlas, IoTHub).- Implement and manage containerized applications using Kubernetes across different cloud environments.

7 Continuous Improvement

- Regularly review and refine operational practices to enhance efficiency and reliability.- Stay updated with the latest industry trends and technologies in SRE, cloud computing, and DevOps.- Contribute to the development of internal tools and frameworks to support SRE practices.

Requirements- Strong knowledge of cloud platforms - Azure and their associated services.- Expert in Observability tools (ELK Stack, Dynatrace, Prometheus )- Expertise in containerization technologies such as Docker and Kubernetes- Understanding of Event-driven architecture and database technologies (Mongo Atlas, Azure SQL, PostgresDB)- Proficient in IaaC tools such as - Terraform and GitHub Actions.- Proficiency in one or more programming languages - Python/.Net/Java- Strong understanding of networking concepts, load balancing, and security practices.

  • india meanSquare.ai Full time

    Immediate Hiring: Senior Site Reliability Engineer (Sr. SRE) – ContractLocation: Hybrid – Hyderabad, IndiaJob Type: Contract (6–12 months, extendable)We are seeking a Senior Site Reliability Engineer (Sr. SRE) with expertise in Azure, Dynatrace, and Splunk to join our team on an immediate basis. This hybrid contract role requires a proactive...


  • india meanSquare.ai Full time

    Immediate Hiring: Senior Site Reliability Engineer (Sr. SRE) – ContractLocation:Hybrid – Hyderabad, IndiaJob Type:Contract (6–12 months, extendable)We are seeking aSenior Site Reliability Engineer (Sr. SRE)with expertise in Azure, Dynatrace, and Splunk to join our team on an immediate basis. This hybrid contract role requires a proactive professional...


  • India Cloudologic Full time

    Company Description : Cloudologic is a prominent cloud consulting and IT service provider based in Singapore and rooted in India, focusing on cloud operations, cyber security, and managed services. With a decade of expertise, our dedication to delivering high-quality services has earned the trust of clients worldwide, making us a valued partner in the tech...


  • india SolarWinds Full time

    At SolarWinds, we put people first. Our mission is to enrich the lives of our employees, customers, partners, and communities by delivering simple, powerful, and secure solutions that accelerate business transformation.We thrive on innovation, collaboration, and accountability. If you're a problem solver who enjoys working in a fast-paced, high-impact...


  • india NationsBenefits Full time

    Position Overview:TheSite Reliability Engineering(SRE) team plays a critical role in maintaining the health, performance, and availability of our platforms. As anL2 SRE , you will monitor and respond to site performance metrics, manage incidents, and work closely with Development, , and Engineering teams to ensure the continuous reliability of our services....


  • India Microsoft Full time

    Job DescriptionM365's COSMIC team designs, builds, and operates a globalscalemanaged-runtime environment based on Azure Kubernetes Service for the benefit of Microsoft Substrate service and developers. COSMIC could be compared to a KubernetesPaaS.Our charter builds and maintains solutions that enable substrate service teams onboarding to Cosmic Linux...


  • india SolarWinds Full time

    At SolarWinds, we put people first. Our mission is to enrich the lives of our employees, customers, partners, and communities by delivering simple, powerful, and secure solutions that accelerate business transformation. We thrive on innovation, collaboration, and accountability. If you're a problem solver who enjoys working in a fast-paced, high-impact...


  • India iVedha Inc. Full time

    Site Reliability Engineer (SRE) Remote in India and have to work in EST (US/Canada) Time Zone with 24*7 Support Model Position Overview: We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with strong expertise in Python , advanced proficiency in Azure-based infrastructure , and significant experience in Customer Reliability...


  • India HARP Technologies and Services Full time

    Experience : 8 Years Location : Mumbai,Chennai (Other cities Remote) Notice period : Immediate to 30 days max Responsibilities of Senior SRE : - The Site Reliability Engineering (SRE) team is responsible for the reliability, scalability, stability and performance of systems and services. - They work with cross-functional teams to design, build and maintain...


  • india 10decoders Full time

    JD: Site Reliability Engineer -GCP With TerraformThe Role:We are looking for a Senior SRE with5+ yearsof experience to work primarily with ourApplication development team. An ideal candidate would have extensive experiencebuilding cloud infrastructure onGoogle Cloud with Terraformand have strongexperience running workloads that scale on Google’s Kubernetes...


  • India Burgeon It Services Pvt Ltd Full time

    Position : Site Reliability Engineer Location : PAN INDIA Location Duration : C2H Exp : 5 - 8 Years JOB DESCRIPTION : - Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role. - Experience with cloud platforms (AWS, Google Cloud, Azure) and containerization technologies (Docker, Kubernetes). - Maintain the stability of the...


  • india Infosys Full time

    Position Overview We are seeking a skilled and motivated Site Reliability Engineer with hands-on expertise in application operations, DevOps tools, and SRE principles. The ideal candidate will have experience in supporting production systems, DEVOPS hands-on, a solid understanding of observability, and a foundational grasp of SRE principles. The role also...


  • India Experience.com Full time

    Come Join UsExperience com - We make every experience matter more Position Title Senior DevOps Engineer SRE Site Reliability EngineerJob Location Chennai Base Location RemoteShift Time General and US shiftEmployment Type Full TimeSummary of PositionAre you a talented Senior DevOps Engineer looking for an exciting opportunity to work for a...


  • india CorroHealth Full time

    Hiring Alert!!!We are looking for highly skilled Site Reliability Engineer (SRE) for our Product Development team based out at Noida Location!!!Only Immediate Joiners preferred!!Candidates who are available for F2F round of interview, can only apply!!Job DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal...


  • india, india BigRio Full time

    Job Title: Site Reliability Engineer Location: Remote with Quarterly visits to Chennai, Tamil Nadu, India Duration: Full-Time About BigRio: BigRio is a remote-based, technology consulting firm headquartered in Boston, MA. We deliver software solutions ranging from custom development and software implementation to data analytics and machine learning/AI...


  • india, india BigRio Full time

    Job Title: Site Reliability Engineer Location: Remote with Quarterly visits to Chennai, Tamil Nadu, India Duration: Full-Time About BigRio: BigRio is a remote-based, technology consulting firm headquartered in Boston, MA. We deliver software solutions ranging from custom development and software implementation to data analytics and machine learning/AI...


  • India Burgeon It Services Pvt Ltd Full time

    Position : Site Reliability EngineerLocation : PAN INDIA LocationDuration : C2HExp : 5 - 8 YearsJOB DESCRIPTION : - Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role. - Experience with cloud platforms (AWS, Google Cloud, Azure) and containerization technologies (Docker, Kubernetes). - Maintain the stability of the software...


  • india Coforge Full time

    Job Title: Site Reliability Engineer Skills : SRE, CI/CD, AWS, Python, Terraform & Kubernetes Location: Hyderabad (Work from Office) Experience: 6-14 Years Note: Immediate joiners are preferable Job Description: We at Coforge are hiring a Site Reliability Engineer with the following skillset: Design, implement, and manage scalable and secure cloud-based...


  • india Info Way Solutions Full time

    Position : SRE (Site Reliability Engineer)Experience : Minimum 5+ Years of exp. (5 - 7 year of exp)Location : HYD, TVM, BLR, Pune, Chennai and KolkataWork Mode : Hybrid ( 3Days a week)Must Have Skill set :(Real time Hands on) CI/CD Tools – Jenkins/Harness,Cloud Infra/deployments – AWS/GCP, Docker & Kubernetes, AnsibleInfra / IaaC -- TerraformApplication...


  • India Forbes Advisor Full time

    Job Title: SRE(Certification Mandate) - Certification allowedAWS Devops professionalAWS Sysops adminAWS Security specialistAWS Solution architect ProfessionalExperience: 8+ YearsLocation: Mumbai, Chennai (If strong candidate other location remote will be offered if from mumbai or chennai only hybrid no remote)Notice period: Immediate to 30 days max...