Senior Site Reliability Engineer

4 weeks ago

india CirrusLabs Full time

We are

CirrusLabs

. Our vision is to become the world's most sought-after niche digital transformation company that helps customers realize value through innovation. Our mission is to co-create success with our customers, partners and community. Our goal is to enable employees to dream, grow and make things happen. We are committed to excellence. We are a dependable partner organization that delivers on commitments. We strive to maintain integrity with our employees and customers. Every action we take is driven by value. The core of who we are is through our well-knit teams and employees. You are the core of a values driven organization.

You have an entrepreneurial spirit. You enjoy working as a part of well-knit teams. You value the team over the individual. You welcome diversity at work and within the greater community. You aren't afraid to take risks. You appreciate a growth path with your leadership team that journeys how you can grow inside and outside of the organization. You thrive upon continuing education programs that your company sponsors to strengthen your skills and for you to become a thought leader ahead of the industry curve.

You are excited about creating change because your skills can help the greater good of every customer, industry and community. We are hiring a talented

Senior Site Reliability Engineer (SRE)

to join our team. If you're excited to be part of a winning team, CirrusLabs (

http://www.cirruslabs.io

) is a great place to grow your career.

Experience - 5 - 8 yearsLocation - Bengaluru

Senior Site Reliability Engineer (SRE) - Mission-Critical SaaS Cloud Products

Key Responsibilities

2 Reliability and Performance Management

- Design, implement, and maintain highly available, scalable, and resilient cloud-native architectures for mission-critical SaaS products.- Develop and implement SLOs, SLIs, and SLAs to measure and improve service reliability.- Continuously optimize system performance and resource utilization across multiple cloud platforms.- Finetune/Optimize Application performance by analyzing the code, traces and database queries.

3 Incident Management and Troubleshooting

- Lead incident response efforts, effectively troubleshooting complex issues to minimize downtime and impact.- Reduce Mean Time to Recover (MTTR) through proactive monitoring, automated alerting, and efficient problem-solving techniques.- Conduct thorough Root Cause Analysis (RCA) for all major incidents and implement preventive measures.

4 Observability and Monitoring

- Design and implement end-to-end observability solutions across our distributed systems.- Develop and maintain comprehensive monitoring strategies using tools like ELK Stack, Prometheus, Grafana.- Create and optimize product status dashboards to provide real-time visibility into system health and performance.

5 Automation and Infrastructure as Code (IaC)

- Implement Infrastructure as Code practices using tools like Terraform.- Develop and maintain automated deployment pipelines and CI/CD workflows.- Create self-healing systems and automate routine operational tasks to reduce manual intervention.

6 Cloud-Agnostic Architecture

- Design and implement cloud-agnostic solutions that can operate efficiently across multiple cloud providers.- Develop expertise in event-driven architectures and related technologies (e.g., Apache Kafka/Eventhub, Redis, Mongo Atlas, IoTHub).- Implement and manage containerized applications using Kubernetes across different cloud environments.

7 Continuous Improvement

- Regularly review and refine operational practices to enhance efficiency and reliability.- Stay updated with the latest industry trends and technologies in SRE, cloud computing, and DevOps.- Contribute to the development of internal tools and frameworks to support SRE practices.

Requirements- Strong knowledge of cloud platforms - Azure and their associated services.- Expert in Observability tools (ELK Stack, Dynatrace, Prometheus )- Expertise in containerization technologies such as Docker and Kubernetes- Understanding of Event-driven architecture and database technologies (Mongo Atlas, Azure SQL, PostgresDB)- Proficient in IaaC tools such as - Terraform and GitHub Actions.- Proficiency in one or more programming languages - Python/.Net/Java- Strong understanding of networking concepts, load balancing, and security practices.

Senior Site Reliability Engineer

4 weeks ago

india meanSquare.ai Full time

Immediate Hiring: Senior Site Reliability Engineer (Sr. SRE) – ContractLocation: Hybrid – Hyderabad, IndiaJob Type: Contract (6–12 months, extendable)We are seeking a Senior Site Reliability Engineer (Sr. SRE) with expertise in Azure, Dynatrace, and Splunk to join our team on an immediate basis. This hybrid contract role requires a proactive...
Senior Site Reliability Engineer

4 weeks ago

india meanSquare.ai Full time

Immediate Hiring: Senior Site Reliability Engineer (Sr. SRE) – ContractLocation:Hybrid – Hyderabad, IndiaJob Type:Contract (6–12 months, extendable)We are seeking aSenior Site Reliability Engineer (Sr. SRE)with expertise in Azure, Dynatrace, and Splunk to join our team on an immediate basis. This hybrid contract role requires a proactive professional...
Senior Site Reliability Engineer

4 hours ago

India Cloudologic Full time

Company Description : Cloudologic is a prominent cloud consulting and IT service provider based in Singapore and rooted in India, focusing on cloud operations, cyber security, and managed services. With a decade of expertise, our dedication to delivering high-quality services has earned the trust of clients worldwide, making us a valued partner in the tech...
Senior Site Reliability Engineer

4 weeks ago

india SolarWinds Full time

At SolarWinds, we put people first. Our mission is to enrich the lives of our employees, customers, partners, and communities by delivering simple, powerful, and secure solutions that accelerate business transformation.We thrive on innovation, collaboration, and accountability. If you're a problem solver who enjoys working in a fast-paced, high-impact...
Site Reliability Engineer L2

4 weeks ago

india NationsBenefits Full time

Position Overview:TheSite Reliability Engineering(SRE) team plays a critical role in maintaining the health, performance, and availability of our platforms. As anL2 SRE , you will monitor and respond to site performance metrics, manage incidents, and work closely with Development, , and Engineering teams to ensure the continuous reliability of our services....
Senior Site Reliability Engineer

2 weeks ago

India Microsoft Full time

Job DescriptionM365's COSMIC team designs, builds, and operates a globalscalemanaged-runtime environment based on Azure Kubernetes Service for the benefit of Microsoft Substrate service and developers. COSMIC could be compared to a KubernetesPaaS.Our charter builds and maintains solutions that enable substrate service teams onboarding to Cosmic Linux...
Senior Site Reliability Engineer

4 weeks ago

india SolarWinds Full time

At SolarWinds, we put people first. Our mission is to enrich the lives of our employees, customers, partners, and communities by delivering simple, powerful, and secure solutions that accelerate business transformation. We thrive on innovation, collaboration, and accountability. If you're a problem solver who enjoys working in a fast-paced, high-impact...
Site Reliability Engineer

3 weeks ago

India iVedha Inc. Full time

Site Reliability Engineer (SRE) Remote in India and have to work in EST (US/Canada) Time Zone with 24*7 Support Model Position Overview: We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with strong expertise in Python , advanced proficiency in Azure-based infrastructure , and significant experience in Customer Reliability...
Site Reliability Engineer

4 hours ago

India HARP Technologies and Services Full time

Experience : 8 Years Location : Mumbai,Chennai (Other cities Remote) Notice period : Immediate to 30 days max Responsibilities of Senior SRE : - The Site Reliability Engineering (SRE) team is responsible for the reliability, scalability, stability and performance of systems and services. - They work with cross-functional teams to design, build and maintain...
Site Reliability Engineer

4 weeks ago

india 10decoders Full time

JD: Site Reliability Engineer -GCP With TerraformThe Role:We are looking for a Senior SRE with5+ yearsof experience to work primarily with ourApplication development team. An ideal candidate would have extensive experiencebuilding cloud infrastructure onGoogle Cloud with Terraformand have strongexperience running workloads that scale on Google’s Kubernetes...
Site Reliability Engineer

22 hours ago

India Burgeon It Services Pvt Ltd Full time

Position : Site Reliability Engineer Location : PAN INDIA Location Duration : C2H Exp : 5 - 8 Years JOB DESCRIPTION : - Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role. - Experience with cloud platforms (AWS, Google Cloud, Azure) and containerization technologies (Docker, Kubernetes). - Maintain the stability of the...
Senior Site Reliability Engineer

4 weeks ago

india Infosys Full time

Position Overview We are seeking a skilled and motivated Site Reliability Engineer with hands-on expertise in application operations, DevOps tools, and SRE principles. The ideal candidate will have experience in supporting production systems, DEVOPS hands-on, a solid understanding of observability, and a foundational grasp of SRE principles. The role also...
Senior Site Reliability Engineer

4 weeks ago

India Experience.com Full time

Come Join UsExperience com - We make every experience matter more Position Title Senior DevOps Engineer SRE Site Reliability EngineerJob Location Chennai Base Location RemoteShift Time General and US shiftEmployment Type Full TimeSummary of PositionAre you a talented Senior DevOps Engineer looking for an exciting opportunity to work for a...
Site Reliability Engineer

4 weeks ago

india CorroHealth Full time

Hiring Alert!!!We are looking for highly skilled Site Reliability Engineer (SRE) for our Product Development team based out at Noida Location!!!Only Immediate Joiners preferred!!Candidates who are available for F2F round of interview, can only apply!!Job DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal...
Site Reliability Engineer

4 weeks ago

india, india BigRio Full time

Job Title: Site Reliability Engineer Location: Remote with Quarterly visits to Chennai, Tamil Nadu, India Duration: Full-Time About BigRio: BigRio is a remote-based, technology consulting firm headquartered in Boston, MA. We deliver software solutions ranging from custom development and software implementation to data analytics and machine learning/AI...
Site Reliability Engineer

4 weeks ago

india, india BigRio Full time

Job Title: Site Reliability Engineer Location: Remote with Quarterly visits to Chennai, Tamil Nadu, India Duration: Full-Time About BigRio: BigRio is a remote-based, technology consulting firm headquartered in Boston, MA. We deliver software solutions ranging from custom development and software implementation to data analytics and machine learning/AI...
Site Reliability Engineer

4 weeks ago

India Burgeon It Services Pvt Ltd Full time

Position : Site Reliability EngineerLocation : PAN INDIA LocationDuration : C2HExp : 5 - 8 YearsJOB DESCRIPTION : - Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role. - Experience with cloud platforms (AWS, Google Cloud, Azure) and containerization technologies (Docker, Kubernetes). - Maintain the stability of the software...
Site Reliability Engineer

4 weeks ago

india Coforge Full time

Job Title: Site Reliability Engineer Skills : SRE, CI/CD, AWS, Python, Terraform & Kubernetes Location: Hyderabad (Work from Office) Experience: 6-14 Years Note: Immediate joiners are preferable Job Description: We at Coforge are hiring a Site Reliability Engineer with the following skillset: Design, implement, and manage scalable and secure cloud-based...
Senior Site Reliability Engineer

4 weeks ago

india Info Way Solutions Full time

Position : SRE (Site Reliability Engineer)Experience : Minimum 5+ Years of exp. (5 - 7 year of exp)Location : HYD, TVM, BLR, Pune, Chennai and KolkataWork Mode : Hybrid ( 3Days a week)Must Have Skill set :(Real time Hands on) CI/CD Tools – Jenkins/Harness,Cloud Infra/deployments – AWS/GCP, Docker & Kubernetes, AnsibleInfra / IaaC -- TerraformApplication...
Site Reliability Engineer

3 weeks ago

India Forbes Advisor Full time

Job Title: SRE(Certification Mandate) - Certification allowedAWS Devops professionalAWS Sysops adminAWS Security specialistAWS Solution architect ProfessionalExperience: 8+ YearsLocation: Mumbai, Chennai (If strong candidate other location remote will be offered if from mumbai or chennai only hybrid no remote)Notice period: Immediate to 30 days max...

Americas

Europe

Asia / Oceania

Africa

Senior Site Reliability Engineer