
Site Reliability Architect
2 days ago
As a System Monitoring Engineer, you will play a crucial role in ensuring the smooth operation of our globally deployed web application.
You will be responsible for monitoring Grafana dashboards and observability tools to detect failures and performance issues. Your primary focus will be on incident response, initiating reports from automated alerts or joining active incident channels.
As the main point of contact during incidents, you will deliver frequent updates to customers and incident commanders. You will also interpret operational metrics such as Quantiles, P99, and Prometheus data to assess system health.
Furthermore, you will track and manage permutations of a globally deployed microservices architecture running on Kubernetes. Your collaboration with engineering and support teams is essential to resolve issues quickly and efficiently.
Maintaining strong communication and customer service throughout incident lifecycles is paramount. Additionally, you will utilize foundational knowledge of AWS or other cloud platforms to support infrastructure monitoring.
Ultimately, your ability to ramp up quickly on existing systems and processes is key to success in this role.
Key Responsibilities:- Monitor Grafana dashboards and observability tools to detect failures and performance issues.
- Act as the primary SRE for incident response, initiating reports from automated alerts or joining active incident channels.
- Deliver frequent updates to customers and incident commanders during incidents.
- Interpret operational metrics such as Quantiles, P99, and Prometheus data to assess system health.
- Track and manage permutations of a globally deployed microservices architecture running on Kubernetes.
- Collaborate with engineering and support teams to resolve issues quickly and efficiently.
- Maintain strong communication and customer service throughout incident lifecycles.
- Utilize foundational knowledge of AWS or other cloud platforms to support infrastructure monitoring.
- 3+ years of experience monitoring and responding to incidents in a globally deployed web application.
- Strong experience with microservices architecture on Kubernetes.
- Deep understanding of observability tools and operational metrics (Grafana, Prometheus, P99, etc.).
- Familiarity with AWS services or any major cloud provider.
- Excellent communication and customer service skills – must be able to clearly articulate status and updates to technical and non-technical stakeholders.
- Ability to ramp up quickly, take ownership, and work independently in a fast-paced environment.
-
Site Architect
19 hours ago
Thrissur, Kerala, India STHAPATI Full timeCompany DescriptionSTHAPATI is a multidisciplinary architectural practice headquartered in New Delhi, globally recognized for a diverse portfolio that includes individual dwellings and large-scale urban developments. Founded in 1986 by Vipul, Harsh, and Anuj Varshney, the firm leads in airport terminals and public transportation infrastructure. STHAPATI is...
-
Site Reliability Engineering Expert
22 hours ago
Thrissur, Kerala, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000SRE Lead Job DescriptionWe are seeking a seasoned and dynamic Site Reliability Engineering (SRE) expert to oversee the reliability, scalability, and performance of our critical systems. Their primary objective is to establish and implement SRE best practices, lead a team of engineers, and drive automation, monitoring, and incident response strategies.The...
-
Senior Site Reliability Engineer- ELK Expert
6 days ago
Thrissur, Kerala, India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...
-
Site Engineer
2 weeks ago
Thrissur, Kerala, India CONNECTING 2 WORK Full timeJob Description Supervision required for various sites.Need to report the daily updates. Training will be provided on:1) Estimation and quantitysurveying.2)Project planning and scheduling3)QA/QC Reports4)Daily, Weekly/monthly, and yearly project report making.5)Bar Bending Schedule preparation6)Handling site workers in terms of timely completion and...
-
Site Reliability Engineer
3 weeks ago
Thrissur, Kerala, India CES Full timeWe are seeking a hands-on SRE with expertise in infrastructure automation, cloud scalability, and performance optimization. You'll design, manage, and monitor large-scale AWS environments, ensuring high availability, security, and reliability for our SaaS platformsKey ResponsibilitiesDevelop and execute UI automation using Cypress with TypeScript.Conduct...
-
Construction Site Manager
2 days ago
Thrissur, Kerala, India beBeeConstruction Full time ₹ 40,00,000 - ₹ 80,00,000Job Summary:We are seeking an experienced Site Manager to oversee and manage our construction sites. As a key member of our team, you will be responsible for ensuring projects are completed on time, within budget, and to the highest quality standards.About the Role:This is a fantastic opportunity for a motivated and organized individual who is looking to...
-
Lead System Reliability Engineer
14 hours ago
Thrissur, Kerala, India beBeeSystemReliability Full time ₹ 1,50,00,000 - ₹ 2,00,00,000**Job Overview:**We are seeking a dynamic individual to lead our Site Reliability Engineering (SRE) team. As an SRE Lead, you will be responsible for overseeing the reliability, scalability, and performance of our critical systems.This role combines software engineering and systems engineering expertise to build and maintain high-performing, reliable...
-
Site Reliability Engineer
1 day ago
Thrissur, Kerala, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job Description:We are seeking a skilled Reliability Specialist to ensure the reliability and efficiency of our business and web applications.Key Responsibilities:Provide production, operations support, and application administration to business and web applications.Work on all types of production support activities, including help requests, installations,...
-
Sr Architect
2 weeks ago
Thrissur, Kerala, India CONNECTING 2 WORK Full timeJob Description Architect job descriptionAre you an ambitious Architect looking for a new challenge and an opportunity to advance your skills and career?We are looking for a hard working Architect to join our team As an Architect at our company, you will design new buildings and take part in restoring and conserving old buildings and developing new ways...
-
Reliable Systems Specialist
2 days ago
Thrissur, Kerala, India beBeeSite Full time ₹ 18,00,000 - ₹ 26,40,000Job SummaryWe are seeking a skilled Site Reliability Engineer to support our LLM Proxy team.Key ResponsibilitiesMonitor and interpret Grafana dashboards to signal failures and problems, managing incident communication.Act as the primary point of contact, exhibiting excellent communication skills to end customers and incident commanders.Provide frequent...