
Lead Site Reliability Engineer
3 days ago
COMPANY- LANDMARK GROUP
Job Title: SRE Lead (Engineering & Reliability)
Experience: 8-12 years
Job Summary:
We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to
oversee the reliability, scalability, and performance of our critical systems. As an SRE Lead,
you will play a pivotal role in establishing and implementing SRE practices, leading a team
of engineers, and driving automation, monitoring, and incident response strategies. This
position combines software engineering and systems engineering expertise to build and
maintain high-performing, reliable systems.
Key Responsibilities:
Reliability & Performance:
• Lead efforts to maintain high availability and reliability of critical services.
• Define and monitor SLIs, SLOs, and SLAs to ensure business requirements are met.
• Proactively identify and resolve performance bottlenecks and system inefficiencies.
Incident Management & Response:
• Establish and improve incident management processes and on-call rotations.
• Lead incident response and root cause analysis for high-priority outages.
• Drive post-incident reviews and ensure actionable insights are implemented.
Automation & Tooling:
• Develop and implement automated solutions to reduce manual operational tasks.
• Enhance system observability through metrics, logging, and distributed tracing tools
(e.g., Prometheus, Grafana, Elastic APM).
• Optimize CI/CD pipelines for seamless deployments.
Collaboration:
• Partner with software engineering teams to improve the reliability of applications and
infrastructure.
• Work closely with product/ engineering teams to design scalable and robust systems.
• Ensure seamless integration of monitoring and alerting systems across teams.
Leadership & Team Building:
• Manage, mentor, and grow a team of SREs.
• Promote SRE best practices and foster a culture of reliability and performance across
the organization.
• Drive performance reviews, skills development, and career progression for team
members.
Capacity Planning & Cost Optimization:
• Perform capacity planning and implement autoscaling solutions to handle traffic
spikes.
• Optimize infrastructure and cloud costs while maintaining reliability and
performance.
Skills & Qualifications:
• Technical Expertise:
o Experience with cloud platforms (AWS / Azure / GCP) and Kubernetes.
o Hands-on knowledge of infrastructure-as-code tools like Terraform /Helm/Ansible.
o Proficiency in Java
o Expertise in distributed systems, databases, and load balancing.
• Monitoring & Observability:
o Proficient with tools like Prometheus, Grafana,, Elastic APM, or New relic.
o Understanding of metrics-driven approaches for system monitoring and alerting.
• Automation & CI/CD:
o Hands-on experience with CI/CD pipelines (e.g., Jenkins, Azure Pipelines etc).
o Skilled in automation frameworks and tools for infrastructure and application deployments.
• Incident Management:
o Proven track record in handling incidents, post-mortems, and implementing
solutions to prevent recurrence.
Leadership & Communication Skills:
• Strong people management and leadership skills with the ability to inspire and motivate teams.
• Excellent problem-solving and decision-making skills.
• Clear and concise communication, with the ability to translate technical concepts for non-technical stakeholders.
Preferred Qualifications:
• Experience with database optimization, Kafka, or other messaging systems.
• Knowledge of autoscaling techniques
• Previous experience in an SRE, DevOps, or infrastructure engineering leadership role.
• Understanding of compliance and security best practices in distributed systems.
Why Join Us?
• Be a key driver in building and scaling reliable systems in a fast-paced environment.
• Work with cutting-edge technologies and influence the evolution of the infrastructure.
• Lead a high-impact team and foster a culture of reliability and innovation.
-
Site Reliability Engineer
6 days ago
Kollam, Kerala, India Xebia Full timeWe are looking for a highly skilled AWS Engineer with strong Python development and Chaos Engineering expertise to design, build, and validate resilient, scalable, and automated cloud-native environments. The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency...
-
Senior site reliability engineer
3 days ago
Kollam, Kerala, India Cimpress Full timeSenior Site Reliability EngineerWho We Are:Cimpress Technology develops cutting-edge, best-in-world software that our mass customization businesses use to create personalized products for over 17 million global customers. Our Mass Customization Platform consists of modular, multi-tenant services. Our businesses can choose the solutions that work for them, or...
-
Site Civil Engineer
7 days ago
Kollam, Kerala, India SAN BUILDERS Full timeJob description We are looking for a dedicated and detail-oriented Civil Site Engineer to oversee the day-to-day execution of residential construction projects Locations- Alappuzha Bangalore Thiruvanthapuram Ernakulam Languages known- Malayalam English Hindi Accommodation provided Key Responsibilities Supervise and monitor daily...
-
Reliable System Architect
3 days ago
Kollam, Kerala, India beBeesystem Full time ₹ 14,24,100 - ₹ 25,17,700About UsWe are seeking a Site Reliability Engineer to ensure the reliability, scalability, and performance of our critical systems.
-
Site Civil Engineer
2 weeks ago
Kollam, Kerala, India FINELINE INFRA Full timeCivil Site Engineer Job Role DescriptionThis is a full-time on-site role for a Site Civil Engineer located in Bengaluru. The Site Civil Engineer will be responsible for designing, planning, and managing civil engineering projects. Day-to-day tasks will include overseeing stormwater management, ensuring compliance with safety regulations, supervising...
-
Site Engineer
18 hours ago
Kollam, Kerala, India HANUKKAH HOME CONSTRUCTIONS PVT LTD Full time ₹ 1,50,000 - ₹ 2,00,000 per yearFull job descriptionJob Summary:The Site Supervisor is responsible for managing and coordinating construction activities on-site to ensure that projects are completed on time, within budget, and to the required quality standards. The role requires strong leadership, excellent organizational skills, and a keen understanding of construction processes, safety...
-
High Availability Systems Engineer
3 days ago
Kollam, Kerala, India beBeeReliability Full time ₹ 18,00,000 - ₹ 24,00,000Job Title: SRE Lead (Engineering & Reliability)We are seeking an experienced and dynamic Site Reliability Engineering (SRE) professional to oversee the reliability, scalability, and performance of critical systems.This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems.
-
Reliability Engineering Specialist
3 days ago
Kollam, Kerala, India beBeeSre Full time ₹ 1,50,00,000 - ₹ 2,00,00,000SRE Lead PositionWe are seeking a seasoned professional to assume the role of SRE Lead. As a critical member of our team, you will be responsible for overseeing the reliability and scalability of our systems.This is an exceptional opportunity to leverage your expertise in cloud platforms, infrastructure-as-code tools, and distributed systems to drive...
-
Kollam, Kerala, India beBeeSiteEngineering Full time ₹ 9,00,000 - ₹ 12,00,000Job OverviewWe are seeking a highly skilled Site Engineer to oversee daily on-site activities, ensure quality control, and coordinate with civil engineers for road work related tasks in Bengaluru North, Mysuru, Chintamani, and Chikballapur.Main ResponsibilitiesOversee daily on-site activities to ensure project deliverables are met.Ensure quality control by...
-
Site Project Coordinator
1 day ago
Kollam, Kerala, India beBeeCivil Full time ₹ 9,00,000 - ₹ 12,00,000Job OverviewThis is a full-time on-site role for a Site Civil Engineer located in Chennai. The position involves planning and executing civil engineering projects from site marking to completion.",