
Lead Site Reliability Engineer
3 days ago
COMPANY- LANDMARK GROUP
Job Title: SRE Lead (Engineering & Reliability)
Experience: 8-12 years
Job Summary:
We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to
oversee the reliability, scalability, and performance of our critical systems. As an SRE Lead,
you will play a pivotal role in establishing and implementing SRE practices, leading a team
of engineers, and driving automation, monitoring, and incident response strategies. This
position combines software engineering and systems engineering expertise to build and
maintain high-performing, reliable systems.
Key Responsibilities:
Reliability & Performance:
• Lead efforts to maintain high availability and reliability of critical services.
• Define and monitor SLIs, SLOs, and SLAs to ensure business requirements are met.
• Proactively identify and resolve performance bottlenecks and system inefficiencies.
Incident Management & Response:
• Establish and improve incident management processes and on-call rotations.
• Lead incident response and root cause analysis for high-priority outages.
• Drive post-incident reviews and ensure actionable insights are implemented.
Automation & Tooling:
• Develop and implement automated solutions to reduce manual operational tasks.
• Enhance system observability through metrics, logging, and distributed tracing tools
(e.g., Prometheus, Grafana, Elastic APM).
• Optimize CI/CD pipelines for seamless deployments.
Collaboration:
• Partner with software engineering teams to improve the reliability of applications and
infrastructure.
• Work closely with product/ engineering teams to design scalable and robust systems.
• Ensure seamless integration of monitoring and alerting systems across teams.
Leadership & Team Building:
• Manage, mentor, and grow a team of SREs.
• Promote SRE best practices and foster a culture of reliability and performance across
the organization.
• Drive performance reviews, skills development, and career progression for team
members.
Capacity Planning & Cost Optimization:
• Perform capacity planning and implement autoscaling solutions to handle traffic
spikes.
• Optimize infrastructure and cloud costs while maintaining reliability and
performance.
Skills & Qualifications:
• Technical Expertise:
o Experience with cloud platforms (AWS / Azure / GCP) and Kubernetes.
o Hands-on knowledge of infrastructure-as-code tools like Terraform /Helm/Ansible.
o Proficiency in Java
o Expertise in distributed systems, databases, and load balancing.
• Monitoring & Observability:
o Proficient with tools like Prometheus, Grafana,, Elastic APM, or New relic.
o Understanding of metrics-driven approaches for system monitoring and alerting.
• Automation & CI/CD:
o Hands-on experience with CI/CD pipelines (e.g., Jenkins, Azure Pipelines etc).
o Skilled in automation frameworks and tools for infrastructure and application deployments.
• Incident Management:
o Proven track record in handling incidents, post-mortems, and implementing
solutions to prevent recurrence.
Leadership & Communication Skills:
• Strong people management and leadership skills with the ability to inspire and motivate teams.
• Excellent problem-solving and decision-making skills.
• Clear and concise communication, with the ability to translate technical concepts for non-technical stakeholders.
Preferred Qualifications:
• Experience with database optimization, Kafka, or other messaging systems.
• Knowledge of autoscaling techniques
• Previous experience in an SRE, DevOps, or infrastructure engineering leadership role.
• Understanding of compliance and security best practices in distributed systems.
Why Join Us?
• Be a key driver in building and scaling reliable systems in a fast-paced environment.
• Work with cutting-edge technologies and influence the evolution of the infrastructure.
• Lead a high-impact team and foster a culture of reliability and innovation.
-
Site Reliability Engineer
3 days ago
Ajmer, Rajasthan, India beBeeCloud Full time ₹ 12,00,000 - ₹ 24,00,000As a technical expert in Site Reliability Engineering, you will play a pivotal role in ensuring the stability and scalability of software systems. Your responsibilities will include designing, developing, and maintaining complex software infrastructure to meet the needs of high-performance applications.Key Skills and Qualifications:Strong background in...
-
Site Reliability Engineer
2 weeks ago
Ajmer, Rajasthan, India CES Full timeWe are seeking a hands-on SRE with expertise in infrastructure automation, cloud scalability, and performance optimization. You'll design, manage, and monitor large-scale AWS environments, ensuring high availability, security, and reliability for our SaaS platformsKey ResponsibilitiesDevelop and execute UI automation using Cypress with TypeScript.Conduct...
-
Site Engineer
4 days ago
Ajmer, Rajasthan, India Nandi Associates Full timeJob Title: Site Engineer – Civil Construction Company: Nandi Associates Location: Kaiga Karwar ,Mysuru, Davangere, Kolar,(multiple places in Karnataka) Employment Type: Full-time Experience : 4+ years in the same industry. About the Role: Nandi Associates is actively seeking a skilled and motivated Site Engineer to oversee and manage day-to-day...
-
Reliability Expert
4 days ago
Ajmer, Rajasthan, India beBeeCloud Full time ₹ 12,09,600 - ₹ 16,29,600Job Title: Site Reliability Engineer The primary responsibility of a Site Reliability Engineer is to provide production, operations support and application administration to business and web applications, 3rd party applications and related ecosystems. The application environment is based on Microsoft technologies, including intranet and extranet...
-
Reliability Engineering Manager
3 days ago
Ajmer, Rajasthan, India beBeeReliabilityEngineeringManager Full time US$ 1,80,000 - US$ 2,00,000Reliability Engineering ManagerJob Summary:This is a leadership role in which you will oversee the reliability, scalability, and performance of our critical systems.Responsibilities:Reliability & Performance:Lead efforts to maintain high availability and reliability of critical services.Define and monitor SLIs, SLOs, and SLAs to ensure business requirements...
-
Highly Skilled System Reliability Specialist
3 days ago
Ajmer, Rajasthan, India beBeeSystemReliability Full time ₹ 1,50,00,000 - ₹ 2,00,00,000About the Role:Job OverviewWe are seeking a highly skilled System Reliability Specialist to ensure the reliability, scalability, and performance of our critical systems.Key Responsibilities:Infrastructure Design: Design, implement, and maintain scalable and reliable infrastructure for production systems.Automation: Automate repetitive operational tasks,...
-
Ajmer, Rajasthan, India beBeeObservability Full time ₹ 1,75,00,000 - ₹ 2,25,00,000Engineering Opportunity:As a senior site reliability engineer with deep expertise in ELK, you will take ownership of large-scale observability infrastructure design and management.You will lead the development and scaling of ELK clusters ingesting 2–3+ TB/day, enhance system reliability across distributed systems, and drive automation within cloud...
-
Reliability Specialist
4 days ago
Ajmer, Rajasthan, India beBeeConditionMonitoring Full time ₹ 18,00,000 - ₹ 25,00,000About People Prime WorldwideWe are seeking a skilled Condition Monitoring professional to join our team.The client is a leading Indian multinational IT services and consulting firm that provides digital transformation, cloud computing, data analytics, enterprise application integration, infrastructure management, and application development services.They...
-
Reliable Infrastructure Specialist
4 days ago
Ajmer, Rajasthan, India beBeeObservability Full time ₹ 15,84,000 - ₹ 26,64,000Job Title: Site Reliability EngineerJob Summary:We are seeking a skilled professional to monitor and interpret Grafana dashboards to identify failures and problems, manage incident communication, and provide regular updates on the status of incidents to all parties.Key Responsibilities:Monitor and interpret Grafana dashboards to identify failures and...
-
Principal Systems Reliability Specialist
3 days ago
Ajmer, Rajasthan, India beBeeReliability Full time ₹ 15,50,000 - ₹ 28,19,999Reliability Engineer RoleWe are seeking a talented Reliability Engineer to join our team. This individual will be responsible for maintaining plant stability across middle-office and operations applications.Lead incident triage, root cause analysis, and communication, focusing on problem management.Partner with regional teams to drive technical and...