
Site Reliability Leadership Opportunities
2 days ago
We are seeking an experienced and dynamic professional to oversee the reliability, scalability, and performance of our critical systems. This role combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems.
Key Responsibilities:- Reliability & Performance:
- Lead efforts to maintain high availability and reliability of critical services.
- Define and monitor Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) to ensure business requirements are met.
- Proactively identify and resolve performance bottlenecks and system inefficiencies.
- Incident Management & Response:
- Establish and improve incident management processes and on-call rotations.
- Lead incident response and root cause analysis for high-priority outages.
- Drive post-incident reviews and ensure actionable insights are implemented.
- Automation & Tooling:
- Develop and implement automated solutions to reduce manual operational tasks.
- Enhance system observability through metrics, logging, and distributed tracing tools.
- Optimize Continuous Integration/Continuous Deployment (CI/CD) pipelines for seamless deployments.
- Collaboration:
- Partner with software engineering teams to improve the reliability of applications and infrastructure.
- Work closely with product/engineering teams to design scalable and robust systems.
- Ensure seamless integration of monitoring and alerting systems across teams.
- Leadership & Team Building:
- Manage, mentor, and grow a team of Site Reliability Engineers.
- Promote SRE best practices and foster a culture of reliability and performance across the organization.
- Drive performance reviews, skills development, and career progression for team members.
- Capacity Planning & Cost Optimization:
- Perform capacity planning and implement autoscaling solutions to handle traffic spikes.
- Optimize infrastructure and cloud costs while maintaining reliability and performance.
- Technical Expertise:
- Experience with cloud platforms and Kubernetes.
- Hands-on knowledge of infrastructure-as-code tools like Terraform/Helm/Ansible.
- Proficiency in Java.
- Expertise in distributed systems, databases, and load balancing.
- Monitoring & Observability:
- Proficient with tools like Prometheus/Grafana/Elastic APM or New Relic.
- Understanding of metrics-driven approaches for system monitoring and alerting.
- Automation & CI/CD:
- Hands-on experience with CI/CD pipelines (e.g., Jenkins/Azure Pipelines).
- Skilled in automation frameworks and tools for infrastructure and application deployments.
- Incident Management:
- Proven track record in handling incidents, post-mortems, and implementing solutions to prevent recurrence.
- Leadership & Communication Skills:
- Strong people management and leadership skills with the ability to inspire and motivate teams.
- Excellent problem-solving and decision-making skills.
- Clear and concise communication, with the ability to translate technical concepts for non-technical stakeholders.
-
Reliability Expert
3 days ago
Erode, Tamil Nadu, India beBeeTechnicalLeader Full time ₹ 9,75,000 - ₹ 10,25,000As a senior site reliability engineer, you will be responsible for ensuring the uptime and performance of our systems.We are looking for an individual with 10+ years of experience in SRE or DevOps roles, with deep expertise in Kubernetes, Networking, and Relational Databases.Strong scripting skills, such as Python and Bash, are required for tooling and...
-
Senior Site Reliability Engineer
4 days ago
Erode, Tamil Nadu, India Cimpress Full timeSenior Site Reliability Engineer Who We Are: Cimpress Technology develops cutting-edge, best-in-world software that our mass customization businesses use to create personalized products for over 17 million global customers. Our Mass Customization Platform consists of modular, multi-tenant services. Our businesses can choose the solutions that work for...
-
Leadership Opportunities Executive
2 days ago
Erode, Tamil Nadu, India beBeeProgram Full time ₹ 8,00,000 - ₹ 15,00,000Job OpportunityWe are seeking a skilled Program Manager to lead cross-functional initiatives across IT and non-IT domains, including supply chain, infrastructure, and marketing. This role requires strong leadership, stakeholder management, and expertise in program management methodologies to deliver high-impact results.Key Responsibilities:Plan, execute, and...
-
Reliable Systems Expert
4 days ago
Erode, Tamil Nadu, India beBeeSRE Full time ₹ 15,00,000 - ₹ 25,00,000Job Title: Senior Site Reliability EngineerThe primary objective of this role is to guarantee the reliability, scalability, and performance of critical systems. The ideal candidate will possess a deep understanding of system architecture, programming languages, and cloud platforms.Key Responsibilities:Implement and maintain monitoring tools and dashboards...
-
Reliable Systems Engineer
3 days ago
Erode, Tamil Nadu, India beBeeSRE Full time ₹ 19,80,000 - ₹ 26,64,000Job SummaryWe are seeking a skilled Site Reliability Engineer to join our team. As an SRE, you will be responsible for monitoring and interpreting Grafana dashboards to identify potential failures and manage incident communication.This role involves proactively detecting and resolving issues before they impact our users. You will work closely with...
-
Leadership Opportunities in IT Services
8 hours ago
Erode, Tamil Nadu, India beBeeManagement Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job OverviewWe are seeking a seasoned IT Manager to lead our team in the development and optimization of information technology and systems functions supporting business processes and technical information systems platforms.This is an exciting opportunity for a motivated professional to join our team and drive IT service delivery for employees, while also...
-
Construction Site Manager
2 days ago
Erode, Tamil Nadu, India beBeeSiteManager Full time ₹ 12,00,000 - ₹ 20,00,000Job Title: Site ManagerWe are seeking a skilled and experienced Site Manager to oversee the daily operations at our construction sites.The ideal candidate will coordinate with project teams, manage resources, and ensure compliance with project plans and schedules.Key Responsibilities:Supervise on-site activities to ensure compliance with design...
-
Site Execution Specialist
5 hours ago
Erode, Tamil Nadu, India beBeeSolar Full time ₹ 21,60,000 - ₹ 36,00,000Job Title: Site Execution SpecialistJob OverviewThe Site Execution Specialist is responsible for overseeing the timely and efficient execution of solar installations at project sites, adhering to design specifications, safety standards, and project timelines.Key Responsibilities:Monitor daily installation activities of solar PV systems.Ensure adherence to...
-
Site Reliability Engineer
3 days ago
Erode, Tamil Nadu, India Birlasoft Full timeSRE Administrator :Experience : 7 to 10 yearsResponsibilities:Be primarily responsible for providing production, operations support and application administration to business and web applications, 3rd party applications and related ecosystems. The application environment though mixed, is primarily based on Microsoft technologies. Among the environments which...
-
System Reliability Specialist
2 days ago
Erode, Tamil Nadu, India beBeeSystemReliability Full time ₹ 30,00,000 - ₹ 40,00,000Key Role: System Reliability SpecialistAchieve high system reliability and scalability by applying software engineering and systems expertise. This position combines technical knowledge to build and maintain high-performing, reliable systems.Primary Responsibilities:Ensure high availability and reliability of critical services.Develop and monitor service...