Senior Site Reliability Engineer
2 days ago
Job Title : Senior Site Reliability Engineer
Position type - Contractual ( 1 Year)
We are seeking a talented and motivated Senior Site Reliability Engineer (SRE) to join our organization.
The SRE team at GreyOrange is responsible for monitoring the stability and availability of mission-critical production systems, managing incidents for quicker resolution, and establishing BAU. The team also manages and maintains internal tools/infra which is consumed by other development teams.
The experienced SRE will play a crucial role in ensuring the reliability, scalability, capacity planning, and performance of our infrastructure and applications. The ideal candidate will have a strong background in software engineering, system administration, containerization, and cloud technologies.
Requirements
- Should have 5 to 8 years of experience
- Well-versed with scripting/programming languages (Python/Bash/PowerShell, etc.) to automate manual work, particularly within cloud environments
- Well-versed with Observability tools (Grafana, Splunk, Dynatrace) for monitoring, alerting, and logging solutions to identify and address potential issues, especially in cloud infrastructure
- Working experience with automation tools (Jenkins, GitLab, Ansible/Chef for configuration management) and processes to streamline deployment, monitoring, and management of systems and applications in the cloud
- Hands-on experience with containerization and orchestration technologies such as Docker, Kubernetes, or similar, particularly in cloud-native environments
- Well aware of SLI, SLO, SLA, and Error Budget concepts and their implementations; provide on-call support and participate in incident management & response activities as needed
- Expert with troubleshooting production issues and bugs.
- Good knowledge of Unix systems, networking, web technologies, and databases.
- Incident Management experience coupled with effective communication skills for production workload.
- Working knowledge in any one of the cloud platforms (AWS or GCP)
What you'll do:
- Lead reliability engineering projects and drive them to closure.
- Ensure system stability and high availability by proactively monitoring performance and troubleshooting issues
- Design, build and maintain efficient, reliable, and scalable cloud-based infrastructure and services
- Automate processes and find opportunities to improve the observability and availability of the Platform to reduce toil.
- Implement and manage observability tools for comprehensive monitoring, alerting, and logging
- Own end-to-end availability and performance of different services & tools.
- Practice sustainable incident response and blameless postmortems.
- Provide on-call support for incident management and participate actively in response activities
-
Site Reliability Engineer
2 weeks ago
Gurgaon, Haryana, India Aerial Telecom Solutions (ATS) Full time ₹ 1,04,000 - ₹ 1,30,878 per yearPosition Overview:SRE- Lead will be responsible for managing a team of engineers focused on software deployments and site reliability engineering practices. The role will involve overseeing the deployment process of software applications and services, implementing automation, monitoring, and alerting tools, and ensuring the reliability, availability, and...
-
Site Reliability Engineer
6 days ago
Gurgaon, Haryana, India RBS Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJoin us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou'll enjoy significant stakeholder interaction, working in...
-
Site Reliability Engineer
3 weeks ago
Gurgaon, Haryana, India Impronics Technologies Full timeJob DescriptionRequired Skills & Experience:- 8+ years of overall experience in infrastructure engineering or SRE roles, with at least 3+ years in thepayments/fintech domain.- Strong understanding ofpayment protocols(UPI, IMPS, RTGS, NEFT, SWIFT, etc.) and transaction processing systems.- Proven expertise inLinux systems administration, cloud platforms (AWS,...
-
Site Reliability Engineer,VP
6 days ago
Gurgaon, Haryana, India RBS Full time ₹ 15,00,000 - ₹ 20,00,000 per yearJoin us as a Site Reliability EngineerIn this key role, you'll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou'll enjoy significant stakeholder interaction, working in...
-
Manager, Site Reliability Engineerinag
2 weeks ago
Gurgaon, Haryana, India Cvent Full time US$ 1,50,000 - US$ 2,00,000 per yearCvent is looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure stability, reliability and performance and rapid deployments of our platform. We build teams that are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you have a passion and track record for solving problems;...
-
Site Reliability Engineer
2 weeks ago
Gurgaon, Haryana, India EDGE Executive Search Full time ₹ 1,04,000 - ₹ 1,30,878 per yearThe JobThe SRE is a global team that provides technical support across the suite of products. The team works closely with a highly competent Technical Operation Centre (TOC), Development and Infrastructure teams to deliver proactive tasks to improve the supportability of our platforms. Our work helps to ensure that the company provides a high-quality...
-
Site Reliability Engineer
2 weeks ago
Gurgaon, Haryana, India LEAPWORK Full time ₹ 1,04,000 - ₹ 1,30,878 per yearAt Leapwork, our vision is to break down the barriers between humans and computers through the worlds most accessible automation platform. We are the leading global AI-powered visual test automation solution, enabling some of the world's largest enterprises to adopt, scale, and maintain automation – in under 30 days.In today's environment, where...
-
Site Reliability Engineer
6 days ago
Gurgaon, Haryana, India GSPANN Full time ₹ 15,00,000 - ₹ 25,00,000 per yearHiring for SRE -Exp- 6+ YearsNotice Period - Immediate - 15 daysAbout the RoleWe are seeking a skilled and passionate Observability Engineer (SRE) to join our team and drive reliability, performance, and visibility across our infrastructure and applications. You will play a key role in designing and implementing observability solutions, improving system...
-
Site Reliability Expert
2 weeks ago
Gurgaon, Haryana, India beBeeReliability Full time ₹ 1,04,000 - ₹ 1,30,878As a Site Reliability Engineer, you will utilize your advanced expertise in both development and operations to identify and prioritize issues. You will work towards finding universal solutions to common problems while mentoring and supporting junior staff.Key responsibilities include:Enlighten, enable and empower a fast-growing set of multi-disciplinary...
-
Site Reliability Engineer II
2 weeks ago
Gurgaon, Haryana, India American Express Full time ₹ 1,50,000 - ₹ 28,00,000 per yearAt American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. As part of Team Amex, you'll experience this powerful backing with comprehensive support for your holistic well-being and many opportunities to learn new...