Site Reliability Engineer
3 months ago
Job OverviewAs a Site Reliability (SRE)/DevOps Automation Engineer, you will be responsible for the availability, automation, performance, efficiency, Scaling, monitoring and emergency response for any incidents/issues in Applications. You will use your deep understanding of platforms, architecture, people, systems, and processes to both establish and continuously improve SLIs and SLOs for uptime, performance, deployment, monitoring, and troubleshooting. You are interested in setting direction and leading the day to day processes that shape our vision for reliability
Responsibilities and Duties
Design and implement automation projects according to the requirements and responsible for end to end delivery up to production environment.
Willing to work hands-on coding to deliver given project.
Work collaboratively with OEM/vendor/partner for IT Infra Automation/Self-service tools deployment for capacity forecasting, predictability of failure, zero touch operation and auto healing.
Build standard documentation for automation.
Participate in RCA and understand the gap in monitoring automation for operations.
Maintain and support the Product and Data systems: proactively monitor events, investigate issues, analyze solutions, and drive problems through to resolution.
Experience with configuration management tools like Chef, Puppet, Salt or equivalent
Experience in Administration of AWS, Google or Azure Cloud
Define requirements and develop tools and reporting as needed by projects and operations.
Participate in 24x7 on-call rotation for after-hours emergencies
Use operational tools and monitoring platforms to gain in-depth knowledge, understanding, and ongoing monitoring of system availability, performance, and capacity.
Implement alerting strategy that makes alerts actionable and unique.
Provide follow-through to ensure issues are resolved to satisfaction
Drive continuous improvement and innovation within the team.
A sense of ownership, initiative and drive.
Qualifications
Bachelor's degree in Computer Science, or a related technical field involving software or systems engineering, or equivalent practical experience
5+ years hands on Experience with Linux/UNIX/Windows OS
Strong Shell/Python/PowerShell skills.
Experience in Infra Orchestration / Automation tools eg. Ansible, Terraform.
Good understanding of Git, DevOps methodology, CI/CD for Automation Projects.
Hands on experience on managing Web servers, Application servers, Databases (SQL/NoSQL)
Experience on Docker/Kubernetes
Knowledge of monitoring tools and strategy
Experience with incident management, running incident post-mortems
Solid understanding of automated deployment processes
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
Experience designing and developing software oriented towards systems or network automation.
-
Site reliability engineer
6 days ago
Mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Apigee Level 1 Experience: 2 to 6 years The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in...
-
Site reliability engineer
6 days ago
Mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2 to 6 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...
-
Site Reliability Engineer
6 days ago
Mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Apigee Level 1 Experience: 2 to 6 years The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in...
-
Site Reliability Engineer
6 months ago
Mumbai, India dentsu Full timeThe purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...
-
Site Reliability Engineer
3 weeks ago
Mumbai, India CSC Full timeRole: Site Reliability EngineerLocation: Mumbai/ BangaloreWorking Model: HybridShift: 12-9PMIntro:Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of business,...
-
Site Reliability Engineer
4 weeks ago
mumbai, India CSC Full timeRole: Site Reliability Engineer Location: Mumbai/ Bangalore Working Model: Hybrid Shift: 12-9PM Intro: Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of...
-
Site Reliability Engineer
4 weeks ago
Mumbai, India CSC Full timeRole: Site Reliability EngineerLocation: Mumbai/ BangaloreWorking Model: HybridShift: 12-9PM Intro:Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of business,...
-
Site reliability engineer
1 week ago
Navi Mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Level 1 & 2 Working days : Work from Office (5 days compulsory) Shift Timings : Rotational Shifts Looking only for #Male candidates and Immediate Joiners. Key Responsibilities: • Monitor system performance and availability across GCP and Anthos environments. • Respond to incidents,...
-
Site Reliability Engineer
2 weeks ago
navi mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Level 1 & 2 Working days : Work from Office (5 days compulsory) Shift Timings : Rotational Shifts Looking only for #Male candidates and Immediate Joiners. Key Responsibilities: • Monitor system performance and availability across GCP and Anthos environments. • Respond to incidents, perform root cause...
-
Site Reliability Engineer
2 weeks ago
navi mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Level 1 & 2Working days : Work from Office (5 days compulsory)Shift Timings : Rotational ShiftsLooking only for #Male candidates and Immediate Joiners.Key Responsibilities:• Monitor system performance and availability across GCP and Anthos environments.• Respond to incidents, perform root cause...
-
Site Reliability Engineer
2 weeks ago
navi mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Level 1 & 2Working days : Work from Office (5 days compulsory)Shift Timings : Rotational ShiftsLooking only for #Male candidates and Immediate Joiners.Key Responsibilities:• Monitor system performance and availability across GCP and Anthos environments.• Respond to incidents, perform root cause...
-
Site reliability engineer
2 weeks ago
Navi Mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Level 1 & 2Working days : Work from Office (5 days compulsory)Shift Timings : Rotational ShiftsLooking only for #Male candidates and Immediate Joiners.Key Responsibilities:• Monitor system performance and availability across GCP and Anthos environments.• Respond to incidents, perform root cause...
-
Site Reliability Engineer
1 month ago
Mumbai, Maharashtra, India antal international network Full timeKey Responsibilities:We are seeking a skilled Site Reliability Engineer to join our team at Antal International Network. The successful candidate will be responsible for ensuring the availability, scalability, and efficiency of our software solutions.Key Responsibilities:Run the production environment by monitoring availability and taking a holistic view of...
-
Site Reliability Engineering Expert
1 month ago
Mumbai, Maharashtra, India Antal Full time{"Job OverviewAs a Site Reliability Engineer at Antal, you will be responsible for ensuring the availability, scalability, and performance of our software systems.Key Responsibilities* Monitor and maintain production environment, identifying and resolving issues to ensure high uptime* Improve system reliability, quality, and time-to-market of software...
-
Site Reliability Engineering Manager
4 weeks ago
Mumbai, Maharashtra, India IDFC FIRST Bank Full timeJob Title: Senior Site Reliability Engineering ManagerFunction/ Department: Information TechnologyJob Purpose:IDFC FIRST Bank is seeking a seasoned Site Reliability Engineering Manager to lead our efforts in ensuring seamless customer experiences. As a key member of our IT team, you will be responsible for defining SRE principles, SLIs, and SLAs, and...
-
Site Reliability Engineer
2 weeks ago
Navi Mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Level 1 & 2 Working days : Work from Office (5 days compulsory) Shift Timings : Rotational Shifts Looking only for #Male candidates and Immediate Joiners. Key Responsibilities: • Monitor system performance and availability across GCP and Anthos environments. • Respond to incidents,...
-
Site Reliability Engineer
2 weeks ago
Navi Mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Level 1 & 2 Working days : Work from Office (5 days compulsory) Shift Timings : Rotational Shifts Looking only for #Male candidates and Immediate Joiners. Key Responsibilities: • Monitor system performance and availability across GCP and Anthos environments. • Respond to incidents, perform root...
-
Site Reliability Engineer
2 weeks ago
Navi Mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Level 1 & 2Working days : Work from Office (5 days compulsory)Shift Timings : Rotational ShiftsLooking only for #Male candidates and Immediate Joiners. Key Responsibilities:• Monitor system performance and availability across GCP and Anthos environments.• Respond to incidents, perform root cause...
-
Site Reliability Engineer
2 weeks ago
Navi Mumbai, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Level 1 & 2Working days : Work from Office (5 days compulsory)Shift Timings : Rotational ShiftsLooking only for #Male candidates and Immediate Joiners. Key Responsibilities:• Monitor system performance and availability across GCP and Anthos environments.• Respond to incidents, perform root cause...
-
Site Reliability Engineer
1 week ago
Mumbai, India Azilen Technologies Full timeObjectives of this RoleAct as the primary point of contact for corporate clients, delivering timely, professional support and ensuring seamless on-site service as needed.Deployment of large distributed application in Production/Staging environment.Run the production environment by monitoring availability and taking a holistic view of application and system...