Site Reliability Engineer

2 days ago


Mumbai, Maharashtra, India Oracle Financial Services Software Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Site Reliability Developer 3

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuni Within the Oracle Health (OHAI) organization, the new EHR and Clinical AI Agent cloud services are at the forefront of new generative AI services for healthcare organizations. Building on the success of the established Digital Assistant (ODA) product, EHR and AI Agent enable healthcare providers to leverage advanced AI technologies, together with voice commands, to reduce manual work and enable providers to focus on patient care.

Oracle Health EHR is expanding their OCI Operations team, and looking to bring in new Site Reliability Engineers. As an SRE engineer, you will be engaged in solving technical challenges on an advanced OCI cloud service platform, focusing on areas such as reliability, scalability, resilience, security, and performance.

You will define how to use latest technologies to optimize the operational efficiency of the service. You will gain a deep understanding of ChatBots, cognitive services, machine learning and analytics. You will work with a team pushing the boundaries of a scalable, self-healing, autonomous platform built on Kubernetes, Docker, Prometheus, and Grafana. You will be exposed to a wide range of OCI cloud services and understand how we interact with many dependent services across the organization.

Areas of responsibility
- Service Ownership
As part of the EHR/Clinial Agent team, you will be responsible for all operational aspects of the OCI services included in our portfolio.
Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of the Digital Assistant suite of products.
Own end-to-end availability, reliability, and performance of a Cloud Service
Participate in LiveSite operations, working rapidly to mitigate issues that may arise.

- Service Design
Designing and implement solutions for rolling out software and security updates with zero downtime
Partner with development and product management to build and maintain platform and automation frameworks to ensure maximum up-time and predictability, preventing outages and service interruptions or degradation
Analyze system failures and developing rapid response processes

- Operations engineering
Evaluate the operation of cloud service deployments across commercial and government datacenters
Monitor the degradation of the service and dependencies under load, and implement solutions to ensure high availability to our customers
Analyse resource utilization and scaling requirements in a high-end production system
Resolve security vulnerabilities to conform to corporate and government security standards.

- Automation
Building on your understanding of automation and orchestration principles, you will be identifying opportunities to automate SRE procedures in production environments
The solution implemented will be designed to minimize the possibility of errors being introduced into the system

- Technical expertise
Handle complex, critical issues encountered in production environments, drawing on your accumulated technical knowledge to rapidly identify the issues and apply steps to mitigate.
Develop an understanding of the underlying AI technologies used to implement the Clinical Digital Assistant service
As an SME, you will be called in to handle major incidents, and your understanding of the architecture and dependent services will position you to apply mitigations to resolve the issue quickly, then working with development to assist implementing preventative actions.

Career Level - IC3

Requirements
years of professional experience as a Site Reliability Engineer or equivalent experience.
BS or MS in Information Technology/Computer System Engineering, or equivalent
Excellent team skills, can-do attitude, focus on quality.
Strong trouble shooting capabilities targeting complicated problems in remote systems
Experience with production operations and best practices for deploying quality code in production.
Experience with public cloud (OCI, AWS, GCP, Azure).
Experience and working knowledge in Python, Perl and/or Shell Scripting.
Knowledge of Infrastructure as Code (IaaC) like Shepherd and Terraform.
Experience with public cloud managed Kubernetes.
Experience with cloud-native administration and monitoring/alerting technologies such as Docker, Helm, Prometheus, Grafana, EFK/ELK, Jaeger, or similar technologies.
Knowledge of version control using Git.
Experience in Linux/Unix environment ng

  • Mumbai, Maharashtra, India Oracle Financial Services Software Ltd Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Software Developer 3 We are seeking a mid-career Site Reliability/DevOps Engineer (IC3) to strengthen our infrastructure and operations teams. This role is critical in advancing our organizational goals of operational excellence, cloud migration, and cost optimization. As part of Oracle Health Applications & Infrastructure (OHAI), this engineer will...


  • Mumbai, Maharashtra, India Fynd Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Fynd is India's largest omnichannel platform and a multi-platform tech company specializing in retail technology and products in AI, ML, big data, image editing, and the learning space. It provides a unified platform for businesses to seamlessly manage online and offline sales, store operations, inventory, and customer engagement. Serving over 2,300 brands,...


  • Mumbai, Maharashtra, India Deqode Full time

    Profile : Site Reliability Engineer (SRE)Experience Required : 6+ YearsLocations : Mumbai, Gurgaon, ChennaiWork Arrangement : HybridKey Responsibilities :- Design and implement scalable, resilient cloud-native infrastructure across AWS/Azure/GCP platforms- Own the SRE function including availability, latency, performance monitoring, emergency response,...

  • Site Engineer

    1 week ago


    Navi Mumbai, Maharashtra, India M L Labade Engineer Contractor Full time ₹ 4,00,000 - ₹ 12,00,000 per year

    Role & responsibilitiesOrganizing materials and ensuring sites are safe and clean.Preparing cost estimates and ensuring appropriate materials and tools are available.Providing technical advice and suggestions for improvement on particular projects.Diagnosing and troubleshooting equipment as required.Negotiating with suppliers and vendors to ensure the best...


  • Mumbai, Maharashtra, India ALIQAN Technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Position: SITE Reliability EngineerBudget- 2. 4 LPM + GSTExp- 10 yrsDuration- 6 monthsLocation- Andheri MumbaiTechnical Skills:Programming: Proficiency in languages like Python.Operating Systems: Deep understanding of Linux/Windows operating systems and networking concepts.Cloud Technologies: Experience with Azure including services, architecture, and best...


  • Mumbai, Maharashtra, India RELX Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Would you like to be part of a team that delivers high-quality software to our customers?Are you a visible champion with a 'can do' attitude and enthusiasm that inspires others?About The BusinessLexisNexis Risk Solutions is the essential partner in the assessment of risk. Within our Business Services vertical, we offer a multitude of solutions focused on...


  • Mumbai, Maharashtra, India Natobotics Full time

    Job DescriptionWere on an exciting journey with our client and we want you to join us. With our client, you will beexposed to the latest technologies and work with some of the brightest minds in the industry.Our client is leading Banking company so you will be playing a key role as a VP Site Reliability Engineering (SRE), who can assist with the below:Roles...


  • Mumbai, Maharashtra, India Search Synergy Pvt Ltd Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Note - Location - Dadar/Kurla (Mumbai)Skill, Knowledge &Trainings : - Own and manage the CI/CD pipelines for automated build, test, and deployment. - Design and implement robust deployment strategies for microservices and web applications. - Set up and maintain monitoring, alerting, and logging frameworks (e.g., Prometheus, Grafana, ELK) - Build...


  • Navi Mumbai, Maharashtra, India Uplers Full time ₹ 8,00,000 - ₹ 25,00,000 per year

    Experience: 4+ yearsSalary: ConfidentialShift: (GMT+05:30) Asia/Kolkata (IST)Opportunity Type: Office (Mumbai)Placement Type: Full time Permanent Position(*Note: This is a requirement for one of Uplers' client--Gofynd)What do you need for this opportunity?Must have skills required: and AWS/Google Cloud and MongoDB/CI/CD/GrafanaJob descriptionFynd is Indias...


  • Mumbai, Maharashtra, India Wipro Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Role Purpose RequiredSkills:- 5+Years of experience in system administration, application development, infrastructure development or related areas 5+ years of experience with programming in languages like Javascript, Python, PHP, Go, Java or Ruby 3+ years of in reading, understanding and writing code in the same 3+years Mastery of infrastructure...