
Site Reliability Engineer3
2 days ago
Within the Oracle Health Applications & Infrastructure (OHAI) organization, the Clinical Digital Assistant (CDA) cloud service is at the forefront of new generative AI services for healthcare organizations. Building on the success of the established Digital Assistant (ODA) product, CDA enables healthcare providers to leverage advanced AI technologies, together with voice commands, to reduce manual work and enable providers to focus on patient care.
CDA is expanding their OCI Operations team, and looking to bring in new Site Reliability Engineers. As an SRE engineer, you will be engaged in solving technical challenges on an advanced OCI cloud service platform, focusing on areas such as reliability, scalability, resilience, security, and performance.
You will define how to use latest technologies to optimize the operational efficiency of the service. You will gain a deep understanding of ChatBots, cognitive services, machine learning and analytics. You will work with a team pushing the boundaries of a scalable, self-healing, autonomous platform built on Kubernetes, Docker, Prometheus, and Grafana. You will be exposed to a wide range of OCI cloud services and understand how CDA interacts with many dependent services across the organization.
Areas of responsibility
- Service Ownership
As part of the CDA team, you will be responsible for all operational aspects of the OCI services included in our portfolio.
Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of the Digital Assistant suite of products.
Own end-to-end availability, reliability, and performance of a Cloud Service
- Service Design
Designing and implement solutions for rolling out software and security updates with zero downtime
Partner with development and product management to build and maintain platform and automation frameworks to ensure maximum up-time and predictability, preventing outages and service interruptions or degradation
Analyze system failures and developing rapid response processes
- Operations engineering
Evaluate the operation of cloud service deployments across commercial and government datacenters
Monitor the degradation of the service and dependencies under load, and implement solutions to ensure high availability to our customers
Analyse resource utilitization and scaling requirements in a high-end production system
Resolve security vulnerabilities to conform to corporate and government security standards.
- Automation
Building on your understanding of automation and orchestration principles, you will be identifying opportunities to automate SRE procedures in production environments
The solution implemented will be designed to minimize the possibility of errors being introduced into the system
- Technical expertise
Develop an understanding of the underlying AI technologies used to implement the Clinical Digital Assistant service
**Responsibilities**:
The Clinical Digital Assistant team works within the Oracle Health Applications & Infrastructure (OHAI) organization. The underlying ODA product has been in production for 5+ years and has over 40,000 instances deployed across OCI datacenters worldwide. Clinical Digital Assistant is a new product within the OHAI org, and you would be joining at an exciting time as this product is delivered to end users. As a member of this team, you will be surrounded by forward-thinking and innovative minds thriving in a collaborative environment.
Site Reliability Engineer skill requirements
3+ years of professional experience as a Site Reliability Engineer or equivalent experience.
2+ years Linux Experience.
Bachelor’s degree/master’s degree (Information Technology/ Computer System Engineering).
2+ years’ experience and working knowledge in Python, Perl and/or Shell Scripting.
Managing production running on UNIX flavours (RHEL, OEL).
Cloud experience IaaS (infrastructure as code IaaC, Configuration as code).
Knowledge of Infrastructure as Code (IaaC) like Shepherd and Terraform.
Knowledge of CI/CD Platforms and components like OKE, Jenkins and Splat.
Knowledge of Source Control Systems
Within the Oracle Health Applications & Infrastructure (OHAI) organization, the Clinical Digital Assistant (CDA) cloud service is at the forefront of new generative AI services for healthcare organizations. Building on the success of the established Digital Assistant (ODA) product, CDA enables healthcare providers to leverage advanced AI technologies, together with voice commands, to reduce manual work and enable providers to focus on patient care.
CDA is expanding their OCI Operations team, and looking to bring in new Site Reliability Engineers. As an SRE engineer, you will be engaged in solving technical challenges on an advanced OCI cloud service platform, focusing on areas such as reliability, scalability, resilience, security, and performance.
You will define how to use latest technologies to optimize the operational effici
-
Senior site reliability engineer
2 weeks ago
Bengaluru, India Delta Air Lines Full timeAbout Delta Tech Hub:Delta Air Lines (NYSE: DAL) is the U. S. global airline leader in safety, innovation, reliability and customer experience. Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-winning customer service. With our mission of connecting...
-
Sr. site reliability engineer
2 weeks ago
Bengaluru, India Delta Air Lines Full timeAbout Delta Tech Hub:Delta Air Lines (NYSE: DAL) is the U. S. global airline leader in safety, innovation, reliability and customer experience. Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-winning customer service. With our mission of connecting...
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India, Karnataka WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India, Karnataka ViewSonic Full timeJob Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India, Karnataka IntraEdge Full timeJob Title: Site Reliability Engineer (SRE) – Production SupportLocation: BengaluruJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India, Karnataka HDFC Limited Full timeHiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience - 8 - 14 Years Job PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance. Job Responsibilities: Help build a Site Reliability...
-
Site Reliability Engineer
1 day ago
Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per yearRole DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....
-
Site Reliability Engineer
11 hours ago
Bengaluru, Karnataka, India, Karnataka JRD Systems Full timePosition: Site Reliability Engineer (SRE) Role Overview: We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in Windows infrastructure to manage and optimize our cloud and on-premises environments. The ideal candidate will partner with development teams to improve service reliability, implement automation, and ensure...
-
Site Reliability Engineering Manager
4 days ago
Bengaluru, Karnataka, India, Karnataka Tata Consultancy Services Full timeRole**: Manager, Site Reliability EngineeringRequired Technical Skill Set: Manager, Site Reliability EngineeringDesired Experience Range: 12 - 18 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual Interview Job Description:Describe what the person will do in the role - how he/she will impact...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, India Programming Full timeRole - Site Reliability Engineering. Location - Bengaluru Years of Expereince - 4+ Years Professional & Technical Skills: Must To Have Skills: Proficiency in Site Reliability Engineering. Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud. Strong understanding of CI/CD tools and practices. Experience with...