DET-TT-Resilience and Reliability Engineer
2 days ago
At EY, you'll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we're counting on your unique voice and perspective to help EY become even better, too. Join us and build an exceptional experience for yourself, and a better working world for all.
Senior Reliability Engineer:
Senior Reliability Engineer (Senior level)
Description
- Reliability Engineering (SRE) is a modern way of delivering IT Solutions by imbibing Software engineering principles in Service Delivery to reduce IT Risk to business, improve business resilience, attain predictability & reliability, optimize cost of IT Infra and Ops
- A Reliability Engineer typically has deep software engineering experience encompassing design, build, deploy and manage / maintain an IT solution ensuring resilience, reliability, and performance.
- A Reliability Engineer is a bridge between development and operations by applying a software engineering mindset to the development, deployment, and maintenance of applications to maximize system reliability & automation, while improving efficiencies by optimizing resources
Responsibilities
- Defining SLA/SLO/SLI for a product / service
- Engineering in resilient design and implementation practices into solutions as they go through the product life cycle
- Engineering out manual effort (Toil) through the development of automated processes and services (e.g., Automated Management of Systems, CI/CD improvements)
- Developing Observability Solutions to track, report, and measure SLA adherence
- Help Optimize Cost of IT Infra & Operations - FinOps
- Critical Situation management
- SOP / Runbook automation, Toil reduction
- Data Analytics & System trend analysis
Typical Skills and Background
- 7+ years of experience in software product engineering principles, processes and systems
- Hands-on experience in Java / J2EE, one of web server (Apache Tomcat or IBM HTTP Server), one of the application servers (Tomcat/WebSphere), and any major RDBMS like Oracle
- Hands-on experience in at least one CI-CD (Azure DevOps, GitLab CI/CD, Jenkins) and IaC tools (Terraform, AWS CloudFormation, Ansible etc.)
- Experience in at least one cloud technology (AWS/Azure/GCP etc. and Docker, Pivotal, Kubernetes, OpenShift etc.) and its reliability tools (Azure AppInsight, CloudWatch, Azure Monitor etc.)
- Experience in Linux (RHEL) operating system performance monitoring parameters and their interpretation, commands used for monitoring
- Experience in Observability - APM tools (Dynatrace, AppDynamics etc.), metrics / log consolidation (Splunk) and ELK Stack
- Defining NFRs and SLA/SLO/SLI agreement for a product / platform / services
- Knowledge on queuing models used, thread pools, request servicing processes etc.
- Knowledge in Web Services, SOA, ESB (DataPower), RESTFul
- Knowledge of application design patterns, J2EE application architectures, Microservices, Spring boot & Cloud native architectures
- Proficiency in Java runtimes, Core Java, Garbage collection, JVM parameters tuning
- Experience in performance tuning on Application Servers (Tomcat/WAS)
- Experience in trouble shooting Performance / Scalability / Availability issues
- Experience in Thread dump, heap dump generation & analysis
- Knowledge on Query tuning and database designs & models
- Knowledge at least one automation scripting language like Python
- Mastery in collaborative software development using Git, Jira, Confluence etc.
- AI/ML & Data Analytics knowledge and experience is a desirable
EY | Building a better working world
EY exists to build a better working world, helping to create long-term value for clients, people and society and build trust in the capital markets.
Enabled by data and technology, diverse EY teams in over 150 countries provide trust through assurance and help clients grow, transform and operate.
Working across assurance, consulting, law, strategy, tax and transactions, EY teams ask better questions to find new answers for the complex issues facing our world today.
-
DET Demo trainer
2 days ago
Karnataka, Bengaluru, India Leap Full time ₹ 2,00,000 - ₹ 4,00,000 per yearAbout us We are building this and this and thisIn short, we are building the platform to drive global careers for millennials from emerging economiesWe work at the exciting intersection of the 2 hottest trends around - edtech & fintechAnd we love that we succeed as a business while powering the dreams of talented studentsFounder Profile Arnav Kumar is a...
-
DET / Trainee - BELRISE INDUSTRIES LTD
2 days ago
Pune, Maharashtra, , India Orbiton HR Services Full time ₹ 1,50,000 - ₹ 2,50,000 per yearBelrise Industries Ltd, a reputed company in the Automotive Components and Engineering & Manufacturing sector, is offering full -time DET / Trainee positions at its Ranjangaon MIDC facility in Pune. This role involves hands -on work across departments such as Maintenance, Production, Assembly, and Quality. It is an excellent opportunity for freshers from...
-
Site Reliability Engineer
4 weeks ago
, India, IN Sonata Software Full timeWe're Hiring: Senior Site Reliability Engineer Location: Onsite (Office: Hyderabad – Mandatory from Day 1) Employment Type: Full-time Notice Period: Immediate to 15 Days Only Experience: 8+ Years About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact...
-
AI Engineer
8 hours ago
India Client of Prasha Consultancy Services Private Limited Full timeImmediate or early Joiners preferred. A US Based IT MNC is looking for a seasoned AI Engineer with hands-on experience in building and optimizing voice models, for one its Reputed client in Enterprise class voice solution domain. Candidate will be working on developing, training, and refining AI models for voice synthesis, voice cloning, speech recognition,...
-
AI Engineer
1 day ago
Delhi, India Client of Prasha Consultancy Services Private Limited Full timeImmediate or early Joiners preferred.A US Based IT MNC is looking for a seasoned AI Engineer with hands-on experience in building and optimizing voice models, for one its Reputed client in Enterprise class voice solution domain. Candidate will be working on developing, training, and refining AI models for voice synthesis, voice cloning, speech recognition,...
-
Pune, Maharashtra, , India Orbiton HR Services Full time ₹ 40,000 - ₹ 60,000 per yearFederal Mogul Sealings India Pvt. Ltd., a leading manufacturer in the Automotive Sealing and Engine Components industry, is looking to hire DET / Trainees for its Mahalunge Chakan facility in Pune. This full -time role spans across departments such as Mechanical, Electrical, Automobile, Fitting, and Machining. It is an excellent opportunity for fresh ITI,...
-
Application Support Engineer
2 days ago
India , NA, Mumbai, India, Maharashtra Reliability Engineering Full time ₹ 4,00,000 - ₹ 8,00,000 per yearKey Responsibilities• Monitoring & Incident Management• Monitor RPA bots in production to ensure stability and availability.• Investigate, analyze, and resolve bot failures, errors, and exceptions within defined SLAs.• Provide first and second-level support for RPA processes (depending on role scope).• Problem Resolution & Root Cause Analysis•...
-
Site Reliability Engineer
4 weeks ago
Delhi, India Sonata Software Full timeWe're Hiring: Senior Site Reliability EngineerLocation:Onsite (Office: Hyderabad – Mandatory from Day 1)Employment Type:Full-timeNotice Period:Immediate to 15 Days OnlyExperience:8+ YearsAbout the RoleWe’re looking for aSenior Site Reliability Engineer (SRE)to lead reliability initiatives across our production systems. This is a high-impact role where...
-
Site Reliability Engineer
4 weeks ago
Delhi, India Sonata Software Full timeWe're Hiring: Senior Site Reliability EngineerLocation: Onsite (Office: Hyderabad – Mandatory from Day 1)Employment Type: Full-timeNotice Period: Immediate to 15 Days OnlyExperience: 8+ YearsAbout the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact role...
-
Site Reliability Engineer
10 hours ago
Indore, Madhya Pradesh, , India HRhelpdesk Full time US$ 80,000 - US$ 1,20,000 per yearAbout the company: Company is a rapidly growing, private equity backed SaaSproduct company and provides cloud -based solutions. Job Summary: As a Site Reliability Engineer (SRE), you will be responsible for building and maintaining theinfrastructure, tools, and pipelines that keep our systems running smoothly. You will collaborateclosely with DevOps,...