Site Reliability Engineer-1
7 months ago
Zelis I&E BU is looking for a DevOps / Site Reliability Engineer to expand our Engineering team. We follow an Agile methodology in small software teams to consistently deliver high-quality software. Our stack includes Ruby, Rails, Angular, TypeScript, Node, Rabbit, Solr, Postgres, Redis, Puppet, and Hubot. Our infrastructure is declared as code and provisioned on AWS. We offer mentorship and career guidance, a competitive salary, remote-friendly workspace, unlimited vacation time and continuing education support (conferences, books, online resources).
We’re looking for someone who’s ready to help us improve our customer experience by building functional systems that bring our business to new heights. In this role you will Gather and analyze metrics from both of our operating systems and applications to assist in performance tuning and fault finding. Partner with development teams to improve services through rigorous testing and release procedures. Participate in system design consulting, platform management, and capacity planning. Create sustainable systems and services through automation. Balance feature development speed and reliability with well-defined service level objectives. Participate in blameless postmortems to identify resilience and reliability improvements.
In this position, you'll be responsible for:
Gathering and analyzing metrics from both operating systems and applications to assist in performance tuning and fault finding.
Partnering with development teams to improve services through rigorous testing and release procedures.
Participating in system design consulting, platform management, and capacity planning.
Creating sustainable systems and services through automation.
Balancing feature development speed and reliability with well-defined service level objectives.
Participating in blameless postmortems to identify resilience and reliability improvements.
Oversight and optimization of AWS infrastructure using configuration management and infrastructure-as-code best practices.
Triaging, routing, and resolution of issues and incidents identified by both internal and external stakeholders.
Advising and guiding other organizational teams with a focus on automation, maintainability, reliability, performance, and security.
Leading, advising, and analyzing load and performance testing exercises to identify performance bottlenecks and breakpoints, and determine infrastructure needs accordingly.
Measurement, monitoring, and reporting of availability, latency, and overall system health based on SLIs/SLOs/SLAs.
Engagement in capacity planning, demand forecasting, software performance analysis, and systems tuning.
Managing the CI/CD pipeline and migration of client software releases through QA, UAT, and production environments to ensure high-quality, on time delivery of all dependencies.
Documentation of tribal knowledge to reduce knowledge silos and reliance on institutional memory to support and maintain reliable systems.
Triaging and troubleshooting production issues related to our S365 products.
Researching and implementing ways to automate the management of our infrastructure and toil.
Supporting deployments across our growing development, UAT, and production environments.
Building out uptime, latency, and error monitoring for the S365 stack.
Participating in blameless postmortems for incidents.
Taking part in on-call rotation for production support.
You might be a good fit if you have:
2-4 years of software development experience.
Experience supporting Linux systems hosted in a cloud environment - We’re using AWS (specifically EC2, CloudFormation, RDS, ElasticCache, and S3, to name a few).
Experience with web programming languages (Ruby on Rails a definite plus).
Familiarity with using Puppet or equivalent infrastructure management & automation tooling.
Excellent communication skills.
A strong desire to understand complex systems and how to make them highly available.
A collaborative spirit and you enjoy working with a team to build things.
A desire to continually improve and you value giving and receiving constant and constructive feedback.
Commitment to Diversity, Equity, Inclusion, and Belonging
At Zelis, we champion diversity, equity, inclusion, and belonging in all aspects of our operations. We embrace the power of diversity and create an environment where people can bring their authentic and best selves to work. We know that a sense of belonging is key not only to your success at Zelis, but also to your ability to bring your best each day.
-
Site Reliability Engineer
3 months ago
hyderabad, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2 to 10 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...
-
Site Reliability Engineer
4 months ago
Hyderabad, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2.5 to 6 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...
-
Site Reliability Engineer
4 months ago
Hyderabad, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Apigee Level 1 Experience: 2.5 to 6 years The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in...
-
Site Reliability Engineer
4 months ago
Hyderabad, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2.5 to 6 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...
-
Site Reliability Engineer
3 months ago
Hyderabad, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2 to 10 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...
-
Site Reliability Engineering Manager
2 weeks ago
Hyderabad, Telangana, India Truetech Full timeSenior Site Reliability EngineerWe are seeking an experienced Senior Site Reliability Engineer to lead and manage a team of engineers, providing guidance and support to ensure the team's success. The ideal candidate will have a strong background in site reliability engineering, cloud computing, and DevSecOps principles.The successful candidate will be...
-
Site Reliability Engineer-1
7 months ago
Hyderabad, Telangana, India Zelis Full time**Job Description**: - Zelis I&E BU is looking for a DevOps / Site Reliability Engineer to expand our Engineering team. We follow an Agile methodology in small software teams to consistently deliver high-quality software. Our stack includes Ruby, Rails, Angular, TypeScript, Node, Rabbit, Solr, Postgres, Redis, Puppet, and Hubot. Our infrastructure is...
-
Site Reliability Engineer
4 weeks ago
Hyderabad, India IDEMIA Full timeWe are hiring for Site Reliability Engineer role at Noida location.Responsibility:Involved in deploy/manage/operate of medium to large scale production systemsUnderstanding of Linux as a runtime environmentFamiliar to Cloud native concepts and virtualisationFamiliar to CI/CD concepts and tools like Jenkins, Gitlab etcPrevious experience of working with...
-
Manager - Site Reliability Engineering
2 months ago
Hyderabad, India Live Connections Full timeWe are looking for Manager Site Reliability Engineer in Hyderabad locationRoles and Responsibilities :Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...
-
Manager - Site Reliability Engineering
4 weeks ago
Hyderabad, India Live Connections Full timeWe are looking for Manager Site Reliability Engineer in Hyderabad location Roles and Responsibilities : Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to...
-
Site Reliability Engineering Lead
3 weeks ago
Hyderabad, Telangana, India Live Connections Full timeWe are looking for a highly skilled Site Reliability Engineering Lead to join our team at Live Connections in Hyderabad. As a key member of our organization, you will be responsible for leading and managing a team of engineers to ensure the reliability, scalability, and performance of our systems.**Estimated Salary: ₹25,00,000 - ₹35,00,000 per...
-
Senior Site Reliability Engineer 1
6 months ago
Hyderabad, India ModMed Full timeWe are united in our mission to make a positive impact on healthcare. Join Us! South Florida Business Journal, Best Places to Work 2024 Inc. 5000 Fastest-Growing Private Companies in America 2023 Company of the Year | 2023 BIG Innovation Awards Fastest-Growing Company of the Year – Large (Bronze) | 2022 Best in Biz Awards Who we are: ...
-
Site reliability engineer
2 weeks ago
Hyderabad, India NCR Voyix Full timeJob Title: Site Reliability Engineer (SRE) Company: NCR VOYIX Location: Hyderabad Job Description: For NCR Platform-Engineering team, we are looking for highly experienced Cloud SRE Engineers to join our expanding Cloud Engineering team. Our Cloud Engineering team is responsible for automating and operating NCR Retail and Restaurant cloud-based...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India NCR Voyix Full timeJob Title: Site Reliability Engineer (SRE)Company: NCR VOYIXLocation: HyderabadJob Description:For NCR Platform-Engineering team, we are looking for highly experienced Cloud SRE Engineers to join our expanding Cloud Engineering team. Our Cloud Engineering team is responsible for automating and operating NCR Retail and Restaurant cloud-based products.Our aim...
-
Site Reliability Engineer
4 hours ago
Hyderabad, India Coforge Full timeJob Title: Site Reliability EngineerSkills: SRE, CI/CD, AWS, Python, Terraform & KubernetesLocation: Hyderabad (Work from Office)Experience: 7-15 YearsNote: Immediate joiners are preferableJob Description:We at Coforge are hiring a Site Reliability Engineer with the following skillset:Design, implement, and manage scalable and secure cloud-based...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India NCR Voyix Full timeJob Title: Site Reliability Engineer (SRE) Company: NCR VOYIX Location: Hyderabad Job Description: For NCR Platform-Engineering team, we are looking for highly experienced Cloud SRE Engineers to join our expanding Cloud Engineering team. Our Cloud Engineering team is responsible for automating and operating NCR Retail and Restaurant cloud-based products....
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India NCR Voyix Full timeJob Title: Site Reliability Engineer (SRE)Company: NCR VOYIXLocation: HyderabadJob Description:For NCR Platform-Engineering team, we are looking for highly experienced Cloud SRE Engineers to join our expanding Cloud Engineering team. Our Cloud Engineering team is responsible for automating and operating NCR Retail and Restaurant cloud-based products.Our aim...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India NCR Voyix Full timeJob Title: Site Reliability Engineer (SRE) Company: NCR VOYIX Location: Hyderabad Job Description: For NCR Platform-Engineering team, we are looking for highly experienced Cloud SRE Engineers to join our expanding Cloud Engineering team. Our Cloud Engineering team is responsible for automating and operating NCR Retail and Restaurant cloud-based...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India NCR Voyix Full timeJob Title: Site Reliability Engineer (SRE)Company: NCR VOYIXLocation: HyderabadJob Description:For NCR Platform-Engineering team, we are looking for highly experienced Cloud SRE Engineers to join our expanding Cloud Engineering team. Our Cloud Engineering team is responsible for automating and operating NCR Retail and Restaurant cloud-based products.Our aim...
-
Site Reliability Engineer
2 months ago
Hyderabad, India 5100 Kyndryl Solutions Private Limited Full timeWho We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...