Site Reliability Engineer II

1 week ago


Greater Hyderabad Area, India Candescent Full time ₹ 5,00,000 - ₹ 15,00,000 per year

Candescent is the largest non-core digital banking provider. We bring together the transformative technologies that power and connect account opening, digital banking and branch solutions for banks and credit unions of all sizes on any core. Our Candescent solutions power the top three U.S. mobile banking apps and are trusted by banks and credit unions of all sizes.

We offer an extensive portfolio of industry-leading products and services with an extensible ecosystem of out-of-the-box and integrated partner solutions. In addition, our API-first architecture and developer tools enable financial institutions to optimize and expand upon their existing capabilities by seamlessly integrating custom-built or third-party solutions. And our connected in-person, remote and digital experiences reinvent customer service across all channels.

Self-service configuration and marketing tools give financial institutions greater control of their branding, targeted messaging and overall user experience. And data-driven analytics and reporting tools provide valuable insights to help drive continued growth and profitability. From conversions and implementations to custom development and customer care, our clients get expert, end-to-end support at every step.

TITLE
: Site Reliability Engineer
Exp: 3-6 Years
Job Role
We are looking for a
Site Reliability Engineer
(SRE) who will be part of our SRE team and help build scalable systems, using best practices around automation, that improve reliability, velocity and enable monitoring of the operational health of stacks throughout their life cycle including metrics collection, aggregation, and visualization.

As a member of the SRE team you will support Candescent Financial Services, product and technology teams to improve the design and operation of systems, focusing on making them scalable, reliable, and efficient while ensuring performance and high availability of products/services primarily residing in the cloud. You will influence the development and implementation of reliable production systems and services to address emerging business needs (such as Cloud-based SaaS). SRE's pride themselves on the resiliency and stability of production systems, yet at the same time are committed to innovation and operational improvement through the application of software engineering practices to operations.

The SRE will facilitate innovation and operational improvement through the application of software engineering practices to operations. You will make our products easier to adopt and use by making improvements to the product, tools, processes and documentation. You are someone who strives for six 9's or better for service availability

Job Description

  • You will be responsible for maintaining and scaling production services and servers for complex and high throughput cloud services.
  • You will bridge and own the union between development, quality, security and operations.
  • You will improve scalability, service reliability, capacity, and performance.
  • You will write automation code for provisioning and operating infrastructure at massive scale.
  • You are not an operator, you're an experienced software engineer focused on operations.
  • You will initiate and contribute to continuous improvement of our software delivery processes and practices in a multi-location, multidisciplinary team to empower and accelerate product development.
  • You will use automation extensively to design, configure, manage, and monitor systems in support of our product development teams.
  • You will participate in disaster recovery planning and execution.
  • You will be responsible for maintaining / patching servers supporting SaaS products. This includes Windows Servers, Linux Servers running in in-house Datacenters and/or using cloud PaaS providers (Primarily GCP & Azure).
  • You'll work hand-in-hand with all teams to ship our code to production using Continuous Integration / Continuous Deployment (CI/CD) and AppSec tooling.
  • You will collaborate with development teams and use intuition, experience and understanding to create SLIs, SLOs, and SLAs.
  • You will provide timely assistance and remediation solutions during critical situations and production incidents to help resolve service problems. (You will be on-call for periods of time.)
  • You will develop monitoring architecture, implementing monitoring agents, build dashboards, manage escalations and alerts
  • You will participate in incident management and driving root cause analysis (RCA) and risk management processes.

EEO Statement
Integrated into our shared values is Candescent's commitment to diversity and equal employment opportunity. All qualified applicants will receive consideration for employment without regard to sex, age, race, color, creed, religion, national origin, disability, sexual orientation, gender identity, veteran status, military service, genetic information, or any other characteristic or conduct protected by law. Candescent is committed to being a globally inclusive company where all people are treated fairly, recognized for their individuality, promoted based on performance and encouraged to strive to reach their full potential. We believe in understanding and respecting differences among all people. Every individual at Candescent has an ongoing responsibility to respect and support a globally diverse environment.

Statement to Third Party Agencies
To ALL recruitment agencies: Candescent only accepts resumes from agencies on the preferred supplier list. Please do not forward resumes to our applicant tracking system, Candescent employees, or any Candescent facility. Candescent is not responsible for any fees or charges associated with unsolicited resumes.



  • Hyderabad, India JP Morgan Chase & Co. Full time

    Job Description Play a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions. As a Site Reliability Engineer II at JPMorgan Chase within the Corporate Technology, youwill use technology to solve business problems and leverage software engineering best practices as we strive towards excellence. This...


  • Greater Bengaluru Area, India Ivanti Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Senior Site Reliability Engineer - 6 positionsWhy We Need YouSenior Site Reliability Engineering (SRE) - is a growing team that partners closely with Product Engineering, Security, and Support. We are responsible for the reliability, deployment, and continuous operation of the Ivanti Cloud services. We need your help to take our existing platform to the next...


  • Hyderabad, Telangana, India JPMorgan Chase Full time

    Job Category Software Engineering Play a key role in ensuring system reliability at one of the world s most iconic and largest financial institutions As a Site Reliability Engineer II at JPMorgan Chase within the Corporate Technology you will use technology to solve business problems and leverage software engineering best practices as we strive...


  • Greater Kolkata Area, India Cling Multi Solutions Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job SummaryWe are seeking an experienced Site Reliability Engineer (SRE) Architect with over 10 years of IT experience, specializing in designing and implementing highly scalable, reliable, and automated systems.The ideal candidate will have strong expertise in cloud-native architectures, automation, monitoring, and SRE practices.This role requires excellent...


  • Hyderabad, Telangana, India COFFEEBEANS CONSULTING LLP Full time

    About the Job :We're looking for a highly skilled and self-driven Site Reliability Engineer (SRE-2) to join our team in Hyderabad. This is a full-time, work-from-office role (5 days a week) perfect for someone with 8-12 years of experience who thrives on challenges and is passionate about building robust, scalable, and highly available systems.You'll play a...


  • Greater Delhi Area, India RELX Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Join our team in delivering high-quality software to customers worldwideAre you motivated to collaborate, solve problems, and inspire others with your enthusiasm?About The BusinessLexisNexis Risk Solutions is an essential partner in risk assessment. In our Business Services vertical, we offer solutions to help organizations of all sizes drive growth, improve...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Site Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Hyderabad, India Talent Worx Full time

    Site Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Pune/Pimpri-Chinchwad Area, India Accelya Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    For more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...


  • Hyderabad, India Jigya Software Services Full time

    Job Title:Senior Site Reliability Engineer (SRE) - AWS/Kubernetes Location:Hyderabad - Onsite Job Type:Full-Time About the Role: We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance,...