Site Reliability Engineer

1 day ago


Bengaluru, Karnataka, India Creencia Technologies Pvt Ltd Full time

We are recruiting an experienced Site Reliability Engineer to join our newly established TechOps division within the Technology department. We maintain the systems that keep our products running smoothly around the world, 24x7 - supporting everything from cloud infrastructure and CI/CD pipelines to observability and incident response.

How you will contribute in this role :

- Define and implement best practices for system reliability, observability, monitoring, and alerting.

- Build and manage automation for our AWS cloud based services, and SaaS stack. Continuously reduce operational toil.

- Drive end-to-end observability across our web and mobile applications, cloud infrastructure, firewalls and CDNs.

- Diagnose infrastructure failures, performance bottlenecks, and production issues through strong debugging skillsWork closely with Service Delivery Managers to drive incident management processes, including postmortems and root cause analysis, and with application teams, and platform engineers to improve reliability and performance.

- Participate in on-call rotations, ensuring rapid incident response across our stack.

- Take ownership of SLAs/SLOs/SLIs and commit to continuous improvement of service levels across all platforms.

- Improve system resilience and minimize MTTR (mean time to recovery) through incident response automation.

What were looking for :

- 4+ years of professional experience as a Site Reliability Engineer or in a Cloud Operations/DevOps role.

- 3+ years in a production environment supporting large-scale, mission-critical applications - including web, mobile, and e-commerce/payment applications.

- Proficient in one or more programming/scripting languages (e.g., Python, Golang, Typescript).

- In-depth knowledge of observability tools (e.g., New Relic, Prometheus, Grafana ).

- Professional experience in cloud platforms (AWS strongly preferred), such as serverless functions, API gateway, relational and NoSQL databases, and caching.

- Strong experience with container orchestration ( ECS, Kubernetes), CI/CD pipelines, and infrastructure-as-code (AWS CDK, Terraform, Pulumi, etc.).

- An advanced degree in software / data engineering, computer / information science, or a related quantitative field or equivalent work experience.

- Strong verbal and written communication skills and ability to work well with a wide range of stakeholders.

- Strong ownership, scrappy and biased for action.

(ref:hirist.tech)

  • Bengaluru, Karnataka, India Enterprise Minds, Inc Full time

    We're Hiring | Site Reliability Engineer | 8-10 years


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bengaluru, Karnataka, India Randstad Full time

    Role: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India TRUGlobal Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Job Title: Site Reliability Engineer (SRE) with Python Development ExpertisePosition Overview: We are seeking a skilled Site Reliability Engineer (SRE) with strong Python development experience to join our team. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our services across both on-premises and...


  • Bengaluru, Karnataka, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Role OverviewAs a Site Reliability Engineer, you will play a pivotal role in driving innovation and modernizing complex systems by leveraging cutting-edge technologies and collaboration with cross-functional teams.


  • Bengaluru, Karnataka, India IDESLABS PRIVATE LIMITED Full time US$ 90,000 - US$ 1,20,000 per year

    Experience: 5+ YearsSkill:Site reliability engineerLocation: BangaloreNotice Period:Immediate.Employment Type: ContractWorking Mode: HybridJob DescriptionSite Reliability Engineer Tech StackPrimaryAWSTerraformAnsibleDockerSecondaryPythonBashGithubJenkins


  • Bengaluru, Karnataka, India Coforge Full time

    Job Description- Design, implement, and maintain scalable infrastructure to ensure high availability and performance of software applications.- Collaborate with development teams to identify and resolve issues affecting application performance, stability, and reliability.- Develop automated monitoring scripts using tools like Prometheus, Grafana, etc. to...


  • Bengaluru, Karnataka, India Infrasoft Technologies Limited Full time

    Job DescriptionJob Title: DeveloperWork Location: Bangalore, KarnatakaExperience Range: 68 YearsJob Description:We are looking for a skilled Developer with strong hands-on experience in Site Reliability Engineering (SRE), Java, JavaScript, and Production Support. The ideal candidate should have a solid background in application monitoring and troubleshooting...


  • Bengaluru, Karnataka, India Collabera Full time

    Job Description As a Principal/Chief Site Reliability Engineer , you will play a critical role in designing, developing, and maintaining scalable and highly reliable systems. You'll work closely with development teams to improve system reliability, monitor critical applications, and design fail-proof infrastructure. Responsibilities Design and implement...