Site Reliability Engineer

5 days ago


Bengaluru, Karnataka, India Tookitaki Holding Full time ₹ 9,00,000 - ₹ 12,00,000 per year

Position Overview

Job Title: Site Reliability Engineer (SRE)
Department: Technology
Location: Bangalore
Reporting To: Head of Infra

Tookitaki is looking for a Site Reliability Engineer (SRE) with 3–6 years of experience to help maintain and scale the infrastructure that powers our flagship products—FinCense and the AFC Ecosystem. As an SRE, you will work at the intersection of software engineering and infrastructure, ensuring high availability, performance, and scalability of our platforms.

You will collaborate with engineering, DevOps, and client success teams to operationalize deployments across on-premise, VPC, and Compliance as a Service (CaaS) environments while improving monitoring, automation, and incident response.

Position Purpose

The SRE role is responsible for ensuring the reliability and efficiency of Tookitaki's production systems and environments. This includes building monitoring systems, improving deployment pipelines, automating routine operations, and responding to production incidents. You'll help build a resilient infrastructure that supports our mission to provide AI-driven solutions that prevent financial crime.

Key Responsibilities
  1. System Monitoring & Incident Management

  • Build and maintain monitoring, alerting, and logging systems using tools like Prometheus, Grafana, and ELK.

  • Respond to incidents and outages, conduct post-mortems, and implement corrective actions.

Infrastructure & Deployment Automation

  • Automate infrastructure provisioning and application deployment using Terraform, Ansible, or Helm.

  • Contribute to CI/CD pipelines, improve reliability and speed of software delivery (GitLab CI, Jenkins, etc.).

Container & Orchestration Management

  • Manage and troubleshoot Docker containers and Kubernetes clusters, ensuring workload scaling, resource management, and health.

  • Support application updates, rollbacks, and blue-green or canary deployments.

Cloud & Platform Operations

  • Operate within AWS (preferred) or GCP environments (EC2, S3, VPC, IAM).

  • Monitor system availability and resource usage across environments.

Security & Reliability Enhancements

  • Implement and monitor TLS/SSL, RBAC, SSO, and secure API practices.

  • Support compliance and security audit activities by maintaining logs, access controls, and operational hygiene.

Collaboration & Documentation

  • Work closely with developers, infra engineers, and support teams to ensure production readiness.

  • Maintain playbooks, runbooks, and system documentation for reliability engineering activities.

Qualifications and Skills
Education
  • Bachelor's degree in Computer Science, Engineering, or related technical field.

Experience
  • 3–6 years in Site Reliability Engineering, DevOps, Platform Engineering, or a related role.

  • Experience with production environments and live system debugging.

Technical Skills
  • Kubernetes, Docker, Helm – experience deploying and scaling services.

  • Linux administration and command-line debugging.

  • Hands-on with AWS (preferred) or GCP cloud platforms.

  • Scripting in Bash and Python for automation and monitoring tasks.

  • Experience with monitoring and alerting tools like Prometheus, Grafana, ELK, or Datadog.

  • Familiarity with databases (e.g., MariaDB, ScyllaDB) and SQL/CQL querying.

Soft Skills
  • Strong problem-solving and debugging skills.

  • Ability to work in on-call rotations and high-pressure production environments.

  • Excellent communication and documentation abilities.

Key Competencies
  • Operational Reliability: Ensures system uptime and performance through proactive monitoring and maintenance.

  • Automation Mindset: Reduces manual effort through scripting and tooling.

  • Incident Response: Quick identification and resolution of issues to minimize downtime.

  • Cross-Functional Collaboration: Works effectively with engineering, support, and infra teams.

  • Security Awareness: Applies best practices in infrastructure and platform security.

Success Metrics
  • Maintain 99.9%+ uptime across production environments.

  • Reduce mean time to detect (MTTD) and mean time to resolve (MTTR) for critical incidents.

  • Increase in automation coverage and reduction in manual deployment steps.

  • High internal satisfaction from developers on CI/CD and platform reliability.

  • Compliance readiness and security log availability for audits.

Benefits
  • Competitive compensation

  • Work on a globally recognized RegTech platform transforming financial crime prevention.

Exposure to cutting-edge AI and big data infrastructure (Spark, Kafka, ScyllaDB, Flink).



  • Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per year

    Company DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...


  • Bengaluru, Karnataka, India Viraaj HR Solutions Private Limited Full time

    Site Reliability Engineer (SRE)About The OpportunityA fast-growing organization in the Enterprise Cloud Infrastructure & SaaS sector delivering highly available, mission-critical services to enterprise customers. We are hiring an on-site Site Reliability Engineer in India to own reliability, automation, and operational excellence across cloud-native...


  • Bengaluru, Karnataka, India Integers Full time ₹ 4,80,000 - ₹ 14,40,000 per year

    Job Title:Site Reliability Engineer (SRE)Location:Bengaluru, Karnataka (Hybrid – 1–2 days in office)Experience Level:8+ yearsJob DescriptionWe are seeking a highly skilled and experiencedSite Reliability Engineer (SRE)to join our engineering team. The ideal candidate will have a strong background in software development, DevOps practices, infrastructure...


  • Bengaluru, Karnataka, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    This posting is for Site Reliability Engineer in the Oracle Analytics Warehouse product development organization. Fully handled Cloud service that provides customers a turn-key enterprise warehouse on the cloud for Fusion Applications. The service is being built on a sophisticated technology stack demonstrating a brand-new data integration platform and the...


  • Bengaluru, Karnataka, India Selective Global Search Full time

    Company Overview: Selective Global Search Pvt. Ltd. is a New Delhi headquartered company operating in the IT industry. With over 25 years of experience, we are a rapidly growing workforce solutions company that helps clients succeed by bringing together the most talented professionals from across levels and positions. Our clientele spans various sectors...


  • Bengaluru, Karnataka, India Empower Full time

    Our vision for the future is based on the idea that transforming financial lives starts by giving our people the freedom to transform their own. We have a flexible work environment, and fluid career paths. We not only encourage but celebrate internal mobility. We also recognize the importance of purpose, well-being, and work-life balance. Within Empower and...


  • Bengaluru, Karnataka, India Zealant Consulting Group Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job Summary:We are seeking a seasoned ​Site Reliability Engineer (SRE) Engineer to join our growing team.This is a critical role in ensuring the reliability, scalability, and performance of our cloud​ infrastructure on AWS. You will leverage your expertise in automation, infrastructure​ management, and cost optimization to build and maintain...


  • Bengaluru, Karnataka, India Whitefield Careers Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Overview The Site Reliability Engineer (SRE) plays a vital role in bridging the gap between development and operations, utilizing a software engineering mindset to automate and enhance the reliability, scalability, and performance of the organization's infrastructure and applications. As a key contributor, the SRE ensures that services are available,...


  • Bengaluru, Karnataka, India Aerospike Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    About AerospikeAerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.Global leaders, including Adobe, Airtel, Barclays,...


  • Bengaluru, Karnataka, India Aerospike Full time ₹ 8,00,000 - ₹ 15,00,000 per year

    About AerospikeAerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.Global leaders, including Adobe, Airtel,...