Site Reliability Engineer

5 days ago


bangalore, India CareerUS Solutions Full time

Position Overview:The Site Reliability Engineer (SRE) is responsible for ensuring the stability, scalability, performance, and reliability of production systems and services. This role bridges software development and operations, using automation, monitoring, and performance optimization to build resilient systems that can scale efficiently and recover quickly from failures.Key Responsibilities:Design, build, and maintain highly reliable and scalable systems and infrastructure.Automate deployment, monitoring, and maintenance processes using DevOps tools and scripts.Implement and manage CI/CD pipelines to support continuous delivery.Monitor application performance, identify bottlenecks, and improve uptime and reliability.Develop and maintain incident response procedures, including root cause analysis and postmortems.Collaborate with development teams to design systems for fault tolerance, load balancing, and failover.Manage and optimize cloud infrastructure (AWS, Azure, GCP).Implement observability solutions — logging, metrics, tracing, and alerting.Maintain strong security and compliance standards across infrastructure.Participate in on-call rotations and ensure 24/7 system availability.Document processes, configurations, and runbooks for operational consistency.Required Skills & Qualifications:Bachelor’s degree in Computer Science, Information Technology, or related field.Strong knowledge of Linux/Unix systems administration and shell scripting.Proficiency with automation and configuration tools (Ansible, Terraform, Chef, Puppet).Experience with cloud platforms — AWS, Azure, or Google Cloud.Familiarity with containerization and orchestration tools (Docker, Kubernetes).Solid understanding of CI/CD tools (Jenkins, GitLab CI, CircleCI).Strong experience with monitoring and observability tools (Prometheus, Grafana, ELK Stack, Datadog).Knowledge of networking fundamentals, load balancing, and DNS management.Proficiency in at least one programming language (Python, Go, or Bash).Excellent analytical, problem-solving, and communication skills.Preferred Qualifications:Experience with infrastructure-as-code (IaC) and serverless architectures.Knowledge of reliability metrics such as SLOs, SLIs, and error budgets.Exposure to database administration (MySQL, PostgreSQL, MongoDB, Redis).Familiarity with security practices for cloud-native systems.Certifications such as AWS Certified DevOps Engineer, Google SRE Certification, or CKA (Certified Kubernetes Administrator).



  • bangalore, India ViewSonic Full time

    Job Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...


  • Bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy people!We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when a...


  • bangalore, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bangalore, India CodeKarma Full time

    Site Reliability Engineer (Multi-Cloud Deployments) Location: Bangalore / Remote Experience: 4–10 years Type: Full-time (6-month probation) About CodeKarma CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow. Our platform runs both as SaaS and as sub-


  • bangalore district, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production Support Location: Bengaluru Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management . The ideal candidate will be responsible for maintaining the reliability, performance, and...


  • bangalore, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation)Job Summary:We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • bangalore, India Tata Consultancy Services Full time

    Role**: Manager, Site Reliability EngineeringRequired Technical Skill Set: Manager, Site Reliability EngineeringDesired Experience Range: 12 - 18 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual Interview Job Description:Describe what the person will do in the role - how he/she will impact...


  • Bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy people!We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when a...


  • Bangalore, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary: We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • bangalore, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary: We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...