Site Reliability Engineer

6 days ago


bangalore, India techolution Full time

We are seeking a highly skilled Site Reliability Engineer - AWS to enhance the reliability, scalability, and security of our cloud infrastructure. The ideal candidate will be responsible for designing, implementing, and maintaining high-availability systems, automating processes, and ensuring seamless operations on AWS. This role requires expertise in DevOps, cloud automation, monitoring, and incident response . Title : Site Reliability Engineer - AWS Location : Remote Work Employment Type: Full-time Work timings : 24*7 rotational shifts Responsibilities: Design and maintain highly available, scalable, and fault-tolerant AWS infrastructure to ensure system reliability and performance. Proactively monitor and troubleshoot system issues, minimizing downtime and optimizing system performance. Develop and maintain Infrastructure as Code (IaC) using Terraform, CloudFormation, or AWS CDK to automate deployments and infrastructure management. Implement and optimize continuous integration and deployment (CI/CD) pipelines using tools like Jenkins, GitLab CI/CD, or AWS CodePipeline. Ensure AWS environments meet security best practices, including IAM policies, network security configurations, and compliance requirements. Set up and manage monitoring and logging solutions using tools such as Prometheus, AWS CloudWatch, ELK Stack, and Datadog. Identify and address performance bottlenecks through load balancing, caching strategies, and system optimizations. Work closely with developers, security teams, and product managers to enhance system architecture and operational efficiency. Required Skills & Experience Strong experience in AWS services such as EC2, Lambda, EKS, S3, SageMaker, DynamoDB, and IAM . Expertise in Infrastructure as Code (IaC) tools like Terraform or CloudFormation . Proficiency in CI/CD pipelines using GitHub Actions, Jenkins, or AWS CodePipeline . Experience with containerization and orchestration (Docker, Kubernetes, Helm). Strong knowledge of monitoring, logging, and alerting tools (CloudWatch, Prometheus, ELK, Datadog). Solid Python, Bash, or Golang scripting skills for automation. Experience working with ML models in production environments is a plus. Familiarity with security best practices (IAM, VPC security, encryption, WAF). Strong problem-solving and troubleshooting skills. Preferred Qualifications Experience with MLOps frameworks and AI model deployment. Knowledge of AWS AI/ML services like SageMaker, Bedrock, or AI pipelines. Hands-on experience with Kafka, Spark, or other big data technologies . About Techolution : Techolution is a next gen Consulting firm on track to become one of the most admired brands in the world for "innovation done right". Our purpose is to harness our expertise in novel technologies to deliver more profits for our enterprise clients while helping them deliver a better human experience for the communities they serve. With that, we are now fully committed to helping our clients build the enterprise of tomorrow by making the leap from Lab Grade AI to Real World AI. Other focus areas being Enterprise Cloud, Product Innovation (IoT, 3D printing, Robotics), Real World AI Services (CV, LLM, CNN). We are honored to have recently received the prestigious , a testament to our commitment to excellence. We were also awarded - by The AI Summit 2023, Platinum sponsor at Advantage DoD 2024 Symposium and a lot more exciting stuff While we are big enough to be trusted by some of the greatest brands in the world, we are small enough to care about delivering meaningful ROI-generating innovation at a guaranteed price for each client that we serve. Our thought leader, Luv Tulsidas, wrote and published a book in collaboration with Forbes, “Failing Fast? Secrets to succeed fast with AI”. Refer here for more details on the content - Let's explore further Uncover our unique AI accelerators with us: 1. : Our no-code DIY AI studio for enterprises. Choose an LLM, connect it to your data, and create an expert-level agent in 20 minutes. 2. : Modernizes ancient tech stacks quickly, achieving over 80% autonomy for major brands3. : Our ComputerVision. AI Offers customizable Computer Vision and Audio AI models, plus DIY tools and a Real-Time Co-Pilot for human-AI collaboration4. : Provides comprehensive robotics, hardware fabrication, and AI-integrated edge design services. 5. : Our proven Reinforcement Learning with Expert Feedback (RLEF) approach bridges Lab-Grade AI to Real-World AI. 6. : Establishes an AI Center of Excellence to maximize AI potential and ROI. 7. : AI-powered user identification system using image recognition and deep neural networks, eliminating the need for keys, badges, or fingerprint scanners Some videos you wanna watch Visit us @ : To know more about our revolutionary core practices and getting to know in detail about how we enrich the human experience with technology.



  • bangalore, India super Full time

    Site Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...


  • Bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy people!We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when a...


  • Bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy people! We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when...


  • bangalore, India Progress Full time

    We are Progress (Nasdaq: PRGS) - the trusted provider of software that enables our customers to develop, deploy and manage responsible, AI-powered applications and experience with agility and ease.We're proud to have a diverse, global team where we value the individual and enrich our culture by considering varied perspectives because we believe people power...


  • Bangalore, India CodeKarma Full time

    Site Reliability Engineer (Multi-Cloud Deployments) Location: Bangalore / Remote Experience: 4–10 years Type: Full-time (6-month probation) About CodeKarma CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow. Our platform runs both as SaaS and as sub-


  • Bangalore, India Flipkart Full time

    Hiring Site Reliability Engineers Exp : 2.5 +years (Excluding internship) Location : Bangalore Apply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...


  • bangalore, India Andor Tech Full time

    Hiring!!🏢 About AndorTechAndorTech is a global IT services and consulting firm founded in 2009, headquartered in Bangalore. The company specializes in software engineering, AI-enabled IT services, application support, analytics, and test automation. With a presence across India, the USA, Europe, and the UAE, AndorTech partners with Global Capability...


  • bangalore, India Cyberhaven Full time

    About the roleWe're looking for an experienced Site Reliability engineer for making sure systems are reliable, scalable, and performing well especially in production environments. Our technology is new and rapidly evolving as an early member on the team, you'll play a key role in shaping the reliability architecture, building scalable infrastructure, and...


  • bangalore, India JRD Systems Full time

    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary: We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments. The ideal candidate will be responsible for designing, implementing, automating, and maintaining scalable infrastructure solutions across AWS, Azure,...


  • Bangalore, India Aqilea (formerly Soltia) Full time

    We are a consulting company with a bunch of technology-interested and happy people!We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and welcoming workplace where each individual is highly valued.With us, each individual is her/himself and respects others for who they are and we believe that when a...