Cloud Reliability Engineer

5 days ago


Mumbai, Maharashtra, India Session AI Full time
Job Description

We are seeking a skilled Cloud Reliability Engineer to join our Site Reliability Engineering Group at Session AI. As a key member of our team, you will be responsible for ensuring the seamless operation of our Cloud platform, with a focus on availability, performance, and stability.

The ideal candidate will have over five years of experience managing cloud-based Big Data solutions, with a strong commitment to resolving operational challenges through automation and sophisticated software tools.

Key Responsibilities:

  • Design and implement solutions to enhance the availability, performance, and stability of our systems, services, and products.
  • Develop, automate, and maintain infrastructure as code for provisioning environments in AWS, Azure, and GCP.
  • Deploy modern automated solutions to enable automatic scaling of the core platform and features in the cloud.
  • Apply cybersecurity best practices to safeguard our production infrastructure.
  • Collaborate on DevOps automation, continuous integration, test automation, and continuous delivery for the Session AI platform and its new features.
  • Manage data engineering tasks to ensure accurate and efficient data integration into our platform and outbound systems.
  • Utilize expertise in DevOps best practices, shell scripting, Python, Java, and other programming languages, while continually exploring new technologies for automation solutions.
  • Design and implement monitoring tools for service health, including fault detection, alerting, and recovery systems.
  • Oversee business continuity and disaster recovery operations.
  • Create and maintain operational documentation, focusing on reducing operational costs and enhancing procedures.
  • Demonstrate a continuous learning attitude with a commitment to exploring emerging technologies.

Preferred Skills:

  • Experience with cloud platforms like AWS, Azure, and GCP, including their management consoles and CLI.
  • Proficiency in building and maintaining infrastructure on:
    • AWS using services such as EC2, S3, ELB, VPC, CloudFront, Glue, Athena, etc.
    • Azure using services such as Azure VMs, Blob Storage, Azure Functions, Virtual Networks, Azure Active Directory, Azure SQL Database, etc.
    • GCP using services such as Compute Engine, Cloud Storage, Cloud Functions, VPC, Cloud IAM, BigQuery, etc.
  • Expertise in Linux system administration and performance tuning.
  • Strong programming skills in Python, Bash, and NodeJS.
  • In-depth knowledge of container technologies like Docker and Kubernetes.
  • Experience with real-time, big data platforms including architectures like HDFS/Hbase, Zookeeper, and Kafka.
  • Familiarity with central logging systems such as ELK (Elasticsearch, LogStash, Kibana).
  • Competence in implementing monitoring solutions using tools like Grafana, Telegraf, and Influx.

Benefits

  • Comparable salary package and stock options
  • Opportunity for continuous learning
  • Fully sponsored EAP services
  • Excellent work culture
  • Opportunity to be an integral part of our growth story and grow with our company
  • Health insurance for employees and dependents
  • Flexible work hours
  • Remote-friendly company


  • Mumbai, Maharashtra, India M&G Full time

    About the RoleWe are seeking a highly skilled Cloud Reliability Engineer to join our team at M&G Global Services. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesDesign and implement cloud infrastructure solutions using Azure.Develop and maintain scripts to...


  • Navi Mumbai, Maharashtra, India Blazeclan Technologies Full time

    Job Title: AWS Cloud Reliability EngineerBlazeclan Technologies is seeking a highly skilled AWS Cloud Reliability Engineer to join our team. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure.Key Responsibilities:Design and implement monitoring and logging solutions...


  • Mumbai, Maharashtra, India M&G Full time

    About the RoleWe are seeking a highly skilled Cloud Site Reliability Engineer to join our team at M&G Global Services. As a Cloud Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key ResponsibilitiesDesign, implement, and maintain cloud-based systems and infrastructure to...


  • Navi Mumbai, Maharashtra, India Blazeclan Technologies Full time

    Job Title: Junior AWS SREJob Summary: We are seeking a highly skilled Junior AWS SRE to join our team at Blazeclan Technologies. As a Junior AWS SRE, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure.Key Responsibilities: Design and implement monitoring and logging solutions to ensure real-time...


  • Mumbai, Maharashtra, India Antal Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You will work closely with our engineering teams to design, implement, and operate...

  • Cloud Architect

    4 days ago


    Mumbai, Maharashtra, India Cloud 9 Infosystems Full time

    Cloud Architect Job DescriptionCloud 9 Infosystems, Inc. is a full-service consultancy based in Mumbai, specializing in end-to-end solutions for organizations' digital cloud strategy.We are seeking a highly skilled Cloud Architect to join our team. The successful candidate will be responsible for designing and implementing robust infrastructure components,...

  • Cloud Architect

    6 days ago


    Mumbai, Maharashtra, India Cloud 9 Infosystems Full time

    Cloud Architect Job DescriptionWe are seeking a highly skilled Cloud Architect to join our team at Cloud 9 Infosystems, Inc. The ideal candidate will have a strong background in cloud-based architecture and deployment, with expertise in solution architecture, infrastructure design, software development, and integration.Key Responsibilities:Design and...


  • Mumbai, Maharashtra, India Antal Full time

    Job Description :A major player in the tech industry, which specializes in retail technology, AI, ML, and big data, is seeking new talent. Established by alumni from a top engineering institute, this organization manages a vast network of brands and stores. Headquartered in Mumbai, it is recognized for its innovation and expertise across multiple tech...


  • Mumbai, Maharashtra, India antal international network Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Antal International Network. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and efficiency of our software solutions.Key Responsibilities:Monitor production environment...


  • Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    About FyndFynd is India's largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni, Fynd is headquartered in Mumbai and has 1000+ brands under management, more than 10k stores, and servicing...


  • Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    About FyndFynd is India's largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in Mumbai and have 1000+ brands under...


  • Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    About FyndFynd is a leading omnichannel platform and tech company specializing in retail tech and innovative products in AI, ML, big data ops, gaming, crypto, image editing, and the learning space. Founded in 2012 by three IIT Bombay alumni, Fynd is headquartered in Mumbai and manages over 1000 brands, 10k stores, and 23k+ pin codes.Role OverviewAs a Site...


  • Navi Mumbai, Maharashtra, India Cyber Sphere LLC Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled and experienced Site Reliability Engineer to join our team at Cyber Sphere LLC.Job Summary:The successful candidate will play a crucial role in ensuring the reliability, scalability, and performance of our Azure AI Services platform.Key Responsibilities:Design, deploy, and maintain a highly...


  • Mumbai, Maharashtra, India antal international network Full time

    Title : Site Reliability EngineerMy client is India's largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming crypto, image editing and learning space. Roles & Responsibility : What will you do?- Run the production environment by monitoring availability and taking a holistic...


  • Mumbai, Maharashtra, India Session AI Full time

    Job Title: Site Reliability Engineer IIWe are seeking a highly skilled Site Reliability Engineer II to join our team at Session AI. As a key member of our Site Reliability Engineering Group, you will play a vital role in ensuring the seamless operation of our Cloud platform.Key Responsibilities:Design and implement solutions to enhance the availability,...


  • Mumbai, Maharashtra, India Cloud 9 Infosystems Full time

    Cloud Architect Job DescriptionAt Cloud 9 Infosystems, we are seeking a highly skilled Cloud Architect to join our team. As a Cloud Architect, you will play a key role in designing and implementing cloud-based solutions for our clients.Key Responsibilities:Support the Business Development team in formulating a strategy to adopt Azure within focus industry...


  • Mumbai, Maharashtra, India Cloud 9 Infosystems Full time

    Senior Cloud ConsultantCloud 9 Infosystems is seeking a highly skilled Senior Cloud Consultant to join our team. As a Senior Cloud Consultant, you will be responsible for designing and implementing cloud-based solutions for our clients.Key Responsibilities:Design and build cloud infrastructure (IaaS) and platform as a service (PaaS) offeringsImplement and...

  • Cloud Engineer

    6 days ago


    Mumbai, Maharashtra, India M&G Full time

    Job Title: Lead Cloud Site Reliability EngineerAbout the Role:We are seeking a highly skilled and experienced Lead Cloud Site Reliability Engineer to join our team at M&G Global Services. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure high scalability,...


  • Mumbai, Maharashtra, India Neemtree Full time

    Job Title: Cloud Infrastructure EngineerJob Summary:Neemtree is seeking a highly skilled Cloud Infrastructure Engineer to join our team. As a Cloud Infrastructure Engineer, you will be responsible for designing, implementing, and managing cutting-edge cloud resources. You will work closely with our team to ensure the reliability and availability of our...

  • AWS Cloud Engineer

    6 days ago


    Navi Mumbai, Maharashtra, India Blazeclan Technologies Full time

    Junior AWS SRE Job DescriptionAt Blazeclan Technologies, we are seeking a highly skilled Junior AWS SRE to join our team.Key Responsibilities:Design and implement world-class observability platforms for multi-cloud infrastructure services.Analyze, troubleshoot, and design vital services, platforms, and infrastructure with a focus on reliability, scalability,...