Site Reliability Engineer II

3 weeks ago


mumbai, India Session AI Full time

Are you ready to make your mark with a true industry disruptor? ZineOne, a subsidiary of Session AI, the pioneer of in-session marketing, is looking to add talented team members to help us grow into the premier revenue tool for e-commerce. We work with some of the leading brands nationwide and we innovate how brands connect with and convert customers.

Job Description

This position offers a hands-on, technical opportunity as a vital member of the Site Reliability Engineering Group. Our SRE team is dedicated to ensuring that our Cloud platform operates seamlessly, efficiently, and reliably at scale. The ideal candidate will bring over five years of experience managing cloud-based Big Data solutions, with a strong commitment to resolving operational challenges through automation and sophisticated software tools.

Candidates must uphold a high standard of excellence and possess robust communication skills, both written and verbal. A strong customer focus and deep technical expertise in areas such as Linux, automation, application performance, databases, load balancers, networks, and storage systems are essential.

Key Responsibilities:

As a Session AI SRE, you will:

  • Design and implement solutions that enhance the availability, performance, and stability of our systems, services, and products.
  • Develop, automate, and maintain infrastructure as code for provisioning environments in AWS, Azure, and GCP.
  • Deploy modern automated solutions that enable automatic scaling of the core platform and features in the cloud.
  • Apply cybersecurity best practices to safeguard our production infrastructure.
  • Collaborate on DevOps automation, continuous integration, test automation, and continuous delivery for the Session AI platform and its new features.
  • Manage data engineering tasks to ensure accurate and efficient data integration into our platform and outbound systems.
  • Utilize expertise in DevOps best practices, shell scripting, Python, Java, and other programming languages, while continually exploring new technologies for automation solutions.
  • Design and implement monitoring tools for service health, including fault detection, alerting, and recovery systems.
  • Oversee business continuity and disaster recovery operations.
  • Create and maintain operational documentation, focusing on reducing operational costs and enhancing procedures.
  • Demonstrate a continuous learning attitude with a commitment to exploring emerging technologies.
Preferred Skills:
  • Experience with cloud platforms like AWS, Azure, and GCP, including their management consoles and CLI.
  • Proficiency in building and maintaining infrastructure on:
    • AWS using services such as EC2, S3, ELB, VPC, CloudFront, Glue, Athena, etc.
    • Azure using services such as Azure VMs, Blob Storage, Azure Functions, Virtual Networks, Azure Active Directory, Azure SQL Database, etc.
    • GCP using services such as Compute Engine, Cloud Storage, Cloud Functions, VPC, Cloud IAM, BigQuery, etc.
  • Expertise in Linux system administration and performance tuning.
  • Strong programming skills in Python, Bash, and NodeJS.
  • In-depth knowledge of container technologies like Docker and Kubernetes.
  • Experience with real-time, big data platforms including architectures like HDFS/Hbase, Zookeeper, and Kafka.
  • Familiarity with central logging systems such as ELK (Elasticsearch, LogStash, Kibana).
  • Competence in implementing monitoring solutions using tools like Grafana, Telegraf, and Influx.

Benefits

  • Comparable salary package and stock options
  • Opportunity for continuous learning
  • Fully sponsored EAP services
  • Excellent work culture
  • Opportunity to be an integral part of our growth story and grow with our company
  • Health insurance for employees and dependents
  • Flexible work hours
  • Remote-friendly company


  • Mumbai, India Session AI Full time

    Are you ready to make your mark with a true industry disruptor? ZineOne, a subsidiary of Session AI, the pioneer of in-session marketing, is looking to add talented team members to help us grow into the premier revenue tool for e-commerce. We work with some of the leading brands nationwide and we innovate how brands connect with and convert customers.Job...


  • Mumbai, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes, send me your updated cv :...


  • Mumbai, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes, send me your updated cv :...


  • Mumbai, India EZINFORMATICS SOLUTIONS PVT LTD Full time

    Company DescriptionEZINFORMATICS SOLUTIONS PVT LTD is a team of designers, developers, authors, thinkers, and visionaries with vast industrial experience and illustrious accomplishments in various IT services. Our focus areas include cyber security, information technology, and consulting services. We strive to provide safe and secure solutions, unify...


  • mumbai, India EZINFORMATICS SOLUTIONS PVT LTD Full time

    Company Description EZINFORMATICS SOLUTIONS PVT LTD is a team of designers, developers, authors, thinkers, and visionaries with vast industrial experience and illustrious accomplishments in various IT services. Our focus areas include cyber security, information technology, and consulting services. We strive to provide safe and secure solutions, unify...


  • mumbai, India Wipro Full time

    Role Purpose Required Skills: � 5+Years of experience in system administration, application development, infrastructure development or related areas � 5+ years of experience with programming in languages like Javascript, Python, PHP, Go, Java or Ruby � 3+ years of in reading, understanding and writing code in the same � 3+years Mastery of...


  • Mumbai, India Wipro Full time

    Role Purpose Required Skills: � 5+Years of experience in system administration, application development, infrastructure development or related areas � 5+ years of experience with programming in languages like Javascript, Python, PHP, Go, Java or Ruby � 3+ years of in reading, understanding and writing code in the same � 3+years Mastery of...


  • Mumbai, India Forcepoint Full time

    Description : Monitor, measure and improve the reliability, availability and scalability of Forcepoint products and infrastructure Partner with Engineering to perform Operations Readiness of our products, ensuring that the products meet architecture & observability design requirements Lead the New Product Introduction Process (NPI) for SRE...


  • mumbai, India Forcepoint Full time

    Description : Monitor, measure and improve the reliability, availability and scalability of Forcepoint products and infrastructure Partner with Engineering to perform Operations Readiness of our products, ensuring that the products meet architecture & observability design requirements Lead the New Product Introduction Process (NPI) for SRE...


  • Mumbai, India SA Technologies Full time

    Description : Role:  Applicationupkeepforuptime&SLAadherence Undertake Applicationmaintenance tasks Troubleshoot deployment issues SkillsRequired: AmazonWebServices Docker,Kubernetesadministration(optionalbutgoodtohave)Python scripting, Shell scripting (optional but good to have) Jenkins - Groovyscripting GIT-Tag,branching,pullrequests,webhooks...


  • Mumbai, India Career Stone Consultant Full time

    PRINCIPAL ACCOUNTABILITIES: 1.AWS Infrastructure Design: o Lead the design and implementation of scalable, reliable, and secure AWS infrastructure. o Provide expertise in architecting solutions that maximize the benefits of AWS services. o Lead the upgrade of Apache web servers for improved performance and security. o Oversee the database (DB) upgrade...


  • Mumbai, India IMC Full time

      As a Site Reliability Engineer at IMC, you'll be an integral member of a highly experienced team, responsible for maintaining a robust, best in class, low latency trading environment. The skills necessary to excel could range from system administration, network troubleshooting, database optimization, software development, release management and...


  • mumbai, India SA Technologies Full time

    Job Description Join SA Technologies Inc. and Make Your Mark on the World SA Technologies Inc. is a global leader in IT Consulting, providing innovative solutions to clients around the world. We are looking for talented and passionate individuals to join our team and help us make a real difference in the world. We are currently hiring for a Sr.Visual...


  • mumbai, India SA Technologies Full time

    Join SA Technologies Inc. and Make Your Mark on the World SA Technologies Inc. is a global leader in IT Consulting, providing innovative solutions to clients around the world. We are looking for talented and passionate individuals to join our team and help us make a real difference in the world. We are currently hiring for a Sr.Visual Designer cum...


  • Mumbai, India Baker Hughes Full time

    Senior Site Reliability Engineer-AWS   Are you an Engineer looking for an interesting and inspiring opportunity?   Are you passionate about being part of a successful team?   Join the Team   Baker Hughes has a new opportunity for Senior Site Reliability Engineer to join the team in India   Partner with the best   As a Senior Site...


  • mumbai, India Baker Hughes Full time

    Senior Site Reliability Engineer-AWS   Are you an Engineer looking for an interesting and inspiring opportunity?   Are you passionate about being part of a successful team?   Join the Team   Baker Hughes has a new opportunity for Senior Site Reliability Engineer to join the team in India  Partner with the best   As a Senior Site...


  • Mumbai, India Career Stone Consultant Full time

    PRINCIPAL ACCOUNTABILITIES:1.AWS Infrastructure Design:o Lead the design and implementation of scalable, reliable, and secure AWS infrastructure.o Provide expertise in architecting solutions that maximize the benefits of AWS services.o Lead the upgrade of Apache web servers for improved performance and security.o Oversee the database (DB) upgrade process,...


  • Mumbai, India Career Stone Consultant Full time

    PRINCIPAL ACCOUNTABILITIES:1.AWS Infrastructure Design:o Lead the design and implementation of scalable, reliable, and secure AWS infrastructure.o Provide expertise in architecting solutions that maximize the benefits of AWS services.o Lead the upgrade of Apache web servers for improved performance and security.o Oversee the database (DB) upgrade process,...


  • Mumbai, India SA Technologies Full time

    Join SA Technologies Inc. and Make Your Mark on the World SA Technologies Inc. is a global leader in IT Consulting, providing innovative solutions to clients around the world. We are looking for talented and passionate individuals to join our team and help us make a real difference in the world. We are currently hiring for a Sr.Visual Designer cum...


  • Mumbai, India Morningstar Full time

    Title: Senior Site Reliability Engineer The Group: At Morningstar, we strive to Empower Investor Success. We tirelessly pursue new ways to combine our data and research with design and technology to help solve investors’ needs. Our solutions pave the way for investors to achieve their goals with confidence. The Morningstar Wealth Platform allows...