Site Reliability Engineer – 2

4 weeks ago


bangalore, India [24]7.ai Full time
Job Role: Site Reliability Engineer – 2
Location: Bangalore
Working Hours : Permanent Night Shifts ( PST Working Hours )
Job Description
At (247).ai, we’re passionate about building software that solves problems. We count on our site reliability engineers (SREs) to empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions. As we expand our customer deployments, we are currently seeking an experienced SRE to deliver insights from massive scale data in real time. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.
Objectives of this Role: -
Run the production environment by monitoring availability and taking a holistic view of system health.
Build software and systems to manage platform infrastructure and applications.
Improve reliability, quality, and time-to-market of our suite of software solutions.
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve.
Provide primary operational support and engineering for multiple large, distributed software applications.
Required Skills: -
Strong working knowledge of Red Hat Linux environments.
Good knowledge of Python, Bash shell script development.
Ability to program with one or more high level languages, such as Python, Perl, etc...
Experience with logging, monitoring, alerting and CICD & Big data platform tools.
Strong communication and analytical/problem-solving skills.
Good understanding of Cloud Technologies like GCP, Azure.
Preferred Qualifications
Bachelor’s degree in computer science or other highly technical, scientific discipline.
Previous success in technical engineering.
Coding experience beyond simple scripts.
Good understanding of networking concepts (load balancers, TCP/IP, Firewalls).
Logging and Monitoring tools like Logstash, Kibana, Grafana, etc...
Strong debugging skills.
Responsibilities:
Should flexible to work in PST working hours
Perform Incident Management and Change Management to maintain the continuous availability of all Cloud Infrastructure services.
Ensure all SRE and operating procedures are maintained and executed.
Work in partnership with stakeholders to design, implement, manage, and support a highly available and secure infrastructure.
Maintain 24x7 production environment with a high level of service availability and Perform quality reviews, manage operational issues.
Partner with development teams in defining and implementing improvements in service architecture.
Interface with Dev, QA, OPS teams to identify root cause analysis and re-instrument triggers to prevent future network degradation and outages.
Explore and innovate new cloud technologies, features, and tools to improve the platform and automate using Bash, Python or Perl, etc...
Implement automation and orchestration for manual processes required to operate and deploy cloud services, be at the heart of developing new ideas into internal tools by working closely with teams.
Partner with development teams to improve services through rigorous testing and release procedures.
Analyze alarms and dashboards to identify problem areas, report incidents, troubleshoot, and escalate as required.
Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
Perform ticket review and updates through JIRA ticketing tool.
Manage, coordinate, and document all type maintenances / events.
Must take initiative and be proactive.
Must take on the responsibility to learn new products and procedures.
Participate in system design consulting, platform management, and capacity planning.
Create sustainable systems and services through automation and uplifts.
Conducting post-incident reviews and creating actional reports and coming up with the application optimization recommendations for engineering teams.
Implementation of proactive monitoring, alerting, trend analysis and self-healing systems.
Understand the existing architecture and work with various Engineering teams to develop and execute strategies to provide a high-quality Global production service.
About (24)7.ai
(24)7.ai is a leader in the Conversational AI market, with over 250+ Fortune 500/1000 customers. We continue to transform our business to drive greater value to our team, shareholders, customers through new product development and market growth.
(24)7.ai is redefining the way companies interact with consumers. Using Artificial Intelligence and Machine Learning to understand consumer intent, (24)7.ai’s technology helps companies create a personalized, predictive and effortless customer experience across all channels. The world’s largest and most recognizable brands are using intent-driven engagement from (24)7.ai to assist several hundred million visitors annually, through more than 1.5 billion conversations, most of which are automated and learn from each consumer experience.
For more information, visit:

  • bangalore, India Flipkart Full time

    About Us Flipkart is the leading ecommerce company of India, committed to transforming the future through our innovative solutions. FCC is a startup within Flipkart, which focuses on externalizing our tech stack globally in the form of B2B SaaS offerings. We are a fast growing team with breadth spanning across the entire eCommerce stack and larger plays like...


  • bangalore, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...


  • Bangalore, Karnataka, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...


  • Bangalore, Karnataka, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...


  • Bangalore, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff...


  • bangalore, India Cricbuzz.com Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...


  • bangalore, India NetApp Full time

    Title: Site Reliability Engineer Location: Bangalore, Karnataka, IN, 560071 Requisition ID: 126661 Job Summary As a Keystone Site Reliability Engineer, you will be responsible for managing the various and monitor environments for Keystone. Your role will involve engaging various aspects in the lifecycle of Keystone services - from working on...


  • Bangalore, India Cyitechsearch Full time

    About the job :We are hiring for Site Reliability EngineerExperience : 5+ Years Work Model : Remote / Contract 3 years Skills : Develop and provide operational support for fullstack software applications. Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation. Five years' experience as a site reliability engineer...


  • bangalore, India Tyson Foods India Full time

    Job Description – Site Reliability Engineer (Cloud Engineering) The role as Site Reliability Engineer in the Data & Analytics organization, is to ensure the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable solutions while working with the...


  • bangalore, India Kunato Full time

    Site Reliability Engineer (SRE) - Python/GolangJob Description:We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) to join our technology team. The ideal candidate will possess strong programming skills with expertise in Python, Golang, or both. This role is pivotal in ensuring the high availability, performance, and security of...


  • bangalore, India Kunato Full time

    Site Reliability Engineer (SRE) - Python/GolangJob Description:We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) to join our technology team. The ideal candidate will possess strong programming skills with expertise in Python, Golang, or both. This role is pivotal in ensuring the high availability, performance, and security of...


  • bangalore, India Kunato Full time

    Site Reliability Engineer (SRE) - Python/Golang Job Description: We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) to join our technology team. The ideal candidate will possess strong programming skills with expertise in Python, Golang, or both. This role is pivotal in ensuring the high availability, performance, and security...


  • bangalore, India Indusface Full time

    Careers » Current Openings » Site Reliability Engineer Role: Indusface is hiring for a talented, enthusiastic individual passionate about all aspects of IT infrastructure operations to join us as a Site Reliability Engineer (SRE) Job Description: Monitor and maintain availability of cloud infrastructure, troubleshoot, identify, and...


  • bangalore, India Indusface Full time

    Careers » Current Openings » Site Reliability Engineer Role: Indusface is hiring for a talented, enthusiastic individual passionate about all aspects of IT infrastructure operations to join us as a Site Reliability Engineer (SRE) Job Description: Monitor and maintain availability of cloud infrastructure, troubleshoot, identify, and...


  • bangalore, India Microsoft Full time

    Overview Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.Microsoft’s Azure Data engineering team is leading the transformation of analytics...


  • bangalore, India Microsoft Full time

    Overview Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.Microsoft’s Azure Data engineering team is leading the transformation of analytics...


  • bangalore, India Vistex Full time

    Vistex is currently hiring a Site Reliability Engineer. The Vistex Site Reliability Engineer will be primarily responsible for service availability, performance, monitoring, incident response, and capacity planning. This is a highly technical, hands-on role with a strong focus on automation, accurate monitoring, actionable alerting, resilient design,...


  • bangalore, India Vistex Full time

    Vistex is currently hiring a Site Reliability Engineer. The Vistex Site Reliability Engineer will be primarily responsible for service availability, performance, monitoring, incident response, and capacity planning. This is a highly technical, hands-on role with a strong focus on automation, accurate monitoring, actionable alerting, resilient design,...


  • bangalore, India slice Full time

    Job DescriptionAbout the team Everything that you see on the internet - developers made it. Even the page that you’ve opened right now and are reading this very line from - a developer. At slice, we’re trying to build a world class product and that takes some crazy, world class engineers. A team so supportive - even if you miss a ‘;’ in your code,...


  • bangalore, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...