Site Reliability Engineer L1

1 month ago


India Sigmaways Inc Full time

Excellent Verbal & Non-verbal communication

Experience / knowledge on Infrastructure Monitoring & support (observability)

Experience / knowledge on Application Support

  1. Must have worked with large enterprise customers/applications.
  2. Understand the support process.

Experience working with large enterprise applications and aware of the L1 support process for

the enterprise and large applications

Experience with Kubernetes administration.

  1. This is from the support perspective. Candidate should be able to perform basic.
  2. operations on k8s while supporting the application.
  3. Candidates must be aware of the k8s concepts and hands on to manage basic k8s.


Experience working with with event driven applications.

  1. Understanding on Kafla
  2. Understanding on Redis
  3. Understanding of MongoDB
  4. Should understand from the perspective of supporting the application.
  5. Should be aware of how queues works and know about basic building blocks of the
  6. event driven application


Exposure to one of the cloud technologies (Amazon/Google Cloud/Azure)

  1. GCP required others good to have


Excellent and MUST have good troubleshooting & Problem-solving skills.

Experience in Linux System Administration

Experience in Bash / Python Scripting

  1. Should be able to run and do the updates to existing automations


Experience with DevOps tools (Jenkins, SumoLogic, Github, Opsgenie, Box, DropBox, Cisco Spark,Rancher)

Experience with analyze and visualize tools (Grafana/Prometheus/ELK), must be aware of

observability concepts and should have practiced

Experience with creating the dashboards and alerts using above observability tools


MUST available for regular weekly support (24*7 environment)

o This is L1 only position and engineer will work in shifts to support the application


Good Understanding of Networking concepts & N/W commands

  1. This is from L1 perspective to troubleshoot the issues using existing observability and the

run-books


Roles & Responsibilities:

  • Monitoring Critical & Non-Critical applications
  • Acknowledge, Triage & troubleshoot alerts within scope adhering to the set SLA
  • Providing On-call Support for mission critical issues, investigate, troubleshoot & drive towards
  • resolution
  • Follow escalation procedures as per RB and escalate alerts
  • Ensuring web-scale systems are highly available & fault-tolerant
  • Improve the performance of micro-services and solve scaling/performance issues
  • Capacity management and planning
  • Strong interpersonal communication skills (including listening, speaking, and writing)
  • Ability to work well in a diverse, team-focused environment with other SREs & developers
  • Knowledgebase engineering – developing / updating Runbooks Preferred
  • Patch deployment as per schedule
  • Backup/Clean file storage activities on servers on a schedule basis
  • Schedule Job’s Manual/Automation
  • Jenkins automation and maintenance on a schedule basis
  • Investigate monitoring alerts/Logs/Grafana Patterns take a proactive approach to address false
  • positives, forecast potential threats via data analyzing, Submit Bug/Enhancement to
  • Development team on demand
  • Various Reports/Automation Scripts creation as per request from various teams , Maintenance &update of existing reports & Automation Scripts
  • Communicate ith various other departments on day-to-day operations and needs using internaltools and emails
  • Need to very creative and propose solution on gap findings, Responsible for capacity planning, shift management and people management.


  • india Sigmaways Inc Full time

    Excellent Verbal & Non-verbal communication Experience / knowledge on Infrastructure Monitoring & support (observability) Experience / knowledge on Application Support Must have worked with large enterprise customers/applications. Understand the support process. Experience working with large enterprise applications and aware of the L1 support process for...


  • India Sigmaways Inc Full time

    Required Skill Sets: Excellent Verbal & Non-verbal communication Experience / knowledge on Infrastructure Monitoring & support (observability) Experience / knowledge on Application Support , Must have worked with large enterprise customers/applications Understand the support process. Experience working with large enterprise applications and aware of the...


  • India Sigmaways Inc Full time

    Required Skill Sets:Excellent Verbal & Non-verbal communication Experience / knowledge on Infrastructure Monitoring & support (observability)Experience / knowledge on Application Support , Must have worked with large enterprise customers/applications Understand the support process.Experience working with large enterprise applications and aware of the L1...


  • india Mirketa Software Pvt. Ltd. Full time

    Job Description : Company Name : Mirketa Software Job Title : Senior Site Reliability Engineer L1 Location : Remote (PAN India) Full time Roles and Responsibilities : - Monitoring Critical & Non-Critical applications- Acknowledge, Triage & troubleshoot alerts within scope adhering to the set SLA- Providing On-call Support for mission critical issues ,...


  • india Cricbuzz.com Full time

    Site Reliability Engineer We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services. Experience - 3 - 5 years Responsibilities: ●...


  • Anywhere in India/Multiple Locations Mirketa Software Pvt. Ltd. Full time

    Job Description : Company Name : Mirketa Software Job Title : Senior Site Reliability Engineer L1 Location : Remote (PAN India) Full time Roles and Responsibilities : - Monitoring Critical & Non-Critical applications- Acknowledge, Triage & troubleshoot alerts within scope adhering to the set SLA- Providing On-call Support for mission critical issues ,...


  • india Korn Ferry Full time

    Role - Site Reliability Engineer Exp - 5+ years Required Location - Hyderabad ( Work from Office-Hybrid) Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely...


  • india ViewSonic Full time

    Job Requirements: Bachelor’s degree in computer science, Engineering, or a related field. 3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role. Proficient in AWS solutions including but not limited to EC2, S3, CloudWatch, Lambda, and RDS. Strong understanding of Platform Engineering concepts and principles. Experience...


  • india SID Global Solutions Full time

    Dear Candidates, We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...


  • india Quiktrak, LLC Full time

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps Engineer Job Description: Summary: As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...


  • india First American (India) Full time

    The Role: A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission. As a Site Reliability Engineering Manager...


  • India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • india System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • india Thoucentric Full time

    Job Description Job Description:We are seeking a skilled and dedicated Site Reliability Engineer (SRE) to join our team. The SRE will be responsible for ensuring the reliability, performance, and scalability of our systems and applications. This role combines software development and systems engineering to build and run large-scale, distributed,...


  • india Encora Inc. Full time

    Description Sr. Software Engineer (Site Reliability Engineer) Important Information Location: Ahmedabad Experience: 5+ years Job Mode: Full-time Work Mode: Remote Job Summary Working with DevOps SRE with good experience in Site Reliability Engineer. Responsibilities and Duties Design, implement, and maintain highly...


  • india WaferWire Cloud Technologies Full time

    Role: SRE (Site Reliability Engineer) Experience: 4+ Years About WaferWire Cloud Technologies: WaferWire Cloud Technologies is a leading provider of innovative cloud solutions aimed at transforming businesses and driving digital growth. With a focus on cutting-edge technology and customer-centric approaches, we empower organizations to thrive in the...


  • India System Soft Technologies Full time

    Job Summary The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • india Greenway Health Full time

    Job Description Job Summary The Manager is responsible for implementing the development process and site reliability engineering practices to resolve issues and identify opportunity areas. This role will lead development and site reliability engineering teams and establish and implement best practices and standards related to engineering...


  • india Next-Link Full time

    Job Description Senior Site Reliability Engineer Desirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...