Site Reliability Engineer

3 weeks ago


india Thoucentric Full time
Job Description

Job Description:

We are seeking a skilled and dedicated Site Reliability Engineer (SRE) to join our team. The SRE will be responsible for ensuring the reliability, performance, and scalability of our systems and applications. This role combines software development and systems engineering to build and run large-scale, distributed, fault-tolerant systems.

 

RESPONSIBILITIES:

 

  • Infrastructure Management: Design, build, and maintain the infrastructure required to support a high-volume, high-availability environment.
  • Monitoring and Incident Response: Develop and implement monitoring strategies to detect and resolve system issues before they impact users. Participate in on-call rotation to manage and mitigate incidents.
  • Automation: Automate repetitive tasks to improve efficiency and reliability of the system. Implement CI/CD pipelines to ensure smooth deployments.
  • Performance Tuning: Analyze and optimize system performance, including troubleshooting latency issues and enhancing system throughput.
  • Capacity Planning: Forecast system capacity and plan for future scaling needs. Ensure systems are resilient to handle increased loads.
  • Collaboration: Work closely with software engineers, QA, product managers, and other stakeholders to ensure the delivery of reliable and performant services.
  • Documentation: Create and maintain detailed documentation of system architecture, processes, and procedures.

 

 

QUALIFICATIONS:

 

  • Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience).
  • Minimum 3 years of experience in a Site Reliability Engineer, DevOps, or similar role.
  • Experience with cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker).
  • Proficient in scripting and automation using languages like Python, Bash, or Ruby.
  • Strong understanding of networking, security, and system administration.

 

 


Requirements

SKILLS:

 

  • Familiarity with configuration management tools (Ansible, Chef, Puppet).
  • Experience with monitoring tools (Prometheus, Grafana, Nagios).
  • Strong analytical and problem-solving skills.
  • Excellent communication and collaboration skills.
  • Experience with database management (SQL, NoSQL).
  • Knowledge of Infrastructure as Code (IaC) using tools like Terraform or Pulumi.
  • Familiarity with Agile/Scrum methodologies.
  • Certification in relevant technologies (e.g., AWS Certified DevOps Engineer) is a plus.

Benefits Be part of the exciting Growth Story of Thoucentric
Work on projects that help you stay ahead of the curve. Not just exciting projects, if you are a self-starter, you will also get multiple opportunities to design, drive and contribute in the organizational and practice initiatives.
Constant learning curve with very approachable and intellectual group of consultants.
Be part of One Extended Family. We bond beyond work - sports, get-togethers, common interests etc. Work in a very enriching environment with Open Culture, Flat Organization and Excellent Peer Group
Requirements
SKILLS: Familiarity with configuration management tools (Ansible, Chef, Puppet). Experience with monitoring tools (Prometheus, Grafana, Nagios). Strong analytical and problem-solving skills. Excellent communication and collaboration skills. Experience with database management (SQL, NoSQL). Knowledge of Infrastructure as Code (IaC) using tools like Terraform or Pulumi. Familiarity with Agile/Scrum methodologies. Certification in relevant technologies (e.g., AWS Certified DevOps Engineer) is a plus.

  • india Cricbuzz.com Full time

    Site Reliability Engineer We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services. Experience - 3 - 5 years Responsibilities: ●...


  • india Korn Ferry Full time

    Role - Site Reliability Engineer Exp - 5+ years Required Location - Hyderabad ( Work from Office-Hybrid) Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely...


  • india ViewSonic Full time

    Job Requirements: Bachelor’s degree in computer science, Engineering, or a related field. 3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role. Proficient in AWS solutions including but not limited to EC2, S3, CloudWatch, Lambda, and RDS. Strong understanding of Platform Engineering concepts and principles. Experience...


  • india SID Global Solutions Full time

    Dear Candidates, We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...


  • india First American (India) Full time

    The Role: A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission. As a Site Reliability Engineering Manager...


  • india Quiktrak, LLC Full time

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps Engineer Job Description: Summary: As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...


  • india System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • india WaferWire Cloud Technologies Full time

    Role: SRE (Site Reliability Engineer) Experience: 4+ Years About WaferWire Cloud Technologies: WaferWire Cloud Technologies is a leading provider of innovative cloud solutions aimed at transforming businesses and driving digital growth. With a focus on cutting-edge technology and customer-centric approaches, we empower organizations to thrive in the...


  • india Encora Inc. Full time

    Description Sr. Software Engineer (Site Reliability Engineer) Important Information Location: Ahmedabad Experience: 5+ years Job Mode: Full-time Work Mode: Remote Job Summary Working with DevOps SRE with good experience in Site Reliability Engineer. Responsibilities and Duties Design, implement, and maintain highly...


  • india Encora Inc. Full time

    Description Sr. Software Engineer (Site Reliability Engineer) Important Information Location: Ahmedabad Experience: 5+ years Job Mode: Full-time Work Mode: Remote Job Summary Working with DevOps SRE with good experience in Site Reliability Engineer. Responsibilities and Duties Design, implement, and maintain highly...


  • India System Soft Technologies Full time

    Job Summary The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Job Summary The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • india Greenway Health Full time

    Job Description Job Summary The Manager is responsible for implementing the development process and site reliability engineering practices to resolve issues and identify opportunity areas. This role will lead development and site reliability engineering teams and establish and implement best practices and standards related to engineering...


  • india STAFIDE Full time

    Job Description About us: Stafide is the premier destination for tech talent consulting, providing comprehensive employment services throughout Europe. Our mission is straightforward: to effortlessly connect job seekers with employers, focusing on the rapidly changing technology sector. Boasting unparalleled expertise and a steadfast commitment, we...


  • india STAFIDE Full time

    Job Description About us: Stafide is the premier destination for tech talent consulting, providing comprehensive employment services throughout Europe. Our mission is straightforward: to effortlessly connect job seekers with employers, focusing on the rapidly changing technology sector. Boasting unparalleled expertise and a steadfast commitment, we...


  • india Next-Link Full time

    Job Description Senior Site Reliability Engineer Desirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...


  • india Next-Link Full time

    Job Description Senior Site Reliability Engineer Desirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...