Site Reliability Engineer

3 weeks ago


India Sigmaways Inc Full time

Background


  • As a developer, you will work with a team of skilled Site Reliability Engineers and help them to improve the application reliability. You will play a critical role in working with the reliability of the massive scale application that processes billions of events every day. You will collaborate with multiple stakeholders and help the team write useful automation that will reduce the toil and make the support process efficient.
  • Relevant past hands-on experience for building the tools and automations working with enterprise level products / services are essential.
  • Engineers who are curious, quick learners and willing to learn new things and demonstrate their excellence are required.


Responsibilities


  • Collaborate with SRE and Dev team and understand the requirements.
  • Understand the application architecture and learn about the services.
  • Implement the in-house product development & the automations to improve the overall reliability efforts.
  • Propose solution architecture, technical specification for the solution.
  • Great at troubleshooting and finding-out the root cause of incidents and give the solutions.
  • Analyze the production metrics and trends, generate the reliability report and share with the stakeholders.
  • Perform peer reviews, peer programming, code reviews to understand the feature.
  • Test solution before releasing to the PROD, write the automated tests, ensure code security and best coding practices are followed, work on code scan reports and fix the issues.
  • Pair with the on-call support for critical incidents, performing troubleshooting and give the resolution, as needed.
  • Participate in high volume production environment to learn about the issues and propose solutions to address them.
  • Collaborate with development and operations teams to ensure seamless integration and deployment of tools and automation scripts.
  • Manage and maintain cloud infrastructure on platforms such as GCP.
  • Deploy application to production using CD pipelines, manage, and scale applications using Kubernetes, as needed.
  • Implement and maintain monitoring solutions using Grafana, including creating and managing dashboards and the alerts.
  • Have an ability to update configuration or recommended code changes to help scale the system.


Qualifications


  • Bachelor’s degree in computer science, Information Technology, or a related field, or equivalent practical experience.
  • Excellent verbal and written communication and collaboration skills.
  • Strong problem-solving skills and the ability to learn new technologies quickly.
  • Prior relevant experience working with enterprise applications and managing the automations.
  • Experience with supporting Kubernetes and container orchestration.
  • Knowledge of cloud, preferably GCP.
  • Hands on experience with coding/scripting and building automations.
  • Experience with monitoring and logging tools, with a strong emphasis on Grafana & Prometheus to create dashboards and alerts.
  • Motivated to work in a fast-paced, innovative environment.



  • india Trigent Software Private Limited Full time

    We are seeking an experienced Site Reliability Engineer (SRE) at the Senior Analyst level. The ideal candidate will have significant experience in maintaining and improving the reliability, scalability, and performance of complex systems.


  • India Cricbuzz.com Full time

    Site Reliability Engineer We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services. Experience - 4 - 5 years Responsibilities:...


  • India Stealth Startup Full time

    Location: India (Remote) Company Overview: We are an innovative tech company dedicated to advancing the future of digital infrastructure with our cutting-edge multi-cloud SaaS products. We are seeking a talented Site Reliability Engineer (SRE) to join our team and take ownership of building, tuning, and operating the infrastructure that powers our advanced...


  • India Stealth Startup Full time

    Location: India (Remote)Company Overview:We are an innovative tech company dedicated to advancing the future of digital infrastructure with our cutting-edge multi-cloud SaaS products. We are seeking a talented Site Reliability Engineer (SRE) to join our team and take ownership of building, tuning, and operating the infrastructure that powers our advanced...


  • India Persistent Systems Full time

    About Position: We are looking for Site Reliability Engineers who are proficient with monitoring tools, preferably New Relic. The person should have experience with Terraform, Docker, Kubernetes, and any cloud. Python coding experience is very much preferred. Role: Site Reliability Engineer Location: Hyderabad Experience: 8+ Yrs. Job Type:...


  • india UBS Full time

    Your role We're looking for a Site Reliability Engineer to:• work as a part of an agile pod (team)• determine the reliability of our digital products, technology services, and the infrastructure that underpins them• minimize the risk and impact of failures by engineering operational improvements, such as predictive monitoring, auto scaling or...


  • India Sigmaways Inc Full time

    BackgroundAs a developer, you will work with a team of skilled Site Reliability Engineers and help them to improve the application reliability. You will play a critical role in working with the reliability of the massive scale application that processes billions of events every day. You will collaborate with multiple stakeholders and help the team write...


  • india Hansen Technologies Full time

    About The Role If you are an experienced Site Reliability Engineer join our team in Pune location to become a driving force in ensuring the reliability, performance, and scalability of our systems. As an SRE, you'll be more than just a technical expert, you’ll be a creative problem solver with exceptional customer relationship skills. Your primary...


  • India BayOne Solutions Full time

    Position: Site Reliability Engineer Location: REMOTE RESPONSIBILITIES: 3+ years of experience working as a mobile SRE Experience on atleast one or more mobile development technologies Objective C, iOS, swift, Xcode, Google Android, Android studio Hands on Experience in developing/debugging mobile native apps 3+ years of experience working with either...


  • India BayOne Solutions Full time

    Position: Site Reliability EngineerLocation: REMOTE RESPONSIBILITIES:3+ years of experience working as a mobile SREExperience on atleast one or more mobile development technologies Objective C, iOS, swift, Xcode, Google Android, Android studioHands on Experience in developing/debugging mobile native apps3+ years of experience working with either mongo db or...


  • India noon Full time

    Job Description- Site Reliability Engineer About noon noon.com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs. noon operates without...


  • India noon Full time

    Job Description- Site Reliability EngineerAbout noon noon.com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.noon operates without boundaries; we...


  • India noon Full time

    Job Description- Site Reliability Engineer About noon noon.com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs. noon operates without boundaries;...


  • india Intel Full time

    Job Description Do you want to innovate an industry leading developer cloud? Join SATG as a Sr. Engineer, Site Reliability.The cloud development division within Software and Advanced Technology Group (SATG) is developing and shaping the way people think about computing by focusing on developers, ecosystem partners, academia etc. We are redefining the...


  • india IDFC FIRST Bank Full time

    Role/Job Title: Site Reliability Engineering Lead Function/Department: Information Technology Job Purpose: Site Reliability Engineering (SRE) department plays a pivotal role in providing seamless experience for our customers. With state-of-the-art technology and tools, we are transforming the overall application development and maintenance...


  • India Stealth Full time

    Company Overview: We are a leading technology company specializing in delivering robust and scalable solutions to our clients worldwide. We are committed to innovation, quality, and customer satisfaction. We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team in India.Job Description:Key Responsibilities:Design, develop, and...


  • India Stealth Full time

    Company Overview: We are a leading technology company specializing in delivering robust and scalable solutions to our clients worldwide. We are committed to innovation, quality, and customer satisfaction. We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team in India. Job Description: Key Responsibilities: Design, develop, and...


  • India Infogain Full time

    You can share your cv on Job Description :- 1. 6 to 12 years of experience in DevOps role with focus on GCP Cloud Infrastructure 2. Strong background of DevOps practices, Cloud Technologies in ensuring reliability and security of Cloud infrastructure 3. Strong proficiency in Infrastructure as Code (IaC) tools like Terraform and Code Configuration...


  • India Infogain Full time

    You can share your cv on sneha.chhabria@infogain.comJob Description :- 1. 6 to 12 years of experience in DevOps role with focus on GCP Cloud Infrastructure 2. Strong background of DevOps practices, Cloud Technologies in ensuring reliability and security of Cloud infrastructure3. Strong proficiency in Infrastructure as Code (IaC) tools like Terraform and Code...


  • india Cortex Consulting Pvt. Ltd. Full time

    We are seeking a highly motivated and results-oriented Site Reliability Engineer (SRE) to join our growing team. You will play a critical role in designing, building, and maintaining our scalable and reliable cloud infrastructure. With your expertise in automation and infrastructure management tools, you will be responsible for ensuring the high...