Staff Engineer SRE

2 days ago


Remote, India Domnic Lewis Full time ₹ 15,00,000 - ₹ 20,00,000 per year
Responsibilities:

The Site Reliability Engineering (SRE) team is responsible for the reliability, scalability, stability and performance of systems and services. They work with cross-functional teams to design, build and maintain systems and they troubleshoot issues when they arise. They bridge the gap between development and operations teams. They work closely with business teams to define Service Level Objectives (SLO) and agreements (SLA) of critical systems. They also monitor and maintain the uptime of these systems in-line with the defined SLO//'s and SLA//'s. They deploy and manage monitoring tools to gain insights on system health and performance. They analyze performance, identify bottlenecks and implement solutions to improve a system//'s scalability and latency durations. They develop scripts, implement tools and automation frameworks to reduce the manual intervention efforts of deployment, monitoring and scaling. They work with development teams for design and development of observability practices like logging, metrics, tracing, etc. They aim to diagnose and troubleshoot issues proactively.

They create actionable alerts on monitoring systems to ensure rapid response for potential production incidents. They forecast resource needs and provision adequately for current and future demand. They design and execute /"chaos experiments/" to test system//'s failure resiliency. They own, define and implement the Disaster Recovery (DR) processes for systems. They also conduct planned and unplanned mock DR drills to test for response preparedness during production incidents. They ensure that security best practices are followed and implemented during design and operations of systems. They also own and maintain documentation of processes, playbooks, and systems. They publish KPI reports and other system health updates on a regular basis to the business.

Requirements:

Must-have - Bachelor//'s degree, preferably in CS or a related field, or equivalent experience Must-have - 10+ years of overall IT experience Must-have - 7+ year of proven work experience as a Senior Site Reliability Engineer or a similar position. Must-have - 4+ years of AWS Cloud experience with AWS Certified DevOps Engineer or SysOps or Security etc. Must-have - AWS experience - 3+ years//' experience with using a broadrange of AWS technologies (e.g. EC2, RDS, ELB, S3, VPC, CloudWatch & Monitoring Tools) to develop and maintain an Amazon AWS based cloud solution, with an emphasis on best practice cloud security. Must-have - 2+ year of experience in CDN and/or Cache systems like Fastly, Akamai, CloudFront, etc. Proven Understanding & strong experience with Cloud deployments ( AWS / Docker/ Kubernetes) Knowledge on provisioning IAC Tools like Terraform, Chef, Ansible, Shell, groovy, python, etc. Experience with monitoring systems such as CloudWatch, NewRelic, Datadog/Splunk, ELK stack. Experience managing cloud network resources (AWS Preferred) such as CloudWatch,

VPC, URL proxies, private link, DNS, ACLs, firewalls, and C2S access points.

Platform or Application Engineering and Operational Knowledge in any of the CI/CD tooling like GitHub Actions, Jenkins, etc. Experience in other tooling Technologies like JIRA, Bitbucket, Jenkins, Fortify, SonarQube, Nexus, Nexus IQ Experience with configuration automation tools like Puppet/Ansible/Chef/Salt Scripting Skills: Strong scripting (e.g. Bash & Python) and automation skills. Operating Systems: Windows and Linux system administration. Problem Solving: Ability to analyze and resolve complex infrastructure resource and application deployment issues Strong attention to detail. Excellent verbal and written communication skills. Strong documentation skills.

Good To Have:

Experience with Terraform/Ansible/Chef/Puppet Experience with GitHub Actions Experience with CloudFront, Fastly Oversees team members performing these functions Anticipates problems and future technical needs and takes necessary steps to address issues. Work primarily in server side technologies and comfortable with client side whenever required Enthusiastically follow technology trends, software engineering best practices and technologies



  • Remote, India TriDevSofts Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Senior Staff Engineer – Risk Platform.Job Type: General Recruiting (FTE)Locations: RemoteTime : 9:00PM or 10:00PM to 5:00AM or 6:00AM (PST timr)Role Overview:Lead the design, development, and execution of WEX's Risk Platform—ensuring it's scalable, secure, and aligned with business needs. Collaborate across engineering, product, and compliance teams to...

  • Cloud Sre

    2 days ago


    Remote, India ixceed solutions Full time

    **Role: Cloud SRE (**Site Reliability Engineer**)** **Location: Remote, India** **Type: Permanent** **Qualifications**: - Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field. - Minimum of 6-8 years of IT experience for an engineer-level position. **Basic Skills**: - Cloud Build - Cloud Functions - GKE (Google...

  • Sre Devloper

    1 week ago


    Remote, India provaantech Full time

    Position - SRE Developer Exp - 10 +yrs Location - Remote Duration - 6 months ( C2C) JD: Technical Skills: Programming: Proficiency in languages like Python, Bash, or Java is essential. Operating Systems: Deep understanding of Linux/Windows operating systems and networking concepts. Cloud Technologies: Experience with AWS & Azure including services,...

  • Infrastructure Sre

    5 days ago


    Remote, India Forcepoint Full time

    Who is Forcepoint? Forcepoint simplifies security for global businesses and governments. Forcepoint’s all-in-one, truly cloud-native platform makes it easy to adopt Zero Trust and prevent the theft or loss of sensitive data and intellectual property no matter where people are working. 20+ years in business. 2.7k employees. 150 countries. 11k+ customers....


  • Remote - India Oportun Full time US$ 1,50,000 - US$ 2,00,000 per year

    POSITION OVERVIEW The mission for the Engineering Ecosystem Org at Oportun is to be the force-multiplicative Org that empowers engineers to deliver member value with high-speed and high-quality. Teams in this Org play a vital role in designing, developing, and maintaining cutting-edge software solutions that power our mission and advance our business....


  • Remote, India beBeeSoftwareEngineer Full time ₹ 1,04,000 - ₹ 1,30,878

    Role Overview:We are seeking an experienced professional to fill a key position in our team. The Associate Staff Engineer will play a crucial role in the design, development, and implementation of application systems.">Develop and maintain high-quality software solutions using Microsoft Dynamics CRM and Power platform.Design and implement custom user...


  • Remote, India vlitechnologies Full time ₹ 50,000 - ₹ 1,00,000 per year

    Locations: Remote Time : 9:00PM or 10:00PM to 5:00AM or 6:00AM (PST time) Interview round : 5 Budget : 1LPM (No GST) EXP : 10 + Year Note : Don't look for candidate based on specific requirements. It is better to look for a strong Python backend engineer who has good understanding of Databases, Docker, K8s, and experience with Messaging Queues, Elastic, helm...


  • Remote, India Boost-IT Full time

    Boost IT is a Portuguese technology consultancy company, we are integrated into one of the most entrepreneurial groups in Portugal, with investment in more than 30 companies. We want to be known for being the most dynamic, energetic and reliable company to operate in the market and, for that, we want to count on you. If you're passionate about technology and...


  • Remote, India Ipeople Infosysteams LLC Full time ₹ 25,00,000 - ₹ 60,00,000 per year

    Hi ,Hope you are doing well.I am hiring for position with client Intellify Solutions. Please go through the JD below and let me know your interest on Role: Staff Engineer (Order or Finance domain)Location: RemoteType: Full timeExperience: 8+ years in relevant technologiesJob Description:● Architectural Mastery: Deep expertise in Microservices, Event-Driven...


  • Remote, India OutSystems Full time US$ 1,20,000 - US$ 2,00,000 per year

    There are NO limits to your career: come shape the future and be part of a truly unique global culture at OutSystemsAbout the roleSite Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals of SRE are to create scalable and highly reliable...