Site Reliability Engineer

1 month ago


delhi, India SkySys Full time
Role: Site Reliability Engineer (SRE)
Position Type: Full-Time Contract (40hrs/week)
Contract Duration: Long Term
Work Time zone: IST
Work Schedule: 8 hours/day (Mon-Fri)
Location: 100% remote (candidate can work from anywhere in India)
Must haves: Monitoring and deploying .net applications Maintaining code, writing scripts Monitor application performance Skilled with monitoring tools such as Splunk Client has a 24x7 environment, someone coming from this would be ideal 1 weekend on call a month Technologies in client's environment: .Net, AWS Splunk Terraform, Ansible Kubernetes Job Summary:
The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams. The SRE designs and configures systems to monitor and alert on critical applications and automate issue resolution. This role includes a focus on providing solutions that are robust, scalable, and highly available. For internal processes and technologies, the SRE will build systems to streamline operations and reduce friction. The SRE will be part of an on-call rotation to troubleshoot production issues with the specific goal of building resilient mitigation processes
Essential Functions: Leverage monitoring tools and custom automation to build and continually refine systems that highlight critical application issues. Evaluate application architecture and reliability and design improvements. Respond to and resolve production incidents as part of an on-call rotation. Build automated solutions to alert, monitor, mitigate issues and system recovery. Communicate and collaborate with business leaders and stakeholders on incident resolution status and RCAs. Collaborate with DevOps Engineers, Performance Engineers and Developers to define and establish reliability practices. Build applications that streamline internal operational procedures Qualification: BA or BS degree in Computer Science or related field required. Master's Degree in Technology or related field desired. Certification(s) specific to Architecture discipline 5+ years of experience working with technical teams. Strong emphasis on SRE as an engineering discipline with a focus on automation. Experience supporting infrastructure and services in public cloud environments (AWS, GCP, etc.). Experience building and supporting containerized application technologies, including Docker, Kubernetes. Experience with public cloud cost management. Experience in performance engineering and capacity planning. Prior success in automating a real-world production environment. Knowledge of IP networking, VPN's, DNS, load balancing and firewall. Expertise in any monitoring tools like Splunk, AppDynamics, Nagios, New Relic. Experience with software development and testing process in an agile environment Excellent problem solving, analytical, and decision-making skills. Ability to work in a collaborative environment. Must be an excellent communicator (verbal and written) Experience with deployments and operations of 24x7 high volume, highly available systems. Cloud scaling and Ability to drive automation/modernization initiatives. Enjoy working with a large variety of services and new technologies. Demonstrate a solid understanding of development, debugging, administration, and automation frameworks: C#/.NET, PowerShell, Python, Ansible, etc. Experience with logging platforms and application performance metrics: DataDog, NewRelic, Splunk, ELK, Dyantrace, App Insights Analytics, etc. In addition to other duties/functions, this position requires full commitment and support for promoting ethical and compliant culture. More specifically, this position requires integrity, honesty, and respectful treatment of others, as well as a willingness to speak up when they see misconduct or have concerns. Decision Making Tactical Decisions focus on intermediate-term issues. The purpose of decisions made at this level are to help move clientcloser to reaching strategic goals. Outcomes are predictable. After a decision is made by Top Executive Leadership, the next phase is to take the needed steps to implement it. Examples are: The amount of money required to implement, which advertising agency to promote a new service or to provide an incentive plan to employees to encourage increased revenue. Operational Decisions focus on day-to-day activities within the company. Decisions made at this level help to ensure that daily activities proceed smoothly and therefore help to move the company toward reaching a strategic goal. They have short term consequences. Examples are: Handling employee conflicts, purchasing materials needed for operations.

  • delhi, India Cricbuzz.com Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...


  • Delhi, India ViewSonic Full time

    Job Requirements:Bachelor’s degree in computer science, Engineering, or a related field.3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role.Proficient in AWS solutions including but not limited to EC2, S3, CloudWatch, Lambda, and RDS.Strong understanding of Platform Engineering concepts and principles.Experience with...


  • delhi, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM ISTWe are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • delhi, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...


  • Delhi, India Daxko Full time

    Company DescriptionDaxko powers health & wellness throughout the world. Every day our team members focus their passion and expertise in helping health & wellness facilities operate efficiently and engage their members.Whether a neighborhood yoga studio, a national franchise with locations in every city, a YMCA or JCC--and every type of organization in...


  • Delhi, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners8 to 9 years for Hyderabad Locationfor a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience inSRE, GCP and Kubernetes , send me your updated cv : find below the...


  • Delhi, India Quiktrak, LLC Full time

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps EngineerJob Description:Summary:As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...


  • Delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • delhi, India WaferWire Cloud Technologies Full time

    Role: SRE (Site Reliability Engineer)Experience: 4+ YearsAbout WaferWire Cloud Technologies:WaferWire Cloud Technologies is a leading provider of innovative cloud solutions aimed at transforming businesses and driving digital growth. With a focus on cutting-edge technology and customer-centric approaches, we empower organizations to thrive in the digital...


  • new delhi, India dentsu Full time

    The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...


  • Delhi, India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • Delhi, India Next-Link Full time

    Job DescriptionSenior Site Reliability EngineerDesirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...


  • Delhi, India UBS Full time

    Your roleWe're looking for a Site Reliability Engineer to:• work as a part of an agile pod (team)• determine the reliability of our digital products, technology services, and the infrastructure that underpins them• minimize the risk and impact of failures by engineering operational improvements, such as predictive monitoring, auto scaling or...


  • new delhi, India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • delhi, India SLK Full time

    **Immediate Joiners only **We are hiring an Senior Engineer with expertise in Observability and Site Reliability Engineer (SRE) , emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry.. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...


  • Delhi, India SLK Full time

    **Immediate Joiners only **We are hiring an Senior Engineer with expertise in Observability andSite Reliability Engineer (SRE) ,emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry.. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...


  • delhi, India CloudBees Full time

    J ob Title - Manager, Site Reliability EngineerLocation - Bangalore and ChennaiYear of Experience - 10+ YearsAbout CloudBeesCloudBees is the leading software delivery platform that enables enterprises to deliver scalable, compliant, and secure software, empowering developers to do their best work.Seamlessly integrating into any hybrid and heterogeneous...


  • delhi, India NorthStar HR Consultants Full time

    Job Title - Site Reliability EngineerJob Location - Pune, MaharashtraAbout Client -Our client is an independent technology company maximizing customer value by delivering digital advertising’s supply chain of the future. They sell-side platform empowers the world’s leading digital content creators across the open internet to control access to their...


  • Delhi, India Tech Mahindra Full time

    Site Reliability EngineerNature of Project24*7 support project involving production and non-production workRotational/Night shifts even on weekends would be applicableNo remote work/Hybrid mode, Work from customer office in BengaluruJD#Mandatory Skills:K8s ,K8s certification, Linux Admin, SREExp:5+y rsCloud:Ø Lead the design, build, and operational...