Site Reliability Engineer
1 month ago
Position Type: Full-Time Contract (40hrs/week)
Contract Duration: Long Term
Work Time zone: IST
Work Schedule: 8 hours/day (Mon-Fri)
Location: 100% remote (candidate can work from anywhere in India)
Must haves: Monitoring and deploying .net applications Maintaining code, writing scripts Monitor application performance Skilled with monitoring tools such as Splunk Client has a 24x7 environment, someone coming from this would be ideal 1 weekend on call a month Technologies in client's environment: .Net, AWS Splunk Terraform, Ansible Kubernetes Job Summary:
The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams. The SRE designs and configures systems to monitor and alert on critical applications and automate issue resolution. This role includes a focus on providing solutions that are robust, scalable, and highly available. For internal processes and technologies, the SRE will build systems to streamline operations and reduce friction. The SRE will be part of an on-call rotation to troubleshoot production issues with the specific goal of building resilient mitigation processes
Essential Functions: Leverage monitoring tools and custom automation to build and continually refine systems that highlight critical application issues. Evaluate application architecture and reliability and design improvements. Respond to and resolve production incidents as part of an on-call rotation. Build automated solutions to alert, monitor, mitigate issues and system recovery. Communicate and collaborate with business leaders and stakeholders on incident resolution status and RCAs. Collaborate with DevOps Engineers, Performance Engineers and Developers to define and establish reliability practices. Build applications that streamline internal operational procedures Qualification: BA or BS degree in Computer Science or related field required. Master's Degree in Technology or related field desired. Certification(s) specific to Architecture discipline 5+ years of experience working with technical teams. Strong emphasis on SRE as an engineering discipline with a focus on automation. Experience supporting infrastructure and services in public cloud environments (AWS, GCP, etc.). Experience building and supporting containerized application technologies, including Docker, Kubernetes. Experience with public cloud cost management. Experience in performance engineering and capacity planning. Prior success in automating a real-world production environment. Knowledge of IP networking, VPN's, DNS, load balancing and firewall. Expertise in any monitoring tools like Splunk, AppDynamics, Nagios, New Relic. Experience with software development and testing process in an agile environment Excellent problem solving, analytical, and decision-making skills. Ability to work in a collaborative environment. Must be an excellent communicator (verbal and written) Experience with deployments and operations of 24x7 high volume, highly available systems. Cloud scaling and Ability to drive automation/modernization initiatives. Enjoy working with a large variety of services and new technologies. Demonstrate a solid understanding of development, debugging, administration, and automation frameworks: C#/.NET, PowerShell, Python, Ansible, etc. Experience with logging platforms and application performance metrics: DataDog, NewRelic, Splunk, ELK, Dyantrace, App Insights Analytics, etc. In addition to other duties/functions, this position requires full commitment and support for promoting ethical and compliant culture. More specifically, this position requires integrity, honesty, and respectful treatment of others, as well as a willingness to speak up when they see misconduct or have concerns. Decision Making Tactical Decisions focus on intermediate-term issues. The purpose of decisions made at this level are to help move clientcloser to reaching strategic goals. Outcomes are predictable. After a decision is made by Top Executive Leadership, the next phase is to take the needed steps to implement it. Examples are: The amount of money required to implement, which advertising agency to promote a new service or to provide an incentive plan to employees to encourage increased revenue. Operational Decisions focus on day-to-day activities within the company. Decisions made at this level help to ensure that daily activities proceed smoothly and therefore help to move the company toward reaching a strategic goal. They have short term consequences. Examples are: Handling employee conflicts, purchasing materials needed for operations.
-
Site Reliability Engineer
1 month ago
delhi, India Cricbuzz.com Full timeSite Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...
-
Site Reliability Engineer
1 month ago
Delhi, India ViewSonic Full timeJob Requirements:Bachelor’s degree in computer science, Engineering, or a related field.3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role.Proficient in AWS solutions including but not limited to EC2, S3, CloudWatch, Lambda, and RDS.Strong understanding of Platform Engineering concepts and principles.Experience with...
-
Site Reliability Engineer
4 weeks ago
delhi, India Korn Ferry Full timeRole - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM ISTWe are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...
-
Site Reliability Engineer
4 weeks ago
delhi, India SID Global Solutions Full timeDear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...
-
Site Reliability Engineer
2 months ago
Delhi, India Daxko Full timeCompany DescriptionDaxko powers health & wellness throughout the world. Every day our team members focus their passion and expertise in helping health & wellness facilities operate efficiently and engage their members.Whether a neighborhood yoga studio, a national franchise with locations in every city, a YMCA or JCC--and every type of organization in...
-
Site Reliability Engineer
4 weeks ago
Delhi, India SID Global Solutions Full timeDear Candidates,We are looking for immediate joiners8 to 9 years for Hyderabad Locationfor a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience inSRE, GCP and Kubernetes , send me your updated cv : find below the...
-
Site Reliability Engineer
2 months ago
Delhi, India Quiktrak, LLC Full timeJob Title: Azure Site Reliability Engineer (SRE) / DevOps EngineerJob Description:Summary:As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...
-
Site Reliability Engineer
4 weeks ago
Delhi, India System Soft Technologies Full timeTitle: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...
-
Site Reliability Engineer
1 month ago
delhi, India System Soft Technologies Full timeTitle: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...
-
Site Reliability Engineer
3 weeks ago
delhi, India WaferWire Cloud Technologies Full timeRole: SRE (Site Reliability Engineer)Experience: 4+ YearsAbout WaferWire Cloud Technologies:WaferWire Cloud Technologies is a leading provider of innovative cloud solutions aimed at transforming businesses and driving digital growth. With a focus on cutting-edge technology and customer-centric approaches, we empower organizations to thrive in the digital...
-
Site Reliability Engineer
2 weeks ago
new delhi, India dentsu Full timeThe purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...
-
Site Reliability Engineer
1 week ago
Delhi, India System Soft Technologies Full timeJob SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....
-
Senior Site Reliability Engineer
5 days ago
Delhi, India Next-Link Full timeJob DescriptionSenior Site Reliability EngineerDesirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...
-
Site Reliability Engineer
3 days ago
Delhi, India UBS Full timeYour roleWe're looking for a Site Reliability Engineer to:• work as a part of an agile pod (team)• determine the reliability of our digital products, technology services, and the infrastructure that underpins them• minimize the risk and impact of failures by engineering operational improvements, such as predictive monitoring, auto scaling or...
-
Site Reliability Engineer
4 weeks ago
new delhi, India System Soft Technologies Full timeJob SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....
-
Senior Site Reliability Engineer
1 week ago
delhi, India SLK Full time**Immediate Joiners only **We are hiring an Senior Engineer with expertise in Observability and Site Reliability Engineer (SRE) , emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry.. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...
-
Senior Site Reliability Engineer
1 week ago
Delhi, India SLK Full time**Immediate Joiners only **We are hiring an Senior Engineer with expertise in Observability andSite Reliability Engineer (SRE) ,emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry.. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...
-
Site Reliability Engineering Manager
1 month ago
delhi, India CloudBees Full timeJ ob Title - Manager, Site Reliability EngineerLocation - Bangalore and ChennaiYear of Experience - 10+ YearsAbout CloudBeesCloudBees is the leading software delivery platform that enables enterprises to deliver scalable, compliant, and secure software, empowering developers to do their best work.Seamlessly integrating into any hybrid and heterogeneous...
-
Site Reliability Engineer
4 weeks ago
delhi, India NorthStar HR Consultants Full timeJob Title - Site Reliability EngineerJob Location - Pune, MaharashtraAbout Client -Our client is an independent technology company maximizing customer value by delivering digital advertising’s supply chain of the future. They sell-side platform empowers the world’s leading digital content creators across the open internet to control access to their...
-
Site Reliability Engineer
3 weeks ago
Delhi, India Tech Mahindra Full timeSite Reliability EngineerNature of Project24*7 support project involving production and non-production workRotational/Night shifts even on weekends would be applicableNo remote work/Hybrid mode, Work from customer office in BengaluruJD#Mandatory Skills:K8s ,K8s certification, Linux Admin, SREExp:5+y rsCloud:Ø Lead the design, build, and operational...