Site Reliability Engineer
1 month ago
Position Overview:
We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will play a critical role in ensuring the reliability, availability, and performance of our systems. The ideal candidate is passionate about building scalable and resilient infrastructure, automating operational tasks, and collaborating with cross-functional teams.
Qualifications :
- Bachelor's degree in Computer Science, Information Technology, or related field.
- Proven experience as a Site Reliability Engineer or similar role.
- Strong programming and scripting skills (e.g., Python, Shell, Go).
- Experience with cloud platforms (e.g., AWS, Azure, GCP).
- Proficiency in configuration management tools (e.g., Ansible, Puppet, Chef).
- Solid understanding of networking, databases, and distributed systems.
- Experience with containerization and orchestration (e.g., Docker, Kubernetes).
- Familiarity with monitoring tools (e.g., Prometheus, Grafana, ELK stack).
- Excellent problem-solving and troubleshooting skills.
- Strong communication and collaboration skills.
Responsibilities :
System Architecture and Design :
- Collaborate with development and operations teams to design, implement, and maintain scalable and reliable systems.
- Participate in architecture reviews and provide recommendations for improving system performance and reliability.
Infrastructure Automation :
- Develop and maintain infrastructure as code (IaC) for automating deployment, scaling, and management of applications and services.
- Implement and improve CI/CD pipelines to streamline the release process.
Monitoring and Alerting :
- Design and implement monitoring solutions to proactively identify and address performance issues.
- Establish and maintain alerting mechanisms to ensure rapid response to system incidents.
Capacity Planning and Performance Optimization :
- Conduct capacity planning to ensure systems can handle current and future loads.
- Optimize system performance through analysis and tuning of various components.
Incident Response and Root Cause Analysis :
- Respond to incidents and outages, ensuring timely resolution and minimizing impact.
- Conduct thorough root cause analysis to identify and address underlying issues.
Security and Compliance :
- Work closely with security teams to implement and maintain security best practices.
- Ensure compliance with industry regulations and internal policies.
Collaboration and Communication :
- Collaborate with cross-functional teams to address system-related challenges and improvements.
- Communicate effectively with team members, providing documentation and knowledge sharing.
-
WovV Technologies
2 hours ago
Anywhere in India/Multiple Locations, IN Wovv Technology Full timeSRE EngineerJob Description :- Owning Infra architecture and non-functional requirements- Apply automation and software to any manual tasks- Roll back a software push as needed- Able to troubleshoot cross-platform issues in a cloud-based/SaaS environment- Bring up additional serving capacity- Use the monitoring systems (for ing and dashboards)- Proven work...
-
Site Reliability Engineer
2 weeks ago
Anywhere in India/Multiple Locations Innoquest Consulting Full timeMANDATORY ASK : 5-8 YEARS RELEVANT EXPERIENCE / STRONG HANDS-ON EXPERIENCE IN ANSIBLE & TERRAFORM / EXPERTISE IN AUTOMATION, DEBUGGING, SCRIPTING TOOLS, APM OR MONITORING TOOLS / EXPERIENCE IN SITE RELIABILITY & CLOUD (PREFERABLY AZURE) / EXPERIENCE IN CONTAINERIZATION USING DOCKER & KUBERNETES. JOB OVERVIEW : As a member of the Platform Engineering...
-
Site Reliability Engineer
2 weeks ago
Bangalore/Anywhere in India/Multiple Locations One of the Consulting Firms Full timeJob Description : - Collaborate with Site Reliability Engineering teammates and Software Delivery teams to determine and implement cloud networking, monitoring, and infrastructure requirements- Ensure that networks and infrastructure are highly available- Develop methodologies to safely deploy and test network and infrastructure changes, including...
-
Site Reliability Engineer
1 month ago
Anywhere in India,Multiple Locations Travash Software SolutionsRisk Resources Full timeJob Description: - 10+ years of experience in SRE or a related field. - Proven experience in designing, developing, and implementing monitoring solutions.- Deep understanding of monitoring technologies and tools, including Prometheus, Grafana, Loki, and Tempo- Experience with cloud-based monitoring systems, such as New Relic, Datadog, and Grafana Cloud-...
-
Site Reliability Engineer
3 weeks ago
Anywhere in India/Multiple Locations, IN Travash Software SolutionsRisk Resources Full timeJob Description:- 10+ years of experience in SRE or a related field.- Proven experience in designing, developing, and implementing monitoring solutions.- Deep understanding of monitoring technologies and tools, including Prometheus, Grafana, Loki, and Tempo- Experience with cloud-based monitoring systems, such as New Relic, Datadog, and Grafana Cloud-...
-
Site Reliability Engineer
7 days ago
india Cricbuzz.com Full timeSite Reliability Engineer We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services. Experience - 3 - 5 years Responsibilities: ●...
-
DevOps/Site Reliability Engineer
3 weeks ago
Bangalore,Anywhere in India,Multiple Locations Rrootshell Technologiiss Pvt Ltd Full timeJob Description:- Minimum 8+ years of experience as a Site Reliability Engineer or DevOps Cloud Engineer role.- Strong understanding of cloud computing platforms like AWS EC2.- Migrate existing data and configurations from the old Ubuntu VMs to the new ones across all service types.- Leading a major infrastructure project involving upgrading the operating...
-
DevOps/Site Reliability Engineer
3 weeks ago
Bangalore/Anywhere in India/Multiple Locations, IN Rrootshell Technologiiss Pvt Ltd Full timeJob Description:- Minimum 8+ years of experience as a Site Reliability Engineer or DevOps Cloud Engineer role.- Strong understanding of cloud computing platforms like AWS EC2.- Migrate existing data and configurations from the old Ubuntu VMs to the new ones across all service types.- Leading a major infrastructure project involving upgrading the operating...
-
Site Reliability Engineer
1 month ago
Anywhere in India,Multiple Locations,Bangalore iMind Your Business Solutions Private Limited Full timeJob Description :Our Client is a Product company, headquartered in London UK and developing an Ai powered enterprise platform that enables insurers the flexibility to scale capacity as needed, safely, securely, efficiently and at speed. Intelligent and flexible tool understands global insurance processing challenges, and seamlessly solves them. A smart and...
-
Site Reliability Engineer
2 weeks ago
india ViewSonic Full timeJob Requirements: Bachelor’s degree in computer science, Engineering, or a related field. 3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role. Proficient in AWS solutions including but not limited to EC2, S3, CloudWatch, Lambda, and RDS. Strong understanding of Platform Engineering concepts and principles. Experience...
-
Site Reliability Engineer
1 day ago
india SID Global Solutions Full timeDear Candidates, We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...
-
Site Reliability Engineer
1 week ago
india iScale Solutions Full timeJob Description This is a remote position. Key Responsibilities: Design, implement, and maintain highly available and scalable infrastructure on AWS cloud platform. Develop and manage Infrastructure as Code (IaC) using Terraform for provisioning and managing cloud resources. Implement containerization strategies using Docker for packaging and deploying...
-
Site Reliability Engineer
4 weeks ago
india Quiktrak, LLC Full timeJob Title: Azure Site Reliability Engineer (SRE) / DevOps Engineer Job Description: Summary: As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...
-
Site Reliability Engineer
2 weeks ago
India System Soft Technologies Full timeTitle: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...
-
Site Reliability Engineer
2 weeks ago
india System Soft Technologies Full timeTitle: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...
-
Site Reliability Engineer
7 days ago
india EZINFORMATICS SOLUTIONS PVT LTD Full timeCompany Description EZINFORMATICS SOLUTIONS PVT LTD is a team of professionals with vast industrial experience and accomplishments in various IT services. They focus on three different spheres: Cyber Security, Information Technology, and Consulting Services. Their goal is to provide safe and secure solutions, unify customer data, and deliver exceptional...
-
Lead Site Reliability Engineer
1 day ago
india JPMorgan Chase & Co. Full timeAssume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking division, you hold a leadership role in your team, demonstrate strong knowledge across...
-
Sr. Site Reliability Engineer
7 days ago
india Encora Inc. Full timeDescription Sr. Software Engineer (Site Reliability Engineer) Important Information Location: Ahmedabad Experience: 5+ years Job Mode: Full-time Work Mode: Remote Job Summary Working with DevOps SRE with good experience in Site Reliability Engineer. Responsibilities and Duties Design, implement, and maintain highly...
-
Site Reliability Engineer
22 hours ago
India System Soft Technologies Full timeJob SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....
-
Manager, Site Reliability Engineering
7 days ago
india Greenway Health Full timeJob Description Job Summary The Manager is responsible for implementing the development process and site reliability engineering practices to resolve issues and identify opportunity areas. This role will lead development and site reliability engineering teams and establish and implement best practices and standards related to engineering...