Site Reliability Engineer
3 days ago
Description :We have an urgent need of strong Python + SRE engineer at Offshore. Kindly share profiles with hands-on experience in AWS, Kubernetes, Python, Splunk, Prometheus & Grafana. Please do share only immediate joiners and mention the candidate availability for this opportunity while sharing the profiles.Key Responsibilities :- Design, implement, and manage scalable and highly available cloud infrastructure on AWS or GCP.- Containerize applications using Docker, and manage orchestration with Kubernetes.- Collaborate with developers and QA teams to integrate CI/CD pipelines and automate deployment processes.- Ensure system reliability, uptime, and performance by leveraging industry-leading monitoring tools such as Grafana, Dynatrace, etc.- Troubleshoot system failures, conduct root cause analysis, and provide long-term solutions to prevent recurrence.- Script and automate operational tasks using Python or Java to improve system efficiency.- Maintain documentation of system architecture, procedures, and configurations.- Participate in incident response and on-call support rotation if Skills & Qualifications :- Minimum 5 years of hands-on experience in a DevOps/SRE role.- Strong expertise in AWS or Google Cloud Platform (GCP).- Deep understanding and practical experience with Docker and Kubernetes in production environments.- Proficient in Java or Python for scripting, automation, and integrations.- Experience with monitoring tools such as Grafana, Dynatrace, Prometheus, etc.- Strong problem-solving skills and ability to work in a fast-paced environment.- Excellent communication and documentation skills.Must Have Skills : - AWS, DevOps, Prometheus, Grafana, Splunk, Python Scripting.- Need experience in dashboards configuration/Setup for monitoring using Splunk, Grafana etcPreferred Attributes : - Prior experience in large-scale enterprise systems.- Ability to work independently and take ownership of DevOps processes.- Exposure to Agile/Scrum methodologies. (ref:hirist.tech)
-
Site Reliability Engineer
2 weeks ago
Thiruvananthapuram, Kerala, India Equifax Full time ₹ 1,04,000 - ₹ 1,30,878 per yearSite Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles.SRE is also an...
-
Site Reliability Engineer – Technical Architect
3 weeks ago
Thiruvananthapuram, India Tata Elxsi Full timeSite Reliability Engineer – Technical Architect We are looking for experienced professionals to join us as Site Reliability Engineer. If you know someone who fits the bill, refer them to join our growing team. Key Skills & Responsibilities: Proficiency in one or more high-level programming languages: Python, Java, C/C++, Ruby, JavaScript Experience...
-
Site Reliability Engineer II
2 weeks ago
thiruvananthapuram, India Zafin Full timeSenior Site Reliability Engineer (SRE II) Own availability, latency, performance, and efficiency for Zafin’s SaaS on Azure. You’ll define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. Reports to the Director of SRE.What you’ll doSLIs/SLOs & contracts: Define customer-centric SLIs/SLOs for...
-
Site Reliability Engineer II
4 weeks ago
Thiruvananthapuram, India Zafin Full timeSenior Site Reliability Engineer (SRE II)Own availability, latency, performance, and efficiency for Zafin’s SaaS on Azure. You’ll define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. Reports to the Director of SRE.What you’ll doSLIs/SLOs & contracts: Define customer-centric SLIs/SLOs for...
-
Site reliability engineer ii
3 weeks ago
Thiruvananthapuram, India Zafin Full timeSenior Site Reliability Engineer (SRE II) Own availability, latency, performance, and efficiency for Zafin’s Saa S on Azure. You’ll define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. Reports to the Director of SRE.What you’ll doSLIs/SLOs & contracts: Define customer-centric SLIs/SLOs for...
-
Site reliability engineer ii
2 weeks ago
Thiruvananthapuram, India Zafin Full timeSenior Site Reliability Engineer (SRE II)Own availability, latency, performance, and efficiency for Zafin’s Saa S on Azure. You’ll define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. Reports to the Director of SRE.What you’ll doSLIs/SLOs & contracts: Define customer-centric SLIs/SLOs for...
-
Site Reliability Engineer II
3 weeks ago
Thiruvananthapuram, India Zafin Full timeSenior Site Reliability Engineer (SRE II) Own availability, latency, performance, and efficiency for Zafin’s SaaS on Azure. You’ll define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. Reports to the Director of SRE. What you’ll do SLIs/SLOs & contracts: Define customer-centric...
-
Site Reliability Engineer II
3 weeks ago
Thiruvananthapuram, India Zafin Full timeSenior Site Reliability Engineer (SRE II) Own availability, latency, performance, and efficiency for Zafin’s SaaS on Azure. You’ll define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. Reports to the Director of SRE. What you’ll do SLIs/SLOs & contracts: Define customer-centric...
-
Senior Site Reliability Engineer
2 weeks ago
Thiruvananthapuram, Kerala, India Equifax Full time ₹ 5,00,000 - ₹ 15,00,000 per yearSite Reliability Engineering (SRE)at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles.SRE is also an...
-
Site Reliability Engineer
4 weeks ago
Thiruvananthapuram / Trivandrum, India Reflections Info Systems Full timeJob Description As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability. Should be able to gather the technical requirements from the DevOps team and the operational requirements from the Application Support team. With the Site Reliability...