Sre
5 days ago
JD
**Responsibilities**:
Work with the customer Development, DevSecOps, and IT teams to ensure operational excellence and maximize the reliability and availability of client systems.
Collaborate with cross-functional teams (DevSecOps, Development, IT) to implement SRE principles throughout the software development life cycle.
Establish and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for critical services, monitoring and maintaining performance against defined targets.
Implement and enhance observability, alerting, and incident response processes to proactively address issues and minimize downtime.
Architect and design highly scalable and available infrastructure solutions, integrating best practices in reliability engineering and automation.
Develop and maintain documentation related to system architecture, configuration, and procedures.
Stay current with industry trends, recommending and adopting new tools and practices to enhance system reliability.
Qualifications:
Must Have Skills
Solid understanding of SRE principles and practices.
Strong understanding of full-stack observability, with hands-on experience using Datadog.
Strong background in managing highly available and scalable infrastructure.
Experience with container orchestration platforms, serverless architectures, CI/CD pipelines(Azure DevOps,Git Actions), and Infrastructure as Code (IaC) implementations (Ansible & Terraform/ Pulumi).
Hands-on experience working with EKS
Good to Have Skills
Hands-on experience working with Amazon Cloud Services (ECS, Lambda, EC2, API Gateway, CloudFront, SQS, SNS, etc.).
Excellent problem-solving skills with the ability to troubleshoot complex issues in production environments.
Proficiency in scripting and automation using Python, Typescript or Shell.
Relevant certifications in SRE, DevOps, Cloud, etc., are a plus.
Strong communication and leadership skills, fostering effective collaboration with cross-functional teams.
**About Virtusa**
Teamwork, quality of life, professional and personal development: values that Virtusa is proud to embody. When you join us, you join a team of 27,000 people globally that cares about your growth — one that seeks to provide you with exciting projects, opportunities and work with state of the art technologies throughout your career with us.
Great minds, great potential: it all comes together at Virtusa. We value collaboration and the team environment of our company, and seek to provide great minds with a dynamic place to nurture new ideas and foster excellence.
Virtusa was founded on principles of equal opportunity for all, and so does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status or any other basis covered by appropriate law. All employment is decided on the basis of qualifications, merit, and business need.
-
Sre Support
7 days ago
Andhra Pradesh, India Virtusa Full timeWe are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our growing engineering team. The SRE will be responsible for ensuring the availability, performance, scalability, and reliability of our production systems. You will work at the intersection of software development and operations, driving best practices in observability,...
-
Sre Gcp
7 days ago
Andhra Pradesh, India Virtusa Full timeSite Reliability Engineering In this role, you will: Manage the availability, latency, scalability, and efficiency of GFiber services running on Google infrastructure by engineering reliability into software and systems. Design, implement, and maintain highly available and scalable infrastructure on GCP. Work closely with development teams to integrate...
-
Ai Authoring/monitoring Sre Eng
2 weeks ago
Andhra Pradesh, India Virtusa Full time**Job Description**: Typical Responsibilities ensuring the stability and reliability of AI systems by monitoring model performance, troubleshooting issues, and optimizing infrastructure work closely with development teams to deploy updates, maintain continuous monitoring frameworks, and ensure that AI systems scale effectively while meeting operational and...
-
Sre Application Support
2 weeks ago
Andhra Pradesh, India Virtusa Full timeKey Responsibilities Incident management and root cause analysis: Respond to production incidents, conduct root cause analysis, and work with teams to implement long-term fixes. Infrastructure as Code IaC: Manage cloud infrastructure AWS, GCP, Azure through tools such as Terraform, CloudFormation, or equivalent. Build and maintain CI/CD pipelines to enable...
-
Production Support
2 weeks ago
Andhra Pradesh, India Virtusa Full timeP1,C3,STS Implement multi-agent systems, orchestration flows, and autonomous task execution using MCP and A2A frameworks. Perform gap analysis, optimization, and troubleshooting of AI-driven workflows and pipelines. Build modular, reusable, and scalable AI components integrated into enterprise ecosystems. Collaborate with product, data, and engineering teams...
-
Computer Scientist
2 weeks ago
Noida, Uttar Pradesh, India Adobe Full timeOur Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies...
-
Senior Site Reliability Engineer
1 week ago
Andhra Pradesh, India Virtusa Full time ₹ 15,00,000 - ₹ 25,00,000 per yearBelow the Job RequirementsHand on experience with Ansible, Puppet and Jenkins to complete day to day SRE work.Ability to develop automation scripts in bash, Shell, PowerShell, Python for day to day work.Ability to develop and debug applications in Java, Python or Ruby.Understanding of messaging related tools preferably Confluent Kafka and log collection...
-
Systems Admin
2 weeks ago
Andhra Pradesh, India Virtusa Full time**JOB DESCRIPTION** **Skill: System Admin** **Role / Tier**: - **System Admin /GCP SRE Engineer**: - Tier 1** Design, build, and scale systems sustainably through mechanisms like automation, and evolve systems by driving changes that improve reliability and velocity Participate in and improve the lifecycle of services from inception and design,...
-
Principal Sre, Hospitality Cloud
5 days ago
Noida, Uttar Pradesh, India Oracle Full timeThe Hospitality Cloud SRE team is focused on maximizing service reliability for our Cloud Native hotel product service offerings across global Oracle data centres. Our team runs with a start-up like approach, leaving room for creative freedom. We have worked to assemble the smartest people in the industry to build and grow this revolutionary and disruptive...
-
Site Reliability Engineer/Lead
2 days ago
uttar pradesh, India Coforge Full timeRole: SRE Lead EngineerSkills: Docker, Prometheus, grafana, ELK, DataDogLocation: NoidaExperience: 8+ YearsMode: Work from officeWe at Coforge are hiring a highly skilled and experienced SRE Lead Engineer to drive reliability, scalability, and performance across our infrastructure and applications. You will lead a team of SREs, collaborate with development...