Current jobs related to Principal Site Reliability Engineer - hyderabad - Vitech Systems Group


  • Hyderabad, Telangana, India Splunk Inc Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Splunk Inc. As a key member of our SRE team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-native microservices platform.Key ResponsibilitiesSet technical direction and lead large-scale technical initiatives...


  • Hyderabad, Telangana, India SID Global Solutions Full time

    Site Reliability EngineerAt SID Global Solutions, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain scalable and highly available cloud...


  • Hyderabad, Telangana, India Virtusa Full time

    Job Title: SRE Devops awsJob Summary: We are seeking a highly skilled Site Reliability Engineer to join our team at Virtusa. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining reliable and scalable infrastructure solutions to support our applications and services.Key Responsibilities:Design and implement robust...


  • Hyderabad, Telangana, India SINGLE POINT TECHNOLOGIES PRIVATE LIMITED Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a skilled Site Reliability Engineer to join our team at Single Point Technologies Private Limited. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and security of our cloud-based product suite.Key Responsibilities:* Design and implement...


  • Hyderabad, Telangana, India Crox Consulting Inc Full time

    Site Reliability EngineerJob Summary:Crox Consulting Inc is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based SaaS environment.Key Responsibilities:Design and implement automation and software solutions...


  • Hyderabad, Telangana, India Tata Consultancy Services Full time

    Job Title: Site Reliability EngineerTata Consultancy Services is a global leader in the technology arena, and we're looking for a skilled Site Reliability Engineer to join our team.Key Responsibilities:Design, develop, and test Java applications using standard frameworks and tools.Analyze and resolve application issues in collaboration with team...


  • Hyderabad, Telangana, India SID Global Solutions Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at SID Global Solutions.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using GCP, AWS/Azure, and Kubernetes.Develop and maintain CI/CD pipelines using Jenkins, GitLab CI, and Docker.Collaborate with...


  • Hyderabad, Telangana, India RealPage, Inc. Full time

    Job SummaryRealPage, Inc. is seeking a highly skilled Site Reliability Engineer to join our SRE & Systems team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our multiple open-source application environments.Key ResponsibilitiesProvision, de-provision, and support multiple open-source application...


  • Hyderabad, Telangana, India Quest Diagnostics Full time

    Job Title: Site Reliability Engineering ManagerWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Quest Diagnostics. As a Site Reliability Engineering Manager, you will be responsible for leading a team of Site Reliability Engineers in designing, implementing, and maintaining scalable and reliable systems.Key...


  • Hyderabad, India Conviction HR Full time

    Job Title : Site Reliability Engineer (SRE) - Conviction HRType : Contract-to-Hire (C2H)Job Description :ConvictionHR is seeking a talented Site Reliability Engineer (SRE) to join our team. This Contract-to-Hire position is perfect for an individual who is passionate about improving system reliability and performance while collaborating closely with both...


  • Hyderabad, Telangana, India Experian Full time

    Job Title: Site Reliability EngineerJob Summary:Experian is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and scalability of our AWS platform.Key Responsibilities:Optimize microservice and serverless processes on robust distributed...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-On People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, Telangana, India Quest Diagnostics Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Quest Diagnostics. As a Site Reliability Engineering Manager, you will be responsible for leading a team of Site Reliability Engineers in designing, implementing, and maintaining reliable and scalable systems.Key ResponsibilitiesLead and manage a team of Site...


  • Hyderabad, Telangana, India Zelis Full time

    Job Title: Site Reliability EngineerZelis is seeking a highly skilled Site Reliability Engineer to join our Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Gather and analyze metrics from operating systems and...


  • Hyderabad, India Conviction HR Full time

    Job Title : Site Reliability EngineerType : Contract-to-Hire (C2H)Job Description :ConvictionHR is seeking a skilled Site Reliability Engineer to enhance system reliability and performance. This Contract-to-Hire position is ideal for an individual passionate about collaborating closely with development and operations teams to improve infrastructure and...


  • hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-OnPeople Leader ResponsibilityPosition will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-On People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, Telangana, India Live Connections Full time

    We are looking for Manager Site Reliability Engineer in Hyderabad locationRoles and Responsibilities :Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-On People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, Dev Sec Ops Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India Live Connections Full time

    We are looking for Manager Site Reliability Engineer in Hyderabad locationRoles and Responsibilities :Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...

Principal Site Reliability Engineer

2 months ago


hyderabad, India Vitech Systems Group Full time

About Vitech


V3locity, Vitech’s cloud-native administration, engagement, and analytics platform, is a transformative suite of complementary applications that offers full life cycle business functionality and robust enterprise capabilities. It marries core administration with superior digital experience and augmented analytics. Its modular design enables flexible, agile deployment strategies. V3locity employs an advanced, cloud-native architecture that leverages the unique capabilities of AWS to deliver a solution with unparalleled security, scalability, and resiliency.


Principal SRE – Join Our Global Engineering Team

We believe that excellence in production systems starts with engineering-driven solutions to operational challenges. Our Site Reliability Engineering (SRE) team is at the heart of ensuring seamless performance for our clients, preventing potential outages, and proactively identifying and resolving issues before they arise.

Our SRE team is a diverse group of talented engineers across India, the US, and Canada. We have T-shaped expertise spanning application development, database management, networking, and system administration across both on-premise environments and AWS cloud. Together, we support mission-critical client environments and drive automation to reduce manual toil, freeing our team to focus on innovation.


About the Role: Principal SRE

As a Principal SRE, you’ll be a key player in revolutionizing how we operate production systems for single and multi-tenant environments. You'll lead technology initiatives, streamline production processes, and drive infrastructure automation. Working in an Agile team environment, you’ll have the opportunity to explore and implement the latest technologies, engage in on-call duties, and contribute to continuous learning as part of an ever-evolving tech landscape.

If you’re passionate about staying ahead of the curve and inspiring others to lead the charge in SRE, this role is for you.


What You’ll Do:

  • Own and manage our AWS cloud-based technology stack, using native AWS services and top-tier SRE tools to support multiple client environments with Java-based applications and microservices architecture.
  • Design, implement, and monitor HA/DR strategies, reviewing and testing live applications for reliability.
  • Introduce best practices from the industry to enhance our production support, resiliency, and automation.
  • Develop and refine SLIs and SLOs focused on availability, performance, and error budgeting.
  • Create meaningful alerts and dashboards to support SRE operations.
  • Enhance infrastructure as code (IAC) patterns using technologies like Terraform, CloudFormation, Python, and SDK.
  • Lead incident management, drive blameless postmortems, and take ownership of follow-up actions.


The Skills and Experience You Bring:

  • Proven hands-on experience as an SRE for critical, client-facing applications, with the ability to dive deep into daily SRE tasks, manage incidents, and oversee operational tools.
  • 4+ years of experience developing and/or managing software in a public cloud environment.
  • 3+ years of experience hosting enterprise applications in AWS (EC2, EBS, ECS/EKS, Elastic Beanstalk, RDS, CloudWatch).
  • Strong understanding of AWS networking concepts (VPC, VPN/DX/Endpoints, Route53, CloudFront, Load Balancers, WAF).
  • Expertise in AWS security and IAM management (Security Groups, KMS Keys, SCPs).
  • In-depth experience with observability platforms (New Relic, Dynatrace, Honeycomb, Grafana) and OpenTelemetry-based monitoring.
  • Experience managing relational databases (Oracle, and/or PostgreSQL) in both cloud and on-prem environments, including SRE tasks like backup/restore and replication.
  • Hands-on experience with web/application layers (Oracle WebLogic, Apache Tomcat, AWS Elastic Beanstalk, SSL certificates, S3 buckets).
  • Experience with containerized applications (Docker, Kubernetes, ECS).
  • Proficiency in analyzing application logs, GC, and conducting root cause analysis for production issues.
  • Automation experience with Infrastructure as Code (Terraform, CloudFormation, Python, Jenkins, GitHub/Actions).
  • Experience designing and implementing SLIs/SLOs.
  • Programming skills in Python, Bash, Java, JavaScript, Node.js.
  • Strong system administration skills in both Linux and Windows environments.
  • Excellent written/verbal communication, critical thinking, and leadership abilities.
  • Willingness to work in shifts and lead your team to resolve issues efficiently.


Why Join Us?

If you thrive in a dynamic environment and are eager to drive innovation in SRE practices, we want to hear from you

  • At Vitech, you’ll be part of a forward-thinking team that values collaboration, innovation, and continuous improvement. We provide a supportive and inclusive environment where you can grow as a leader while helping shape the future of our organization.


Vitech Inc. is an equal opportunity employer. We celebrate diversity and are committed to creating an equitable and inclusive environment for all employees.