Site Reliability Engineer

4 weeks ago


Hyderabad, India Quiktrak, LLC Full time

Job Title: Azure Site Reliability Engineer (SRE) / DevOps Engineer

Job Description:

Summary:

As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous integration and continuous deployment (CI/CD) pipelines, setting up monitoring and alerting systems, and automating infrastructure as code. The ideal candidate will have strong experience in Azure cloud services, DevOps practices, and a proactive approach to ensuring system stability and efficiency.

Responsibilities:

1. Infrastructure Deployment and Management:

- Design, implement, and manage cloud infrastructure solutions on the Azure platform.

- Automate infrastructure provisioning, configuration, and deployment using tools such as Azure Resource Manager (ARM) templates, Terraform, or Azure DevOps.

2. Continuous Integration and Continuous Deployment (CI/CD):

- Set up and maintain CI/CD pipelines for automating software builds, testing, and deployment processes.

- Implement best practices for version control, branching strategies, and release management using tools like Azure DevOps, GitHub Actions, or Jenkins.

3. Monitoring and Alerting:

- Configure monitoring and alerting systems to track the health, performance, and availability of Azure services and applications.

- Implement proactive monitoring solutions using Azure Monitor, Application Insights, or similar tools to detect and mitigate issues before they impact users.

4. Incident Management and Response:

- Develop incident response plans and procedures for addressing system outages, performance degradation, and other operational issues.

- Lead incident response efforts, troubleshoot issues, and coordinate with cross-functional teams to resolve critical incidents in a timely manner.

5. Infrastructure as Code (IaC):

- Implement infrastructure as code (IaC) principles to automate the provisioning and configuration of cloud resources.

- Maintain version-controlled repositories for infrastructure code and ensure consistency and reproducibility across environments.

6. Security and Compliance:

- Implement security best practices and compliance standards for Azure infrastructure, including identity and access management, network security, and data protection.

- Conduct regular security assessments and audits to identify and remediate potential vulnerabilities.

7. Performance Optimization:

- Monitor system performance metrics and conduct performance tuning to optimize resource utilization and improve application performance.

- Identify opportunities for cost optimization and implement strategies to reduce cloud spending without sacrificing performance or reliability.

Qualifications:

- Bachelor's degree in Computer Science, Engineering, or related field.

- Proven experience as a Site Reliability Engineer (SRE) or DevOps Engineer, with a focus on Azure cloud services.

- Strong understanding of cloud computing concepts and Azure services, including compute, storage, networking, and security.

- Experience with infrastructure as code (IaC) tools such as ARM templates, Terraform, or Azure DevOps.

- Proficiency in setting up and managing CI/CD pipelines using tools like Azure DevOps, GitHub Actions, or Jenkins.

- Hands-on experience with monitoring and alerting tools such as Azure Monitor, Application Insights, or Prometheus.

- Knowledge of incident management, troubleshooting, and root cause analysis techniques.

- Familiarity with Agile methodologies and DevOps practices for collaboration and automation.

- Excellent communication and collaboration skills, with the ability to work effectively in a team environment.

Preferred Skills:

- Certification in Microsoft Azure (e.g., Azure Administrator, Azure DevOps Engineer).

- Experience with containerization technologies such as Docker and Kubernetes.

- Knowledge of scripting languages such as PowerShell, Python, or Bash.

- Familiarity with serverless computing and microservices architectures.

- Understanding of networking concepts and security best practices in cloud environments.



  • hyderabad, India Insight Global Full time

    Required Skills and Experience *- Bachelor's or master's degree in computer science, Software Engineering, or a related field.- Proven experience (7+ years) in SRE, automation testing- Strong skills in developing and implementing automation testing strategies and frameworks.- Solid understanding of site reliability principles and best practices.- Leadership...


  • Hyderabad, India Insight Global Full time

    Required Skills and Experience *- Bachelor's or master's degree in computer science, Software Engineering, or a related field.- Proven experience (7+ years) in SRE, automation testing- Strong skills in developing and implementing automation testing strategies and frameworks.- Solid understanding of site reliability principles and best practices.- Leadership...


  • Hyderabad, India Insight Global Full time

    Required Skills and Experience *- Bachelor's or master's degree in computer science, Software Engineering, or a related field.- Proven experience (7+ years) in SRE, automation testing- Strong skills in developing and implementing automation testing strategies and frameworks.- Solid understanding of site reliability principles and best practices.- Leadership...


  • Hyderabad, India Insight Global Full time

    Required Skills and Experience * - Bachelor's or master's degree in computer science, Software Engineering, or a related field. - Proven experience (7+ years) in SRE, automation testing - Strong skills in developing and implementing automation testing strategies and frameworks. - Solid understanding of site reliability principles and best practices. -...


  • Hyderabad, India Snaphunt Full time

    The OfferWork within a company with a solid track record of successGreat work environmentAttractive salary & benefitsThe JobYou will be responsible for :Gathering and evaluating user feedback.Providing code documentation and other inputs to technical documents.Supporting continuous improvement by investigating alternatives and new technologies and presenting...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 DescriptionPosition : SREPrimary skills: devops CI/CD pipelineLocation: HyderabadShould have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability.Should have proficiency in industry standard monitoring tools (Dynatrace,...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability. Should have proficiency in industry standard monitoring tools...


  • hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability.Should have proficiency in industry standard monitoring...


  • Hyderabad, India Snaphunt Full time

    The OfferWork within a company with a solid track record of successGreat work environmentAttractive salary & benefitsThe Job You will be responsible for : Gathering and evaluating user feedback.Providing code documentation and other inputs to technical documents.Supporting continuous improvement by investigating alternatives and new technologies and...


  • hyderabad, India Snaphunt Full time

    The Offer Work within a company with a solid track record of success Great work environment Attractive salary & benefits The Job You will be responsible for : Gathering and evaluating user feedback. Providing code documentation and other inputs to technical documents. Supporting continuous improvement by investigating alternatives and new technologies...


  • Hyderabad, India Microsoft Full time

    OverviewAre you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so, then look...


  • Hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If so,...


  • hyderabad, India Microsoft Full time

    Overview Are you interested in working for one of the most exciting products at Microsoft, passionate about exceeding customer expectations and advancing Microsoft's cloud first strategy? Are you interested in a start-up like the environment, passionate about cloud computing technology and driving growth in one of Microsoft's core businesses? If...


  • hyderabad, India Anicalls (Pty) Ltd Full time

    The RoleMentor teammates on SRE best practices and guide technical direction Work closely with the product engineering team to rapidly deliver capabilitiesAutomate and optimize developer pipelinesBuild monitoring to assess system and pipeline healthQualifications:Proficiency in Python, Go, Ruby, or Java is a plusExpertise in Linux administration,...


  • hyderabad, India ValueLabs Full time

    Experienced in SRE or Site Reliability EngineerDesign, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps.Collaborate with cross-functional teams to optimize system performance, reliability, and scalability.Develop and maintain tools for continuous integration, continuous deployment (CI/CD), and...


  • Hyderabad, India Oriontek INC Full time

    RoleYou will be responsible for :Gathering and evaluating user feedback.Providing code documentation and other inputs to technical documents.Supporting continuous improvement by investigating alternatives and new technologies and presenting these for architectural review.Troubleshooting and debugging to optimise performance.Ideal ProfileYou possess a...


  • Hyderabad, India Quiktrak, LLC Full time

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps EngineerJob Description:Summary:As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...


  • Hyderabad, India Quiktrak, LLC Full time

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps Engineer Job Description: Summary: As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...


  • Hyderabad, India ValueLabs Full time

    Experienced in SRE or Site Reliability Engineer Design, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps.Collaborate with cross-functional teams to optimize system performance, reliability, and scalability.Develop and maintain tools for continuous integration, continuous deployment (CI/CD), and...


  • Hyderabad, India ValueLabs Full time

    Experienced in SRE or Site Reliability Engineer Design, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps.Collaborate with cross-functional teams to optimize system performance, reliability, and scalability.Develop and maintain tools for continuous integration, continuous deployment (CI/CD), and...