Immediate Start Site Reliability Engineer

23 hours ago


Hyderabad Telangana, India Inspire Brands Hyderabad Support Center Full time

About Inspire Brands Hyderabad Support Center Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies The company s technology hub Inspire Brands Hyderabad Support Center India will lead technology innovation and product development for the organization and its portfolio of distinct brands The Inspire Brands Hyderabad Support Center will focus on developing new capabilities in data science data analytics eCommerce automation cloud computing and information security to accelerate the company s business strategy Inspire Brands Hyderabad Support Center will also host an innovation lab and collaborate with start-ups to develop solutions for productivity optimization workforce management loyalty management payments systems and more About Inspire Brands Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies The companys technology hub Inspire Brands Hyderabad Support Center India will lead technology innovation and product development for the organization and its portfolio of distinct brands The Inspire Brands Hyderabad Support Center will focus on developing new capabilities in data science data analytics eCommerce automation cloud computing and information security to accelerate the companys business strategy Inspire Brands Hyderabad Support Center will also host an innovation lab and collaborate with start-ups to develop solutions for productivity optimization workforce management loyalty management payments systems and more Job Title Site Reliability Engineer Position Summary In just a few sentences broadly describe the main purpose of the job Indicate what is done and why outcome i e answer the question Why does the job exist Site Reliability Engineering SRE combines software and systems engineering to build and run large-scale distributed fault-tolerant systems enabling online ordering for thousands of restaurants across multiple brands SRE ensures that Inspire Digital Platform IDP services have reliability uptime appropriate to users needs and a fast rate of improvement Additionally SREs will keep an ever-watchful eye on our systems capacity and performance SRE is also responsible for performing regular capacity planning exercises Much of our software development focuses on optimizing existing systems building infrastructure and eliminating toil through automation Essential Job Responsibilities List and describe the positions key responsibilities in order of importance and indicate the approximate percentage of time spent on the responsibility Percentages must add up to 100 For each describe in simple terms what the job holder must do to accomplish the main purpose of the job and the amount of direction that is required to perform the job duties If the job manages others describe the management duties including authority to hire fire recommend pay increases manage overall work product schedule etc Insert additional rows as needed Note These statements are not intended to be an exhaustive list of all responsibilities and duties Technical Review current workload patterns understand the business case and prioritize areas of weakness within the platform through log and metric investigation as well as application profiling Work with senior engineering and testing team members to build tools and recommend testing strategies for problem prevention detection Employ deep troubleshooting skills to improve the availability performance and security to ensure services are designed with 24 7 availability and operational readiness and rigor Perform in depth postmortem on production incidents to assess effective business impact and for Engineering to learn from these Create Dashboards and alerts for Monitoring the IDP platform define key metrics and service level indicators and ensure relevant metric data is collected to create actionable alerts for SRE and Network Operation Center Participate in the 24 7 on call rotation Automate toil by building software and automation for seamless application deployment and third-party tool integration Ensure the platform holds a high degree of reliability at least three 9s Define non-functional requirements as part of the product lifecycle to influence the new designs standards and methods for scalable highly available distributed systems own technically intricate issues that cross between DevOps Databases Networking Code Infrastructure and people drive them to satisfactory completion Provide recommendations and feedback in design reviews and review sessions Knowledge Skills and Abilities Indicate the education level previous experience specific knowledge skills and abilities required to meet minimum requirements for this position Education 4-year degree in computer science Information Technology or related field Experience Minimum 5 years of experience as a Software Engineer Platform SRE or Devops engineer supporting large scale SAAS Production B2C or B2B Cloud Platforms Hands-on problem-solving and troubleshooting Knowledge and skills general and technical Minimum 5 years of experience as a Software Engineer Platform SRE or Devops engineer supporting large scale SAAS Production B2C or B2B Cloud Platforms Development skills Java TypeScript python OOP expertise is a must Hands on Azure Cloud experience particularly with AKS API management Azure Cache for Redis Azure Blob Storage Cosmo DB Service Bus Azure Functions Proficiency in monitoring APM and profiling tools New Relic Splunk Prometheus Grafana Working experience with containers Kubernetes and Helm Functional knowledge of Cloud Network Firewalls Ingress and Egress controllers Service Mesh and Experience with Auth0 Secret management and Cloudflare CDN Load Balancer Cache Firewall worker features Experience with ArgoCD GitLab CICD Terraform Infrastructure as Code Strong communication skills and ability to explain technical concepts clearly A willingness to dive into understanding debugging and improving any layer of the stack Technical Skills Level of competency 3 on a scale of 5 for skills mentioned below Cloud Provider Azure Core Services Elasticpool SQL Application Gateway API Management APIM Key Vaults AKS Azure Kubernetes Service VMSS Virtual Machine Scale Sets VM Networking NSG Network Security Groups Private Endpoints Private Linked Service VNet Subnets WAF Web Application Firewall GeoReplication Storage Storage Accounts Messaging and Events EventHub EventGrid Azure Service Bus Namespaces Queues Topics Identity and Security Managed Identities Workload Identities Private DNS Auth0 Containerization and Orchestration Kubernetes K8s For container orchestration Helm For Kubernetes package management Docker For containerization Monitoring and Observability New Relic Splunk Automation and Scripting PowerShell Python Other requirements licenses certifications specialized training Good to have certifications Certified Kubernetes Administrator Developer AZ-104 Microsoft Certified Azure Administrator Associate AZ-305 Designing Microsoft Azure Infrastructure Solutions



  • Hyderabad, India Sonata Software Full time

    Hello Connetions Greetings of the day!!! We have immediate openings for SRE Role - Site Reliability Engineer Experience - 7 to 12yrs Work Location -Hyderabad Notice Period -immediate Interested candidates can share your CVs to -


  • Hyderabad, India Sonata Software Full time

    Hello Connetions Greetings of the day!!! We have immediate openings for SRE Role - Site Reliability Engineer Experience - 7 to 12yrs Work Location -Hyderabad Notice Period -immediate Interested candidates can share your CVs to -


  • Hyderabad, India Atyeti Inc Full time

    Job Description :- We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our growing team. - Bachelor’s degree in computer science, Engineering, or equivalent practical experience. - 7+ years’ experience in Site Reliability deploying and managing large-scale distributed systems successfully. - Understanding of SRE concepts...


  • Hyderabad, Telangana, India, Telangana Sonata Software Full time

    Role:Site Reliability Engineer Location:HyderabadNotice Period: Immediate to 20 Days Employment Type:Full TimeExperience7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)Primary Skills (Must-Have)AWS, CI/CD, Jenkins, IAAC, Terraform,...


  • Hyderabad, India Chase Bank Full time

    Job Description Guide and shape the future of technology at a globally recognized firm, driven by pride in ownership. As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Consumer & Community Banking, youare the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your...


  • Hyderabad, India Sonata Software Full time

    Role: Site Reliability Engineer Location: HyderabadNotice Period: Immediate to 20 DaysEmployment Type: Full TimeExperience7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)Primary Skills (Must-Have)AWS, CI/CD, Jenkins, IAAC,...


  • Hyderabad, India Sonata Software Full time

    Role: Site Reliability EngineerLocation: HyderabadNotice Period: Immediate to 20 DaysEmployment Type: Full TimeExperience7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U)Primary Skills (Must-Have)AWS, CI/CD, Jenkins, IAAC, Terraform,...


  • Hyderabad, India Sonata Software Full time

    Role: Site Reliability Engineer Location: Hyderabad Notice Period: Immediate to 20 Days Employment Type: Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...


  • Hyderabad, India Sonata Software Full time

    Role: Site Reliability Engineer Location: Hyderabad Notice Period: Immediate to 20 Days Employment Type: Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...


  • Hyderabad, India Sonata Software Full time

    Role: Site Reliability Engineer Location: Hyderabad Notice Period: Immediate to 20 Days Employment Type: Full Time Experience 7–12 years in site reliability, cloud-based data infrastructure, data pipeline observability, automation, and high-availability engineering within EdTech platforms (2U) Primary Skills (Must-Have) AWS, CI/CD, Jenkins, IAAC,...