Only 24h Left: Lead Site Reliability Engineer

6 days ago


Hyderabad, India AutoRABIT Full time
AutoRABIT Profile

AutoRABIT is the leader in DevOps and CI/CD for SaaS platforms such as Salesforce. Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, and effective. AutoRABIT’ s highly scalable framework covers the entire DevOps cycle, which makes it the favorite platform for companies, especially large ones who require industrial strength and robustness in their deployment environment. AutoRABIT increases the productivity and the velocity of developers which makes it a critical tool for development teams, especially large ones with complex applications. AutoRABIT recently received some institutional funding and is well-positioned for growth. The company is headquartered in CA, USA.

Key Responsibilities

- Architect, implement, and maintain scalable, resilient, and secure infrastructure using AWS.- Develop and manage infrastructure as code (IaC) using Terraform to automate deployments and streamline infrastructure management.- Design and implement CI/CD pipelines for automated deployments and smooth application delivery.- Contribute to and maintain monitoring, logging, and alerting systems for comprehensive visibility into infrastructure health.- Troubleshoot system performance issues, identify bottlenecks, and implement solutions to enhance reliability and scalability.- Participate in sustainable incident response, perform root cause analyses (RCAs), and ensure prompt resolution of incidents with minimal disruption.- Work with development teams to ensure applications are designed for reliability, scalability, and performance.- Assist internal and customer-facing teams with deployments, including VPN and other security-related infrastructure.- Support AutoRABIT services through on-call rotations, ensuring timely resolution of critical issues.- Automate manual tasks, such as user provisioning in production and test environments.- Drive automation initiatives to improve efficiency, reliability, and deployment speed.- Mentor peers and team members through knowledge sharing, training, and collaboration.- Foster a culture of continuous improvement and blameless postmortems to learn from incidents.- Ensure security best practices are followed across infrastructure and deployments.- Adhere to internal controls and compliance requirements, ensuring all infrastructure aligns with security and regulatory standards.- Responsibility to adhere to set internal controls.

Required Skills and Experience

Technical Expertise:

- Proven experience designing and managing AWS-based infrastructure.- Strong hands-on experience with Terraform for infrastructure as code.- Working knowledge of CI/CD pipelines and related tools like Jenkins, AWS CodePipeline, or equivalent.- Proficiency in scripting languages such as Bash or Python for automation tasks.- Knowledge of programming languages like Python.- Experience with configuration management tools like Ansible or AWS SSM.- Expertise in monitoring tools like Grafana, or Elasticsearch.

Soft Skills:

- Strong problem-solving and troubleshooting abilities.- Excellent written and verbal communication skills, particularly in working with global teams.- Leadership qualities with the ability to challenge the status quo and drive innovation.- A collaborative mindset with a focus on mentoring and knowledge sharing.

Security and Compliance:

- Knowledge of security best practices for infrastructure and application deployments.- Experience ensuring compliance with regulatory requirements.

Preferred Qualifications:

- Previous experience in a Lead SRE, DevOps, or DevSecOps role.- Experience in performing RCAs and conducting blameless postmortems.- Familiarity with containerization and orchestration tools like Docker and Kubernetes.- Experience with on-call rotations and supporting mission-critical applications.- Exposure to SOC2, PCI-DSS, or similar compliance frameworks.

Education and Qualification

- Bachelor's or Master’s degree in Computer Science, Information Technology, or a related field.

Location: Hyderabad

Work Mode: Hybrid – 3 days a week at Office

Experience: 8-10 years

Website:

  • Hyderabad, India Arrise Solutions (India) Pvt. Ltd Full time

    Site Reliability EngineerLocation: HyderabadReports to: Head – DevOps & TechOpsType of Position: Full TimeIntroduction:PragmaticPlay is one of the world’s leading suppliers of online slots, casinos, live dealers and bingo games with new and exciting products and verticals added on a continuous basis. Pragmatic Play currently employs over 2,000 people in...


  • Hyderabad, Telangana, India Live Connections Full time

    We are looking for a highly skilled Site Reliability Engineering Lead to join our team at Live Connections in Hyderabad. As a key member of our organization, you will be responsible for leading and managing a team of engineers to ensure the reliability, scalability, and performance of our systems.**Estimated Salary: ₹25,00,000 - ₹35,00,000 per...


  • Hyderabad, India GeekBull Consulting Full time

    Job Code: GBC-2411129 Job Role: Senior Site Reliability EngineerJob Type : Contract - to - Hire ( C2H )Duration : 6 MonthsExperience: 7 - 10 YearsLocation: HyderabadWork Location : Hyderabad/ RemoteShift Timings : 6 PM to 3 AM ISTAbout Company:We collaborate with a wide range of clients, from startups to industry giants in sectors like...


  • Hyderabad, India GeekBull Consulting Full time

    Job Code: GBC-2411129 Job Role: Senior Site Reliability Engineer Job Type : Contract - to - Hire ( C2 H ) Duration : 6 Months Experience: 7 - 10 Years Location: Hyderabad Work Location : Hyderabad/ Remote Shift Timings : 6 PM to 3 AM IST About Company: We collaborate with a wide range of clients, from startups to industry...


  • Hyderabad, India GeekBull Consulting Full time

    Job Code: GBC-2411129Job Role: Senior Site Reliability EngineerJob Type : Contract - to - Hire ( C2H )Duration : 6 MonthsExperience: 7 - 10 YearsLocation: HyderabadWork Location : Hyderabad/ RemoteShift Timings : 6 PM to 3 AM ISTAbout Company:We collaborate with a wide range of clients, from startups to industry giants in sectors like Healthcare,...


  • Hyderabad, India GeekBull Consulting Full time

    Job Code: GBC-2411129Job Role: Senior Site Reliability EngineerJob Type : Contract - to - Hire ( C2 H )Duration : 6 MonthsExperience: 7 - 10 YearsLocation: HyderabadWork Location : Hyderabad/ RemoteShift Timings : 6 PM to 3 AM ISTAbout Company:We collaborate with a wide range of clients, from startups to industry giants in sectors like Healthcare,...


  • Hyderabad, India SID Global Solutions Full time

    Job Role: Site Reliability Engineer (SRE) – GCPLocation: Hyderabad (Work from Office only)Job Type: Full TimeAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience,...


  • Hyderabad, Telangana, India Live Connections Full time

    We are seeking an experienced Site Reliability Engineering Team Lead to join our team at Live Connections in Hyderabad.About the RoleThis is a leadership position that requires a strong technical background in site reliability engineering and experience in managing teams. The ideal candidate will have a proven track record of driving projects to successful...


  • Hyderabad, Telangana, India Live Connections Full time

    About Live ConnectionsWe're a cutting-edge technology firm dedicated to delivering innovative solutions. Our team is passionate about crafting exceptional products that drive business success.Job Description:System Reliability Engineer ManagerThis role offers an exciting opportunity to lead our site reliability engineering team, driving strategies for...


  • Hyderabad, India GeekBull Consulting Full time

    Job Code: GBC-2411129 Job Role: Senior Site Reliability Engineer Job Type : Contract - to - Hire ( C2H ) Duration : 6 Months Experience: 7 - 10 Years Location: Hyderabad Work Location : Hyderabad/ Remote Shift Timings : 6 PM to 3 AM IST About Company: We collaborate with a wide range of clients, from startups to industry...


  • Hyderabad, India GeekBull Consulting Full time

    Job Code: GBC-2411129Job Role: Senior Site Reliability EngineerJob Type: Contract - to - Hire ( C2H )Duration: 6 Months Experience: 7 - 10 YearsLocation: HyderabadWork Location: Hyderabad/ RemoteShift Timings : 6 PM to 3 AM ISTAbout Company:We collaborate with a wide range of clients, from startups to industry giants in sectors like Healthcare, Education,...


  • Hyderabad, India GeekBull Consulting Full time

    Job Code: GBC-2411129 Job Role: Senior Site Reliability Engineer Job Type : Contract - to - Hire ( C2H ) Duration : 6 Months Experience: 7 - 10 Years Location: Hyderabad Work Location : Hyderabad/ Remote Shift Timings : 6 PM to 3 AM IST About Company: We collaborate with a wide range of clients, from startups to industry giants in sectors like...


  • Hyderabad, India GeekBull Consulting Full time

    Job Code: GBC-2411129Job Role: Senior Site Reliability EngineerJob Type: Contract - to - Hire ( C2H )Duration: 6 Months Experience: 7 - 10 YearsLocation: HyderabadWork Location: Hyderabad/ RemoteShift Timings : 6 PM to 3 AM ISTAbout Company:We collaborate with a wide range of clients, from startups to industry giants in sectors like Healthcare, Education,...


  • Hyderabad, India Live Connections Full time

    We are looking for Manager Site Reliability Engineer in Hyderabad locationRoles and Responsibilities :Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India Live Connections Full time

    We are looking for Manager Site Reliability Engineer in Hyderabad location Roles and Responsibilities : Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to...


  • hyderabad, India GeekBull Consulting Full time

    Job Code: GBC-2411129 Job Role: Senior Site Reliability Engineer Job Type : Contract - to - Hire ( C2H ) Duration : 6 Months Experience: 7 - 10 Years Location: Hyderabad Work Location : Hyderabad/ Remote Shift Timings : 6 PM to 3 AM IST About Company: We collaborate with a wide range of clients, from startups to industry giants in sectors like...


  • hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2 to 10 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1 GCP EXPERINCE IS MUST Experience: 2 to 6 years The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and...


  • hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1 GCP EXPERINCE IS MUST Experience: 2 to 6 years The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1GCP EXPERINCE IS MUSTExperience: 2 to 6 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing...