Site Reliability Engineer/Architect

1 month ago


BangaloreAny Location, IN Grizmo Labs Full time

Responsibilities :

- Own the Infrastructure, and APM and work with Developers and Systems engineers to Build, Release, Monitor, and run the reliability of the service exceeding the agreed SLAs.

- Write software to automate API-driven tasks at scale and contribute to the product codebase in Java, JS, React, Node, Go, and Python.

- Write automation to reduce toil and eliminate manual, repeatable tasks.

- Work with Ansible, Puppet, Chef, Terraform, or another config management/orchestration suite, know where it's broken, work toward fixing them, and explore new alternatives.

- Define and accelerate the implementation of support processes, tools, and best practices Maintain services once they are live by measuring and monitoring availability, latency, and overall system reliability.

- Handle cross-team performance issues from identification of the cause, to determining the areas of improvement and driving those actions to closure.

- Performance and maturity baselining of Systems, tools maturity, coverage, metrics, technology, and engineering practices.

- Define, Measure, and Improve Reliability Metrics (SLO/SLI), Observability (Monitoring, Logging-Tracing solutions), Ops process (Incident, Problem Mgmt) and streamline - automate release management.

- Build dashboards to provide visibility into the performance of the applications.

- Create chaos in the production environment purposefully in a controlled manager to validate the reliability of systems.

- Mentor and coach other SREs in the organization.

- Provide written and verbal updates to executives and the stakeholders of the application in the organization.

- Understand the current process, and system setup and propose the improvements needed in the processes, and technology so that the application exceeds the desired Service Level Objective.

- Troubleshoot, debug and diagnose operational issues and drive them to closure.

- Understanding of software delivery life cycles, particularly Agile/Lean, and DevOps.

Requirements :

- A strong believer in automation to bring in sustained continuous improvement by automating Toil, and Runbooks, improving the ability of the applications to auto-heal leading to improved reliability.

- 15+ years of experience in the Development and Operations of applications/services in production that have uptime over 99.9%.

- 8+ years of experience as a SRE in handling web-scale applications.

- Strong hands-on coding experience in one or more programming languages such as Python, Golang, Java, Bash, etc.

- Good understanding of Observability (monitoring, logging, tracing, metrics) and chaos engineering concepts.

- Proficiency in using Observability tools (for example : New Relic, Datadog, etc) for monitoring, logging, and tracing.

- Expert level hands-on knowledge in public cloud platform AWS and/or Google Cloud Platform.

- A professional-level certificate in one of the public clouds is highly desirable.

- Must have hands-on experience in using configuration management systems such as Ansible or SaltStack and infrastructure automation tools like Terraform or CloudFormation.

- Should have used altering systems such as Pager Duty.

- Should have implemented solutions around Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for services.

- Measurement should have been within a system and across systems in distributed systems.

- Should have supported Production Incidents (PIs) on critical applications of a company.

- Proven experience in handling large-scale and growing infrastructure across Data Centers and heterogeneous Cloud platforms.

- Experience as a service owner in managing large - geographically diverse stakeholders.

- Ability to work with creative - fast-growing engineering teams and motivate them to deliver their best work.

- History of driving innovation.

(ref:hirist.tech)

  • Any Location, IN RapidBraiins Full time

    Job Description :We are seeking a highly skilled and experienced Senior DevOps Site Reliability Engineer to join our dynamic team. The ideal candidate will have a proven track record of success in DevOps, Site Reliability Engineering (SRE), or development roles within SaaS-based or enterprise applications. As a Senior DevOps SRE Engineer, you will play a...

  • Solution Architect

    1 month ago


    Bangalore/Any Location, IN Repletio Full time

    Hasura is looking for an experienced Solutions Architect to work directly with Hasura customers to facilitate the growth and adoption of the product within the organization.GraphQL is changing the way developers and teams build software today. The Hasura GraphQL Engine is an open-source tool that makes it fast and easy to compose a GraphQL API for secure...

  • WovV Technologies

    1 month ago


    Any Location/Bangalore, IN WovV Technologies Full time

    As Senior DevOps you are expected to :- Continuously improving reliability, scalability and performance of all our systems and platforms.- Monitor, prevent and resolve incidents across the systems and assets.- Automate workflows to speed up project development.- Work closely with Dev the development teams supporting and to optimizing deployments,...


  • Any Location, IN FINCENT SOFTWARE SERVICES PRIVATE LIMITED Full time

    Responsibilities :Help to eliminate operational toil - seek to automate repetitive operations work.Work with product development teams to ensure that our new features are able to meet SLAs.Help mature the delivery process for teams; defining/managing automated deployment pipelines such as Jenkins pipelines, designing canary release deploys, building in...

  • Salesforce Architect

    2 weeks ago


    Any Location, IN Swift Strategic Staff Solutions INC Full time

    Job Description:We are seeking a talented Salesforce Architect with expertise in Zuora integration to join our team. The ideal candidate will have 4 to 7 years of experience in Salesforce architecture and design, with a strong focus on Zuora integration for subscription management. The primary responsibility will be to design and implement scalable and...

  • Enterprise Architect

    4 hours ago


    Any Location, IN InOrg Global Full time

    Responsibilities :- Architectural Design : Design and architect robust, scalable, and secure technical solutions that align with business requirements and best practices.- Cloud Integration: Lead the integration of Java/Python/Any programming language applications with cloud services, ensuring optimal utilization of cloud resources and adherence to cloud...

  • Solution Architect

    4 weeks ago


    Any Location/Bangalore, IN ADVANSOFT Full time

    Responsibilities :- Architect and design scalable, reliable, and high-performance Java-based solutions to address business requirements.- Collaborate with stakeholders to understand business goals, technical requirements, and constraints.Lead the technical design and development of enterprise-level Java applications, ensuring adherence to best practices and...


  • Bangalore/Gurgaon/Gurugram, IN Codersbrain technology pvt ltd Full time

    Key Responsibilities :- Provide expert production support for application teams utilizing our platform, ensuring high availability, reliability, and performance.- Diagnose and resolve complex issues in production environments, collaborating closely with development teams and stakeholders.- Implement and maintain monitoring, alerting, and logging solutions to...

  • Principal Architect

    1 month ago


    Any Location, IN Enhanceplus Full time

    Primary Responsibilities :- Define software architecture and provide clear communication with the team and outside stakeholders to ensure the design is being followed- Design and build an architecture that is scalable, reliable, highly available and performant.- Assist with tracking and managing product risks and identify ways to reduce or eliminate risk.-...


  • Hyderabad/Bangalore/Mumbai, IN Genesis HR Services Full time

    Role/ Job Title: Site Reliability EngineerFunction/ Department: Information TechnologyJob Purpose:The role entails the responsibility to provide technical leadership for projects throughout the project delivery lifecycle. It will include the creation of technical documentation, ensuring proper resourcing and skills on the project, efficient code management,...


  • Any Location, IN CareerNet Technologies Full time

    Requirements :11+ years of experience in engineering, with a preference for candidates with a mix of start-up and large-company experience.eComm domain expertise is a bonus.Hands-on experience driving software transformations within high-growth environments at scale.Mastery in cross-functional consensus building and influencing without given...

  • Cloud Architect

    4 hours ago


    Any Location, IN InOrg Global Full time

    Responsibilities :- Design, implement, and maintain cloud-based infrastructure solutions using best practices for scalability, reliability, and security.- Lead the implementation of DevOps practices including CI/CD pipelines, infrastructure as code, automated testing, and deployment automation.- Collaborate with development and operations teams to streamline...


  • Any Location, IN Harness.io Full time

    Job Description :This is an amazing opportunity to be a Senior Software Engineer in a high-growth, high-potential startup and to work on redefining the Software Engineering Insights product module within the Harness developer platform. In this role, you will be responsible for architecting, designing, developing, and delivering high-quality software that...

  • Software Architect

    4 weeks ago


    Any Location, IN TalentXo Full time

    About the role - Software ArchitectSoftware Architect creates the technological vision, drives technology strategy and is responsible for ensuring the technical design of the platform fulfills the business requirements. He/she works with engineering leaders on the definition and delivery of highly scalable and secure SaaS solutions.This position requires...

  • Software Engineer

    5 hours ago


    Any Location, IN Kjbnlabs Full time

    Senior Software Engineer - Backend MicroservicesDo you have a passion for building robust and scalable backend systems? Are you an expert in crafting clean APIs and leveraging the power of cloud events? If so, we want to hear from you!About the Role:We are searching for a talented and experienced Senior Software Engineer to join our growing backend...

  • Senior Data Engineer

    4 hours ago


    Any Location, IN Coders Brain Technology Pvt. Ltd. Full time

    Position Name : Sr Data EngineerExperience Required : 8+yearsSalary : As per the Market Standard or Else whatever budget you haveNotice period : Immediate joiner Job Type : Full timeLocation : RemoteJob Description :Solution Architect (Databricks) :Candidate having 8+ years of total experience in Data engineering (development/design), in which at least 5+...


  • Any Location, IN Troy Consultancy Full time

    Job Summary :- We are seeking a dynamic Apriso Solution Architect with 2 to 6 years of experience to join our team.- The ideal candidate will have a solid understanding of manufacturing operations and expertise in Apriso FlexNet platform implementation and customization.- The role requires proficiency in designing and architecting solutions using Apriso...


  • Any Location, IN CallTek Full time

    Wireless Support Engineer About the job :Staff4Me is currently seeking a knowledgeable and dedicated Wireless Support Engineer to join our team. As a Wireless Support Engineer, you will be responsible for providing technical support and troubleshooting assistance for wireless networks, ensuring the stable and optimal performance of wireless networks, and...

  • MDM Architect

    2 weeks ago


    Any Location, IN Swift Strategic Staff Solutions INC Full time

    About the Role :We are seeking a highly experienced MDM Architect with a strong background in banking to join our team and play a critical role in designing and implementing our Master Data Management (MDM) solution. You will leverage your expertise in data governance, data security, and banking regulations to architect a robust and scalable platform that...

  • Grafana Architect

    3 weeks ago


    Any Location, IN CoralRidge Management Consultant Full time

    Location : RemoteGrafana Architect Total exp : 17 + yearsNotice Period : Max 30Days.Interview Process : 2 Virtual Rounds of Interview Job Description :Accountabilities :- Craft Observability Strategy: Design and execute the SRE Observability program, aligned with leadership goals and roadmap.- Mastermind Grafana Solutions: Architect a scalable and...