Staff IT SRE Engineer

2 months ago


bangalore, India NVIDIA Full time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. NVIDIA is looking for an IT SW SRE Engineer, to join the IT team to support the R&D & Manufacturing activities. This job role will design, build, maintain, supervise and lead large scale production systems with high efficiency and availability using the combination of solutions, software and systems engineering practices on emphasizing on Manufacturing sites. This is a highly specialized subject area that demands knowledge across different systems, networking, coding, database, capacity management, continuous delivery, and deployment, and opensource cloud-enabling technologies like Kubernetes. Also, this job role will require to be a tech savvy, highly experienced in Linux platform with all the components that comes along with it.

We are looking for an IT SRE Engineer, SW for our IT Engineering team working out from India or Vietnam. As part of this team, you will be involved in exciting technical challenges by analyzing, solving, and designing vital services, platforms, and automations while always thinking about reliability, scalability, resilience, security, and performance. Also, be a part of the team responsible for helping to support uptime and availability of production critically important on-premises & cloud services distributed across multiple regions. You'll help to create more consistent, automated push button environments across all tiers, proactively test and tune all aspects of the infrastructure, streamline CI/CD processes, supervise, and respond to system notifications and alerts and continually work to optimize and improve the performance, security, and reliability of our systems. Are you ready for this challenge?

What you'll be doing:

  • Work closely with other software engineers within the organization to identify and implement, build & packaging infrastructure requirements, automated tests to accelerate development for Autonomous Vehicles

  • Design and build high-end architectures and sophisticated solutions emphasis on Manufacturing Sites.

  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health, including pro-active actions based on reports and deep data analysis.

  • Able to address complicated, cross platform issues handling OS, storage, networking, database on-premises or in a cloud-based IaaS/PaaS/SaaS environment and handle live production incidents, debug/solve application, and infrastructure issues, follow and implement SRE standard methodologies.

  • Keep up-to date with security and proactively identify, diagnose, and solve sophisticated security issues.

What we need to see:
  • Proven experience mainly on automation of Infrastructure configuration and management as DevOps Engineer

  • Demonstrable experience in Containerization-Docker and orchestration (Kubernetes) – Required

  • Expert with large Scale project management and matrix management and the ability to handle several projects in parallel effectively, prioritize and implement in a fast-paced environment.

  • Experience with building and designing automations with standard methodology tools in the market.

  • Background with Infrastructure as a Code (Salt Stack, Puppet, Terraform, Ansible)

  • Experience with Cloud-based platforms (AWS, Google, Azure)

  • Proficient in building and managing highly available and scalable IT infrastructure, with knowledge on Docker/Virtualization, GIT, Gerrit, Perforce, Continuous Delivery, Continuous Monitoring, etc.

  • Should have the ability to communicate both verbally and in writing with users, vendors and management.

  • B.Sc. in an engineering field and 7+ years of experience

  • Highly experienced in Linux platform

Ways to stand out from the crowd:
  • Systematic problem-solving approach, coupled with strong interpersonal skills and a sense of ownership and drive.

  • Ability to debug and optimize code and automate routine tasks.

  • Well familiar with ITIL Module.

  • Experienced with Problem solving and decision making.

  • Deep understanding of Software Configuration Management (SCM) processes and tools such as Perforce, Git, Subversion, multi-site development.

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.



  • bangalore, India Protoporos Staffing Services Private Limited Full time

    Opportunity with a leading B2B SaaS product client specializing in cutting-edge data integration solutionsPosition Overview: We are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our team. As a Staff SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our data integration...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 19, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 20, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 24, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 22, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 23, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 26, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 29, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 28, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 30, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • Bangalore, Karnataka, India Protoporos Staffing Services Pvt Ltd Full time

    Opportunity with a leading B2B SaaS product client specializing in cutting-edge data integration solutions. Position Overview: We are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our team. As a Staff SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our data integration...


  • Bangalore, India Protoporos Staffing Services Pvt Ltd Full time

    Opportunity with a leading B2B SaaS product client specializing in cutting-edge data integration solutions. Position Overview: We are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our team. As a Staff SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our data integration...


  • bangalore, India Protoporos Staffing Services Pvt Ltd Full time

    Opportunity with a leading B2B SaaS product client specializing in cutting-edge data integration solutions. Position Overview: We are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our team. As a Staff SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our data integration...

  • Devops Engineer

    2 weeks ago


    bangalore, India Sonata Software Full time

    Job Title: Senior Site Reliability Engineer (SRE)Department: Cloud EngineeringJob Type: Full-time Job Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) with extensive experience in Cloud Engineering, particularly in AWS. The ideal candidate should have hands-on expertise in developing Cloud solutions using Terraform or...

  • Sr SRE Engineer

    1 month ago


    Bangalore Metropolitan Area, India UST Full time

    Responsibilities The engineer will enable clients to navigate and adoption of IT methodologies and operating models to drive business agility using SRE and Agile frameworks. As a SRE engineer, you will work closely with our clients to define clients’ operational and governance modelsDesign and deploy scalable, reliable, and secure SRE solutions. The ideal...

  • SRE - Bengaluru

    6 days ago


    bangalore, India Virtusa Full time

    SRE - CREQ189656 Description We are looking for senior SRE (Software Reliability Engineer) profiles for our squad with the capacity to become Tech lead.Strong hands-on skills required on: distributed architecture and high availability automation and scripting network and system performance analysis CICD toolchains Infrastructure services, esp. on Kubernetes...

  • Platform SRE Engineer

    4 weeks ago


    bangalore, India DigiCert Full time

    ABOUT DIGICERT We're a leading, global security authority that's disrupting our own category. Our encryption is trusted by the major ecommerce brands, the world's largest companies, the major cloud providers, entire country financial systems, entire internets of things and even down to the little things like surgically embedded pacemakers. We help...


  • bangalore, India Spectrum Consultants India Private Limited Full time

    Staff Infrastructure SRE Engineer - Infrastructure support Summary Experience Required: 9 - 15 YearsJob Term: PermanentLocation: BangaloreCategory: Networking /System Administration /Technical SupportWorld leader in visual and AI Computing.For more than two decades, company has pioneered visual computing, the art and science of computer graphics. With a...


  • bangalore, India Virtusa Full time

    SRE with AIOP and Dynatrace - CREQ181002 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • bangalore, India Concentrix Full time

    Concentrix is a technology-enabled global business services company specializing in customer engagement and business performance. With more than 4,00,000 staff, Concentrix is present across 40 countries and six continents. We are considered as a category leader in the CXM (Customer Experience Management) Services. We serve automotive; banking and financial...