SRE and DevOps ML Framework

3 days ago


bangalore, India ITC Infotech Full time

🔍 We're Hiring I'm excited to share that we're looking for SRE and DevOps - ML Framework to join our team at ITC Infotech.Below is the JD for your reference.Job Functions: ● You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization.● You'll partner with vendors and the infrastructure engineering team for security and service availability● You'll fix production issues with engineering teams, researchers, data scientists, including performance and functional issues● Diagnose and solve customer technical problems● Participate in training customers and prepare reports on customer issues● Be responsible for customer service improvements and recommend product improvements● Write support documentation● You'll design and implement zero-downtime to monitor and accomplish a highly available service (99.999%)● As a support engineer, find opportunities to automate as part of the problem management process, creating automation to avoid issues● Define engineering excellence for operational maturity● You'll work together with AI platform developers to provide the CI/CD model to deploy and configure the production system automatically● Develop and follow operational standard processes for tools and automation development. Including: Style guides, versioning practices, source control, branching and merging patterns and advising other engineers on development standards● Deliver solutions that accelerate the activities, phenomenal engineers would perform through automation, deep domain expertise, and knowledge sharingRequired Skills:● Demonstrated ability in designing, building, refactoring and releasing software written in Python. ● Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton● Ability to handle framework-related issues, version upgrades, and compatibility with data processing / model training environments● Experience with AI/ML model training and inferencing platforms is a big plus● Experience with the LLM fine tuning system is a big plus● Debugging and triaging skills● Cloud technologies like Kubernetes, Docker and Linux fundamentals● Familiar with DevOps practices and continuous testing● DevOps pipeline and automations: app deployment/configuration & performance monitoring● Test automations, Jenkins CI/CD● Excellent communication, presentation, and leadership skills to be able to work and collaborate with partners, customers and engineering teams● Well organized and able to manage multiple projects in a fast paced and demanding environment● Good oral/reading/writing English ability.Job Location: BangaloreIf you're interested or know someone who might be a great fit, please reach out or apply



  • bangalore, India ACL Digital Full time

    ACL Digital (An Alten Group Company) hiring for SRE / Devops (ML Framework).Interested candidates can reach out at dineshkumar.s@acldigital.comExperience: 5+ YearsLocation: Bellandur, BengaluruNotice Period: Less than 2 WeeksKey Responsibilities:Demonstrated ability in designing, building, refactoring and releasing software written in Python. Hands-on...


  • bangalore, India ACL Digital Full time

    ACL Digital (An Alten Group Company) hiring for SRE / Devops (ML Framework). Interested candidates can reach out at dineshkumar.s@acldigital.com Experience: 5+ Years Location: Bellandur, Bengaluru Notice Period: Less than 2 Weeks Key Responsibilities: Demonstrated ability in designing, building, refactoring and releasing software written in Python. Hands-on...


  • bangalore, India ACL Digital Full time

    ACL Digital (An Alten Group Company) hiring for SRE / Devops (ML Framework). Interested candidates can reach out at Experience: 5+ Years Location: Bellandur, Bengaluru Notice Period: Less than 2 Weeks Key Responsibilities: Demonstrated ability in designing, building, refactoring and releasing software written in Python. Hands-on experience with ML frameworks...


  • Bangalore, India ITC Infotech Full time

    We're Hiring! I'm excited to share that we're looking for SRE and DevOps - ML Framework to join our team at ITC Infotech. Below is the JD for your reference. Job Functions: ● You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization. ● You'll partner...


  • bangalore district, India ITC Infotech Full time

    🔍 We're Hiring! I'm excited to share that we're looking for SRE and DevOps - ML Framework to join our team at ITC Infotech. Below is the JD for your reference. Job Functions: ● You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization. ● You'll...


  • Bangalore, Karnataka, India ITC Infotech Full time

    AI Platform - SRE Devops ML Framework 2 30 PM to 11 00 PM Work timings Required Skills Demonstrated ability in designing building refactoring and releasing software written in Python Hands-on experience with ML frameworks such as PyTorch TensorFlow Triton Ability to handle framework-related issues version upgrades and compatibility with data processing model...


  • Bangalore, India Prospance Inc Full time

    SRE & DevOps Engineer (ML/AI Platform) Contract Position | Global E-Commerce Leader | Hybrid About the Opportunity We're partnering with a leading global e-commerce company to find an exceptional SRE & DevOps Engineer to join their AI Platform Team. This is your chance to shape the future of machine learning infrastructure that powers innovation for millions...


  • bangalore district, India Prospance Inc Full time

    SRE & DevOps Engineer (ML/AI Platform) Contract Position | Global E-Commerce Leader | Hybrid About the Opportunity We're partnering with a leading global e-commerce company to find an exceptional SRE & DevOps Engineer to join their AI Platform Team. This is your chance to shape the future of machine learning infrastructure that powers innovation for millions...


  • bangalore, India Prospance Inc Full time

    SRE & DevOps Engineer (ML/AI Platform)Contract Position | Global E-Commerce Leader | HybridAbout the OpportunityWe're partnering with a leading global e-commerce company to find an exceptional SRE & DevOps Engineer to join their AI Platform Team. This is your chance to shape the future of machine learning infrastructure that powers innovation for millions of...


  • Bangalore, India Prospance Inc Full time

    SRE & DevOps Engineer (ML/AI Platform) Contract Position | Global E-Commerce Leader | Hybrid We're partnering with a leading global e-commerce company to find an exceptional SRE & DevOps Engineer to join their AI Platform Team. This is your chance to shape the future of machine learning infrastructure that powers innovation for millions of users worldwide....