Python Systems Engineer – LLM Evaluation

13 hours ago


Delhi, India Supercoder Full time

Greetings and thank you for visiting our job post.Supercoder is an AI-powered career development platform connecting developers worldwide to remote job opportunities with competitive payment.Type of work: 100% RemoteOverview:The client is hiring Python/Linux Engineers to design complex system-level evaluation tasks for LLMs. Design advanced benchmark tasks that evaluate the capabilities of modern Large Language Models (LLMs) such as ChatGPT, Claude, and other AI systems.This role focuses on building realistic, technically challenging engineering scenarios that test model reasoning, debugging, and problem-solving abilities.What You Will DoDesign complex, realistic engineering tasks to evaluate LLM reasoning, coding, debugging, and system understanding.Build Python- and Linux-based workflows, pipelines, and multi-step scenarios.Create reproducible environments using Python, Shell, and CLI tools.Develop tasks that measure code comprehension, debugging, refactoring, and optimization.Write clear technical documentation: problem statements, constraints, expected outputs, and detailed edge cases.Use LLM tools (ChatGPT, Claude, etc.) to validate tasks and analyze model performance.Must-Have Qualifications5+ years of professional software development experience.Strong Python: modular code design, debugging complex programs, structured codebases.Proficiency with Linux, Shell scripting, Bash, and command-line tools.Solid technical English writing ability.Strong reasoning, analytical thinking, and problem-solving skills.Ability to design logical multi-step engineering scenarios.Nice-to-Have SkillsExperience creating benchmark datasets, online judge problems, coding tests, or technical challenges.Background with ICPC, Codeforces, Kaggle, or competitive programming.Familiarity with Docker, Git, and CI/CD pipelines.Experience with ML/AI or data-intensive engineering environments.Who Will Excel in This RoleEngineers who enjoy designing difficult problems rather than simple feature development.Developers who are strong at debugging, identifying subtle issues, and understanding complex system interactions.Engineers who work well independently and can define their own approach.Individuals interested in LLM evaluation, AI reliability, and technical task design.



  • New Delhi, India Supercoder Full time

    Greetings and thank you for visiting our job post.Supercoder is an AI-powered career development platform connecting developers worldwide to remote job opportunities with competitive payment.- Type of work: 100% RemoteOverview:The client is hiring Python/Linux Engineers to design complex system-level evaluation tasks for LLMs. Design advanced benchmark tasks...

  • LLM Python

    1 week ago


    Delhi, NCR, India Codefeast Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Role Overview:This position is within a project with one of the foundational LLM companies. The goal is to assist these foundational LLM companies in enhancing their Large Language Models.One way we help these companies improve their models is by providing them with high-quality proprietary data. This data serves two main purposes: first, as a basis for...

  • LLM Engineer

    1 week ago


    Delhi, India Insight Global Full time

    Position : GenAI LLM Engineer Location : Remote in IndiaWorking hours: 11am-7pm ISTPay range: $18-$20 USD per hourJOB DESCRIPTIONInsight Global is sourcing for an AI/LLM Engineer to sit remotely in India, joining a global consulting firm. This position will support various Digital Products Teams in the AI Center of Excellence, to support building key...

  • LLM Engineer

    1 week ago


    Delhi, India Insight Global Full time

    Position : GenAI LLM Engineer Location : Remote in India Working hours: 11am-7pm IST Pay range: $18-$20 USD per hour JOB DESCRIPTION Insight Global is sourcing for an AI/LLM Engineer to sit remotely in India, joining a global consulting firm. This position will support various Digital Products Teams in the AI Center of Excellence, to support building key...

  • LLM Engineer

    1 week ago


    Delhi, India Insight Global Full time

    Position: GenAI LLM EngineerLocation: Remote in IndiaWorking hours: 11am-7pm ISTPay range: $18-$20 USD per hourJOB DESCRIPTIONInsight Global is sourcing for an AI/LLM Engineer to sit remotely in India, joining a global consulting firm. This position will support various Digital Products Teams in the AI Center of Excellence, to support building key...

  • LLM Engineer

    1 week ago


    Delhi, India Insight Global Full time

    Position: GenAI LLM EngineerLocation: Remote in IndiaWorking hours: 11am-7pm ISTPay range: $18-$20 USD per hourJOB DESCRIPTIONInsight Global is sourcing for an AI/LLM Engineer to sit remotely in India, joining a global consulting firm. This position will support various Digital Products Teams in the AI Center of Excellence, to support building key...

  • Senior LLM Engineer

    4 weeks ago


    New Delhi, India RingCentral Full time

    Job Description:We are seeking an experienced AI Engineer with a strong background in Natural Language Understanding (NLU) who is passionate about pushing the boundaries of Conversational AI. In this role, you will design, develop, and deploy scalable AI solutions leveraging LLMs, Retrieval-Augmented Generation (RAG), and prompt engineering techniques to...

  • LLM - Python

    7 days ago


    Delhi, Delhi, India ProEchoes Technology Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    Company DescriptionProEchoes Technology is a dynamic and innovative team driven to deliver exceptional value by leveraging a blend of technologists, domain experts, architects, quality assurers, and managers. Our commitment to excellence ensures successful deliverables and fosters long-term client relationships. With a strong focus on disruptive technologies...


  • New Delhi, India Artemis ABA Inc. Full time

    About Us Artemis ABA Inc. is a leading provider of behavioral health solutions built on the Salesforce enterprise cloud. Our mission is to simplify ABA therapy operations through secure, scalable, and intuitive mobile technology.Position Overview We are seeking a highly skilled AI/ML Software Engineer with hands-on experience in designing and deploying...


  • Delhi, India MillionLogics Full time

    Company Description As a trusted Oracle Partner, MillionLogics stands as a global powerhouse in IT solutions, merging innovation, expertise, and strategic vision. With offices in London, UK, and a development hub in Hyderabad, India, MillionLogics bridges sharp business foresight with world-class technical talent. Our mission is to transform enterprises...