Python Systems Engineer – LLM Evaluation
8 hours ago
Greetings and thank you for visiting our job post.Supercoder is an AI-powered career development platform connecting developers worldwide to remote job opportunities with competitive payment.- Type of work: 100% RemoteOverview:The client is hiring Python/Linux Engineers to design complex system-level evaluation tasks for LLMs. Design advanced benchmark tasks that evaluate the capabilities of modern Large Language Models (LLMs) such as ChatGPT, Claude, and other AI systems.This role focuses on building realistic, technically challenging engineering scenarios that test model reasoning, debugging, and problem-solving abilities.What You Will Do- Design complex, realistic engineering tasks to evaluate LLM reasoning, coding, debugging, and system understanding. - Build Python- and Linux-based workflows, pipelines, and multi-step scenarios. - Create reproducible environments using Python, Shell, and CLI tools. - Develop tasks that measure code comprehension, debugging, refactoring, and optimization. - Write clear technical documentation: problem statements, constraints, expected outputs, and detailed edge cases. - Use LLM tools (ChatGPT, Claude, etc.) to validate tasks and analyze model performance.Must-Have Qualifications- 5+ years of professional software development experience. - Strong Python: modular code design, debugging complex programs, structured codebases. - Proficiency with Linux, Shell scripting, Bash, and command-line tools. - Solid technical English writing ability. - Strong reasoning, analytical thinking, and problem-solving skills. - Ability to design logical multi-step engineering scenarios.Nice-to-Have Skills- Experience creating benchmark datasets, online judge problems, coding tests, or technical challenges. - Background with ICPC, Codeforces, Kaggle, or competitive programming. - Familiarity with Docker, Git, and CI/CD pipelines. - Experience with ML/AI or data-intensive engineering environments.Who Will Excel in This Role- Engineers who enjoy designing difficult problems rather than simple feature development. - Developers who are strong at debugging, identifying subtle issues, and understanding complex system interactions. - Engineers who work well independently and can define their own approach. - Individuals interested in LLM evaluation, AI reliability, and technical task design.
-
Python Systems Engineer – LLM Evaluation
6 hours ago
Delhi, India Supercoder Full timeGreetings and thank you for visiting our job post.Supercoder is an AI-powered career development platform connecting developers worldwide to remote job opportunities with competitive payment.Type of work: 100% RemoteOverview:The client is hiring Python/Linux Engineers to design complex system-level evaluation tasks for LLMs. Design advanced benchmark tasks...
-
Senior LLM Engineer
3 weeks ago
New Delhi, India RingCentral Full timeJob Description:We are seeking an experienced AI Engineer with a strong background in Natural Language Understanding (NLU) who is passionate about pushing the boundaries of Conversational AI. In this role, you will design, develop, and deploy scalable AI solutions leveraging LLMs, Retrieval-Augmented Generation (RAG), and prompt engineering techniques to...
-
Software Engineer – AI/ML, LLMs
6 days ago
New Delhi, India Artemis ABA Inc. Full timeAbout Us Artemis ABA Inc. is a leading provider of behavioral health solutions built on the Salesforce enterprise cloud. Our mission is to simplify ABA therapy operations through secure, scalable, and intuitive mobile technology.Position Overview We are seeking a highly skilled AI/ML Software Engineer with hands-on experience in designing and deploying...
-
Software Engineer – AI/ML, LLMs
2 days ago
New Delhi, India Artemis ABA Inc. Full timeAbout UsArtemis ABA Inc. is a leading provider of behavioral health solutions built on the Salesforce enterprise cloud. Our mission is to simplify ABA therapy operations through secure, scalable, and intuitive mobile technology.Position OverviewWe are seeking a highly skilled AI/ML Software Engineer with hands-on experience in designing and deploying...
-
LLM Python
1 week ago
Delhi, NCR, India Codefeast Full time ₹ 15,00,000 - ₹ 25,00,000 per yearRole Overview:This position is within a project with one of the foundational LLM companies. The goal is to assist these foundational LLM companies in enhancing their Large Language Models.One way we help these companies improve their models is by providing them with high-quality proprietary data. This data serves two main purposes: first, as a basis for...
-
LLM Engineer
1 week ago
Delhi, India Insight Global Full timePosition : GenAI LLM Engineer Location : Remote in IndiaWorking hours: 11am-7pm ISTPay range: $18-$20 USD per hourJOB DESCRIPTIONInsight Global is sourcing for an AI/LLM Engineer to sit remotely in India, joining a global consulting firm. This position will support various Digital Products Teams in the AI Center of Excellence, to support building key...
-
LLM Engineer
1 week ago
Delhi, India Insight Global Full timePosition : GenAI LLM Engineer Location : Remote in India Working hours: 11am-7pm IST Pay range: $18-$20 USD per hour JOB DESCRIPTION Insight Global is sourcing for an AI/LLM Engineer to sit remotely in India, joining a global consulting firm. This position will support various Digital Products Teams in the AI Center of Excellence, to support building key...
-
Agentic AI Developer – LLM Systems
3 weeks ago
New Delhi, India AIMLEAP Full timeAgentic AI Developer – LLM Systems & AutomationExperience: 3–5 YearsLocation: Remote (WFH)Mode of Engagement: Full-timeNo of Positions: 4Educational Qualification: B.E./B.Tech/M.E./M.Tech in Computer Science, AI/ML, or relatedIndustry: IT – AI/ML & Automation ServicesNotice Period: Immediate JoinerWhat We Are Looking For- AI & LLM Development: Strong...
-
Python with AI Engineer
1 week ago
New Delhi, India Infomatics Corp Full timeWe are seeking a talented and experienced AI and Python Engineer to design, develop, and deploy cutting-edge phone integration solutions for our Agentic AI platform within the health insurance sector. This pivotal role involves building intelligent, autonomous agents on an Interactive Voice Response (IVR) system to streamline member interactions, automate...
-
Senior LLM Engineers
1 week ago
New Delhi, India MillionLogics Full timeCompany DescriptionMillionLogics, a trusted Oracle Partner, is an IT solutions provider with a global presence, blending innovation and strategic vision. With offices in London, UK, and a development hub in Hyderabad, India, we offer a diverse range of services, including Data & AI, Cloud Solutions, and IT Consulting. Our mission is to transform enterprises...