Staff Software Engineer AI

6 days ago


Chennai, Tamil Nadu, India FourKites Full time
About the Role

We are seeking an experienced Staff AI Engineer to join our AI and Data Platform team, where you'll play a pivotal role in building and scaling our next-generation AI workforce platform. You'll work on cutting-edge agent-based systems that are transforming supply chain operations for Fortune 500 companies, delivering real business value through intelligent automation.

Key Responsibilities
Technical Leadership
  • Design and implement production-scale AI agent systems and orchestration frameworks (LangGraph, LangChain, similar architectures)
  • Lead architecture for multi-agent systems handling complex business workflows
  • Optimize deployment strategies using both LLMs and SLMs based on use case requirements
  • Build natural language-configurable business process automation frameworks
  • Implement multi-modal AI systems for document understanding (tables, charts, layouts)
AI/ML Implementation & Optimization
  • Deploy and optimize LLMs/SLMs in production with fine-tuning techniques (LoRA, QLoRA, DPO)
  • Implement quantization strategies (INT8, INT4) and model distillation for edge deployment
  • Build evaluation frameworks including LLM-as-judge systems and regression testing
  • Design streaming architectures for real-time LLM responses (SSE, WebSockets)
  • Create semantic caching and embedding-based retrieval systems
  • Develop GraphRAG and long-context handling strategies (100k+ tokens)
System Architecture & Engineering
  • Design scalable microservices with comprehensive observability (LangSmith, Arize, custom telemetry)
  • Build secure multi-tenant systems with prompt injection prevention and output validation
  • Implement cost optimization through intelligent model routing and fallback strategies
  • Develop document processing pipelines with OCR and layout understanding
  • Create event-driven architectures for real-time shipment tracking and exception handling
Data & Infrastructure
  • Build data pipelines for training data curation, synthetic generation, and PII masking
  • Implement RLHF/RLAIF feedback loops for continuous improvement
  • Design experiment tracking and model registry systems (MLflow, DVC)
  • Optimize inference costs through batch processing and spot instance utilization
  • Establish model governance, audit trails, and compliance frameworks
Required Qualifications
Technical Skills
  • 8+ years software engineering, 3+ years in production AI/ML systems
  • Expertise in Python, PyTorch/JAX, and AI frameworks (LangChain, Transformers, PEFT)
  • Experience with LLMs (GPT-4, Claude, Gemini) and SLMs (Phi, Llama, Mistral)
  • Hands-on experience with:
    • Fine-tuning techniques (LoRA, QLoRA, DPO, RLHF)
    • Model optimization (quantization, distillation, pruning)
    • Vector databases and RAG architectures
    • Streaming systems and real-time processing
    • Security measures (prompt injection prevention, jailbreak detection)
  • Strong background in distributed systems, Kubernetes, and cloud platforms
Domain Knowledge(nice to have)
  • Experience with document intelligence and multi-modal AI systems
  • Understanding of supply chain operations, EDI/API integrations
  • Knowledge of token economics and consumption-based pricing models
  • Familiarity with enterprise compliance requirements (GDPR, CCPA, SOC2)
Professional Skills
  • Track record of delivering complex projects with measurable business impact
  • Experience with technical sales support, POCs, and customer success
  • Strong communication for technical and non-technical audiences
  • Data-driven decision making for model selection and cost optimization
Preferred Qualifications
  • Supply chain, logistics, or transportation management experience
  • Experience with OCR pipelines and document extraction at scale
  • Knowledge of GraphRAG and knowledge graph integration
  • Contributions to open-source AI projects (Hugging Face, Ollama)
  • Experience reducing inference costs by 50%+ through optimization
  • Familiarity with MoE architectures and constitutional AI approaches
  • Background in building usage-based billing and margin optimization
  • Experience with specialized tools (vLLM, TGI, Triton, ONNX, TensorRT)
What You'll Work On
  • Building specialized AI agents solving supply chain problems
  • Fine-tuning domain-specific models for supply chain terminology
  • Implementing hybrid architectures combining cloud LLMs with edge SLMs
  • Creating secure document intelligence systems for Fortune 500 clients
  • Developing real-time exception handling for shipment tracking
  • Building observability and evaluation frameworks for agent performance
  • Designing fallback strategies and multi-provider redundancy
Technical Environment

Models: GPT-4, Claude, Gemini, Llama 3, Mistral, Phi-3, custom fine-tuned models
Fine-tuning: LoRA/QLoRA, PEFT, DeepSpeed, bitsandbytes, Axolotl
Infrastructure: Kubernetes, AWS SageMaker/Bedrock, GPU clusters, edge devices
Frameworks: LangChain, LangGraph, vLLM, FastAPI, Transformers
Observability: LangSmith, Weights & Biases, custom telemetry
Data: PostgreSQL, Redis, Vector DBs, Kafka, feature stores

Impact & Growth

You'll directly contribute to AI initiatives generating millions in revenue while shaping systems processing millions of transactions daily. Lead technical decisions affecting 25+ engineers while mentoring the next generation of AI engineers. Be at the forefront of production AI optimization, balancing performance, cost, and latency for enterprise customers.

Who we are:

FourKites, the leader in AI-driven supply chain transformation for global enterprises and pioneer of advanced real-time visibility, turns supply chain data into automated action. FourKites' Intelligent Control Tower breaks down enterprise silos by creating a real-time digital twin of orders, shipments, inventory and assets. This comprehensive view, combined with AI-powered digital workers, enables companies to prevent disruptions, automate routine tasks, and optimize performance across their supply chain. FourKites processes over 3.2 million supply chain events daily — from purchase orders to final delivery — helping 1,600+ global brands prevent disruptions, make faster decisions and move from reactive tracking to proactive supply chain orchestration.

Working at FourKites

We provide competitive compensation with stock options, outstanding benefits and a collaborative culture for all employees around the globe, including:

  • 5 global recharge days, in addition to standard holidays, and a hybrid, flexible approach to work.
  • Parental leave for all parents, an annual wellness stipend and volunteer days also provide you with time and resources for self care and to care for others.
  • Opportunities throughout the year to learn and celebrate diversity.
  • Access to leading AI tools and foundation models, with the freedom to experiment and find creative ways to be more effective in your role
And we're always listening for new ways to support everyone in and out of the office.

  • Chennai, Tamil Nadu, India FourKites, Inc. Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    At FourKites we have the opportunity to tackle complex challenges with real-world impacts. Whether it's medical supplies from Cardinal Health or groceries for Walmart, the FourKites platform helps customers operate global supply chains that are efficient, agile and sustainable.Join a team of curious problem solvers that celebrates differences, leads with...


  • Chennai, Tamil Nadu, India Genesys Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    locationsChennai (Flexible)time typeFull timeposted onPosted Todayjob requisition idJR109140Genesys empowers organizations of all sizes to improve loyalty and business outcomes by creating the best experiences for their customers and employees. Through Genesys Cloud, the AI-powered Experience Orchestration platform, organizations can accelerate growth by...

  • AI Engineer Intern

    2 days ago


    Chennai, Tamil Nadu, India DevAssure - AI Test Agent Full time

    Company DescriptionDevAssure is a game-changing AI powered Platform designed to help software development and QA teams expedite their shift-left strategy. The platform enables teams to develop, generate functional test cases, and automate them 10x faster in parallel, resulting in a faster feedback loop. Developers and testers can execute tests and assess...


  • Chennai, Tamil Nadu, India Banyan Software Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Banyan Software provides the best permanent home for successful enterprise software companies, their employees, and customers. We are on a mission to acquire, build and grow great enterprise software businesses all over the world that have dominant positions in niche vertical markets. In recent years, Banyan was named the #1 fastest-growing private software...


  • Chennai, Tamil Nadu, India Banyan Software Full time ₹ 8,00,000 - ₹ 16,00,000 per year

    Banyan Software provides the best permanent home for successful enterprise software companies, their employees, and customers. We are on a mission to acquire, build and grow great enterprise software businesses all over the world that have dominant positions in niche vertical markets. In recent years, Banyan was named the #1 fastest-growing private software...


  • Chennai, Tamil Nadu, India PayPal Full time

    The CompanyPayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy.We operate a global, two-sided network at scale that...


  • Chennai, Tamil Nadu, India Trimble Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Staff Software EngineerWe are seeking an experienced and technically proficient Staff Software Engineer to lead our team in building cutting-edge, enterprise-level backend services. This role is perfect for a passionate engineer who thrives on solving complex problems and is dedicated to creating highly scalable, cloud-native solutions that drive digital...

  • Software Specialist

    1 week ago


    Chennai, Tamil Nadu, India Scalingwolf AI Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    Company DescriptionScalingwolf AI is an innovative company offering the first AI-powered Business Health Tracker designed to help entrepreneurs identify and address hidden losses. By utilizing advanced artificial intelligence, the platform analyzes business data to uncover profit leaks, growth barriers, and actionable strategies for accelerated and...


  • Chennai, Tamil Nadu, India PayPal Full time ₹ 8,00,000 - ₹ 16,00,000 per year

    The CompanyPayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy. We operate a global, two-sided network at scale...


  • Chennai, Tamil Nadu, India Bolt Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Bolt is on a mission to democratize commerce. We relentlessly prioritize our retailers—putting their brands front and center while enabling frictionless shopping at any touchpoint in the customer journey. At the center of it all is our rapidly growing universal shopper network—Bolt merchants such as Revolve, Luisa via Roma, Benefit Cosmetics, Kendra...