Generative AI Engineer

4 weeks ago


Eluru, India BigRio Full time

Job Title: Generative AI Engineer (LLM Expert – AWS Focus)Location: Remote Employment Type: Ongoing ContractAbout BigRioBigRio is a Boston-based, remote-first technology consulting firm specializing in advanced data, cloud, and software engineering solutions. We partner with forward-thinking organizations to deliver scalable, secure, and high-performance technologies, with deep expertise in AI/ML, data engineering, and AWS-native architectures.Our clients span healthcare, life sciences, government, and enterprise sectors, and we’re known for tackling complex, high-impact challenges with cutting-edge innovation and measurable results.About the RoleWe’re seeking a hands-on Generative AI Engineer (LLM Expert) who combines strong AWS development experience (70%) with deep expertise in applied LLM engineering (30%).This role is ideal for an engineer who has built real-world applications using OpenAI APIs and retrieval-augmented generation (RAG) — not someone focused on traditional ML or model training. You’ll work with BigRio’s internal AI team and client partners to design, build, and optimize LLM-powered features, integrating them into cloud-native, production-ready systems.This is a senior technical role, not a research or experimental position. The focus is on building, shipping, and scaling LLM applications using OpenAI models, LangChain, and AWS infrastructure.Key ResponsibilitiesDesign, develop, and deploy AWS-based applications (Lambda, API Gateway, ECS, RDS, S3, Secrets Manager) that integrate LLM-powered features.Implement OpenAI-driven workflows, leveraging reasoning and non-reasoning models, temperature settings, and model versioning best practices.Apply prompt engineering and prompt chaining techniques to improve LLM accuracy and performance for production workloads.Build retrieval-augmented generation (RAG) pipelines using LangChain, ChromaDB, or similar frameworks.Develop FastAPI or Flask-based backends that connect to OpenAI APIs and vector databases.Build interactive front-ends and tools using Gradio or Streamlit for rapid prototyping and testing.Ensure secure, containerized deployments using Docker and integrate SSO and role-based access controls.Automate data pipelines and document workflows via Google Drive, AWS SDKs, or REST APIs.Write production-grade Python code, following clean architecture, documentation, and CI/CD best practices.Collaborate closely with AI engineers, DevOps teams, and clients to deliver enterprise-ready LLM applications.Required Qualifications5+ years of experience in professional software development, with a strong focus on AWS cloud and backend systems.3+ years of direct experience working with OpenAI APIs, GPT models, and LLM application development.Proven ability to build and deploy LLM-powered applications, not just experiment with models.Expertise in Python, FastAPI, and API-driven architecture.Strong practical experience with LangChain, ChromaDB, RAG, and prompt engineering.Proficiency in Docker, AWS IAM, and secure deployment practices.Excellent communication skills — ability to explain LLM behavior, tradeoffs, and reasoning clearly to both technical and non-technical teams.Comfortable working independently in a fast-paced, client-facing environment across time zones.Nice to HaveExperience with LangGraph or other LLM orchestration frameworks.Knowledge of vector databases like Pinecone or FAISS.Familiarity with MLOps, CI/CD pipelines, and observability for LLM workloads.Exposure to healthcare, biotech, or regulated data environments.Demonstrated experience explaining and documenting AI system design and decision-making for non-AI stakeholders.What This Role is NotTo set clear expectations, this is not a role focused on:Classical machine learning or model training (e.g., TensorFlow, PyTorch-based model design).Research, experimentation, or theoretical AI.Low-code or no-code chatbot builders.This is a pure LLM engineering and AWS application development role — building scalable, production-quality AI systems using OpenAI and related frameworks.Equal Opportunity Statement:BigRio is an equal-opportunity employer committed to creating a diverse and inclusive workplace. We value and promote diversity and prohibit discrimination based on various factors outlined by federal, state, or local laws. All qualified applicants will receive equal consideration for employment.


  • Ai engineer

    4 days ago


    Eluru, India Sutra.AI Full time

    Role: AI Engineer About Sutra. AI Sutra. AI is a rapidly growing AI Enterprise Saa S Platform company focused on building data-to-decision automation at scale . Our mission is to help enterprises transform raw data into intelligent, actionable insights through AI, automation, and decision intelligence. Role Summary We’re seeking an AI Engineer who is...


  • Eluru, India S2T AI - AI-Powered Investigations Full time

    We are seeking a skilled and resourceful developer with expertise in reverse engineering mobile applications and their network traffic. The ideal candidate will analyze undocumented APIs, implement secure bypasses, and develop robust data extraction solutions. You'll have the autonomy to select your preferred tools and programming languages based on your...


  • Eluru, India Vectorial AI Full time

    Company Description Vectorial is a simulation engine platform powered by millions of synthetic users—state-of-the-art models that capture real human behavior—to deliver instant, nuanced validation across the entire product lifecycle. Our founding team has deep academic roots from Carnegie Mellon and Stanford and we are push state-of-the-art human...


  • Eluru, India Infomatics Corp Full time

    We are seeking a talented and experienced AI and Python Engineer to design, develop, and deploy cutting-edge phone integration solutions for our Agentic AI platform within the health insurance sector. This pivotal role involves building intelligent, autonomous agents on an Interactive Voice Response (IVR) system to streamline member interactions, automate...


  • Eluru, India People Prime Worldwide Full time

    About Client:Our Client is a global IT services company headquartered in Southborough, Massachusetts, USA. Founded in 1996, with a revenue of $1.8B, with 35,000+ associates worldwide, specializes in digital engineering, and IT services company helping clients modernize their technology infrastructure, adopt cloud and AI solutions, and accelerate innovation....


  • Eluru, India Scry AI Full time

    Position: Insurance Specialist (SME – Claims, Underwriting & Compliance)Location: India (Remote)Employment Type: Full-TimeSchedule: Monday to Friday, Day ShiftExperience: 2+ Years in Insurance Operations, Claims Management, or Underwriting (experience with AI-driven insurance or digital tools preferred)Company DescriptionScry AI is a research-led...


  • Eluru, India Scry AI Full time

    Position: Insurance Specialist (SME – Claims, Underwriting & Compliance)Location: India (Remote)Employment Type: Full-TimeSchedule: Monday to Friday, Day ShiftExperience: 2+ Years in Insurance Operations, Claims Management, or Underwriting (experience with AI-driven insurance or digital tools preferred)Company DescriptionScry AI is a research-led...


  • Eluru, India Prospire Technology Services Pvt Ltd Full time

    For the qualifications section: Group the skills listed below based on their domain and intended purposes. List of skills: Cloud Platforms (AWS, Azure, GCP), Configuration Management (Ansible, Chef, Puppet), Infrastructure as Code Tools (Terraform, CloudFormation), Scripting Languages (Python, Shell), CI/CD Tools (Jenkins, GitLab CI, CircleCI),...

  • AI Graphic Designer

    2 weeks ago


    Eluru, India Parman Exclusive Full time

    AI Graphic Designer – Fashion & Jewellery📍 Remote | 💼 Part-Time / Full-TimeWe’re looking for an AI-driven Graphic Designer to bring imagination and precision together for our silver jewellery brand, Parman Exclusive.Your work will define how our products are presented across catalogues, campaigns, and digital platforms — combining technical skill...


  • Eluru, India Rivi Full time

    About Rivi We build AI-first products across travel and beyond. We’re looking for a backend-builder passionate about scalable APIs, microservices, databases, and LLM integrations to power seamless, high-performance AI tools for our customers. What you’ll do Design, build, and version REST + g RPC microservices in Python / Node.js / Type Script. Model and...