AI Data Engineer

1 week ago


Narela, India Peak Trust Global Real Estate Full time

Location: RemoteType: Full-timeExperience : 3+ YearsSalary: up to 60K/MonthRole Summary We are looking for a hands-on AI Data Engineer who can independently manage end-to-end data workflows, including data collection, document processing, dataset preparation, retrieval pipelines, model fine-tuning, and data visualization.This role requires strong technical skills across Python, automation, ML tooling, and analytical reporting.Key Responsibilities (Technical) 1. Data Acquisition & Automation Build automated data collection workflows using tools such as Firecrawl , Playwright , Scrapy , or similar frameworksExtract multi-format documents (PDFs, HTML, text, images)Handle large-scale crawling, rate limits, error handling, and scheduling2. Document Processing & Transformation Clean and process unstructured documentsApply OCR (Tesseract, PaddleOCR) for scanned filesConvert and structure data using PyPDF2 , pymupdf , BeautifulSoup , etc.Prepare data in formats such as JSON, JSONL, or CSV3. Dataset Preparation Segment and structure text for ML trainingCreate Q&A datasets, summaries, instruction-response pairs, and labeled textBuild high-quality datasets compatible with fine-tuning frameworks4. Retrieval & Indexing Pipelines Implement document chunking strategiesGenerate embeddings and manage vector databases ( Qdrant , Pinecone , Weaviate )Build retrieval workflows using LangChain or LlamaIndexOptimize retrieval accuracy and latency5. Model Training & Fine-Tuning Run fine-tuning jobs using HuggingFace Transformers , LoRA/QLoRA , or similar methodsMonitor training performance and refine datasetsPackage and deploy fine-tuned models6. Data Visualization & Analytics Create analytical charts, trends, and insights using:PandasMatplotlibSeabornPlotlyBuild simple internal dashboards or visual summaries for reportsTransform raw datasets into meaningful visual insights7. Automation & Infrastructure Write modular, maintainable Python scriptsContainerize workflows with DockerMaintain version control with GitEnsure reproducibility and pipeline stabilityRequired Technical Skills Strong proficiency in PythonExperience with Firecrawl , Playwright, Scrapy, or similar toolsStrong background in document parsing , text processing, and OCRFamiliarity with LangChain or LlamaIndexExperience with vector databasesHands-on experience with HuggingFace , Transformer models, and fine-tuningAbility to write clean, efficient data pipelinesExperience with Matplotlib , Seaborn , Plotly , or other visualization toolsComfort using Docker and GitNice to Have Experience serving models or building small APIs (FastAPI)Exposure to GPU training environmentsBackground in large-scale unstructured data workAbility to create lightweight dashboards (Plotly Dash, Streamlit)Ideal Candidate Comfortable owning full pipelines independentlyDetail-oriented and analyticalStrong problem-solving abilityCan work with minimal supervisionEnjoys building structured systems from scratch


  • Software Engineer

    7 days ago


    Narela, India NextDimension AI Full time

    Location: Gurgaon Compensation: INR 12-24 LPA base salary + bonus + equity.Level: Senior/LeadNextDimension AI is a US-based technology startup building AI Agents in Healthcare and Finance, established by a team of distinguished AI/ML Scientists and Engineers from Google, Amazon, and Snowflake. We're empowering Enterprises by building sophisticated,...


  • Narela, India Zelar Full time

    The MissionWe are looking for a Technical Storyteller . You are an engineer at heart but a consultant in practice. Your mission is to accompany our Field Sales Representatives to client meetings and turn their business problems into technical solutions on the Google Cloud Platform . You don't just write code; you design the future of our clients' data...


  • Narela, India Zelar Full time

    The MissionWe are looking for a Technical Storyteller . You are an engineer at heart but a consultant in practice. Your mission is to accompany our Field Sales Representatives to client meetings and turn their business problems into technical solutions on the Google Cloud Platform . You don't just write code; you design the future of our clients' data...


  • Narela, India Tumeryk Full time

    Company Description Tumeryk is a security and governance platform tailored for Agentic AI infrastructure. We assist enterprises in discovering, securing, and governing AI agentic applications, chatbots, and large language models across their cloud and internal environments. Our offerings include AI Trust Score™ Guardrails for enforcing real-time controls,...


  • Narela, India NextDimension AI Full time

    Company:NextDimension AI (Location:Delhi, India Job Type:Full-time Compensation:Cash INR 9-14 LPA + EquityAbout Us: NextDimension AIis a fast-growing healthcare AI startup dedicated to revolutionizing the patient experience. We leverage cutting-edge AI and automation to streamline administrative processes for healthcare providers in the United States. We are...

  • Ai/Ml Architect

    2 weeks ago


    Narela, India Whatjobs IN C2 Full time

    About the Company Transnational AI Private Limited is a deep-tech organization building intelligent digital platforms that combine modern event-driven architecture, cloud-native systems, and AI/ML-powered intelligence. Job Role We are hiring a senior Technical Architect to design and lead the backend development and system design for real-time, event-driven...

  • AI Infra Intern

    3 weeks ago


    Narela, India Bharat.Law Full time

    Company DescriptionBharat.Law (Bharat Technologies, Inc.) is democratising Indian legal expertise by deploying NLP and AI technologies throughout the legal lifecycle.Role Description This is a contract on-site role for an AI Infra Intern located in Noida/Delhi. The AI Infra Intern will be responsible for assisting in the development and maintenance of AI...


  • Narela, India Whatjobs IN C2 Full time

    About Intellectyx Intellectyx is an AI-native, digital innovation company transforming enterprises through agentic, autonomous, and data-driven platforms . We enable organizations to evolve from traditional digital systems to self-learning ecosystems powered by advanced engineering, contextual observability, and AI-driven decision intelligence. Role Overview...

  • Manager, Operations

    3 weeks ago


    Narela, India NextDimension AI Full time

    Company:NextDimension AI (Location:Delhi, India Job Type:Full-time Compensation:Cash INR 10-18 LPA + EquityAbout Us: NextDimension AIis a fast-growing healthcare AI startup dedicated to revolutionizing the patient experience. We leverage cutting-edge AI and automation to streamline administrative processes for healthcare providers in the United States. We...

  • Data Engineer

    3 weeks ago


    Narela, India Coforge Full time

    Coforge is Hiring Azure Data EngineerLocation: Greater Noida / Pune / HyderabadExperience Domain: Insurance domain experience is mandatoryJoining: Immediate joiners preferredMandatory Skills:Azure Data Bricks (ADB)PythonSQLGood to HaveSnowflakeThis role involves working closely with our clients to deliver high-quality data solutions.Experience:6+ years of...