Data Engineer

4 hours ago


Ajmer, India OWOW Full time

What You'll Build Core Responsibilities Data Architecture & Infrastructure (40%) ● Design and implement a multi-database architecture (MongoDB, Redis, Milvus, Neo4j, BigQuery) ● Build scalable data pipelines for real-time conversation processing and personalization● Architect ETL/ELT workflows for data migration from legacy systems● Implement data partitioning, sharding, and optimization strategies for high-throughput systems ● Create data governance frameworks ensuring quality, security, and compliance Vector & Graph Database Systems (25%)● Design and optimize Milvus vector collections for semantic search (1024-dim embeddings) ● Build graph schemas in Neo4j for customer journey mapping and persona relationships● Implement HNSW indexing strategies and similarity search optimization● Create hybrid search systems combining vector, full-text, and graph queries● Monitor and tune database performance (query latency, throughput, resource utilization) ML Data Infrastructure (20%) ● Build data collection pipelines for LLM fine-tuning (conversation logs, tool executions)● Create feature stores for GNN training (customer interactions, engagement signals)● Implement data versioning and lineage tracking for ML experiments ● Design A/B testing data infrastructure with CUPED variance reduction● Build real-time feature computation pipelines for contextual bandits Analytics & Monitoring (15%) ● Design BigQuery schemas for marketing analytics and performance tracking● Create materialized views and aggregation pipelines for real-time dashboards● Implement data quality monitoring and anomaly detection ● Build observability infrastructure (Prometheus metrics, Grafana dashboards)● Develop cost optimization strategies for cloud data warehousing Technical Stack You'll Work With Databases & Storage ● MongoDB (conversation state, active sessions) ● Redis (caching, rate limiting, real-time data) ● Milvus (vector embeddings, semantic search) ● Neo4j (customer journey graphs, persona networks) ● BigQuery (analytics warehouse, historical data) Data Processing & Orchestration ● Apache Airflow or Prefect (workflow orchestration) ● Pandas, Polars (data transformation) ● Apache Spark (optional - for large-scale processing) ● dbt (data transformation and modeling) ML/AI Data Pipeline ● vLLM (LLM inference serving) ● MLflow (model registry, experiment tracking)● Sentence Transformers (embedding generation) ● PyTorch, TensorFlow (ML model training) Cloud & Infrastructure ● Google Cloud Platform (BigQuery, Cloud Storage, Compute) ● Docker & Kubernetes (containerization, orchestration) ● Terraform (infrastructure as code) ● GitHub Actions or GitLab CI (CI/CD pipelines) Programming & Tools ● Python 3.10+ (primary language) ● SQL (complex queries, query optimization) ● Shell scripting (Bash/Zsh) ● Git (version control) Requirements Must-Have Skills ● 5+ years of data engineering experience with production systems● Expert-level SQL and database design skills ● Strong Python programming (async/await, type hints, testing) ● Experience with at least 3 different database technologies (SQL, NoSQL, Vector, Graph) ● Proven track record building high-scale data pipelines (>1M records/day)● Deep understanding of data modeling (dimensional, normalized, denormalized)● Experience with cloud data warehouses (BigQuery, Redshift, or Snowflake)● Strong knowledge of data quality, validation, and governance ● Excellent debugging and optimization skills Highly Desirable ● Experience with vector databases (Milvus, Pinecone, Weaviate, Qdrant)● Experience with graph databases (Neo4j, ArangoDB, Neptune) ● Knowledge of embedding models and semantic search ● Experience with ML data pipelines (feature stores, model training data)● Understanding of A/B testing and experimental design ● Experience with real-time streaming (Kafka, Pub/Sub, Kinesis) ● Knowledge of LLMs and conversational AI systems ● Experience with data migration projects (especially large-scale) ● Background in marketing technology or customer data platformsNice-to-Have ● Experience with PyTorch Geometric or graph neural networks ● Knowledge of marketing analytics (attribution, segmentation, personalization)● Familiarity with LangChain, LangGraph, or agent frameworks ● Experience with cost optimization in cloud environments ● Contributions to open-source data engineering projects ● Experience with data compliance (GDPR, CCPA) Key Projects You'll Own Phase 1: Foundation ● Migrate 10M+ conversation vectors from Pinecone to Milvus ● Design and implement MongoDB schemas for real-time agent state● Set up Neo4j graph database with customer journey models ● Create BigQuery data warehouse with partitioned tables Phase 2: Optimization ● Build automated data quality monitoring system ● Implement caching strategies (Redis) for 10x latency reduction ● Optimize vector search queries (target:


  • Data engineer

    3 days ago


    Ajmer, India Forage AI Full time

    Experience Level: Data Engineer- 3- 7 years of relevant experience in data engineering.About Forage AI: Forage AI is a pioneering AI-powered data extraction and automation company that transforms complex, unstructured web and document data into clean, structured intelligence. Our platform combines web crawling, NLP, LLMs, and agentic AI to deliver highly...


  • Ajmer, India LTIMindtree Full time

    Let's Connect!!!We are hiring for ADB +ADF +Py spark +SQL +SynapseRoles:Specialist - Data Engineering 5 to 8 YrsSenior Specialist - Data Engineering 8 to 12 yrsLocation: Coimbatore & IndorePlease apply in below link Ga WD

  • Senior Data Engineer

    3 weeks ago


    Ajmer, India CES Full time

    We are looking for an enthusiastic and highly skilled Senior Data Engineer to join our growing team and play a key role in shaping complex data-centric solutions that power smarter decisions for our clients and internal teams.As part of our data engineering team, you’ll build and maintain scalable data systems and pipelines managing acquisition, storage,...


  • Ajmer, India EXTRAGIG Full time

    🚀 Contract Assistant – Data Engineer Support (Remote, EST Hours) 🚀📅 Start Date: Sept 10, 2025⏳ Duration: 6 months (extendable)💰 Pay: $1,000/month🕗 Work Hours: 8:00 AM – 5:30 PM ESTWe’re looking for a Contract Assistant to support a PySpark Data Engineer with daily activities. This is a remote contract role (not formal employment).What...

  • Senior Data Engineer

    4 weeks ago


    Ajmer, India SAIVA AI Full time

    We are building the future of healthcare analytics. Join us to design, build, and scale robust data pipelines that power nationwide analytics and support our machine learning systems. Our goal: pipelines that are reliable, observable, and continuously improving in production.This is a fully remote role, open to candidates based in Europe or India, with...


  • Ajmer, India Mitra AI Full time

    We are seeking a skilled UniVerse Data Administrator to ensure the structure, quality, performance, and accessibility of data within the UniVerse environment. This role will focus on database administration, performance tuning, data integrity, and operational support, while also managing batch jobs and collaborating with stakeholders across technology and...


  • Ajmer, India People Prime Worldwide Full time

    About Company: Our Client Corporation provides digital engineering and technology services to Forbes Global 2000 companies worldwide. Our Engineering First approach ensures we can execute all ideas and creatively solve pressing business challenges. With industry expertise and empowered agile teams, we prioritize execution early in the process for impactful...


  • Ajmer, India SaShr Consultants Full time

    About the companyOur client is a global, AI-powered EdFinTech company, dedicated to simplifying and democratizing access to education financing for students pursuing studies abroad. By leveraging technology and strategic partnerships, our client aims to provide seamless, transparent, and affordable financial solutions to empower the next generation of global...

  • Senior data scientist

    3 weeks ago


    Ajmer, India Delivery Hero Full time

    About Delivery Hero: As the world’s leading local delivery platform, our mission is to deliver an amazing experience, fast, easy, and to your door. We operate in over 70+ countries worldwide, powered by tech but driven by people. As one of Europe’s largest tech platforms, we enable ambitious talent to deliver solutions that create impact within our...


  • Ajmer, India KPG99 INC Full time

    HiHope you are doing wellPlease look at below mentioned Job Description and share your updated resume and mention your work authorization & Current Location.If the JD match with your skills.Role: SAP Data EngineerLocation: - India Remote Duration: Offshore 12+ Months ContractJob Description:Core Skillset Must Be:Data SphereS4 HanaSACBW Data...