Data Engineer

1 day ago


Pune, Maharashtra, India Karini Ai Full time

Role :- Data Engineer

Experience: 3 to 5 Years

Location: Pune (Baner)

Type: Full-time

Role Overview

We are looking for a skilled Data Engineer with deep AWS expertise to help operationalize and scale our SaaS platform. The ideal candidate should have strong hands-on experience in AWS analytics services (EMR, Glue, Athena, S3, Postgres/RDS), data pipeline development, and cloud-native architectures.

You will design, build, and maintain scalable data pipelines, implement secure cloud data architectures, optimize storage and compute performance, and ensure end-to-end operational reliability of our SaaS platform.

While the role is primarily AWS-focused, some exposure to AI, vector search, or modern data processing techniques is an added advantage.

Key Responsibilities:-

AWS-Centric Data Engineering

  • Design and implement ETL/ELT pipelines using AWS Glue, EMR, Lambda, and Step Functions
  • Develop optimized Athena queries, partitions, data catalog management, and metadata workflows
  • Manage and optimize Amazon S3 data lakes (performance tuning, lifecycle policies, cost optimization)
  • Set up and maintain PostgreSQL/RDS for operational and analytical workloads
  • Implement VPC, subnets, security groups, NAT gateways, and private networking for secure data operations

Data Pipeline & Platform Operations

  • Build highly scalable batch and real-time data workflows
  • Develop connectors for diverse data sources (databases, cloud storage, SaaS APIs)
  • Implement data quality validation, monitoring, and alerting across pipelines
  • Ensure high availability, reliability, and performance of data systems for our SaaS platform
  • Set up CI/CD pipelines, infrastructure automation using Terraform, and containerized workflows using Docker/Kubernetes

AI & Advanced Processing (Good to Have)

  • Basic knowledge of embeddings, tokenization, and vector DBs (OpenSearch, Pinecone, Weaviate, Chroma)
  • Experience with document processing, OCR, semantic chunking, or RAG pipelines
  • Integration exposure with LLM services (OpenAI, Bedrock, Databricks)

Required Technical Skills

  • AWS Services (Primary Focus):

    EMR, Glue, Athena, S3, RDS/Postgres, Lambda, IAM, VPC, CloudWatch
  • Data Engineering:

    Python (pandas, NumPy), SQL, ETL/ELT design, data warehousing, data modeling
  • Distributed Processing:

    Spark on EMR (mandatory), Ray (optional)
  • Streaming Systems:

    Kafka or Kinesis (good to have)
  • DevOps & Infra:

    Docker, Kubernetes, Terraform, GitHub Actions, CI/CD pipelines
  • AI/Data Tools (Advantage):

    Vector databases, MLflow/DVC, embedding generation, RAG pipelines

Required Experience

  • 3-5 years of experience as a Data Engineer
  • Strong hands-on experience with AWS analytics & data services
  • Experience managing and optimizing S3-based data lakes
  • Strong experience with PostgreSQL/RDS
  • Prior work on secure cloud architectures (IAM, VPC, subnets)
  • Experience with ETL/ELT pipeline design for high-volume workloads
  • Exposure to SaaS platform operations or multi-tenant architectures is a plus

Key Skills for Success

  • Strong analytical and problem-solving mindset
  • Deep understanding of AWS data ecosystem
  • Ability to work in a fast-paced startup environment
  • Ownership mentality and willingness to build from the ground up
  • Collaborative approach with cross-functional teams

What We Offer

  • Work on cutting-edge Gen AI products solving real-world problems
  • Collaborate with AWS on innovative AI initiatives
  • Competitive salary + equity in growing AI company
  • Professional growth in rapidly evolving AI landscape
  • Flexible, innovative culture valuing creativity and ownership

  • Data Engineer

    5 days ago


    Pune, Maharashtra, India Jash Data Sciences Full time

    Do you love solving real-world data problems with the latest and best techniques? And having fun while solving them in a team Then come join our high-energy team of passionate data people. Jash Data Sciences is the right place for you.We are a cutting-edge Data Sciences and Data Engineering startup based in Pune, India.We believe in continuous learning and...


  • Pune, Maharashtra, India Data Axle Full time ₹ 15,00,000 - ₹ 30,00,000 per year

    About Data Axle:Data Axle Inc.  has been an industry leader in data, marketing solutions, sales, and research for over 50 years in the USA. Data Axle now as an established strategic global centre of excellence in Pune. This centre delivers mission critical data services to its global customers powered by its proprietary cloud-based technology platform and...


  • Pune, Maharashtra, India NTT DATA Full time

    Req ID: 345385NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Business Consulting-Data Engineer with Power BI to join our team in Pune, Mahārāshtra (IN-MH), India (IN).Job...

  • Data Scientist

    2 weeks ago


    Pune, Maharashtra, India Data Axle Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About Data Axle:Data Axle Inc.  has been an industry leader in data, marketing solutions, sales and research for over 45 years in the USA. Data Axle has set up a strategic global centre of excellence in Pune. This centre delivers mission critical data services to its global customers powered by its proprietary cloud-based technology platform and by...

  • Data Scientist

    3 days ago


    Pune, Maharashtra, India Jash Data Sciences Full time

    We are a fast-growing startup based in Pune, India, specializing in cutting-edge Data Science and Data Engineering solutions. Our team of dedicated professionals is committed to solving complex data challenges for companies worldwide.Our CultureWe foster a vibrant startup culture that values:Intellectual curiosityContinuous learningPositive work...

  • Software Engineer

    2 weeks ago


    Pune, Maharashtra, India Data Axle Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    About Data Axle:Data Axle Inc.  has been an industry leader in data, marketing solutions, sales and research for over 50 years in the US. Data Axle has set up a strategic global centre of excellence in Pune. This centre delivers mission critical data services to its global customers powered by its proprietary cloud-based technology platform and leveraging...

  • Software Engineer

    2 weeks ago


    Pune, Maharashtra, India Data Axle Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Data Axle Inc. has been an industry leader in data, marketing solutions, sales and research for 50 years in the US. Data Axle has set up a strategic global centre of excellence in Pune. This centre delivers mission critical data services to its global customers powered by its proprietary cloud-based technology platform and leveraging proprietary business &...

  • Sr Software Engineer

    2 weeks ago


    Pune, Maharashtra, India Data Axle Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About Data Axle:Data Axle Inc.  has been an industry leader in data, marketing solutions, sales and research for 50 years in the US. Data Axle has set up a strategic global centre of excellence in Pune. This centre delivers mission critical data services to its global customers powered by its proprietary cloud-based technology platform and leveraging...


  • Pune, Maharashtra, India, Maharashtra Data > Nuance. Full time

    About Data>Nuance Data>Nuance is a global privacy, data protection, and AI governance consultancy, trusted by 1,000+ organizations worldwide.We operate across Barcelona, Bangalore, and Dubai (expanding soon), with a dedicated team of data protection, governance, and regulatory specialists.We specialize in:Outsourced DPO ServicesData Protection Consultancy &...


  • Pune, Maharashtra, India Hevo Data Full time

    About HevoHevom) is a simple, intuitive, and powerful No-code Data Pipeline platform that enables companies to consolidate data from multiple software for faster analytics. Hevo powers data analytics for 2000+ data-driven companies across multiple industry verticals, includingt, Postman, ThoughtSpot, Jawa Motorcycles. By automating complex data integration...