Data Engineer II

1 day ago


Prayagraj, India ClearDemand Full time

Job Summary:Building on the foundation of the SDE-I role, the DE- II position takes on a greater level of responsibility and leadership. You'll play a crucial role in driving the evolution and efficiency of our data collection and analytics platform, capable of handling terabyte-scale data and billions of data points. Key ResponsibilitiesLead the design, development, and optimization of large-scale data pipelines and infrastructures using technologies like Apache Airflow, Spark, Kafka, and more. Architect and implement distributed data processing solutions to handle terabyte-scale datasets and billions of records efficiently across multi-region cloud infrastructure (AWS, GCP, DO). Develop and maintain real-time data processing solutions for high-volume data collection operations using technologies like Spark Streaming and Kafka. Optimize data storage strategies using technologies such as Amazon S3, HDFS, and Parquet/Avro file formats for efficient querying and cost management. Build and maintain high-quality ETL pipelines, ensuring robust data collection and transformation processes with a focus on scalability and fault tolerance. Collaborate with data analysts, researchers, and cross-functional teams to define and maintain data quality metrics, implement robust data validation, and enforce security best practices. Mentor junior engineers (SDE-I) and foster a collaborative, growth-oriented environment. Participate in technical discussions, contributing to architectural decisions, and proactively identifying improvements for scalability, performance, and cost-efficiency. Ensure application performance monitoring (APM) is in place, utilizing tools like Datadog, New Relic, or similar to proactively monitor and optimize system performance, detect bottlenecks, and ensure system health. Implement effective data partitioning strategies and indexing for performance optimization in distributed databases such as DynamoDB, Cassandra, or HBase. Stay current with advancements in data engineering, orchestration tools, and emerging cloud technologies, continually enhancing the platform’s capabilitiesQualifications & Experience:4-5+ years of hands-on experience with Apache Airflow and other orchestration tools for managing large-scale workflows and data pipelines. Expertise in AWS technologies, Athena, AWS Glue, DynamoDB, Apache Spark, PySpark, SQL, and NoSQL databases. Experience in designing and managing distributed data processing systems that scale to terabyte and billion-scale datasets using cloud platforms like AWS, GCP, or Digital Ocean. Proficiency in web crawling frameworks, including Node.js, HTTP protocols, Puppeteer, Playwright, and Chromium for large-scale data extraction. Experience with monitoring and observability tools such as Grafana, Prometheus, Elasticsearch, and familiarity with monitoring and optimizing resource utilization in distributed systems. Strong understanding of infrastructure as code using Terraform, automated CI/CD pipelines with Jenkins, and event-driven architecture with Kafka. Experience with data lake architectures and optimizing storage using formats such as Parquet, Avro, or ORC. Strong background in optimizing query performance and data processing frameworks (Spark, Flink, or Hadoop) for efficient data processing at scale. Knowledge of containerization (Docker, Kubernetes) and orchestration for distributed system deployments. Deep experience in designing resilient data systems with a focus on fault tolerance, data replication, and disaster recovery strategies in distributed environments. Strong data engineering skills, including ETL pipeline development, stream processing, and distributed systems. Excellent problem-solving abilities, with a collaborative mindset and strong communication skills.


  • Senior Data Engineer

    2 weeks ago


    Prayagraj, India SAIVA AI Full time

    We are building the future of healthcare analytics. Join us to design, build, and scale robust data pipelines that power nationwide analytics and support our machine learning systems. Our goal: pipelines that are reliable, observable, and continuously improving in production.This is a fully remote role, open to candidates based in Europe or India, with...

  • Data Engineer

    2 weeks ago


    Prayagraj, India Alternative Path Full time

    Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm. In this role, you will collaborate with individuals across various company departments to shape and innovate new products and features for our platform, enhancing existing ones. You will have a large degree of independence and trust, but...

  • Data Engineer

    2 weeks ago


    Prayagraj, India Alternative Path Full time

    Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm. In this role, you will collaborate with individuals across various company departments to shape and innovate new products and features for our platform, enhancing existing ones. You will have a large degree of independence and trust, but...

  • Data & AI Architect

    4 weeks ago


    Prayagraj, India Delphi Consulting Middle East Full time

    Ready to embark on a journey where your growth is intertwined with our commitment to making a positive impact? Join the Delphi family - where Growth Meets Values.At Delphi Consulting Pvt. Ltd., we foster a thriving environment with a hybrid work model that lets you prioritize what matters most. Interviews and onboarding are conducted virtually, reflecting...

  • Data Modeler

    2 weeks ago


    Prayagraj, India Tredence Inc. Full time

    Job Location- Bangalore, Chennai, Pune, Kolkata, Gurugram Exp Level: 5+ YrsRole & responsibilitiesMust have total 5+ yrs. of IT experience Mandatory experience in Consumer Packaged Goods (CPG) or Retail industriesSolution Architect for Data modelling Understanding of Enterprise datasets Sales, Procurement, Finance, Supply Chain, Logistics, R&D, Advanced...

  • Data Modeler

    2 weeks ago


    Prayagraj, India Tredence Inc. Full time

    Job Location- Bangalore, Chennai, Pune, Kolkata, Gurugram Exp Level: 5+ YrsRole & responsibilitiesMust have total 5+ yrs. of IT experience Mandatory experience in Consumer Packaged Goods (CPG) or Retail industriesSolution Architect for Data modelling Understanding of Enterprise datasets Sales, Procurement, Finance, Supply Chain, Logistics, R&D, Advanced...

  • Senior data scientist

    3 weeks ago


    Prayagraj, India Analyttica Datalab Full time

    Senior Data ScientistLocation: BangaloreAbout the RoleWe are seeking a Senior Data Scientist with deep expertise in data science methodologies and exceptional proficiency in Python. You will lead end-to-end analytical initiatives — from problem framing and data preparation to model development, validation, deployment, and monitoring. This role requires...

  • Senior Data Scientist

    2 weeks ago


    Prayagraj, India Tonik Full time

    Job Title: Senior Data Scientist Location: DLF Cybercity, Porur, Chennai- India Experience: 8 - 12 YearsAbout the Job:We are looking for a highly skilled and impact-oriented Senior Data Scientist to design, develop, and scale machine learning solutions that support critical decision-making across Credit Risk, Customer Growth, Marketing, and Operations in...

  • Senior Data Scientist

    2 weeks ago


    Prayagraj, India Tonik Full time

    Job Title: Senior Data Scientist Location: DLF Cybercity, Porur, Chennai- India Experience: 8 - 12 YearsAbout the Job:We are looking for a highly skilled and impact-oriented Senior Data Scientist to design, develop, and scale machine learning solutions that support critical decision-making across Credit Risk, Customer Growth, Marketing, and Operations in...


  • Prayagraj, India OptiLnX Full time

    Remote 1-2 days in a month the candidate should visit the office - any location of client (The location would be Chennai, Pune, Noida, Gurgaon, Indore, Bangalore or Hyderabad) Shift Timings - 11am to 8:30 pm  Budget - 70K Position Overview:  We are seeking a highly skilled and experienced Senior Cloud Data Engineer to join our team for a Cloud Data...