Data engineer ii

18 hours ago


Aligarh, India ClearDemand Full time

Job Summary: Building on the foundation of the SDE-I role, the DE- II position takes on a greater level of responsibility and leadership. You'll play a crucial role in driving the evolution and efficiency of our data collection and analytics platform, capable of handling terabyte-scale data and billions of data points. Key Responsibilities Lead the design, development, and optimization of large-scale data pipelines and infrastructures using technologies like Apache Airflow, Spark, Kafka, and more. Architect and implement distributed data processing solutions to handle terabyte-scale datasets and billions of records efficiently across multi-region cloud infrastructure (AWS, GCP, DO). Develop and maintain real-time data processing solutions for high-volume data collection operations using technologies like Spark Streaming and Kafka. Optimize data storage strategies using technologies such as Amazon S3, HDFS, and Parquet/Avro file formats for efficient querying and cost management. Build and maintain high-quality ETL pipelines, ensuring robust data collection and transformation processes with a focus on scalability and fault tolerance. Collaborate with data analysts, researchers, and cross-functional teams to define and maintain data quality metrics, implement robust data validation, and enforce security best practices. Mentor junior engineers (SDE-I) and foster a collaborative, growth-oriented environment. Participate in technical discussions, contributing to architectural decisions, and proactively identifying improvements for scalability, performance, and cost-efficiency. Ensure application performance monitoring (APM) is in place, utilizing tools like Datadog, New Relic, or similar to proactively monitor and optimize system performance, detect bottlenecks, and ensure system health. Implement effective data partitioning strategies and indexing for performance optimization in distributed databases such as Dynamo DB, Cassandra, or HBase. Stay current with advancements in data engineering, orchestration tools, and emerging cloud technologies, continually enhancing the platform’s capabilities Qualifications & Experience: 4-5+ years of hands-on experience with Apache Airflow and other orchestration tools for managing large-scale workflows and data pipelines. Expertise in AWS technologies, Athena, AWS Glue, Dynamo DB, Apache Spark, Py Spark, SQL, and No SQL databases. Experience in designing and managing distributed data processing systems that scale to terabyte and billion-scale datasets using cloud platforms like AWS, GCP, or Digital Ocean. Proficiency in web crawling frameworks, including Node.js, HTTP protocols, Puppeteer, Playwright, and Chromium for large-scale data extraction. Experience with monitoring and observability tools such as Grafana, Prometheus, Elasticsearch, and familiarity with monitoring and optimizing resource utilization in distributed systems. Strong understanding of infrastructure as code using Terraform, automated CI/CD pipelines with Jenkins, and event-driven architecture with Kafka. Experience with data lake architectures and optimizing storage using formats such as Parquet, Avro, or ORC. Strong background in optimizing query performance and data processing frameworks (Spark, Flink, or Hadoop) for efficient data processing at scale. Knowledge of containerization (Docker, Kubernetes) and orchestration for distributed system deployments. Deep experience in designing resilient data systems with a focus on fault tolerance, data replication, and disaster recovery strategies in distributed environments. Strong data engineering skills, including ETL pipeline development, stream processing, and distributed systems. Excellent problem-solving abilities, with a collaborative mindset and strong communication skills.



  • Aligarh, India INSPYR Solutions Full time

    Position 1 :MLOps Engineer II ( Mid-Senior-Level) Location : Remote (Night Shift – 10 PM to 7 AM CST) Contract : 12 months (renewable) Start Date : December 1, 2025MLOps Engineer II (Mid-Level) Overview : Proficient MLOps engineer capable of independently managing production model deployments, pipelines, and infrastructure operations.Responsibilities :...


  • Aligarh, India Quantiphi Full time

    Company Profile: Quantiphi is an award-winning Data Science and Machine Learning Software and Services Company focused on helping organizations translate the big promise of Machine Learning technologies into quantifiable business impact. We were founded on the belief that machine learning and artificial intelligence are transformative technologies that will...


  • Aligarh, India Aspect Software Full time

    Job Title: Technical Support Engineer IILocation: India, RemoteShift Timing: Shift 1: 4:00 PM to 1:00 AM ISTShift 2: 10:30 PM to 7:30 AM ISTAbout Aspect Software: Aspect Software develops world-class Workforce Engagement Management software that empowers businesses to achieve operational excellence. We are committed to fostering a collaborative and dynamic...


  • Aligarh, India Innova ESI Full time

    Role: Big Data Engineer – AzureExperience: 10+ YearsLocation: Bangalore Immediate Joiners OnlyBig Data :Mandatory Skills:Azure Databricks, Azure Data Factory, Azure Function apps, Apache Spark, Scala, Java, Apache Kafka, event stream and Big DataOptional Skills: Airflow, PythonRoles & ResponsibilitiesOverall, experience in IT industry, including 5 years in...


  • Aligarh, India BioSales Full time

    Data Platform Engineer – B2B Intelligence Systems (Life Sciences) Location:Remote |Type:Full-TimeAbout BioSales BioSales partners with contract research organizations (CROs) and life sciences companies to provide comprehensive sales and go-to-market services. We lead the entire sales process from prospecting new clients and closing deals to account...


  • Aligarh, India canadainternationalprojects Full time

    Contract Type: Contract/ConsultantLocation: RemoteClient: Leading Canadian EnterpriseWork Schedule: Eastern Time Zone (5:00 PM - 1:00 AM IST)About the OpportunityWe are seeking an experienced Senior Quality Engineer for an exciting contract opportunity with a premier Canadian client. This role focuses on ensuring the quality of enterprise-scale data...

  • Senior Data Engineer

    3 weeks ago


    Aligarh, India Amicon Hub Services Full time

    Must Have Skills:1. Strong expertise in SQL (complex queries, optimization, data modeling).2. Hands-on experience with SSAS (SQL Server Analysis Services).3. Deep understanding of OLAP cubes and multidimensional data models.4. Experience working with BI tools (e.g., Power BI, Tableau, or equivalent).


  • Aligarh, India PASO Co Full time

    Job descriptionWe’re Hiring: Data Operations LeadShift:10:00 AM – 07:00 PM IST Positions:1 |Type:Full-TimeWhat You’ll Be Doing:● Lead and manage the Data Operations team responsible for data collection, enrichment, validation, and delivery. ● Implement and monitor data quality metrics, identify discrepancies, and drive process improvements. ●...


  • Aligarh, India Integrated Wireless Solutions Full time

    Job Summary: We are seeking a highly experienced Senior Data Scientist with 7-8 years of hands-on expertise in Python, Advanced Analytics, Generative AI (GenAI) technologies, and AWS storage solutions. The ideal candidate should also have knowledge of BI tool implementation, along with a good understanding of CSS for data presentation. Data visualization...

  • Senior Data

    3 weeks ago


    Aligarh, India Kooner Transport Group - KTG Full time

    Location: Remote/ Uttarakhand - Ramnagar Office Type: 3-Month Contract (extendable based on performance) Work Hours: Must overlap 3–4 hours/day with Eastern Time (Canada) Compensation: ₹100,000 – ₹150,000 INR per month (depending on skill & experience)About the RoleKooner Transport Group (Canada) operates a large cross-border trucking fleet with over...