Data Quality Engineer

1 week ago


Hyderabad, Telangana, India Algoleap Technologies Pvt Ltd Full time ₹ 15,00,000 - ₹ 20,00,000 per year

Job Description: Quality Engineer (Data)

JOB SUMMARY

We are seeking a highly skilled Quality Engineer with 5-10 years of professional experience to ensure the integrity, reliability, and performance of our data pipelines and AI/ML solutions within the SmartFM platform. The ideal candidate will be responsible for defining and implementing comprehensive quality assurance strategies for data ingestion, transformation, storage, and the machine learning models that generate insights from alarms and notifications received from various building devices. This role is crucial in delivering high-quality, trustworthy data and intelligent recommendations to optimize facility operations.

ROLES AND RESPONSIBILITIES

  • Develop and implement end-to-end quality assurance strategies and test plans for data pipelines, data transformations, and machine learning models within the SmartFM platform.
  • Design, develop, and execute test cases for data ingestion processes, ensuring data completeness, consistency, and accuracy from various sources, especially those flowing through IBM StreamSets and Kafka.
  • Perform rigorous data validation and quality checks on data stored in MongoDB, including schema validation, data integrity checks, and performance testing of data retrieval.
  • Collaborate closely with Data Engineers to ensure the robustness and scalability of data pipelines and to identify and resolve data quality issues at their source.
  • Work with Data Scientists to validate the performance, accuracy, fairness, and robustness of Machine Learning, Deep Learning, Agentic Workflows, and LLM-based models. This includes testing model predictions, evaluating metrics, and identifying potential biases.
  • Implement automated testing frameworks for data quality, pipeline validation, and model performance monitoring.
  • Monitor production data pipelines and deployed models for data drift, concept drift, and performance degradation, setting up appropriate alerts and reporting mechanisms.
  • Participate in code reviews for data engineering and data science components, ensuring adherence to quality standards and best practices.
  • Document testing procedures, test results, and data quality metrics, providing clear and actionable insights to cross-functional teams.
  • Stay updated with the latest trends and tools in data quality assurance, big data testing, and MLOps, advocating for continuous improvement in our quality processes.

REQUIRED TECHNICAL SKILLS AND EXPERIENCE

  • 5-10 years of professional experience in Quality Assurance, with a significant focus on data quality, big data testing, or ML model testing.
  • Strong proficiency in SQL for complex data validation, querying, and analysis across large datasets.
  • Hands-on experience with data pipeline technologies like IBM StreamSets and Apache Kafka.
  • Proven experience in testing and validating data stored in MongoDB or similar NoSQL databases.
  • Proficiency in Python for scripting, test automation, and data validation.
  • Familiarity with Machine Learning and Deep Learning concepts, including model evaluation metrics, bias detection, and performance testing.
  • Understanding of Agentic Workflows and LLMs from a testing perspective, including prompt validation and output quality assessment.
  • Experience with cloud platforms (Azure, AWS, or GCP) and their data/ML services.
  • Knowledge of automated testing frameworks and tools relevant to data and ML (e.g., Pytest, Great Expectations, Deepchecks).
  • Familiarity with and React environments to understand system integration points.

ADDITIONAL QUALIFICATIONS

  • Demonstrated expertise in written and verbal communication, adept at simplifying complex technical concepts related to data quality and model performance for diverse audiences.
  • Exceptional problem-solving and analytical skills with a keen eye for detail in data.
  • Experienced in collaborating seamlessly with Data Engineers, Data Scientists, Software Engineers, and Product Managers.
  • Highly motivated to acquire new skills, explore emerging technologies in data quality and AI/ML testing, and stay updated on the latest industry best practices.
  • Domain knowledge in facility management, IoT, or building automation is a plus.

EDUCATION REQUIREMENTS / EXPERIENCE

  • Bachelor's (BE / BTech) / Master's degree (MS/MTech) in Computer Science, Information Systems, Engineering, Statistics, or a related field.



  • Hyderabad, Telangana, India Enable Data Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Experience Required: 8+YearsMode of work: RemoteSkills Required: Azure DataBricks, Eventhub, Kafka, Architecture,Azure Data Factory, Pyspark, Python, SQL, SparkNotice Period : Immediate Joiners/ Permanent/Contract role (Can join within September 15th 2025)Design, develop, and maintain scalable and robust data solutions in the cloud using Apache Spark and...


  • Hyderabad, Telangana, India Enable Data Full time US$ 90,000 - US$ 1,20,000 per year

    Enable Data Incorporated is currently seeking a skilled and experienced Azure Data Engineer to join our dynamic team. As a leading provider of advanced application, data, and cloud engineering services, Enable Data has developed deep expertise across various industries. We work closely with our customers to leverage modern solutions and technologies to drive...


  • Hyderabad, Telangana, India ICE Data Services Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Job PurposeThe Property Data Engineeris responsible for developing and maintaining data conversion programs that transform raw property assessment data into standardized formats based on specifications by Property Data Analyst and Senior Analysts. This role requires not only advanced programming and ETL skills but also a deep understanding of the structure,...


  • Hyderabad, Telangana, India Enable Data Incorporated Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Design, develop, and maintain scalable and robust data solutions in the cloud using Apache Spark and Databricks. Gather and analyse data requirements from business stakeholders and identify opportunities for data-driven insights. Build and optimize data pipelines for data ingestion, processing, and integration using Spark and Databricks. Ensure data...


  • Hyderabad, Telangana, India Enable Data Incorporated Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Experience Required: 8+YearsMode of work: RemoteSkills Required: Azure DataBricks, Eventhub, Kafka, Architecture,Azure Data Factory, Pyspark, Python, SQL, SparkNotice Period : Immediate Joiners/ Permanent/Contract role (Can join within September 15th 2025)Design, develop, and maintain scalable and robust data solutions in the cloud using Apache Spark and...


  • Hyderabad, Telangana, India Enable Data Incorporated Full time

    Experience Required: 8+YearsMode of work: RemoteSkills Required: Azure DataBricks, Eventhub, Kafka, Architecture, Azure Data Factory, Pyspark, Python, SQL, SparkNotice Period : Immediate Joiners/ Permanent/Contract role (Can join within September 29th 2025)- Translate business rules into technical specifications and implement scalable data solutions.- Manage...

  • Data Quality Engineer

    2 weeks ago


    Hyderabad, Telangana, India beBeeDataQuality Full time ₹ 90,00,000 - ₹ 1,20,00,000

    About Data Quality Engineering SupportThe ideal candidate will be responsible for implementing and supporting enterprise-wide data quality frameworks across cloud-native data platforms.Key responsibilities include driving hands-on initiatives to monitor, validate, and reconcile data across ingestion, processing, and consumption layers enabling trusted data...


  • Hyderabad, Telangana, India Data Economy Full time US$ 1,50,000 - US$ 2,00,000 per year

    We are seeking a Lead/Senior Data Engineer with 7-12 years of experience to architect, develop, and optimize data solutions in a cloud-native environment. The role requires strong expertise in AWS Glue, PySpark, and Python with a proven ability to design scalable data pipelines and frameworks for large-scale enterprise systems. Prior exposure to financial...


  • Hyderabad, Telangana, India Enable Data Incorporated Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Experience Required: 8+YearsMode of work: RemoteSkills Required: Azure DataBricks, Eventhub, Kafka, Architecture, Azure Data Factory, Pyspark, Python, SQL, SparkNotice Period : Immediate Joiners/ Permanent/Contract role (Can join within September 29th 2025)Translate business rules into technical specifications and implement scalable data solutions. Manage a...


  • Hyderabad, Telangana, India Data Unveil Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Job Description. About us :At Data Unveil, we believe in delivering the best for our clients (Pharma Companies). We use the latest technology and tools to aggregate and analyze specialty healthcare data received from various data partners. We provide clear and hassle-free business insights to enhance the client's vision and drive business success. Position...