Dockerfile Data Validation Engineer
8 hours ago
About Turing: Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L. About the Role: We are seeking an engineer responsible for designing, implementing, and maintaining data-validation workflows inside Docker-based build pipelines . This role involves creating and managing Dockerfile labels, metadata standards, and validation scripts that ensure datasets, schemas, and model artifacts meet quality and compliance requirements before deployment. You will work closely with data engineering, machine learning, and DevOps teams to build reliable, reproducible, and fully validated containerized data pipelines . What does day-to-day look like : Develop and optimize Dockerfiles with built-in data-validation steps. Implement LABEL metadata for dataset versions, schemas, and lineage. Create validation scripts (Python/Bash) for schema checks, data integrity, and quality control. Integrate validation steps into CI/CD pipelines and enforce fail-on-bad-data checks. Document standards for Dockerfile labeling, validation logic, and data governance . Required Skills: Experienced DevOps engineers. Strong experience with Docker & Dockerfiles. Proficiency in Python or Bash for validation scripting. Knowledge of data formats, schemas, and validation tools. Familiarity with CI/CD systems and container registries. Nice to Have: Previous participation in LLM research or evaluation projects. Experience building or testing developer tools or automation agents. Experience with MLOps workflows, data versioning, or Great Expectations. Knowledge of Kubernetes or container security tools. Perks of Freelancing With Turing: Work in a fully remote environment. Opportunity to work on cutting-edge AI projects with leading LLM companies. Offer Details: Commitments Required : At least 4 hours per day and minimum 20 hours per week with overlap of 4 hours with PST. (We have 3 options of time commitment: 20 hrs/week, 30 hrs/week or 40 hrs/week) Employment type : Contractor assignment (no medical/paid leave) Duration of contract : 2-4 weeks; [expected start date is next week] Evaluation Process (approximately 75 mins) : Interviews (30-60 min technical discussion in QODE) Know amazing talent? Refer them at turing.com/referrals, and earn money from your network.
-
Data Quality Engineer
7 days ago
Vijayawada, India CodeVyasa Full timeJole OverviewWe are seeking a highly skilled and motivated Data Quality Engineer || Remote with 8+ years of experience. If you're passionate about coding, problem-solving, and innovation, we'd love to hear from you!About UsCodeVyasa is a mid-sized product engineering company that works with top-tier product/solutions companies such as McKinsey, Walmart,...
-
Data engineer
1 week ago
Vijayawada, India BayOne Solutions Full timeThe Opportunity: We are seeking a highly experienced Data Engineer to join our Mar Tech team and play a pivotal role in driving innovation within our microservices architecture, with a strong emphasis on data engineering and back-end systems. You will be responsible for leading the development, optimization, and maintenance of both data pipelines and...
-
Data engineer
1 week ago
Vijayawada, India Ubique Systems Full timePrimary skills: Python, SQL, data lakes, azure Experience: 4+ years Immediate Joiner Key Responsibilities Pipeline Development & Automation · Design, build, and maintain CI/CD pipelines to automate deployment of DQ rules and data services across environments. · Optimize data pipelines and jobs for efficiency, scalability, and enterprise-grade reliability...
-
Senior DevOps Engineer
3 weeks ago
Vijayawada, India Crest Data Full timeCompany Overview: Crest Data is the global leading provider of Data Analytics, Security, DevOps, Cloud Solutions, Software integrations, Analytics, and security-based technological services. With a clientele that includes several Fortune 500 corporations and some of the innovative Silicon Valley Startups.Company URL : http://www.crestdata.ai Job Location-...
-
Data engineer
3 weeks ago
Vijayawada, India Insight Global Full timeGCP DATA ENGINEER - Contract (Long term) 4–5+ years of experience as a Data Engineer with hands-on support for Google Looker. Strong experience in data modeling and building data marts. Proficiency in ETL/ELT pipeline development and SQL performance tuning. Experience with Look ML and semantic layer design in Looker. Familiarity with healthcare data and...
-
Chief Manager
2 days ago
Vijayawada, India Xpert Conexions Full timeJob DescriptionRole & Responsibilities- Analyzing large datasets to identify trends, patterns, and insights.- Using statistical methods to validate findings and ensure they are reliable.- Performing exploratory data analysis (EDA) to understand the data's structure and characteristics.- Developing machine learning models to make predictions or...
-
Data Analytics Test Lead
1 week ago
Vijayawada, India Birlasoft Full timeAbout the CompanyBirlasoft, a global leader at the forefront of Cloud, AI, and Digital technologies, seamlessly blends domain expertise with enterprise solutions. The company’s consultative and design-thinking approach empowers societies worldwide, enhancing the efficiency and productivity of businesses. As part of the multibillion-dollar diversified CKA...
-
Data Analytics Test Lead
1 week ago
Vijayawada, India Birlasoft Full timeAbout the CompanyBirlasoft, a global leader at the forefront of Cloud, AI, and Digital technologies, seamlessly blends domain expertise with enterprise solutions. The company’s consultative and design-thinking approach empowers societies worldwide, enhancing the efficiency and productivity of businesses. As part of the multibillion-dollar diversified CKA...
-
Data Engineer Lead
2 weeks ago
Vijayawada, India JRD Systems Full timeAbout the RoleWe are seeking an experienced Data Engineer Lead to design, develop, and maintain scalable data solutions on Azure and Databricks as part of our enterprise data modernization initiatives. The ideal candidate will have a strong background in data pipeline development, data integration frameworks , and cloud-based data engineering , with...
-
Data Engineer Lead
1 week ago
Vijayawada, India JRD Systems Full timeAbout the RoleWe are seeking an experienced Data Engineer Lead to design, develop, and maintain scalable data solutions on Azure and Databricks as part of our enterprise data modernization initiatives. The ideal candidate will have a strong background in data pipeline development, data integration frameworks , and cloud-based data engineering , with...