Data Engineer II
3 weeks ago
Role Purpose:
The Data Engineer II will play a crucial role in enhancing the data infrastructure and analytics capabilities. This role is vital for building and maintaining data transformation pipelines, monitoring data distribution shifts, managing dataset versions, and analyzing system performance. Contributions in this position will support the continuous improvement of machine learning models, ensuring they operate with high accuracy and efficiency in various production environments.
Role Value:
As a Data Engineer II, you will be instrumental in advancing data engineering processes and infrastructure. Your expertise in building data pipelines, managing datasets, and analyzing system performance will directly impact the efficiency and reliability of machine learning models. By automating data transformation processes and monitoring data distribution shifts, you will help ensure that models are frequently updated and operate effectively in production.
T-Shaped Engineering Expectation:
Beyond deep expertise in data engineering, you will bring a broad understanding of software development, testing, and data science. As a T-shaped Data Engineer, you will take full ownership of the data infrastructure you build, ensuring it is reliable, thoroughly tested, and capable of supporting machine learning and analytics efforts. Your skills in Python, SQL, and cloud environments will enable you to tackle complex data challenges, provide valuable insights into system behavior, and contribute to delivering top-tier identity verification solutions. Your work will play a critical role in maintaining leadership in the online identity verification, eKYC, and AML solutions market.
Example Responsibilities:
- Building data transformation pipelines with humans in the loop: Automate and expand a semi-manual datasets generation pipeline, including tagging jobs preparation, scheduling, and post-processing to increase the frequency of ML model updates.
- Data distribution shift monitoring: Design a system capable of detecting changes in data distribution and unknown data types by monitoring ML models in production.
- Dataset growth strategy: Leverage your expertise in business data and monitoring tools to help identify valuable zones of expansion for automated solutions.
- Dataset versioning management: Manage and document dataset versions in an ecosystem of highly dependent and evolving datasets.
- Data volatility management: Develop solutions to stabilize datasets in environments where data retention is time-limited.
- System performance analysis: Dive into data and models to analyze system behavior on specific transactions and data buckets, producing targeted performance metrics using advanced Python skills.
Experience and Qualifications:
- Experience building data pipelines in dynamic and evolving environments.
- Proficiency in Python (pandas, numpy) and SQL for data wrangling.
- Skilled in data analysis and deep dives using Jupyter notebooks or other notebook tools.
- Interest or experience in Machine Learning and Data Science.
- Proficient in leveraging cloud environments.
Great-to-Have Experience and Qualifications:
- Familiarity with privacy by design.
- Knowledge of serverless data engineering.
- Experience with Java, Apache Spark, or Flink.
Work Environment:
Located in a hub of technical excellence with a strong focus on Machine Learning enablement, the team is committed to high standards, innovation, and continuous learning.
Values:
- IDEAL: Integrity, Diversity, Empowerment, Accountability, Leading Innovation
Equal Opportunities:
We foster a collaborative environment of diverse perspectives and backgrounds, welcoming applications and colleagues from all walks of life.
About Us:
We are a B2B technology company dedicated to eradicating online identity fraud, money laundering, and other financial crimes, working to make the internet a safer space. Through AI, biometrics, machine learning, liveness detection, and automation, we create solutions trusted by global brands across Financial Services, Travel, Sharing Economy, Fintech, Gaming, and other industries.
-
Data Engineer II
3 weeks ago
India Talentoj Full timeRole Purpose:The Data Engineer II will play a crucial role in enhancing the data infrastructure and analytics capabilities. This role is vital for building and maintaining data transformation pipelines, monitoring data distribution shifts, managing dataset versions, and analyzing system performance. Contributions in this position will support the continuous...
-
Data Engineer II
3 weeks ago
India Talentoj Full timeRole Purpose: The Data Engineer II will play a crucial role in enhancing the data infrastructure and analytics capabilities. This role is vital for building and maintaining data transformation pipelines, monitoring data distribution shifts, managing dataset versions, and analyzing system performance. Contributions in this position will support the...
-
Data Engineer II
3 weeks ago
india Talentoj Full timeRole Purpose: The Data Engineer II will play a crucial role in enhancing the data infrastructure and analytics capabilities. This role is vital for building and maintaining data transformation pipelines, monitoring data distribution shifts, managing dataset versions, and analyzing system performance. Contributions in this position will support the...
-
Clinical Data Programmer Ii
5 months ago
India Novotech Full time**Brief Position Description**: The core responsibility for this position is as a member of the Data Management department at Novotech. The Clinical Data Programmer-II (CDP-II) will be responsible for programming activities on clinical trial projects and to ensure compliance with Good Clinical Data Management Practices (GCDMP). **Minimum Qualifications &...
-
Data Engineer II
3 weeks ago
Anywhere in India/Multiple Locations Jumio Full timeJob Role Summary As a Data Engineer II at Jumio, you will be instrumental in advancing our data engineering processes and infrastructure. Your expertise in building data pipelines, managing datasets, and analyzing system performance will directly impact the efficiency and reliability of our machine learning models. By automating data transformation processes...
-
Software Engineer II
3 weeks ago
India Info Services Full timeJob Title: Software Engineer IIAbout the Role:We are seeking a highly skilled Software Engineer II to join our growing team and contribute to the development and maintenance of our SMARTdiagnostics machine health platform. This platform stores and processes industrial IoT sensor data to provide analytics and insights to our users, helping us achieve our goal...
-
Software Development Engineer II
1 month ago
india SuperAGI Full timeAbout UsSuperAGI is pioneering the future of Artificial General Intelligence with groundbreaking research and innovative AI products. Our mission is to transform the future of applications through intelligent, autonomous solutions that drive unparalleled efficiency and growth. We are building a world where AI and human intelligence collaborate seamlessly to...
-
Data Scientist-II
1 month ago
india Rebel Foods Full timeAbout the job Company Description: Rebel Foods, formerly known as Faasos, is the world's largest chain of Online Restaurants. Today we operate 25+ own brands such as Behrouz Biryani, Ovenstory Pizza, Faasos, Sweet Truth on our proprietary operating system, a mix of culinary craft and technology infrastructure. We are now a network of 4000+ Restaurants...
-
Data Scientist-II
1 month ago
india Rebel Foods Full timeAbout the jobCompany Description:Rebel Foods, formerly known as Faasos, is the world's largest chain of Online Restaurants. Today we operate 25+ own brands such as Behrouz Biryani, Ovenstory Pizza, Faasos, Sweet Truth on our proprietary operating system, a mix of culinary craft and technology infrastructure. We are now a network of 4000+ Restaurants across...
-
Software Development Engineer II
1 month ago
India SuperAGI Full timeAbout Us SuperAGI is pioneering the future of Artificial General Intelligence with groundbreaking research and innovative AI products. Our mission is to transform the future of applications through intelligent, autonomous solutions that drive unparalleled efficiency and growth. We are building a world where AI and human intelligence collaborate seamlessly...
-
Software development engineer ii
1 month ago
India SuperAGI Full timeAbout Us Super AGI is pioneering the future of Artificial General Intelligence with groundbreaking research and innovative AI products. Our mission is to transform the future of applications through intelligent, autonomous solutions that drive unparalleled efficiency and growth. We are building a world where AI and human intelligence collaborate...
-
Data engineer ii
1 month ago
India Tsworks Full timeWho We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Data Engineer II
1 month ago
India tsworks Full timeWho We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Data Engineer II
4 weeks ago
india tsworks Full timeWho We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Data Engineer II
6 months ago
India tsworks Full timeWho We Aretsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Data Engineer II
2 months ago
india tsworks Full timeWho We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Data Engineer II
6 months ago
India tsworks Full timeWho We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Data Engineer II
3 months ago
India tsworks Full timeWho We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Lead data engineer
3 days ago
India Wavicle Data Solutions Full timeJob Description: We are seeking a highly experienced Lead Data Engineer with over 8 years of expertise in data engineering. As a Lead Data Engineer, you will play a pivotal role in architecting and implementing data solutions. Your proficiency in Python, Py Spark, AWS, Databricks, SQL, and leadership skills will be crucial for success. Key...
-
Jumio - Data Engineer II - Python/SQL
2 months ago
Anywhere in India/Multiple Locations Jumio.com Full timeRole Purpose : The Data Engineer II will play a crucial role in enhancing Jumio's data infrastructure and analytics capabilities. This role is vital for building and maintaining data transformation pipelines, monitoring data distribution shifts, managing dataset versions, and analyzing system performance. Your contributions will support the...