Skip to content

Gauxi/data-knoller

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 
 
 
 
 

Repository files navigation

An under construction data preparation project.

Data preparation is the process transforming data before serving them to downstream tasks, such as data analysis, duplication detection, and machine learning. Much data do not meet the requirements of the following tasks, leading users, including both expert data scientists and novice data users, to frequently conduct ad-hoc data preparation. It is often reported that preparing data is a labour-intensive yet tedious work, which accounts for 50%-80% of the time spent in the whole data lifecycle.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 57.6%
  • Scala 42.4%