In the good old days of machine learning and data mining, in the era of nearest neighbours, decision trees, linear regression and naive Bayes, the limitations of these models were clear. They worked surprisingly well in many cases, especially if the underlying data was rich enough.
In his recent work, Krisztian Buza challenged the aforementioned “widely acknowledged truth” in context of data augmentation. His observations show that rich training data may…
DiscoverR tools are the components of the enRichMyData toolbox that help users find and understand data that they can use in their data enrichment processes.…