The PAPIs conference is a prestigious platform for practical aspects of emerging applications, tools and methods for Predictive Analytics and Machine Learning. At the PAPIs Europe conference, held in London, RapidMiner researcher, David Arnu presented the DS4DM project work. His presentation entitled Smart Data through Automatic Data Search and Extraction [1]. In his talk, David emphasized on leveraging smart data from the ever-increasing volumes of data. In this context, he presented the Correspondence Search, which harnesses large tabular corpus and can be controlled by the data engineer using the front end (Data Search extension).
In addition, David demonstrated [2] various data extraction extensions that help to consolidate data from a variety of sources, for example to find all pubs in London. This leads to an enriched and concomitant form of smart data, which generates higher quality results in machine learning and business intelligence scenarios.
Reference
[1] Smart Data through Automatic Data Search and Extraction, David Arnu.
[2] Recording of the talk, weblink: