Abstract
In this paper We follow a simple approach which allows the implementation of machine learning (ML for short) techniques to large data sets. More specifically, we study the case of on-demand dynamic creation of a local model in the neighborhood of a target datum instead of creating a global one on the whole training data set. This approach exploits the advanced data structures and algorithms, embedded in modern relational databases, to identify the neighborhood of a target datum, rapidly. Preliminary experimental results from a large scale classification problem (HIGGS dataset) show that the typical machine learning techniques are applicable to large data sets through this approach, under particular conditions. We highlight some restrictions of the method and some issues arising by implementing it.
Original language | English |
---|---|
Title of host publication | 2014 International Conference on Telecommunications and Multimedia, TEMU 2014 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 69-73 |
Number of pages | 5 |
ISBN (Electronic) | 9781479932009 |
DOIs | |
Publication status | Published - 7 Oct 2014 |
Event | 2014 International Conference on Telecommunications and Multimedia, TEMU 2014 - Heraklion, Greece Duration: 28 Jul 2014 → 30 Jul 2014 |
Other
Other | 2014 International Conference on Telecommunications and Multimedia, TEMU 2014 |
---|---|
Country/Territory | Greece |
City | Heraklion |
Period | 28/07/14 → 30/07/14 |
Keywords
- Big Data
- Higgs
- Hyper-box
- Machine Learning
- Support Vector Machines