Abstract
Self-organizing maps (SOMs) are popular tools for grouping and visualizing data in many areas of science. This paper describes recent changes in package kohonen, implementing several different forms of SOMs. These changes are primarily focused on making the package more useable for large data sets. Memory consumption has decreased dramatically, amongst others, by replacing the old interface to the underlying compiled code by a new one relying on Rcpp. The batch SOM algorithm for training has been added in both sequential and parallel forms. A final important extension of the package’s repertoire is the possibility to define and use data-dependent distance functions, extremely useful in cases where standard distances like the Euclidean distance are not appropriate. Several examples of possible applications are presented.
Original language | English |
---|---|
Number of pages | 18 |
Journal | Journal of Statistical Software |
Volume | 87 |
Issue number | 7 |
DOIs | |
Publication status | Published - 12 Nov 2018 |
Keywords
- Distance functions
- Parallellization, R.
- Self-organizing maps