SOMbrero
(‘Self Organizing Maps Bound to Realize
Euclidean and Relational Outputs’) implements several variants of the
stochastic Self-Organising Map algorithm and is able to handle numeric
and non numeric data sets (contingency tables, graphs or any
‘relational’ data described by a dissimilarity matrix).
See help(SOMbrero)
for further details.
Information on grids in SOMbrero
The numeric SOM is illustrated on the well-known iris
data set. This data describe iris flowers with 4 numeric variables
(Sepal.Length
, Sepal.Width
,
Petal.Length
and Petal.Width
) and a fifth
variable (not used to train the SOM) is the flower species. This example
is processed in the numeric
SOM guide.
The SOM algorithm provided by the package SOMbrero
can
also handle some non-numeric data. First, data described by contingency
tables, which can be processed using the ‘korresp’ algorithm (see
Cottrell et al., 2004, 2005). This case is illustrated on the
presidentielles2002
dataset, which contains the number of
votes in the first round of the French 2002 presidential election, for
each of the French administrative departments (row variables) and each
of the candidates (column variables). This example is used in the korresp
user guide.
Data described by a dissimilarity matrix can also be processed by
SOMbrero
as described in Olteanu et al., 2015a. This case
is illustrated on a data set extracted from the novel
Les Miserables
, written by the French author Victor Hugo
and published during the XIXth century. This dataset provides a
dissimilarity matrix between the characters of the novel, based on the
length of shortest paths in a network defined from the novel. This
example is provided in the relational
user guide.
For those who have an R developer soul, and who want to help improve this package, the following picture provides an overview the current function dependencies of the package:
This vignette has been computed with the following environment:
sessionInfo()
## R version 4.3.2 (2023-10-31)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 22.04.3 LTS
##
## Matrix products: default
## BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3
## LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.20.so; LAPACK version 3.10.0
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=fr_FR.UTF-8 LC_COLLATE=C
## [5] LC_MONETARY=fr_FR.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=fr_FR.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=fr_FR.UTF-8 LC_IDENTIFICATION=C
##
## time zone: Europe/Paris
## tzcode source: system (glibc)
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## loaded via a namespace (and not attached):
## [1] digest_0.6.34 R6_2.5.1 fastmap_1.1.1 xfun_0.41
## [5] cachem_1.0.8 knitr_1.45 htmltools_0.5.7 rmarkdown_2.25
## [9] lifecycle_1.0.4 cli_3.6.2 sass_0.4.8 jquerylib_0.1.4
## [13] compiler_4.3.2 rstudioapi_0.15.0 tools_4.3.2 evaluate_0.23
## [17] bslib_0.6.1 yaml_2.3.8 rlang_1.1.3 jsonlite_1.8.8