A PROBABILISTIC SELF-ORGANIZING MAP FOR BINARY DATA TOPOGRAPHIC CLUSTERING
Abstract
This paper introduces a probabilistic self-organizing map for topographic clustering, analysis and visualization of multivariate binary data or categorical data using binary coding. We propose a probabilistic formalism dedicated to binary data in which cells are represented by a Bernoulli distribution. Each cell is characterized by a prototype with the same binary coding as used in the data space and the probability of being different from this prototype. The learning algorithm, Bernoulli on self-organizing map, that we propose is an application of the EM standard algorithm. We illustrate the power of this method with six data sets taken from a public data set repository. The results show a good quality of the topological ordering and homogenous clustering.
This is the extended version of the paper: M. Lebbah, N. Rogovschi and Y. Bennani, BeSOM: Bernoulli on Self-Organizing Map, International Joint Conferences on Neural Networks, IJCNN 2007, Orlando, Florida (12–17 August, 2007).
Remember to check out the Most Cited Articles! |
---|
Check out these titles in artificial intelligence! |