Please use this identifier to cite or link to this item: http://hdl.handle.net/2122/14408
Authors: Langer, Horst* 
Falsaperla, Susanna* 
Hammer, Conny* 
Editors: Spichak, Viacheslav 
Title: Unsupervised learning
Publisher: Elsevier B.V.
Issue Date: Jan-2020
URL: https://www.sciencedirect.com/science/article/pii/B9780128118429000030?via%3Dihub
https://doi.org/10.1016/B978-0-12-811842-9.00003-0
ISBN: 9780128118429
Keywords: pattern recognition
unsupervised learning
cluster analysis
Density-based clustering
Self-Organizing Maps
Subject Classification04.04. Geology 
04.06. Seismology 
04.07. Tectonophysics 
04.08. Volcanology 
05.04. Instrumentation and techniques of general interest 
Abstract: Unsupervised learning is based on the definition of an appropriate metrics defining the similarity of patterns. On the basis of the metrics, we form groups or clusters of patterns following various strategies. In partitioning cluster analysis, we form disjoint clusters. Being faced with data, where clusters still exhibit heterogeneities or subclusters, we may adopt the strategy of hierarchical clustering, which leads to the generation of the so-called dendrograms. In the partitioning strategy, we choose a priori the number of clusters we wish to form, whereas in the hierarchical strategy, the number of clusters depends on the resolution we want to have. Density-based clustering considers local structures of a data set. We consider a unit volume in our data space and derive the density of samples within this volume. Moving toward neighboring volumes, we verify whether the number of samples has dropped below a threshold. If this is the case, we identify a heterogeneity, otherwise we join the neighboring volumes to a common cluster. Self-Organizing Maps (SOMs) provide a way of representing multidimensional data in much lower dimensional spaces than the original data set. The process of reducing the dimensionality of vectors is essentially a data compression technique known as vector quantization. The SOM technique creates a network that stores information in a way that it maintains the topological relationships within the patterns of the data set. Each node of the network represents a number of patterns. Assigning a color code to the nodes, the representation of pattern characteristics with high-dimensional feature vectors becomes extremely effective.
Appears in Collections:Book chapters

Files in This Item:
File Description SizeFormat
Chapter 3.pdfabstract195.52 kBAdobe PDFView/Open
Show full item record

Page view(s)

31
checked on Apr 24, 2024

Download(s)

4
checked on Apr 24, 2024

Google ScholarTM

Check

Altmetric