TY - JOUR
T1 - VATdt: Visual Assessment of Cluster Tendency using Diagonal Tracing
AU - Hu (1949-2016), Yingkang
N1 - Yingkang Hu. "VATdt: Visual Assessment of Cluster Tendency using Diagonal Tracing" American Journal of Computational Mathematics 2.1 (2012): 27-41.
doi:10.4236/ajcm.2012.21004
source:http://www.scirp.org/journal/ajcm/
Available at: http://works.bepress.com/yingkang_hu/8
PY - 2012/1/1
Y1 - 2012/1/1
N2 - The visual assessment of tendency (VAT) technique, for visually finding the number of meaningful clusters in data, developed by J. C. Bezdek, R. J. Hathaway and J. M. Huband, is very useful, but there is room for improvements. Instead of displaying the ordered dissimilarity matrix (ODM) as a 2D gray-level image for human interpretation as is done by VAT, we trace the changes in dissimilarities along the diagonal of the ODM. This changes the 2D data structure (matrices) into 1D arrays, displayed as what we call the tendency curves, which enables one to concentrate only on one variable, namely the height. One of these curves, called the d-curve, clearly shows the existence of cluster structure as patterns in peaks and valleys, which can be caught not only by human eyes but also by the computer. Our numerical experiments showed that the computer can catch cluster structures from the d-curve even in some cases where the human eyes see no structure from the visual outputs of VAT. And success on all numerical experiments was obtained us- ing the same (fixed) set of program parameter values.
AB - The visual assessment of tendency (VAT) technique, for visually finding the number of meaningful clusters in data, developed by J. C. Bezdek, R. J. Hathaway and J. M. Huband, is very useful, but there is room for improvements. Instead of displaying the ordered dissimilarity matrix (ODM) as a 2D gray-level image for human interpretation as is done by VAT, we trace the changes in dissimilarities along the diagonal of the ODM. This changes the 2D data structure (matrices) into 1D arrays, displayed as what we call the tendency curves, which enables one to concentrate only on one variable, namely the height. One of these curves, called the d-curve, clearly shows the existence of cluster structure as patterns in peaks and valleys, which can be caught not only by human eyes but also by the computer. Our numerical experiments showed that the computer can catch cluster structures from the d-curve even in some cases where the human eyes see no structure from the visual outputs of VAT. And success on all numerical experiments was obtained us- ing the same (fixed) set of program parameter values.
KW - Clustering
KW - Clustering tendency
KW - Data visualization
KW - Dissimilarity measures
M3 - Article
VL - 2
JO - American Journal of Computational Mathematics
JF - American Journal of Computational Mathematics
ER -