
papers in PDF format
... relocation continues until the criterion function, e.g. square-error converges. Despite its wide popularity, k-means is very sensitive to noise and outliers since a small number of such data can substantially influence the centroids. Other weaknesses are sensitivity to initialization, entrapments in ...
... relocation continues until the criterion function, e.g. square-error converges. Despite its wide popularity, k-means is very sensitive to noise and outliers since a small number of such data can substantially influence the centroids. Other weaknesses are sensitivity to initialization, entrapments in ...
A Rough Set based Gene Expression Clustering Algorithm
... Clustering gene expression data: Clustering is one of the first steps in gene expression analysis. One of the important characteristics of gene expression data is that it is meaningful to cluster both genes and samples. During cluster analysis, genes are clustered based on similarity. Proximity meas ...
... Clustering gene expression data: Clustering is one of the first steps in gene expression analysis. One of the important characteristics of gene expression data is that it is meaningful to cluster both genes and samples. During cluster analysis, genes are clustered based on similarity. Proximity meas ...
Nearest-neighbor chain algorithm

In the theory of cluster analysis, the nearest-neighbor chain algorithm is a method that can be used to perform several types of agglomerative hierarchical clustering, using an amount of memory that is linear in the number of points to be clustered and an amount of time linear in the number of distinct distances between pairs of points. The main idea of the algorithm is to find pairs of clusters to merge by following paths in the nearest neighbor graph of the clusters until the paths terminate in pairs of mutual nearest neighbors. The algorithm was developed and implemented in 1982 by J. P. Benzécri and J. Juan, based on earlier methods that constructed hierarchical clusterings using mutual nearest neighbor pairs without taking advantage of nearest neighbor chains.