Iterative Discovery of Multiple Alternative Clustering Views

Characterizing Pattern Preserving Clustering - Hui Xiong

synopsis text mining for information retrieval

A Comparison of Clustering Techniques for Malware Analysis

Association Rule Mining Using Firefly Algorithm

Computational Intelligence and Data Mining

On Approximate Solutions to Support Vector Machines∗

Mining Useful Patterns from Text using Apriori_AMLMS

Grid-Based Mode Seeking Procedure

... Broad category of feature space analysis techniques relay on the density estimation, the construction of the unknown density function from the observed data. Estimated density reveals statistical trends and hidden patterns in data distribution where dense regions correspond to clusters of the data s ...

IOSR Journal of Computer Engineering (IOSR-JCE)

... Heart Attack Prediction System Using Fuzzy C Means Classifier risk factors were identified. K.Srinivas et al [5] applied data mining techniques to predict heart attack. Using medical profiles such as age, sex, blood pressure and blood sugar it can predict the likelihood of patients getting a heart ...

Mining Trajectory Data

Document

... • Introduced in Kaufmann and Rousseeuw (1990) • Implemented in statistical analysis packages, e.g., Splus • Use the Single-Link method and the dissimilarity matrix. • Merge nodes that have the least dissimilarity • Go on in a non-descending fashion • Eventually all nodes belong to the same cluster ...

Dirichlet Enhanced Latent Semantic Analysis

... of latent topics. Latent Dirichlet allocation (LDA) [3] generalizes PLSI by treating the topic mixture parameters (i.e. a multinomial over topics) as variables drawn from a Dirichlet distribution. Its Bayesian treatment avoids overfitting and the model is generalizable to new data (the latter is pro ...

Traffic Anomaly Detection Using K-Means Clustering

CSE 592: Data Mining

... Optimization Problem ...

Semi-Supervised Clustering I - Network Protocols Lab

Revealing structure in visualizations of dense 2D and 3D parallel

Identification of Business Travelers through Clustering Algorithms

Mining Hierarchies of Correlation Clusters

... between P and Q using the local correlation similarity matrix M̂P as weight. The motivation for the adaptation of M̂P is that the original local covariance matrix ΣP has two undesirable properties: (1) It corresponds to a similarity measure and to an ellipsoid which is oriented perpendicularly to th ...

d(j, i)

A query language for constraint-based clustering

... is the input table (data) where a column cluster has been added that contains for each instance its cluster assignement as a strictly positive integer. Figure 2 shows the different elements of the query CLUSTER x, y FROM (SELECT * FROM points). This is a very simple way of giving the result of a clu ...

COMP 790-090 Data Mining: Concepts

Presenting a Novel Method for Mining Association Rules Using

... a derivation of Apriori algorithm with a series of additional controls. On this basis, DHP uses Hash table, which helps candidates to be limited. DHP includes two important characteristics: efficient construction of strong item sets and effective reduction in database size by discarding its characte ...

IOSR Journal of Computer Engineering (IOSR-JCE)

Zodiac: Organizing Large Deployment of Sensors to Create

... any description, and we use point type abbreviation to label these points. These abbreviations can sometimes reveal the point type more accurately, as they do not necessarily vary due to changes in description. “ZN-T” is an example abbreviation of the point type “zone temperature”. However, these po ...

< 1 ... 13 14 15 16 17 18 19 20 21 ... 88 >

Nearest-neighbor chain algorithm

In the theory of cluster analysis, the nearest-neighbor chain algorithm is a method that can be used to perform several types of agglomerative hierarchical clustering, using an amount of memory that is linear in the number of points to be clustered and an amount of time linear in the number of distinct distances between pairs of points. The main idea of the algorithm is to find pairs of clusters to merge by following paths in the nearest neighbor graph of the clusters until the paths terminate in pairs of mutual nearest neighbors. The algorithm was developed and implemented in 1982 by J. P. Benzécri and J. Juan, based on earlier methods that constructed hierarchical clusterings using mutual nearest neighbor pairs without taking advantage of nearest neighbor chains.

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Nearest-neighbor chain algorithm