Download logic systems

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Expectation–maximization algorithm wikipedia , lookup

K-nearest neighbors algorithm wikipedia , lookup

K-means clustering wikipedia , lookup

Nearest-neighbor chain algorithm wikipedia , lookup

Cluster analysis wikipedia , lookup

Transcript
Scalable Constrained Spectral Clustering
Scalable Constrained Spectral Clustering
ABSTRACT:
Constrained spectral clustering (CSC) algorithms have shown great promise in
significantly improving clustering accuracy by encoding side information into
spectral clustering algorithms. However, existing CSC algorithms are inefficient in
handling moderate and large datasets. In this paper, we aim to develop a scalable
and efficient CSC algorithm by integrating sparse coding based graph construction
into a framework called constrained normalized cuts. To this end, we formulate a
scalable constrained normalized-cuts problem and solve it based on a closed-form
mathematical analysis. We demonstrate that this problem can be reduced to a
generalized eigen value problem that can be solved very efficiently. We also
describe a principled k-way CSC algorithm for handling moderate and large
datasets. Experimental results over benchmark datasets demonstrate that the
proposed algorithm is greatly cost-effective, in the sense that (1) with less side
information, it can obtain significant improvements in accuracy compared to the
unsupervised baseline; (2) with less computational time, it can achieve high
clustering accuracies close to those of the state-of-the-art.
EXISTING SYSTEM:
 Data in a wide variety of areas tend to large scales. For many traditional
learning based data mining algorithms, it is a big challenge to efficiently
mine knowledge from the fast increasing data such as information streams,
images and even videos.
Scalable Constrained Spectral Clustering
 To over-come the challenge, it is important to develop scalable learning
algorithms. Constrained clustering is an important area in the research
communities of machine learning. Researchers proposed many new
algorithms.
DISADVANTAGES OF EXISTING SYSTEM:
 Straightforward integration of the constrained normalized cuts and the sparse
coding based graph construction, and the formulated scalable constrained
normalized-cuts problem.
PROPOSED SYSTEM:
 In this paper, we develop an efficient and scalable CSC algorithm that can
well handle moderate and large datasets. The SCACS algorithm can be
understood as a scalable version of the well-designed but less efficient
algorithm known as Flexible Con-strained Spectral Clustering (FCSC).
 To our best knowledge, our algorithm is the first efficient and scalable
version in this area, which is derived by an integration of two recent studies,
the constrained normalized cuts and the graph construction method based on
sparse coding. However, it is by no means straight forward to integrate the
two existing methods.
ADVANTAGES OF PROPOSED SYSTEM:
Scalable Constrained Spectral Clustering
 We randomly sample clabelled instances from a given input dataset, and
then obtain based on the rules of The clustering accuracy is evaluated by the
best matching rate(ACC).
 Let h be the resulting label vector obtained from a clustering algorithm. Let
g be the ground truth label vector. Then, the best matching rate is defined as
where the delta function that returns 1 if a¼b and returns 0 otherwise, and
map(hi) is the permutation mapping function that maps each cluster label hi
to the equivalent label from the data corpus.
SYSTEM ARCHITECTURE:
SYSTEM REQUIREMENTS:
Scalable Constrained Spectral Clustering
HARDWARE REQUIREMENTS:
 System
:
Pentium IV 2.4 GHz.
 Hard Disk
:
40 GB.
 Floppy Drive
:
1.44 Mb.
 Monitor
:
15 VGA Colour.
 Mouse
:
Logitech.
 Ram
:
512 Mb.
SOFTWARE REQUIREMENTS:
 Operating system :
Windows XP/7.
 Coding Language :
JAVA/J2EE
 IDE
:
Netbeans 7.4
 Database
:
MYSQL
REFERENCE:
Jianyuan Li, Yingjie Xia, Zhenyu Shan, and Yuncai Liu, “Scalable Constrained
Spectral Clustering”, IEEE TRANSACTIONS ON KNOWLEDGE AND
DATA ENGINEERING, VOL. 27, NO. 2, FEBRUARY 2015