Download Title - K.f.u.p.m. ISI

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Cluster analysis wikipedia , lookup

Transcript
'Intelligent Yardstick', An Approach Of Ranking To Filter NonPromising
Attributes From Schema In Data Mining Process
Hassan, MM
SPRINGER-VERLAG BERLIN, INTELLIGENT CONTROL AND AUTOMATION; pp:
623-632; Vol: 344
King Fahd University of Petroleum & Minerals
http://www.kfupm.edu.sa
Summary
Knowledge searching and representation process needs to go through filtering to
make it concise and precise. The data mining process (DM) as a knowledge extraction
method also needs filtering to remove unnecessary or erroneous data. Sometimes
filtering also needs to cover processing or representational limits, especially in visual
data mining. Common approaches for filtering or shrinking the database are - discard
records, prune attributes from the database or reduce dimensionality. In this paper we
propose an approach where we have applied single dimension DM process on each
individual attribute to select only those attributes which are good for future full-scale
DM and prune the non prospective attributes. In this process we check several
properties of a single attribute like data distance, clustering tendency and cluster
density. According to the result, we formulate a scale that will rank all attributes to
indicate their importance for future knowledge extraction schema. The main
observation that helps us to build this approach is -'An attribute that has no data
pattern itself will not help to find a pattern in a multidimensional environment'.
References:
1.
2.
3.
4.
5.
6.
7.
©
ABRAHAM B, 2005, IEEE T KNOWLEDGE DAT, V17
EAMONN K, 2002, ACM T DATABASE SYST, V27, P188
FAYYAD UM, 1996, DATA MIN KNOWL DISC, P1
HINNEBURG A, 2003, COMPUT SCI ENG, V5, P14
MARK AH, 2005, IEEE T KNOWLEDGE DAT, V17
MOHAMMAD MH, P INT C INT SYST KUA
PAUL A, 2002, IEEE T KNOWLEDGE DAT, V14
Copyright: King Fahd University of Petroleum & Minerals; http://www.kfupm.edu.sa
8.
9.
10.
11.
12.
RAKESH A, 1993, IEEE T KNOWL DATA EN, V5, P914
SAMUEL HH, 2003, IEEE T KNOWLEDGE DAT, V15
STUART R, 2003, STAT LEARNING METHOD, P736
WYNNE H, 2000, P 6 ACM SIGKDD INT C, P430
XINGQUAN Z, 2005, IEEE T KNOWLEDGE DAT, V17
For pre-prints please write to: [email protected]
©
Copyright: King Fahd University of Petroleum & Minerals; http://www.kfupm.edu.sa