Download Fuzzy data mining fo..

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Chaoyang University of Technology
Fuzzy data mining for interesting
generalized association rules
Source : Fuzzy Sets and Systems ; Vol.138, No. 2, 2003, pp.255-269
Author : Tzung-Pei, Kuei-Ying Lin, Shyue-Liang Wang
Instructor Professor :Rong-Chung Chen
Present : Ya-Hui Chin (金雅慧)
Chaoyang University of Technology
Outline
• Introduction
– Overview of Data Mining
– What is Association Mining
– Mining Procedures
•
•
•
•
Fuzzy Generalized Mining Algorithm
Experimental results
Conclusion
Comments
2005/12/06
Fuzzy data mining for interesting generalized association rules
2
Chaoyang University of Technology
Introduction(1/3)
Overview of Data Mining
Association Mining
AB equal to BA
Sequential Mining
ACDE
Classification
Clustering
2005/12/06
Fuzzy data mining for interesting generalized association rules
3
Chaoyang University of Technology
Introduction(2/3)
What is Association Mining
• Finding frequent patterns, correlations, or
causal etc.
• Application
– Market basket analysis, cross-marketing, catalog
design, clustering, classification, etc.
2005/12/06
Fuzzy data mining for interesting generalized association rules
4
Chaoyang University of Technology
Introduction(3/3)
min. support = 2
Database D
TID
100
200
300
400
L2
Items
134
235
1235
25
itemset sup.
2
C1 {1}
{2}
3
Scan D {3}
3
{4}
1
{5}
3
itemset sup
{1 3}
2
{2 3}
2
{2 5}
3
{3 5}
2
C2
itemset sup
{1 2}
1
{1 3}
2
{1 5}
1
{2 3}
2
{2 5}
3
{3 5}
2
L1
itemset sup.
{1}
2
{2}
3
{3}
3
{5}
3
Scan D
C2
itemset
{1 2}
natural{1 3}
join
{1 5}
{2 3}
{2 5}
{3 5}
Self-join
C3
2005/12/06
itemset
{2 3 5}
Scan D
L3 itemset sup
{2 3 5}
2
Fuzzy data mining for interesting generalized association rules
5
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(1/12)
.Single concept level
TID
1
2
3
Items
Milk Cake
Juice T-shirt
Cake Milk
T-shirt
T-shirt
.Integrate fuzzy-set
TID
1
2
3
2005/12/06
Items
(Milk,3) (Cake,1)
(T-shirt,2)
(Juice,2) (T-shirt,1)
(Cake,1) (Milk,2) (T-shirt,1)
Fuzzy data mining for interesting generalized association rules
6
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(2/12)
.Multiple-level association rules
2005/12/06
Fuzzy data mining for interesting generalized association rules
7
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(3/12)
2005/12/06
Fuzzy data mining for interesting generalized association rules
8
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(4/12)
(C,9)(E,10)(T2,9)(T3,10)
Ex. (Bread,9) (T-shirt,10)
9
Food
( C , 9 ) => ( T2 , 9 )
10
Clothes
( E , 10 ) => ( T3 , 10 )
Drink
Bread
9
Jackets
Milk
2005/12/06
T-shirts
10
Juice
Fuzzy data mining for interesting generalized association rules
9
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(5/12)
0.8
0.6
0.5
0.4
0.2
9 10
(C,9) => C ( 0/Low , 0.4/Middle , 0.6/High )
(E,10) => E ( 0/Low , 0.2/Middle , 0.8/High )
2005/12/06
Fuzzy data mining for interesting generalized association rules
10
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(6/12)
n
count jl   f ijl
i 1
,
count jl 為Membership的總和
C.High => 0.0 + 0.2 + 0.8 + 0.6 +0.0 + 0.4 = 2.0
2005/12/06
Fuzzy data mining for interesting generalized association rules
11
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(7/12)
Minimum support value α = 1.5
2005/12/06
Fuzzy data mining for interesting generalized association rules
12
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(8/12)
(T1.Low , T2.High)
ˇ (B.Low , C.Middle) ˇ (C.Middle , D.Middle) ˇ (D.Middle , T1.Low)
ˇ (B.Low , D.Middle) ˇ (C.Middle , T1.Low) ˇ (D.Middle , T2.High) ˇ (T1.Low , T3.Middle)
(B.Low , T1.Low)
(C.Middle , T2.High)
(B.Low , T2.High) ˇ (C.Middle , T3.Middle)
ˇ (B.Low , T3.Middle)
2005/12/06
(D.Middle , T3.Middle)
ˇ (T2.High , T3.Middle)
Fuzzy data mining for interesting generalized association rules
13
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(9/12)
Minimum support value α = 1.5
1.4
2005/12/06
Fuzzy data mining for interesting generalized association rules
14
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(10/12)
6
Confidence threshold 0.7
 (T .Middle  B.Low)
i 1
3
6
 (T .Middle )
i 1

1.6
 0.57
2.8
3
If B = Low , then T3 =Middle, with a confidence value of 0.73.
2005/12/06
Fuzzy data mining for interesting generalized association rules
15
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(11/12)
Interest threshold 1.5
2005/12/06
Fuzzy data mining for interesting generalized association rules
16
Chaoyang University of Technology
Fuzzy Generalized Mining Algorithm(12/12)

If T1.Low , then T3.Middle , confidence factor of 0.8

If T3.Middle , then T2.High , confidence factor of 0.86
2005/12/06
Fuzzy data mining for interesting generalized association rules
17
Chaoyang University of Technology
Experimental results(1/3)
2005/12/06
Fuzzy data mining for interesting generalized association rules
18
Chaoyang University of Technology
Experimental results(2/3)
2005/12/06
Fuzzy data mining for interesting generalized association rules
19
Chaoyang University of Technology
Experimental results(3/3)
2005/12/06
Fuzzy data mining for interesting generalized association rules
20
Chaoyang University of Technology
Conclusion & Comments
• Conclusion
– Discovering interesting patterns
– Getting smoother mining rules
• Comments
– Without explaining how to classify the items
– Without comparing with other method in
experiment
2005/12/06
Fuzzy data mining for interesting generalized association rules
21
Related documents