* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Mining Fuzzy Multiple-Level Association Rules from Quantitative Data
Survey
Document related concepts
Transcript
Mining Fuzzy Multiple-Level Association Rules from Quantitative Data Author: TZUNG-PEI HONG KUEI-YING LIN BEEN-CHIAN CHIEN Advisor: Dr. Hsu Graduate: Yan Pin Huang ADSL Wednesday, October 01, 2003 Content Motivation Objective Introduction Notation The Multiple-Level Fuzzy Data-Mining Algorithm Experimental Results Conclusion Personal opinion Motivation Machine-learning and data-mining techniques have been developed to turn data into useful taskoriented knowledge. Most algorithms for mining association rules identify relationships among transactions using binary values and find rules at a single-concept level. Objective This paper proposes a fuzzy multiple-level mining algorithm for extracting knowledge implicit in transactions stored as quantitative values. The proposed algorithm adopts a top-down progressively deepening approach to finding large itemsets. It integrates fuzzy-set concepts, data-mining technologies and multiple-level taxonomy to find fuzzy association rules from transaction data sets. Introduction Proposed a method for mining association rules from data sets using quantita-tive and categorical attributes. R. Srikant and R. Agrawal, “Mining quantitative association rules in large relational tables,” in The 1996 ACM SIGMOD International Conference on Management of Data. Introduction(cont.) Fuzzy set theory is being used more and more fre-quently in intelligent systems because of its simplicity and similarity to human reasoning [15]. Mining at Multiple Concept Levels They divided the min-ing process into two phases. Candidate itemsets were generated and counted by scanning the transaction data. Association rules were induced from the large itemsets found in the first phase Mining at Multiple Concept Levels Notation(cont.) Notation(cont.) The Multiple-Level Fuzzy Data-Mining Algorithm The Multiple-Level Fuzzy Data-Mining Algorithm(cont) Step 1. Each item name is first encoded using the predefined taxonomy. Results are shown in Table 2. The Multiple-Level Fuzzy Data-Mining Algorithm(cont) 1** 11* 111 The Multiple-Level Fuzzy Data-Mining Algorithm(cont) Step 2. All transactions shown in Table 1 are then encoded using the above coding scheme Step 3. k is initially set at 1, where k is used to store the level number being processed. Step 4. All the items in the transactions are first grouped on level one. The Multiple-Level Fuzzy Data-Mining Algorithm(cont) The Multiple-Level Fuzzy Data-Mining Algorithm(cont) Step 5. The quantitative values of the items on level 1 are represented using fuzzy sets. The Multiple-Level Fuzzy Data-Mining Algorithm(cont) (1,1)(6,0) 帶入 y=ax+b 求解a,b (a=-1/5 b=6/5) Function: y= -1/5x+6/5 (1**,5)帶入member function求得 y=0.2 The Multiple-Level Fuzzy Data-Mining Algorithm(cont) Step 6. The scalar cardinality of each fuzzy region in the transactions is calculated as the count value. Its scalar cardinality=(0.8+0.8+ 0.0+0.2+0.0+0.0) =1.8 The Multiple-Level Fuzzy Data-Mining Algorithm(cont) Step 7. The fuzzy region with the highest count among the three possible regions for each item is found. The Multiple-Level Fuzzy Data-Mining Algorithm(cont) Step 8. The count of any region selected in Step 7 is checked against the predefined minimum support value α. (α =2.1) The Multiple-Level Fuzzy Data-Mining Algorithm(cont) 1.2 The Multiple-Level Fuzzy Data-Mining Algorithm(cont) Step 12. r is set at 2, where r is used to store the number of items kept in the current itemsets. Step 13. Since is null, k =k + 1=2 and Step 4 is done. The results for level 2 are shown in Table 10.The results for level 3 are shown in Table 11.Since there are no items on level 4, Step 17 is done. The Multiple-Level Fuzzy Data-Mining Algorithm(cont) Step 17. The association rules are constructed for each large itemset using the following substeps. The Multiple-Level Fuzzy Data-Mining Algorithm(cont) The Multiple-Level Fuzzy Data-Mining Algorithm(cont) Step 18. The confidence values of the possible association rules are checked against the predefined confidence threshold λ. (λ =0.7) 6. Experimental Results They were implemented in C on a Pentium-III 700 Personal Computer. The number of levels was set at 3. 64 purchased items (terminal nodes) on level 3, 16generalized items on level 2, and 4 generalized items on level 1. Experimental Results(cont) Experimental Results(cont) Discussion and Conclusions Proposed a fuzzy multiplelevel data-mining algorithm that can process transaction data with quantitative values and discover interesting patterns among them. This method achieves better time complexity since only the most important fuzzy term is used for each item. This proposed algorithm does not find association rules for items on the same paths in given hierarchy trees. Discussion and Conclusions We will therefore attempt to dynamically adjust the membership functions in the proposed mining algorithm We will also attempt to design specific datamining models for various problem domains. Personal opinion Find association rules for items on the same paths in given hierarchy trees. Find a method that can dynamically adjust the membership functions. Fuzzy SOM. Fuzzy clustering. The strategy of using fuzzy set in Ant Colony algorithm. Personal opinion(cont.) Personal opinion(cont.)