Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Preprocessing with WEKA 5/7/2017 1 WEKA: the software Machine learning/data mining software written in Java (distributed under the GNU Public License) Used for research, education, and applications Complements “Data Mining” by Witten & Frank Main features: Comprehensive set of data pre-processing tools, learning algorithms and evaluation methods Graphical user interfaces (incl. data visualization) Environment for comparing learning algorithms 5/7/2017 2 WEKA only deals with “flat” files @relation iris @attribute sepallength real @attribute sepalwidth real @attribute petallength real @attribute petallength real @attribute petalwidth real @attribute class { Iris-setosa, Iris-versicolor, Iris-virginica} @data 4.6,3.2,1.4,0.2,Iris-setosa 5.3,3.7,1.5,0.2,Iris-setosa 7.0,3.2,4.7,1.4,Iris-versicolor 6.3,3.3,6.0,2.5,Iris-virginica 6.3,2.9,5.6,1.8,Iris-virginica ... 5/7/2017 3 WEKA only deals with “flat” files @relation iris @attribute sepallength real @attribute sepalwidth real @attribute petallength real @attribute petallength real @attribute petalwidth real @attribute class { Iris-setosa, Iris-versicolor, Iris-virginica} @data 4.6,3.2,1.4,0.2,Iris-setosa 5.3,3.7,1.5,0.2,Iris-setosa 7.0,3.2,4.7,1.4,Iris-versicolor 6.3,3.3,6.0,2.5,Iris-virginica 6.3,2.9,5.6,1.8,Iris-virginica... 5/7/2017 4 5/7/2017 5 5/7/2017 6 Explorer: pre-processing the data Data can be imported from a file in various formats: ARFF, CSV, C4.5, binary Data can also be read from a URL or from an SQL database (using JDBC) Pre-processing tools in WEKA are called “filters” WEKA contains filters for: 5/7/2017 Discretization, normalization, resampling, attribute selection, transforming and combining attributes, … 7 5/7/2017 University of Waikato 8 5/7/2017 University of Waikato 9 5/7/2017 University of Waikato 10 5/7/2017 University of Waikato 11 5/7/2017 University of Waikato 12 5/7/2017 University of Waikato 13 5/7/2017 University of Waikato 14 5/7/2017 University of Waikato 15 5/7/2017 University of Waikato 16 5/7/2017 University of Waikato 17 5/7/2017 University of Waikato 18 5/7/2017 University of Waikato 19 5/7/2017 University of Waikato 20 5/7/2017 University of Waikato 21 5/7/2017 University of Waikato 22 5/7/2017 University of Waikato 23 5/7/2017 University of Waikato 24 5/7/2017 University of Waikato 25 5/7/2017 University of Waikato 26 5/7/2017 University of Waikato 27 5/7/2017 University of Waikato 28 Conclusion: try it yourself! WEKA is available at http://www.cs.waikato.ac.nz/ml/weka 5/7/2017 29