Download Appendix: The WEKA Data Mining Software

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Data Mining Workbenches: a overview
&comparison focusing on open-source
packages
CS240A notes by C. Zaniolo
1
Most Popular Data Mining Software
Rexer Analytics Survey (Early 2007) asked
about the tools used often and occasionally.
Clearly more popular than the rest were:
 SPSS or SPSS Clementine
 "Own Code"
 SAS or SAS Enterprise Miner
Followed by
 R
 Weka
 C4.5 / C5.0
2
Critical Mass and Popularity
Top ten most used packages by KDD Nuggets Survey (May 2007):










SPSS/ SPSS Clementine
Salford Systems CART/MARS/TreeNet/RF
Yale (now Rapid Miner)
SAS / SAS Enterprise Miner
Angoss Knowledge Studio / Knowledge Seeker
KXEN
Weka
R
Microsoft SQL Server?
MATLAB?
Note: Microsoft Excel omitted as it's not really "data mining" software, and
I've merged the tools offered by a single vendor (SPSS and SAS)
You can see the full survey results
3
Comments Gregory Piatetsky-Shapiro,
KDnuggets Editor:
Votes from tool vendors were removed..
Comparing with 2008 KDnuggets Poll on
data mining tools/software used,
the big changes are growth in SPSS,
RapidMiner, and R.
4
Popular Data Mining Software (cont.)
Rexer Analytics Survey is taken every year and the
summary report can be obtained free.

2009 SURVEY HIGHLIGHTS:



Open-source tools Weka and R made substantial movement up
data miner’s tool rankings this year, and are now used by large
numbers of both academic and for-profit data miners.
SAS Enterprise Miner dropped in data miner’s tool rankings
2010 SURVEY HIGHLIGHTS:
 R: After a steady rise across the past few years, R overtook other tools
to become the tool used by more data miners (43%)
 STATISTICA has also been climbing in the rankings.
STATISTICA, IBM SPSS Modeler, and R received the
strongest satisfaction ratings in both 2010 and 2009.
5
6
7
Related documents