Download Names of student and superviser

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Geographic information system wikipedia , lookup

Neuroinformatics wikipedia , lookup

Theoretical computer science wikipedia , lookup

Computational phylogenetics wikipedia , lookup

K-nearest neighbors algorithm wikipedia , lookup

Multidimensional empirical mode decomposition wikipedia , lookup

Inverse problem wikipedia , lookup

Data analysis wikipedia , lookup

Pattern recognition wikipedia , lookup

Corecursion wikipedia , lookup

Data assimilation wikipedia , lookup

Transcript
TEMPLATE FOR SEMINAR PAPERS FOR THE DATA
MINING AND KNOWLEDGE DISCOVERY COURSE
Names of student and superviser
Institutuins
Jožef Stefan International Postgraduate School
Jamova 39, 1000 Ljubljana, Slovenia
e-mail: student’s e-mail
ABSTRACT
The summary should be two hundred words or less. An
abstract is a concise single paragraph summary of
completed work or work in progress. In a minute or less a
reader can learn about the aim of the study, general
approach to the problem, pertinent results, and important
conclusions or open questions.
1 INTRODUCTION
Provide the description of the problem addressed (the
context of your data) and the project objectives
Optional, but recommended:
 What other methods have already been used on
your data, if any?
 What other techniques, besides data mining, could
be used to analyze your data?
 Very briefly describe the experimental design and
how it accomplished the stated objectives.
What will the results be used for?
2 DATA
Provide a few general sentences about your dataset.
Describe the data acquisition process: where, when, who …
Provide a sentence for each attribute describing its meaning,
the relation to the target variable, the distribution and
values…
In the case of a descriptive induction problem: Provide the
descriptions of items (e.g. articles in a market basket, …)
2.3 Data preprocessing
What did you need to do?
Noise handling, discretization, data transformation, …
Did you do it manually or with the help of WEKA or
another tool?
3. MACHINE LEARNING METHODS USED
Which method did you choose and why?
3.1 Brief description of the methods used
•
Describe the method
•
Describe the parameters and how do they influence
the results of the method
3.2 Brief description of the evaluation criteria
Which criteria were chosen and why do they fit your
problem?
4. EVALUATION
2.1 Data description
•
number of instances
•
number of attributes
o
numerical
o
nominal
•
target variable nominal/numerical
o
distribution of target variable
•
missing values
Provide a screenshot of your data.
2.2 Data understanding
In the case of a classification problem: Provide a detailed
description of your target variable. What benefits would
who have by being able to predict it? What is the state of
the art in the area of your data?
Describe the experimental setup and the results of the
experiments
•
Which algorithm, which implementation
•
Parameters’ values
•
Results, possibly including the screenshot (e.g. of a
decision tree with a certain parameter setting, and the tree
obtained by another parameters setting)
•
Evaluation method, evaluation parameters and
results (e.g., 10-fold cross validation, accuracy, and result)
5. VISUALIZATION
How would you present your results to somebody who is
not familiar with data mining?
All the images and the tables should be readable.
6. CONCLUSION
Provide a brief summary of your work.
Emphasize the interesting outcomes and the potential
novelty of the results compared to your previous
expectations/knowledge. What possible implication could
your results have in the domain?
Discuss the advantages and disadvantages data mining
provides compared to other methods you know.
Discuss possible further work.
References
[1] Authors, Title, Publication, Volume, Year, Pages,
Publisher
Appendix
For large pictures and tables that don’t fit in the text.