Download 1 - Middle East Technical University

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
MIDDLE EAST TECHNICAL UNIVERSITY
STAT 356 – STATISTICAL DATA ANALYSIS (3-2) 4
COURSE OUTLINE
Spring, 2013
Instructor: Ceylan YOZGATLIGİL, Ph.D.
Office: Dept. of Statistic Room 136
Phone: 210 2963
Email: [email protected]
Teaching Assistant: Ozan ÇINAR
Office: Dept. of Statistic Room 231
Phone: 210 5309
Email: [email protected]
Course Schedule
Tuesday
1:40 p.m.- 4:30 p.m. (Z-22)
Thursday (R1) 10:40 a.m.- 12:30p.m. (Comp Lab.)
Friday (R2) 8:40 a.m.- 10:30a.m. (Comp. Lab.)
Office Hours: WBA
Course Web page: http://www.metu.edu.tr/~ceylan/STAT356.htm
Course description and objectives: This is an applied course — applications of statistics in many
different fields will be covered. The objectives of this course are to enable the students to understand
how to think about data, and to be able to handle graphical and methodological ways to highlight what is
going on in data, summarize relationships in data using statistical models, and demonstrate the ability to
highlight structure in data by doing so. Students will see many analyses of real data, and will spend lots
of time doing their own statistical analyses of real data using the computer and learning to interpret the
results of those analyses.
I strongly urge you to bring in to me examples of statistics, probabilistic reasoning, and so on, that you
see in newspapers or magazines, whether they are directly relevant to material being discussed in class
or not. Such material might very well then be incorporated into class discussion.
Background: The course is required for all undergraduate statistics majors. Prerequisite courses are
STAT 156, STAT 291.
Learning goals:
 To develop quantitative reasoning
 To develop statistical reasoning and methodology provide the tools to become numerate.
 How to think about randomness, and about data
 Explain the role of data and models in the decision making process and how they support (rather
than determine) decisions
 Decide what tools are appropriate in different data and decision making settings


Decide how to structure model so that the essential elements are included and that the model can
be analyzed in a timely fashion
Translate the results of the model into a statement about the data relationships and of the actions
that could be taken with the new understanding.
Textbooks: No specific textbook.
Reference Books & Papers:
 Agresti, A., (2002) Categorical Data Analysis, 2nd edition, Wiley, New York.
 Chatfield, C., (1980) Problem Solving: A Statistician’s Guide, 2nd edition, Chapman & Hall.
 Dunham, M.H., (2002) Data mining: Introductory and advanced topics, Prentice Hall.
 Han, J. And Kamber,. M., (2000) Data mining: Concepts and techniques, Morgan Kaufmann
Pub.
 Hoaglin, D.C., Mosteller, F. and Tukey, J.W., (1983) Understanding Robust and Exploratory
Data Analysis, Wiley.
 Ilk, O. (2011) R Yazılımına Giriş, ODTÜ.
 Little, R.J.A and Rubin, D.B., (2002) Statistical Analysis with Missing Data, 2nd edition,
Chichester:Wiley .
 Neter, J., Kutner, M.H., Nachtsheim, C.J. and Wasserman, W., (1996) Applied Linear Statistical
Models, 4th edition, Irwin.
 Tukey, J.W., (1977) Exploratory Data Analysis, Addison-Wesley.
 Data Analysis and Graphics Using R: An Example-based Approach, by John Maindonald and
John Braun. Cambridge Series in Statistical and Probabilistic Mathematics, 2003. Has some nice
things, but not quite right as the sole textbook for this course
 Ramsey and Schafer (2002), The Statistical Sleuth: A Course in Methods of Data Analysis, 2nd
edition
Course Content:
 Introduction, Motivation. Types of data.
 Graphical and tabular representation of data, exploratory data analysis
 Unexpected structures in data – multicollinearity, confounding, interaction effects ...
 Data transformation
 Handling missing data
 Introduction to the analysis of categorical data
 Introduction to Robust estimation
 Smoothing methods – short review since it is already covered in S361
 Introduction to data mining
 Case Studies
Computing: We will use R and GGobi.
Calculator: You are expected to have a calculator, Casio FX-82ES. Advanced calculators will not be
allowed in the exams.
Attendance: Mandatory, though I will not take roll. You are responsible for everything we do in class,
even on days you do not attend.
Exams and Grading: There will be one in-class midterm, one project work and a final exam.
Midterm exam (25%)
Homework (15%)
Project: 25% (Report 18%, Presentation 7%)
Final (35%)
For the project, you will be provided with a dataset, asked to analyze it and present your results in a
written report and in an in-class presentation. You can work in groups of at MOST 3 or alone. You will
be required to submit your homework assignments and projects to turn-it in program. If the similarity
index of your work is greater than or equal to 50%, you will receive 0 points from that work. If index is
between 30% and 50%, you will receive only half of the points that you deserve.
Make-Up Work: Make-up exams will only be given in very unusual circumstances, with one week
prior notification (or, in the event of an emergency, *very* strong documentation of that emergency).
If you have this kind situation and don’t contact with me one week before or after the exam, you
cannot take the make-up exam. Make-up exam will be given at the end of the semester and it will be
similar to the final exam (cover all the topics).
Students who miss one of the exams and make-up will receive an NA grade. Students who do not return
a project will also receive an NA grade. Students who receive an NA grade are not allowed to take resit
exams.
Homework: Assignments are due at the beginning of class on Fridays. Please clearly write
down your name on top of the first page. And if your assignment contains multiple pages,
please staple them!
Please write down your solutions legibly and neatly. Don't skimp on margins! Remember to
properly define your variables, label your diagrams, etc., so that we know what you're trying to
communicate.
In general, you should show enough work/reasoning in deducing the final answer. If you need
to refer to a specific result mentioned in class or in the textbook, please indicate so as well.
Insufficient or uncited work will receive reduced or no credit.
Homework assignments form an integral component of the course. You should make every
effort to solve the assigned problems, using the concepts learned from the lectures and readings.
If you have any question, come see me during office hours, and/or talk to your teaching
assistant. Help each other out! Collobration is allowed as idea-sharing. However, you should
write up your work on your own and in your own words. Exact duplication of others' work is
considered plagiarism.
Your homework will not be graded, if you bring it after the recitation hour.
Academic Integrity: All assignments, quizzes, and exams must be done on your own. Note that
academic dishonesty includes not only cheating, fabrication, and plagiarism, but also includes helping
other students commit acts of academic dishonesty by allowing them to obtain copies of your work. You
are allowed to use the Web for reference purposes, but you may not copy code from any website or any
other source. In short, all submitted work must be your own. Should a student be caught cheating during
an examination or be involved in plagiarism, a zero (0) will be assigned for the exam, quiz or writing
assignment.
Please look at the following page for further information:
http://www.ueam.metu.edu.tr/TURKCE/ueam/ueam_ilkeler/ueam_ilkeler_honor_code_tab.htm
Important Dates:
 Week of the ADD-DROP for the classes: February 24-28, 2014
 National Holiday (National Sovereignty and Children's Day, Wednesday) April 23, 214







Labor and Solidarity Day (Thursday) May 1, 2014
National Holiday (Commemoration of Atatürk & Youth and Sports Festival, Monday) May 19,
2014
Last date for the WITHDRAWAL: April 25, 2014
End of lectures: May 23, 2014
Final exams: May 26 – June 7, 2014
Announcement of final grades: June 16, 2014
Resit Examinations: June 18 – 20, 2014