Download ITEC 4305 3.0M Course Outline

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
AP/ITEC 4305M
Web mining
Winter 2013
Course Outline
Time and Location
Monday 11:30-14:30pm
Tel 1016
Instructor
Mariam Daoud
TEL 3052
Office hours: Monday 3pm:4pm
[email protected]
Important Dates
 First day of class, Monday, January 7, 2013 (Tel 1016, 11:30-14:30pm)
 Last day of class, Monday, April 1, 2013.
 Reading week - Saturday, February 16 to Friday, February 22, 2013.
 Winter 2013 Final Exam - Wednesday, April 10 to Friday, April 26, 2013.
 Midterm exams: TBA
 Final exam: TBA
Course Description
The World Wide Web (or the Web for short) is officially defined as a "wide area hypermedia
information retrieval initiative aiming to give universal access to a large universe of documents".
The rapid growth of the Web in the last decade makes it the largest publicly accessible data
source in the world. The Web has many unique characteristics, which make mining useful
information and knowledge from the Web a fascinating and challenging task. This course is an
advanced course after the courses ITEC4020 "Internet Client-Server Systems" and ITEC3020
"Introduction to Web Technology". It covers some advanced topics and the latest research topics
on Web mining.
The major objectives of this course are to introduce Web mining technology from a practical
point of view and for the students to obtain a solid grasp of how techniques in Web mining
technology can be applied to solve problems in real-world applications.
Web mining aims to discover useful information or knowledge from Web hyperlinks, page
contents and usage data. Due to the richness and diversity of information and other Web specific
characteristics, Web mining is not just an application of data mining. Web mining has developed
many of its own methods, ideas, models and algorithms. This course will cover the following
topics:



Introduction to WWW and Web Mining Systems
Learning and Knowledge Discovery from the Web
Information Retrieval (IR) and Web Search






Web Crawling and Information Integration
Web Link Analysis such as Social Network Analyis, PageRank and HITS
Opinion and Sentiments Mining
Web Aspect Search and Mining
Web Usage Mining
Web Mining Applications such as Web Blogs Mining and Online Medical Data
Analysis
Required Textbook
Web Data Mining: Exploring Hyperlinks, Contents and Usage Data. 532 pages. Bing Liu
Springer, 2007 ISBN: 978-3-540-37881-5
Evaluation
The grade for this course will be made up from two parts:
Three assignments: 70%
Final exam: 30%
Late Policy
You are given one (1) grace day to use during the term: once, and once only, you may submit an
assignment up to 24 hours late with no penalty. The grace day will be applied to the first late
assignment; if you submit two assignments late, the second one will not even be marked. In
exceptional cases, late assignments may be accepted provided that medical or other acceptable
documentation is presented. When going to see a doctor please use the form downloaded from:
http://www.registrar.yorku.ca/services/petitions/forms.htm#6
If you miss the final exam for medical reasons you have to apply for deferred final examination
within a week from the exam date.
Academic Honesty
Assignments are supposed to be produced through independent work. You may talk to your
classmates but the final form of the assignments must be your own. The penalty for
electronically copied assignments is a zero plus the possibility of a disciplinary action.