Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
AP/ITEC 4305M Web mining Winter 2013 Course Outline Time and Location Monday 11:30-14:30pm Tel 1016 Instructor Mariam Daoud TEL 3052 Office hours: Monday 3pm:4pm [email protected] Important Dates First day of class, Monday, January 7, 2013 (Tel 1016, 11:30-14:30pm) Last day of class, Monday, April 1, 2013. Reading week - Saturday, February 16 to Friday, February 22, 2013. Winter 2013 Final Exam - Wednesday, April 10 to Friday, April 26, 2013. Midterm exams: TBA Final exam: TBA Course Description The World Wide Web (or the Web for short) is officially defined as a "wide area hypermedia information retrieval initiative aiming to give universal access to a large universe of documents". The rapid growth of the Web in the last decade makes it the largest publicly accessible data source in the world. The Web has many unique characteristics, which make mining useful information and knowledge from the Web a fascinating and challenging task. This course is an advanced course after the courses ITEC4020 "Internet Client-Server Systems" and ITEC3020 "Introduction to Web Technology". It covers some advanced topics and the latest research topics on Web mining. The major objectives of this course are to introduce Web mining technology from a practical point of view and for the students to obtain a solid grasp of how techniques in Web mining technology can be applied to solve problems in real-world applications. Web mining aims to discover useful information or knowledge from Web hyperlinks, page contents and usage data. Due to the richness and diversity of information and other Web specific characteristics, Web mining is not just an application of data mining. Web mining has developed many of its own methods, ideas, models and algorithms. This course will cover the following topics: Introduction to WWW and Web Mining Systems Learning and Knowledge Discovery from the Web Information Retrieval (IR) and Web Search Web Crawling and Information Integration Web Link Analysis such as Social Network Analyis, PageRank and HITS Opinion and Sentiments Mining Web Aspect Search and Mining Web Usage Mining Web Mining Applications such as Web Blogs Mining and Online Medical Data Analysis Required Textbook Web Data Mining: Exploring Hyperlinks, Contents and Usage Data. 532 pages. Bing Liu Springer, 2007 ISBN: 978-3-540-37881-5 Evaluation The grade for this course will be made up from two parts: Three assignments: 70% Final exam: 30% Late Policy You are given one (1) grace day to use during the term: once, and once only, you may submit an assignment up to 24 hours late with no penalty. The grace day will be applied to the first late assignment; if you submit two assignments late, the second one will not even be marked. In exceptional cases, late assignments may be accepted provided that medical or other acceptable documentation is presented. When going to see a doctor please use the form downloaded from: http://www.registrar.yorku.ca/services/petitions/forms.htm#6 If you miss the final exam for medical reasons you have to apply for deferred final examination within a week from the exam date. Academic Honesty Assignments are supposed to be produced through independent work. You may talk to your classmates but the final form of the assignments must be your own. The penalty for electronically copied assignments is a zero plus the possibility of a disciplinary action.