Download Information

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Open Systems and Electronic Commerce
Instructor: Jeffrey Yu [email protected]
Tutors: Hong Pan (PhD) [email protected]
Zechao Shang [email protected]
Class Schedules: Wednesday 7:00–9:30pm, LSB LT1.
Course web page:
http://www.se.cuhk.edu.hk/~seem5770
http://www.se.cuhk.edu.hk/~eclt5840
All emails about the course should be sent to:
[email protected]
[email protected]
This is a course combined of SEEM5770 and ECLT5840.
Big Data
• “Big data is a collection of data sets so large and
complex.”
– Volume, Various, Velocity, Veracity, Value
• “Businesses, governments and society are only
starting to tap its vast potential.”
Big Data – a growing torrent
[McKinsey Global Institute]
• $600 to buy a disk drive that can store all of the
world’s music.
• 5 billion mobile phones in used in 2010
• 30 billion pieces of content shared on Facebook
every month
• 40% projected growth in global data generated per
year vs 5% growth in global IT spending
• 235 terabytes data collected by the US Library of
Congress by April 2011
• 15 out of 17 sectors in the United States have more
data stored per company than the US Library of
Congress
Big Data – capturing its value
[McKinsey Global Institute]
• $600 billion potential annual consumer
surplus from using personal location data
globally
• 60% potential increase in retailers’ operating
margins possible with big data
Three Main Topics
• Data Movement: Open Systems (5 lectures)
– Computer Networks and Internet
– Application, Transport, Network, and Link Layers
• Database Querying (3 lectures)
– Querying Databases using SQL
– Different database systems to handle big data.
• Data Analysis: Knowledge Discovery for ECommerce (5 lectures)
– Introduction to Data Mining,
– Classification,
– Association Rule, and
– Clustering.
References
• Computer Networking: A Top-Down Approach Featuring
the Internet (5th edition) by James F. Kurose and Keith W.
Ross, Addison Wesley, 2009.
• Database System Concepts (5th Edition) by Avi Silberschatz,
Henry F. Korth, and S. Sudarshan, McGraw-Hill, 2010.
• Introduction to Data Mining, by Pang-Ning Tan, Michael
Steinbach, and Vipin Kumar, Addison Wesley, 2005.
• Data Mining – Concepts and Techniques (2nd Edition), by
Jiawei Han and Micheline Kamber, Morgan Kaufmann, 2006.
• Electronic Commerce 2010 – A Managerial Perspective, by
Efraim Turban, Jae K Lee, David King, Ting Peng Liang,
Deborrah Turban, Prentice Hall, 2010.
Assessments
• Individual Assignments 15% (networking)
• Individual Assignments 15% (database)
• Individual Assignments 20% (data mining)
• Final-exam (December 04, 2013) 50%
Important Notes:
Pay enough attention to Honesty in
Academic Work: A Guide for Students
and Teachers