Download Visualization

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
ACADEMIC
VISUALIZATION
AN INSIGHT INTO
ACADEMIA
CHANGFENG LIU
ACEMAP GROUP
MAY 31, 2016
OUTLINE
• Background
• Work Review
• SchoolDrag
• Future work
May 31, 2016
Academic Visualization
2
• Q&A
OUTLINE
• Background
• What is visualization?
• Why academic visualization?
• Work Review
• SchoolDrag
• Future work
May 31, 2016
Academic Visualization
3
• Q&A
WHAT IS
VISUALIZATION
• Wikipedia:
• Information visualization is the study of
(interactive) visual representations of abstract
data to reinforce human cognition.
• An easy way to understand the world!
Visualization
Academic Visualization
http://echarts.baidu.com/demo.html#heatmap-map
4
http://www.air-level.com/rank
May 31, 2016
WHY ACADEMIC
VISUALIZATION
• Prof. Wang:
May 31, 2016
Academic Visualization
5
• We have got massive academic data,
which is like raw food materials. What
we should do is to cook delicious
dishes!
• How to be a good chef?
WHY ACADEMIC
VISUALIZATION
Information visualization!
May 31, 2016
Academic Visualization
Data warehousing, DBMS
Data mining, knowledge discovery
6
Crawling, data cleaning
WHY ACADEMIC
VISUALIZATION
• Why we need to build our own academic visualization?
• Because traditional academic search engines
provide few visualizations…
May 31, 2016
Academic Visualization
Microsoft academic
dblp
7
Google Scholar
OUTLINE
• Background
• Work Review
• SchoolDrag
• Future work
May 31, 2016
Academic Visualization
8
• Q&A
WORK REVIEW
• One paper accepted by WWW BigScholar workshop
May 31, 2016
Academic Visualization
9
• Two patents submitted
WORK REVIEW
Local network analysis and hierarchical clustering
By clicking the nodes, networks of
the references are expanded and
collapsed.
Find multiple paths to the
center paper and highlight
them in the graph.
The cluster graph
shows fields in
computer science.
May 31, 2016
Academic Visualization
10
Zhaowei Tan, Changfeng Liu, et al., "AceMap: A Novel Approach towards Displaying Relationship among Academic
Literatures", WWW BigScholar Workshop, 2016
BEYOND PAPER + PAPER
WE CAN SEE MORE!
• Paper + University?
• Paper + Timeline?
• Paper + Location?
May 31, 2016
Academic Visualization
11
abcd
OUTLINE
• Background
• Work Review
• SchoolDrag
•
•
•
•
Motivation
Visualization tool selection
Handle the data
Real-time querying
• Future work
May 31, 2016
Academic Visualization
12
• Q&A
SCHOOLDRAG
• Our team implemented three visualizations this
semester
• Organization
• Geographical location
• Hierarchical structure
May 31, 2016
Academic Visualization
13
• I will introduce the one about
organizations -- SchoolDrag
MOTIVATION
• Select good graduate
schools in a certain field
• Find out the developing
trends of different universities
• Which university is the
groundbreaker? Which
university is the follower?
May 31, 2016
Academic Visualization
14
• No academic search engines
provide such a function.
VISUALIZATION TOOL
SELECTION
• There are many tools available
• D3 and vega
• Tableau
• Gephi
• Eigenfactor
• How to choose?
Open-source
Easy to code
Community support
Web-based
May 31, 2016
Academic Visualization
15
•
•
•
•
HANDLE THE DATA -GOAL
• To know the numbers of published papers of every
organization in every year.
• To get the values of corresponding authors of
every organization in every year.
May 31, 2016
Academic Visualization
16
• To demonstrate the developing trends and
academic potential of different organizations in an
interactive way.
HANDLE THE DATA -OTHER TOOLS
• MySQL to manage and retrieve data
MySQL
• C++ & Python to do the computation
Batch files
• JSON to store the data
C++
Python
JSON
May 31, 2016
Academic Visualization
17
• Batch files to do the repeating tasks
HANDLE THE DATA -RESULTS
• Two fields according to CCF recommendation
list
• Network
• Artificial intelligence
• Ten L1 or L2 fields according to MAG
datasets
• Computer security
• Computer vision
• Embedded system
May 31, 2016
Academic Visualization
18
• ……
• More fields are coming
May 31, 2016
Academic Visualization
http://acemap.sjtu.edu.cn/schoolDrag/networktop100.html
19
DEMO TIME
OFFLINE DEMO
1. Each circle stands for an
academic organization.
2. The more published
papers an organization
has in the specified year,
the bigger the circle is.
3. The curve shows the
developing trends of
the total number of
published papers and
authors of an
organization.
May 31, 2016
Academic Visualization
20
schoolDrag in AI field
REAL-TIME QUERYING
• Real-time Querying V.S. Retrieving JSON files
• More storage efficient – 50k fields in MAG dataset
• More flexible – what if a new field joins
• Less time efficient – minute-level waiting is
unbearable
• How to deal with this trade-off?
• Rapid sampling[1]
May 31, 2016
Academic Visualization
21
[1] Kim, Albert, et al. "Rapid sampling for visualizations with ordering guarantees." Proceedings of the VLDB Endowment
8.5 (2015): 521-532.
REAL-TIME QUERYING -RAPID SAMPLING
• Main ideas
• Part stands for the whole
population
• Only focusing on what
we care about
• Confidence intervals
count
May 31, 2016
Academic Visualization
22
Kim, Albert, et al. "Rapid sampling for visualizations with ordering guarantees." Proceedings of the VLDB Endowment 8.5
(2015): 521-532.
REAL-TIME QUERYING -RAPID SAMPLING
• For our scenario
• Focus on ordering rather than exact paper
numbers
• Compute in back-end and returning JSON files
• Combine static JSON with rapid sampling
May 31, 2016
Academic Visualization
23
Kim, Albert, et al. "Rapid sampling for visualizations with ordering guarantees." Proceedings of the VLDB Endowment 8.5
(2015): 521-532.
OUTLINE
• Background
• Work Review
• SchoolDrag
• Future work
May 31, 2016
Academic Visualization
24
• Q&A
FUTURE WORK
• Building an interactive visualization for Junxian’s
paper
May 31, 2016
Academic Visualization
25
• Focusing on clustering algorithms at theory group
(how to cook food)
May 31, 2016
Academic Visualization
26
Q&A
THANK
YOU!
May 31, 2016
Academic Visualization