Download logd-swc2010-presentation - data-gov-wiki

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
TWC LOGD:
A Portal for Linking Open
Government Data
Dominic DiFranzo, Li Ding, John S. Erickson, Xian Li, Tim Lebo, James Michaelis,
Alvaro Graves, Gregory Todd Williams, Jin Guang Zheng, Johanna Flores,
Zhenning Shangguan, Gino Gervasio, Deborah L. McGuinness, Jim Hendler
Tetherless World Constellation
Rensselaer Polytechnic Institute
Semantic Web Challenge 2010
Nov 10, 2010
2
The TWC LOGD Portal for SWC2010
Real World Data
 US, UK, China,…
 Health, energy, economy
Semantic Web in Gov Domain
 Major partner of Data.gov
 8.5 billion triples in LOD
End User Applications
 Community Portal
 Fast, Low-cost Mashups
3
“Semantic Web” and RDF logo showed up on the frontpage of the US Data.gov website
4
5
Major Partner of US Data.gov Project
 Semantic Web Tech deployed in Data.gov
 RDF data, SPARQL endpoint, semantic mashups
6
data.gov relaunch
with semantic web
featured
Oct, 2010
SPARQL
End Point
& RDF data
& Demos
Replicated at
Data.gov
May 21, 2010
data.gov online
May, 2010
May 21, 2009
Government Adoption Process
New Application
published by
a team at DOE
Demos
Tutorials
Videos
SPARQL Endpoint
Two-day Mashathon
in Washington DC
Oct, 2010
Data-gov Wiki
@RPI
online
Aug, 2010
2010 …
2009-2010
July,2009
2009
TWC LOGD
Drupal Site
announced
7
The Largest Real World LOD Dataset
 8.5+ billion triples from real world
 7500+ LOD links
 Accessible via Data Browser, e.g. Tabulator
Smoking Prevalence vs. Tax, Policy …
Extensible and accountable Mashups with NCI
http://logd.tw.rpi.edu/project/popscigrid
Trends in Smoking Prevalence, Tobacco Policy
Coverage and Tobacco Prices (1991-2007)
Extensible Mashups via Linked Data
 Diverse datasets from NIH
 Potentially linking to “unemployment rate”
Accountable Mashups via Provenance
 Annotate datasets used in demos
 Feedback users’ comment to gov contact (e.g. %)
8
9
White House Visitor Search
Leveraging linked data (DBpedia & New York Times)
NYTimes
Wikipedia
dbpedia:Barack_Obama
Semantic Wiki
“POTUS”
The White House
 [Person Mashup] Data.gov (statistics) + DBpedia (personal profiles)+ NYTimes (news)
 [Technologies] Semantic MediaWiki, Google Visualization, IPad Apps available in Apple Store
Created by Dominic DiFranzo, Evan Patton, RPI, http://data-gov.tw.rpi.edu/demo/stable/white-house-visitor/top100-visitees.php
Linking GDP of the US and China
Linking international government data meaningfully
GDP of the US (Billion Dollar)
8.3
6.3
2000
2010
GDP of China (Billion Chinese Yuan )
[Temporal Mashup] bea.gov + federalreserve.gov +stats.gov.cn
10
Reaching Open Source Communities
Linking semantic web with web developers
• Social Semantic Web extensions/modules to
popular CMS, e.g. Semantic Wiki, Drupal
• Process/consume integrated gov data in a
number of different ways: social networks, natural
language technologies, workflows, search…
11
12
TWC LOGD Status: Website Statistics
•
•
•
•
•
378,128 page hits
28,481 visits
16,041 visitors
4126 cities
34 countries
Note: the above statistics are about http://data-gov.tw.rpi.edu. Dataset access not counted.
13
Summary of the TWC LOGD Portal
http://logd.tw.rpi.edu
Real World Data
8.5+ billion triples
400+ datasets
10+ sources
 Many domains
Semantic Web Technology
 completely open source
 Demos/tutorials/videos
Community and Users
 partner of US government
 open source community
 education in university
Beyond just dogfood;
Linking Open Government Data Now!
14
The Team and Sponsors
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Jim Hendler
Deborah L. McGuinness
Li Ding
Dominic DiFranzo
Sarah Magidson
James Michaelis
Alvaro Graves
Jin Guang Zheng
Xian Li
Gregory Todd Williams
Tim Lebo
Zhenning Shangguan
Devin Gaffney
Peter Coons
Adam Bell
William Cooper
Brian Zaik
Johanna Flores
Government Sponsors
DARPA
NSF
NASA
IARPA
NIH/NCI
…
15
Backups
Many countries
• US
• UK
• Australia
• New Zealand
…
June30,2009
2009
Putting
Government
Data online
May 21, 2010
data.gov online
January 19, 2010
“Openness will strengthen
our democracy and promote
efficiency and effectiveness
in Government.”
--- President Obama
May 21, 2009
January 1, 2009
Data.gov and World-Wide Open Government
Data Activities
data.gov relaunch
with semantic web
featured
2010 …
data.gov.uk online
16
17
Data-gov Wiki: Innovations at RPI
The Data-gov Wiki explores and educates the use of semantic web technologies,
esp. linked data, in producing, processing and utilizing government data from data.gov.
40+ Demos
400+ Datasets
Tutorials & Videos
The Data-gov Wiki is run by the Tetherless World Constellation at RPI, headed by Professors Jim Hendler and Deborah McGuinness and led by Li Ding.
Other student team members include: Dominic DiFranzo, Sarah Magidson ,James Michaelis, Alvaro Graves, Adam Bell, Jin Guang Zheng, Xian Li, Tim
Lebo, Gregory Todd Williams, Peter Coons, Zhenning Shangguan, Devin Gaffney, William Cooper, Brian Zaik, and Johanna Flores .
18
Tech: Abstraction and Versioning
Data publishing stages
Conversion
Layer
Version
LOGD
(raw)
OGD (part1)
Snapshot
Table
Source
LOGD
(e1)
…
OGD (part2)
Snapshot
…
Record
Dataset
…
high
Levels of structural data granularity
low
19
Tech: Provenance in LOGD data
Access
Convert
derive
derive
Enhance
revision
Version
SemDiff
derive
20
Consume LOGD data in Semantic Search
Data-gov Semantic Search
http://data-gov.tw.rpi.edu/
XHTML+RDFa
ARC2
Related documents