Download statistics - VIVA, The Virtual Library of Virginia

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Data assimilation wikipedia , lookup

Time series wikipedia , lookup

Transcript
Statistics on the Web: VIVA and
Beyond
Donna Tolson
Scholars’ Lab
University of Virginia Library
Virginia State University & Radford University
June 6 & 7, 2007
Scholars’ Lab
Today’s topics
General stuff about statistical
information
Finding & using
domestic statistics
Finding & using
international statistics
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Texts vs. Statistics
• Qualitative
• Database = collection
of texts or citations
• Key word
• Rarely need metadata
• Majority published
privately
Statistics on the Web:
VIVA and Beyond
• Quantitative
• Database = collection
of observations
• Variable
• Usually need metadata
• Majority published
publicly
Scholars’ Lab
Statistics vs. Data
• Usually aggregated
• Count, %, µ, x:y, S,
ranking
• The “quick answer”
• WYSIWYG
• For use as context,
evidence, or
argument
Statistics on the Web:
VIVA and Beyond
• Usually microdata
• Matrix or spreadsheet
of values
• Never a “quick answer”
• Exists to be analyzed
• For research, testing
hypotheses
Scholars’ Lab
A few things to know….
• Statistics come from data about populations
(enumeration) or samples (survey).
• The bigger the sample, the more reliable the
statistics.
• Aggregated statistics generally can’t be disaggregated, or filtered. This is what
microdata is for.
• Confidentiality is strictly enforced – affects
access to statistics about small populations
and microdata.
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
The statistical reference interview
How will the information be used?
Context = statistics vs. Analysis = dataset
Statistics about what or who?
Person, household, country, etc. = unit of analysis
When?
Most current vs. time series
Where?
Geographic level as well as precise location,
Single vs. multiple locations
Beware the data fallacy:
Just because someone asks the question doesn’t mean there is an answer!
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Locating statistics
• In library catalogs, search for specific variable (e.g.
income) AND statistics
• Web sources may not be cataloged – time to Google
• Work backwards from data sources cited in articles
• Consult comprehensive sources:
– University web sites: Michigan, Virginia
– Government web sites: FedStats
– Published statistical databases: LexisNexis
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Domestic Statistics:
Government sources
• Statistics available on most anything regulated
or funded by government
• Shrinking number of pre-analyzed statistics
• Growing number of interactive data sites
• Rarely, government withdraws data – ex.
immigration statistics
• Data you won’t find in government sources:
– Religion data
– Market-based information (who buys what)
– Polling data or opinion surveys
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Interactive data sites
+
+
+
+
+
PROs
Increased access to
large datasets
Allows selection of
variables
Customizable queries
Automatically handles
weighting, etc.
Can omit need for
statistical software
Statistics on the Web:
VIVA and Beyond
–
–
–
–
–
CONs
No standard system
Multi-step process
May use short,
confusing variable
names
Rarely includes
analysis
Not for statistical
novices
Scholars’ Lab
Locating U.S. statistical info
• Begin with StatUSA or FedStats or U.S.
Statistical Abstract
• Is geography or subject more important?
Search accordingly.
• If you’re in search of a few statistics,
reports are fine. If you’re in search of many
statistics or data, skip pdfs and look for
online or downloadable data (unless you’re
looking for citations of data sources).
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Federal sites
worth getting to know
 American Factfinder for demographics,
economics & housing
 CDC for health data and vital statistics
 BJS for crime & law enforcement
 NCES for education
 BLS for labor force and employment
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Locating historical statistics
• Generally, online data only available from
mid-1990s
• Selected sources for historical data:
• FedStats – look for historical statistics links
• Census
• Selected historical data and publications
• Historical Census Browser
• Be ready to help patrons navigate off the
web and into print Documents (oh no!!)
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
International Statistics
• Several comprehensive sources
• Most countries have statistical agencies, but few
have as much free information as U.S.
• “Global” statistics are part of the data fallacy –
generally an aggregate of as many countries as
possible is the best you can do
• Use IE
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Comprehensive Sources
• UN Common Database (274 countries & areas)
• Organization for Economic Cooperation and
Development (30 member countries)
• Eurostat (EU member countries)
• World Bank (185 member countries)
• Nationmaster
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Country-specific Sources
Most other countries have one or two agencies in
charge of producing statistical information
•
•
BLS Statistical Sites on the Worldwide Web
Univ. of Auckland Official Statistics on the Web
Remember to:
1. use IE
2. look for English translation (can be tricky!)
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Locating NGO statistics
• Public opinion surveys & polls:
• World Public Opinion
• Some polling archives offer some data for free:
• The Gallup Poll – exit polls
• Roper Center for Public Opinion – presidential
approval ratings
• Statistics & data on religion:
• Association of Religion Data Archives
• Adherents.com
• Market-based data
Statistics on the Web:
VIVA and Beyond
Scholars’ Lab
Thank you!
Donna Tolson
[email protected]
Statistics on the Web:
VIVA and Beyond