Download Archive - Chandra X

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Entity–attribute–value model wikipedia , lookup

Extensible Storage Engine wikipedia , lookup

Concurrency control wikipedia , lookup

Microsoft Jet Database Engine wikipedia , lookup

IMDb wikipedia , lookup

Relational model wikipedia , lookup

Database wikipedia , lookup

Clusterpoint wikipedia , lookup

ContactPoint wikipedia , lookup

Database model wikipedia , lookup

Astrophysics Data System wikipedia , lookup

Transcript
The Chandra Bibliography Database
Arnold Rots, Sherry Winkelman,
Sarah Blecksmith, John Bright
Chandra Data Archive Operations Group, CXC/SAO
Stéphane Paltani
Observatoire de Marseille
CXC
Summary
 Existing capability
 Identifiers and automatic linking
 Extension of the database
 Attributes
 Database design
 Database maintenance
 Services
 Conclusion and coming attractions
 This presentation is adapted from a paper given at ADASS XIII
Pages 9-13 are less relevant and provided FYI only
The interface on pages 15, 16 is an old version, provided for illustrative
purposes only; please try the prerelease, using the URL on page 18
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
2
Existing Capability
 On the part of the archive:
– Links from datasets (observations)
to articles in the ADS
– Scattered links to some specific
articles
 On the part of the ADS:
– Links from articles (bibcodes) to
datasets in data center archives
– General project tags
 This is very valuable, but also very
labor-intensive
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
3
Existing Capability
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
4
Identifiers and Automatic Linking
 The ADS, the data centers, and US journal editors have reached
an agreement that will enable authors to insert these links
directly in a manuscript
 Central to such linking are IVOA-compliant dataset identifiers
–
–
–
–
Namespace: ivo:
Authority Id: ADS
Data collection
Dataset
ivo://ADS/Sa.CXO#214
ivo://ADS/Sa.CXO#M31mosaic
 We will provide services that will enable users to insert these IDs
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
5
Extension of the Database
 Originally, the database contained only journal articles and
conference proceeding papers that can unambiguously be
connected with specific observations, plus an amorphous
collection of papers that are “Chandra-related”
 Extension of subject categorization:
–
–
–
–
–
Referring to specific observations
Referring to published results
Predicting Chandra results
Referring to instrumentation, software, or operations
Other
 Inclusion of all other types of publications (except preprints!)
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
6
Attributes
 Subject
– Observations, instruments, software, operations
 Kind of publication
– Book, journal, proceedings, thesis, circular, review, newsletter, internal
 Type of publication
– Article, abstract, memo, data, erratum, article (abstract only available),
title only, electronic
 Number of citations
 Keywords (standard ApJ as well as custom)
 A variety of other items
– Date of publication, refereed or not, etc.
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
7
Database Design
ObsId
1
0+
1+
1+
BibTable
1+
1+
1
1
Subjects
1
1
Observation
Catalog
0+
0+
Datasets
1
1
Keywords
0,1
1
URLs
Proposals
1+
DatasetObsIds
1
Std Keywords
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
8
Database Maintenance
 Management of new entries
through a dedicated database:
– Automatic filling of BibWork
– Attributes filled in through GUI
– Migrate entries to BibTable upon
completion
– The Datasets and DatasetObsIds
table are common with the main
database
BibWork
ObsId
Datasets
DatasetObsIds
 Automatic updating of number of
citations
 Automatic check on validity of
bibcodes
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
9
Database Maintenance Interface
Filling the database
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
10
Database Maintenance Interface
Checking the paper
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
11
Database Maintenance Interface
Set attributes
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
12
Database Maintenance Interface
Establish proposal links
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
13
Services
 Exchange of information with ADS: harvesting of Bibcode –
Dataset Identifier pairs in both directions
 Provide access to datasets through either a Dataset Identifier or
a Bibcode
 Provide information to ADS on Bibcodes that are not related to
specific observations
 Provide access to publications through queries from our archive;
see next page and:
http://cxc.harvard.edu/cgi-gen/cda/bibliography.cgi
 Derive metrics through queries (standardized as well as custom;
see Paul Green’s presentation)
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
14
Services
Literature search from the archive (shown here is an old version)
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
15
Services
 (Previous page)
 A simple query example: find all
publications related to Chandra
Crab observations
 This renders 4 articles – be aware
that there may be more (e.g.,
meeting abstracts!) that could not
be traced to specific observations
 The bibcodes link to the abstracts
in the ADS
 (This page)
 The link to the ADS provides more
details on all four papers
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
16
Database Metrics (Oct 2003)
All Articles
Category
1999
2000
2001
2002
Referee d only
2003
Total
No. Cit.
Total
No. Cit.
Observations
53
284
485
485
352
1659
5639
712
5597
Refer to obs.
9
94
333
499
322
1257
5300
897
5231
Instr., etc.
34
141
124
69
18
386
1362
354
1355
Predict result
11
67
21
14
21
135
306
22
296
Unclassified
15
90
70
29
40
244
663
118
650
122
676
1033
1097
753
3681
13270
2103
13129
1011
2507
2735
2758
1859
10870
Total
Reviewed
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
17
Conclusion
 We have developed a comprehensive database that is capable
of tracking all mission-related publications and preserving all
relevant information
 Added to this are a database and GUI that make maintenance
(i.e., data entry) as painless as possible
 Services include cross-linking with the ADS, a powerful literature
search from the Chandra archive, and metrics
 The entire package is reasonably mission independent and we
are happy to provide it to other data centers
 Try the new interface at:
http://cxc.harvard.edu/cgi-gen/cda/bibliography.cgi
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
18
Coming Attractions
 FITS keyword database:
an interactive web-based tool that allows users to look up the meaning
and use of all keywords used in CXC FITS files, and to construct
compliant headers
 Special Requests:
a web-based tool that allows users to make special data requests,
backed up by a database that tracks the status of these requests
– Request for previous data versions
– Request for special processing
– Request for data on physical medium
– Request for custom database query
– Anything else (reasonable)
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
19