* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Archive - Chandra X
Entity–attribute–value model wikipedia , lookup
Extensible Storage Engine wikipedia , lookup
Concurrency control wikipedia , lookup
Microsoft Jet Database Engine wikipedia , lookup
Relational model wikipedia , lookup
Clusterpoint wikipedia , lookup
ContactPoint wikipedia , lookup
The Chandra Bibliography Database Arnold Rots, Sherry Winkelman, Sarah Blecksmith, John Bright Chandra Data Archive Operations Group, CXC/SAO Stéphane Paltani Observatoire de Marseille CXC Summary Existing capability Identifiers and automatic linking Extension of the database Attributes Database design Database maintenance Services Conclusion and coming attractions This presentation is adapted from a paper given at ADASS XIII Pages 9-13 are less relevant and provided FYI only The interface on pages 15, 16 is an old version, provided for illustrative purposes only; please try the prerelease, using the URL on page 18 CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 2 Existing Capability On the part of the archive: – Links from datasets (observations) to articles in the ADS – Scattered links to some specific articles On the part of the ADS: – Links from articles (bibcodes) to datasets in data center archives – General project tags This is very valuable, but also very labor-intensive CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 3 Existing Capability CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 4 Identifiers and Automatic Linking The ADS, the data centers, and US journal editors have reached an agreement that will enable authors to insert these links directly in a manuscript Central to such linking are IVOA-compliant dataset identifiers – – – – Namespace: ivo: Authority Id: ADS Data collection Dataset ivo://ADS/Sa.CXO#214 ivo://ADS/Sa.CXO#M31mosaic We will provide services that will enable users to insert these IDs CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 5 Extension of the Database Originally, the database contained only journal articles and conference proceeding papers that can unambiguously be connected with specific observations, plus an amorphous collection of papers that are “Chandra-related” Extension of subject categorization: – – – – – Referring to specific observations Referring to published results Predicting Chandra results Referring to instrumentation, software, or operations Other Inclusion of all other types of publications (except preprints!) CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 6 Attributes Subject – Observations, instruments, software, operations Kind of publication – Book, journal, proceedings, thesis, circular, review, newsletter, internal Type of publication – Article, abstract, memo, data, erratum, article (abstract only available), title only, electronic Number of citations Keywords (standard ApJ as well as custom) A variety of other items – Date of publication, refereed or not, etc. CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 7 Database Design ObsId 1 0+ 1+ 1+ BibTable 1+ 1+ 1 1 Subjects 1 1 Observation Catalog 0+ 0+ Datasets 1 1 Keywords 0,1 1 URLs Proposals 1+ DatasetObsIds 1 Std Keywords CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 8 Database Maintenance Management of new entries through a dedicated database: – Automatic filling of BibWork – Attributes filled in through GUI – Migrate entries to BibTable upon completion – The Datasets and DatasetObsIds table are common with the main database BibWork ObsId Datasets DatasetObsIds Automatic updating of number of citations Automatic check on validity of bibcodes CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 9 Database Maintenance Interface Filling the database CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 10 Database Maintenance Interface Checking the paper CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 11 Database Maintenance Interface Set attributes CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 12 Database Maintenance Interface Establish proposal links CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 13 Services Exchange of information with ADS: harvesting of Bibcode – Dataset Identifier pairs in both directions Provide access to datasets through either a Dataset Identifier or a Bibcode Provide information to ADS on Bibcodes that are not related to specific observations Provide access to publications through queries from our archive; see next page and: http://cxc.harvard.edu/cgi-gen/cda/bibliography.cgi Derive metrics through queries (standardized as well as custom; see Paul Green’s presentation) CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 14 Services Literature search from the archive (shown here is an old version) CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 15 Services (Previous page) A simple query example: find all publications related to Chandra Crab observations This renders 4 articles – be aware that there may be more (e.g., meeting abstracts!) that could not be traced to specific observations The bibcodes link to the abstracts in the ADS (This page) The link to the ADS provides more details on all four papers CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 16 Database Metrics (Oct 2003) All Articles Category 1999 2000 2001 2002 Referee d only 2003 Total No. Cit. Total No. Cit. Observations 53 284 485 485 352 1659 5639 712 5597 Refer to obs. 9 94 333 499 322 1257 5300 897 5231 Instr., etc. 34 141 124 69 18 386 1362 354 1355 Predict result 11 67 21 14 21 135 306 22 296 Unclassified 15 90 70 29 40 244 663 118 650 122 676 1033 1097 753 3681 13270 2103 13129 1011 2507 2735 2758 1859 10870 Total Reviewed CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 17 Conclusion We have developed a comprehensive database that is capable of tracking all mission-related publications and preserving all relevant information Added to this are a database and GUI that make maintenance (i.e., data entry) as painless as possible Services include cross-linking with the ADS, a powerful literature search from the Chandra archive, and metrics The entire package is reasonably mission independent and we are happy to provide it to other data centers Try the new interface at: http://cxc.harvard.edu/cgi-gen/cda/bibliography.cgi CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 18 Coming Attractions FITS keyword database: an interactive web-based tool that allows users to look up the meaning and use of all keywords used in CXC FITS files, and to construct compliant headers Special Requests: a web-based tool that allows users to make special data requests, backed up by a database that tracks the status of these requests – Request for previous data versions – Request for special processing – Request for data on physical medium – Request for custom database query – Anything else (reasonable) CXC 2004-01-12 Chandra Users Committee: Chandra Data Archive 19