Download specific data types - Tetherless World Constellation

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Big data wikipedia , lookup

Data Protection Act, 2012 wikipedia , lookup

Data center wikipedia , lookup

Data model wikipedia , lookup

Database model wikipedia , lookup

Forecasting wikipedia , lookup

Data analysis wikipedia , lookup

Data vault modeling wikipedia , lookup

3D optical data storage wikipedia , lookup

Information privacy law wikipedia , lookup

Business intelligence wikipedia , lookup

Transcript
Data types and persistent identifiers in
the Deep Carbon Observatory Data Portal
Xiaogang (Marshall) Ma ([email protected]), John Erickson, Patrick West, Stephan Zednik, Peter Fox
Tetherless World Constellation, Rensselaer Polytechnic Institute, 110 8th Street, Troy, NY, USA
Background
The Research Data Alliance (RDA) - Data Type
Registry (DTR) working group addressed a core
issue of data interoperability: to parse,
understand, and potentially reuse data
retrieved from others. The RDA - Persistent
Identifier Information Types (PIT) working
group addressed the essential types of
information associated with persistent
identifiers (PID). The Deep Carbon Observatory
(DCO; http://deepcarbon.net) Data Portal
enables centrally-managed digital object
identification, object registration and metadata
management. DCO anticipated a large number
of digital object registrations, and therefore
needed an appropriate mechanism to curate
and reuse the registered information.
Initial Results
Updates to the DCO Ontology:
• A new class dco:DataType. Each specific data type is an instance of it
• An object property dco:hasDataType linking a dataset and a data type
• A collection of other classes and properties associated with dco:DataType
Research Questions
•
•
Each registered data type
needs a stable & resolvable
PID
Annotate data types with
formal & specific semantics
Each registered object, such as a
dataset or a data type, has a unique
identifier called DCO ID, which
implements the global Handle System.
(Image credit: Ainsley Seago, PLoS Biology)
Nature of efforts
• The primitives in a DTR are comparable to a
list of BASIC DATA TYPE classes in the
DCO ontology, such as Dataset, Image,
Video, and Audio, etc.
• A registered DCO dataset is regarded as an
instance of one of those basic data type
classes.
• It is possible to further annotate a registered
dataset with the SPECIFIC DATA TYPES
defined within a DTR, and each data type
has a unique PID.
Scan to get a copy
of the poster:
Future Works
• Analyze more use cases
relevant to data types in the
DCO community
• Refine the schema for the
annotation and provenance of
specific data types
• Interoperability between DCO
specific data types and data
types registered in other DTRs