Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Data Protection Act, 2012 wikipedia , lookup
Entity–attribute–value model wikipedia , lookup
Data center wikipedia , lookup
Clusterpoint wikipedia , lookup
Data analysis wikipedia , lookup
Forecasting wikipedia , lookup
3D optical data storage wikipedia , lookup
Information privacy law wikipedia , lookup
Data vault modeling wikipedia , lookup
Database on Oceanography and Marine Ecosystems Janus Larsen, E-mail: [email protected] ICES, H. C. Andersens Boulevard 44-46, DK-1553, Copenhagen V. Tel: 33386735. Why? What? DOME is a relational database to store oceanographic, environmental and fisheries data at the ICES Data Centre, supporting a marine ecosystem approach at the data level. Phase 1 of the development is ongoing and it includes biota, fish disease, sediment and water bottle data. In phase 2 high-resolution oceanographic data such as CTD data and biological community data will be included. Some fisheries data will be considered for a 3rd phase of DOME’s development. User requirements The primary advantage of an integrated multidisciplinary database is facilitation of the ecosystem approach. For example, providing data for an investigation of biological data of a specific area in relation to the physical environment such as fish disease data in oxygen depleted areas. Another major advantage is easier access to data from many disciplines through a single interface. Data Centre requirements A driving force is lower maintenance costs. Operating several diverse database and coding systems is expensive. By integrating our databases, ICES lowers operating costs while gaining staff flexibility through harmonization of databases as well as data processing. How? Data structure Codes Administrative tables (User definitions & settings) Position, platform, country Etc. Code request DOME interface Code return Code lookup Stored procedure Code return Water bottle data Biota data Sediment data Definitions: Codes, methods, owner, release date, etc. Measurements comprise DOME’s middle level. They are organised into separate structures corresponding to each of the major data categories. The separate structures are necessary due to the different natures of the data categories and their respective difference in types of data. DOME Result Request new code Look up Unknown code The data structure includes 3 common parts; • top level: time, position, sampling platform, etc. • bottom level: definitions of codes, methods, access, etc. • system level: administrative tables. Standard codes and handling of data from different scientific communities is essential for integration. Relevant multi-disciplinary codes are implemented. However, specialized codes have been incorporated as necessary. Code table DOME draws on 2 standard code catalogues, the BODC data dictionary and the Integrated Taxonomic Information System (ITIS). Specific ICES codes have been standardized in our RECO database. Add code Code fetch ITIS RECO BODC DD Codes are stored in master databases which serves all ICES databases including DOME. For optimal performance, relevant codes are kept up-to-date by automated replication to DOME. Types Example Reporting format and fixed field databases. DEPHU PERCE SIGND UNCRT DETLI LMQNT 35 NULL NULL 0.4 0.001 NULL Type value DEPHU 35 UNCRT 0.4 DETLI 0.001 DOME is normalized with Types and Values The database is normalized by type/value combinations. This makes the database very flexible. New data types and meta-data can be stored without any structural changes to the database and without any adjustment of the database interfaces. This approach also eliminates most null values. Reporting format The example to the right illustrates the use of codes and types/values to store data from a measurement of mercury in a muscle sample. Species information is stored higher up in the hierarchy. The essential data, namely, the value, the parameter code and the matrix code, are stored in fixed fields. The rest of the data are either linked to the measurement via a many-to-many relation to a code table (blue fields) or stored in a ‘types/values’ table which is linked to the measurement table (red fields). It is easy to add new types of information. If necessary, new definitions are added to the respective common table, and the measurement can then be associated with those definitions – no changes to the database structure are needed. MATRX Matrix analysed MU DEPHU Pressure/depth 35 PARAM Parameter code HG MUNIT Measurement unit BASIS Basis of determination Wet AMLNK Analytical methods link 1 VFLAG Validity flag QFLAG Qualifier flag < VALUE Value 12.3 PERCR Percentage recovery SIGND Significant digits UNCRT Uncertainty value .4 METCU Method of calculating uncertainty SD DETLI Limit of detection value .001 LMQNT Limit of quantification DOME Types & values Codes Where? DOME will be equipped with 2 interfaces: an internal interface for handling data input and output, and a web interface to allow searching and downloading data from the ICES website. Although the interfaces are separate, they use the same search and export code as shown in the figure to the right. This object-oriented design allows easy maintenance of the system. Firewall DOME interface Internal interface The purpose of the internal interface is to promote efficient data import and export within the ICES secretariat. The internal interface will feature facilities such as import and update of data, assignment of access rights. DOME web interface VB.NET windows application Web interface ASP.NET application DOMELib VB.NET dll The web interface will provide users with facilities to download data directly from the ICES website. Various levels of data restriction will be enforced: some data will be freely available, whereas others will be password protected with various degrees of restriction, including explicit permission of the data owners. Eventually, the web interface will be extended with a range of data products, see the Gantt chart under ‘When?’. DOME MS SQL server Conclusion The DOME database design succeeds in unifying access to multidisciplinary data, and will facilitate integrated data products for use in implementing an ecosystem approach. DOME harmonizes structural aspects as well as many common elements of the data such as code definitions. However, the choice of coding systems, the application of meta-data, and the approach to data quality assurance are still data category-dependent. These differences are, to a large degree, due to different traditions in different scientific communities. DOME is being developed in 3 phases: • Phase 1 (ongoing): develop database and interfaces for biota, sediment and water bottle data. • Phase 2. Add high resolution oceanographic data and biological community data and modify the interfaces accordingly. Add data plot to web interface. • Phase 3. Include selected fisheries data (under consideration). When?