Download ppt - ResistVir

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
ResistVir-Db
The database of
ResistVir European Project
Co-ordination of Research on Genetic
Resistance to Plant Pathogenic Viruses, and
their Vectors, in European Crops
Work Package no 2
Survey and database of European resources
and centres of activity
Organization
Scientists and
other DB users
Computer staff
• Steering committee
• DB development team
• WP2 participants
• Quadrate DM support
• Consortium members
• Consortium members
• Professional users
• Others
• Public users
Technical choices
• PostgreSQL
open source object-relational database
management system
http://www.postgresql.org
• eZ publish
open source content management system and
development framework
http://ez.no
• Apache
open source web server
http://www.apache.org
• Linux
An Object-Relational Database
• User’s view :
– Tables
rows = instances of objects of the same type
columns = fields for this type of data
rows can be ordered by fields values
columns can be chosen and ordered
– Forms (pages)
one instance per page
all available fields
– Links
a table row -> an instance form
an instance form field -> another instance form
an instance form URL field -> a Web page
Database Web Query Interface (1)
• Open a session
– As anonymous
– As Consortium member
• Create a set of requests
– Single criterion request
• Choose type of data
• Choose criterion
• Define operator and values
– Combination of 2 requests
• Choose 2 requests (same type of data)
• Choose operators
– Query execution -> count of instances
Database Web Query Interface (2)
• Display the set of instances corresponding
to a request
– Choose columns (or default columns)
– Choose instances order
– Jump scrolling
• Export data (.csv)
• Display specific instance form
• Display links …
A Multilingual Database
• Default language : English
– All reference data and interface terms in
English
– All questionnaires in English
• Other languages
Acceptance criteria for another language :
– Provide reference data and all interface terms
– Second set of questionnaires
• Technical choices :
– ISO 639 2-letters language code
– Encoding system : UTF-8
Which data ?
References
• Main Reference Entities
–
–
–
–
–
–
–
–
Disease
Virus
Vector
Plant : species, populations, lines, …
Genetic resistance
Locus
QTL
Gene
• Some are also research topics
Which data ?
• Centres of Activity
Centres (1)
– Organization, Research Unit, …, Team
– Dependency links
– Description : acronym, full name, type of centre,
address, postal code, city, country, phone, fax,
email, url
– Staff (whole centre) : total, permanent, total
scientists, total permanent scientists, PhD
students
– Staff (ResistVir fields) : total, permanent, total
scientists, total permanent scientists, PhD
students
Which data ?
Centres (2)
• Centres of Activity
–
–
–
–
–
–
–
–
–
Persons
Studied viruses
Studied plants
Studied vectors
Studied diseases
Studied genetic resistances
Studied genes
Research topics
Know-how and technologies
Which data ?
•
•
•
•
•
Research results
Main research results
Main published bibliographic references
Thesis
Other documents
Patents
Which data ?
•
•
•
•
•
Plant resources
Diagnosis services
Biotechnological resources
Agrochemical resources
Agricultural resources
Tools
Links with ontologies
• ISO Codes (countries, languages, …)
• Structured controlled vocabularies
– Our own data …
– … linked whenever possible with existing ontologies
databases (GO, TO, PO, taxon, …)
• When ontologies databases don’t exist today :
– Simple reference tables
– Creation of new ontologies ?
Identifiers
• General Identifier
– Identify one instance whatever its type
– http://www.resistvir-db.org/query?
id=rstvr:0000123&lang=ES
• Specific Identifier
– Identify both data type and instance
– http://www.resistvir-db.org/query?
id=GR0000067&lang=CA
Data Access Policy
• Public
– Some instances will be hidden
– Some fields will be hidden
• Consortium
– Personal accounts with password (and term ?)
– All (validated ?) instances
– All fields
• Private
• Project
– Consortium view + specific data
• DBA
Origin of Data
• Initial loading period
– Questionnaires (spreadsheet .sxc, .xls, .csv)
– Specific files from existing databases
– Batch loading
• Then
– Questionnaires with batch loading
– Web Update Interface (?)
• only by authorized for update accounts
• only for data of each one’s organization
Data Validation Policy
• Restrictive policy
– No data published without Committee
agreement
– An a priori control
– Least update requires new validation
• Laxist policy
–
–
–
–
Data published under provider responsibility
An a fortiori control
Updated instances remain validated
Good practice agreement
State Transition Diagram
Object current and new versions
Consortium
submitted
Private
current
version
Current and
private new
versions
Consortium validated
Dismissed
Public submitted
Public validated
Data Traceability
• Minimal level
– Person and organization responsible for each
instance
– Creation and last update dates
• Increased traceability
– Description of all state change events
• Maximal traceability
– Record all consecutive versions of each instance
Related documents