Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
GBIF Global Biodiversity Information Facility Background Presentation for Cyberinfrastructure for Environmental Research & Education Workshop Boulder, Colorado 31 October 2002 Meredith Lane Communications Officer, GBIF [email protected] www.gbif.org WCMC ABI GenBank EMBL WDCM FishNet CONABIO INBio AETFAT ANGIS NGO & science-driven resources and networks, mid-late 1990s EIOnet NBII NABIN IABIN CONABIO INBio ERIN Politically-driven content networks, mid-late 1990s MEGASCIENCE FORUM of the OECD (became Global Science Forum after the GBIF recommendation was adopted) Examples of Working Groups: Neutron Sources Nuclear Physics Radio Astronomy Biological Informatics (1996–1999) • Subgroup : Biodiversity Informatics • Subgroup : Neuroinformatics Recommended that the Megascience Forum endorse development of the Global Biodiversity Information Facility Nature IS Networks EIOnet WCMC ABI NBII NBII GenBank NABIN EMBL WDCM FishNet IABIN CONABIO CONABIO AETFAT INBio INBio ERIN ANGIS GBIF vision Characteristics of this information domain: Biodiversity Itself Developed World Developing World Biodiversity Information RATIONALE for a global biodiversity informatics effort The biodiversity information domain is vast, complex, and critically important to society. However, most existing biodiversity data and information are not at present dynamically accessible, and therefore are not yet fully useful. Recent technological and political developments provide exciting research opportunities in biodiversity informatics. GBIF What is GBIF? GBIF • An international scientific co-operative project based on a multilateral agreement (MoU) between countries and international organisations, dedicated to: • with links to molecular, genetic, ecological data levels • establishing an interoperable, distributed network of databases containing scientific biodiversity information, in order to: • make the world’s scientific biodiversity data universally and freely available via the internet, • with initial focus on species- and specimen-level data, and GBIF PRINCIPLES • • • • • • • • Equitable sharing of information and its benefits Providers maintain and upgrade their own data Public-private partnerships User-driven Transparent Best-practice use of technologies, standards and procedures Protection of intellectual property rights International and inter-institutional coordination GBIF The Market for GBIF’s Products: • Biodiversity knowledge management at the global level • • Global biodiversity policy • Biological research support at global, regional, national and local levels Natural resource management at regional and national levels GBIF Relationships GLOBAL Global Biodiversity Information Facility (GBIF) Clearinghouse Mechanism (CHM) REGIONAL European Information Organization Network (EIOnet) Inter-American Biodiversity Information Network (IABIN) North American Biodiversity Information Network (NABIN) NATIONAL LOCAL CBIN (Canada) CONABIO (Mexico) ERIN (Australia) InBIO (Costa Rica) NBII (U.S.A.) Many examples: Natural History Museum Collections, research databases, etc. GBIF Relationships GBIF Participant Nodes GOVERNMENT Clearinghouse Mechanism (CHM) WCMC NCEAS ORGANIZATION NatureServe ETI BIOSIS CABI COMMUNITY Species 2000 TDWG OBIS GenBank Protein DataBank RESEARCH FishBase FlyBase HerpNet Characteristics of a Megascience Effort Something that cannot be undertaken by only one country expense no one country has access to all the data Some components of the research can be done at the national or regional levels, but some must be truly global Usually infrastructural in nature (e.g. CERN) Involves Public-Private Partnerships Scientists do some research in their own laboratories and some in the megafacility Involves collaboration among many scientists and others The topic is hugely inclusive and affects many disciplines • Retroactive Data Capture is a Megascience Activity Molecular Biological Informatics “bioinformatics” Age of “molecular biology” virtually equals age of computers (ca. 50 yr); Many of the data automatically share > 95% of all data ATGC, amino acids, etc.) with binary structure are digitized Biodiversity & Ecosystem Informatics Knowledgebase is 5X older than computers (ca. 250 yr); < 5% is digitized common language (i.e., Data languages are immensely complex on biological and sociological levels (no standardization; not binary) Minimum of $500M spent per year on "bioinformatics" $50M per year spent on biodiversity informatics, even though a minimum of $1B is spent per year on environmental observations globally How GBIF’s Tasks Were Chosen Content area responsibilities of GBIF Biological Specimen Data GenBank, et al. Electronic Catalog of Names Sequence Data (RNA, protein, etc.) Geospatial Data Climate Data Specialized Search Engines Data Access/ Interoperability GBIF will enable synergism among existing investments that is not possible at present Ecosystems Data Ecological Data Existing responsibilities of other agencies What is needed? Computational Capacities Connectivity Content Content Content Content Content Content Content Content Content Content Content Content Content Content ContentContent • 1.5 - 3 B specimens worldwide Data Description, Cleansing, Indexing Tools Data Entry Tools Hardware Legacy Active Databases Active Databases Datasets (disciplinary) (cross-disciplinary) Data from static media Software Content New data from research and monitoring RAW DATA Appropriate Processing Capacity Cross-Database Linking Tools STRUCTURED DATA Presentation Tools ANALYZED DATA INFORMATION EDUCATION ENTERTAIN MENT Data Management Tools Dynamic, Append-only Data Storage Data Correlation & Analysis Tools Reporting Tools Bulk Data Information Transport Delivery (T3 & up) (T1, ASDL) Environmental Decision-Support Example: Species distribution User = Decisionmaker working on location of Wildlife Reserves GIS layers Single species model Using Biodiversity and other data interoperably “Species Richness” and/or “Habitat conservation” models Example: User = Decisionmaker working on location of Wildlife Reserves Combined single species models Presentation tools can simplify the analysis for end users Dave Peters Tasmanian Parks Dept. •250 years of literature • Sequences … • Art • Images • Film / Sound COMPLEX LEGACY Megascience is Strategic Beneficiaries of this Megascience Effort Biodiversity-rich countries repatriation of data access to global biodiversity knowledge-base Information networks (CHM, IABIN, EIOnet, et al.) Governments and others responsible for management of living natural resources Private Sector, e.g. agriculture, forestry, fishing... publishing and biotechnology pharmaceuticals and other health-related areas Scientific research Education (formal and informal) Characteristics of GBIF GBIF • Equitable sharing of information and its benefits • Providers retain control over their own data • Protection of intellectual property rights • Complement existing biodiversity information networks • Open facility architecture • Reduces duplications of effort • Coordinated by Secretariat staff, but • Most of the real work de-centralized GBIF Membership • Voting membership is open to any country or economy that agrees to: Sign MoU Make required financial contribution Share scientific biodiversity data Establish a national GBIF node • Associate (non-voting) membership is open to any country, economy or organisation that agrees to: Sign MoU Share scientific biodiversity data Establish a GBIF node Established: 2001 GBIF • March: 1st Governing Board Meeting (GB1) • • • Elected Christoph Häuser Chair Established Ebbe Nielsen prize June: GB2: Denmark (University of Copenhagen) selected to host the GBIF Secretariat September: GB3: Selected Dr. James L. Edwards to be Executive Secretary Established Science and Budget Committees March 2002: GB4: Awarded first Ebbe Nielsen Prize Established Participant Node Managers Committee Where we are now: GBIF • October 2002: GB5: Work Programme Participant Nodes • 24 Voting Participants • 25 Associate Participants (11 countries, 14 organizations) as of 31 Oct 2002 What we’ve done - YEAR 1 Secretariat established in Copenhagen • IT infrastructure designed and in process of implementation • Work Programme developed by Secretariat and approved by Science Committee and the Governing Board GBIF • • • Secretariat staff hired STAG meetings held to aid in developing the priority work programmes GBIF Work Programmes • Data Access and Database Interoperability (DADI) • Electronic Catalogue of Names of Known Organisms (ECAT) • Digitization of Natural History Collections (DIGIT) • Outreach and Capacity Building (OCB) • Down the road: “Species Bank” Digital Biodiversity Literature Resources GBIF Relationships & IT Services GLOBAL Global Biodiversity Information Facility (GBIF) CBD Clearinghouse Mechanism (CHM) REGIONAL European Information Organization Network (EIOnet) Inter-American Biodiversity Information Network (IABIN) North American Biodiversity Information Network (NABIN) NATIONAL CBIN (Canada) CONABIO (Mexico) ERIN (Australia) INBio (Costa Rica) NBII (U.S.A.) LOCAL Natural History Museum Collections, libraries, research databases, etc. What GBIF will do - YEAR 2 - GBIF working with the Participant Nodes: • Adopt standards for interoperability (DADI) • Development of appropriate search engines, the GBIF central portal and Participant portals (IT) • Build on the Catalog of Names of Known Organisms (ECAT) • Increase rate of digitisation of biodiversity data (DIGIT) • Plan for installing GBIF hardware and software and for training individuals in their use (OCB) • Data-rich biodiversity databases agree to affiliate with GBIF via Participant Nodes (Participants) GBIF The Participants are GBIF ... Supporting partnerships Open Architecture Access and Interoperability Capacity Building GBIF Data-rich Nodes (institutions, organizations) Human Resources (GB, Committees, STAGS, GBIF users & data providers) Computing and Communications Databases (including legacy data) GBIF GBIF WILL: Provide comprehensive information to Respond to environmental questions; Avoid duplication of effort; Cost-efficiently leverage funds; Track data, software and human resources; and Inform internationally in support of Coordination of informatics efforts in order to Achieve sustainability goals and Learn about biodiversity. TOXICOLOGY NEUROSCIENCE DATA DATA MOLECULAR CHEMICAL DATA AND INFORMATION BIODIVERSITY GENETIC EDUCATION AND DATA ECOSYSTEMS HEALTH ENTERTAIN MENT DATA SCIENCES PHYSIOLOGY DATA DATA PHYLOGENETIC DATA industry policy management public informal formal