Download PenningtonBEAM200412014intro

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Climatic Research Unit documents wikipedia , lookup

Transcript
Introduction for
BEAM Ecological Niche
Modeling Working Meeting
Deana Pennington
University of New Mexico
December 14, 2004
SEEK Project
•NSF-funded
•Information Technology Research (ITR)
•5 years (starting year 3)
•50+ researchers and developers
•9 institutions
Grand Challenges
in Ecology




Alterations in biodiversity…exotic
species, infectious disease
Altered biogeochemical cycles at
multiple spatial scales
Climate change and variability,
including ecosystem reponse to
change
Coupled human-natural
ecosystems
Ecoinformatics
Tackling these question will require the use of all of
the information available to us






200+ years of data collection in US, 300+ globally
Large and widely distributed data sets
Data heterogeneity (text, Excel, GIS, DB, etc.)
New data collection techniques: in situ sensor arrays
Remotely-sensed imagery
Scaling issues: space, time, levels (taxon)
Biodiversity and ecosystem informatics R&D has been
identified as a critical national priority
–Computer-mediated collaboration
–New tools for synthetic understanding
Science and Technology
Data-intensive
Data mining
Bio-inspired algorithms
Exp. Data Analysis
Visualization
Compute-intensive
Parallel processing
High throughput
Grid technologies
EcoGrid
Analysis & Modeling
Domain-intensive
User interfaces
Human cognition
Ontologies
Semantic mediation
Technologic Systems
for Scientists
Technology-enabled
Science
Data-intensive
Science-focused
Computeintensive
Domainintensive
Kepler
Workflow
System
Informatics and
the Research Cycle
Data-intensive
Data mining
Bio-inspired algs.
Exp. Data Analysis
Visualization
Domainintensive
User interfaces
Human cognition
Ontologies
Sem. mediation
Inductive, Descriptive
Statistics
Mental
Model
Research
Design
Collect
Data
Conduct
Analyses
Share
Results
Metadata
Deductive, Prescriptive
Mechanistic
Computeintensive
Parallel processing
High throughput
Grid technologies
Scientific Workflow
Source: NIH BIRN (Jeffrey Grethe, UCSD)
Promoter Identification
Workflow (PIW)
Source: Matt Coleman (LLNL)
Species Distribution Workflow
EcoGrid
DataBase
Species
pres. & abs.
points
Species
pres. & abs.
points
Test sample
+A2
+A3
EcoGrid
Query
Physical
Transformation
Sample
Data
EcoGrid
DataBase
Training
sample
GARP
rule set
Model
Calculation
Validation
GARP
rule set
Integrated
layers
Env.
layers
EcoGrid
DataBase
EcoGrid
Query
EcoGrid
DataBase
Model quality
parameters
+A1
Integrated
layers
Layer
Integration
Native range
prediction map
Map
Generation
User
Selected
prediction
maps
Scaling
Archive
To Ecogrid
Generate
Metadata
ENM workflows
Climate change
 Species invasion
 Macroanalysis
 Cross-validation
 Calibration
 Environmental monitoring
 Time-specific predictions
 Zoonotic disease

Past year
Conceptual
Workflows
Executable
Workflows

Scripting/Visual modeling
 Single environment
 Single platform

Workflows:
 Cross-platform
 Cross-environment
 Distributed data &
analyses
Data & Analysis Sharing:
EcoGrid
What is a workflow?
Research Design
•Data integration
•Analysis integration (data transformation)
Reporting
Sharing
Starting point: Ptolemy II
Edward Lee et al. http://ptolemy.eecs.berkeley.edu/ptolemyII
Kepler Additions

Grid-enabled data and analysis sharing







Local
Shared
Web application
Web service
Statistical library: R (open source)
GIS library: GDAL/GRASS (open source)
Domain specific functionality (GARP, etc.)
Kepler
Contributors, Projects, Sponsors






















Ilkay Altintas SDM
Chad Berkley SEEK
Shawn Bowers SEEK
Tobin Fricke ROADNet
Jeffrey Grethe BIRN
Christopher H. Brooks Ptolemy II
Zhengang Cheng SDM
Dan Higgins SEEK
Efrat Jaeger GEON
Matt Jones SEEK
Edward A. Lee Ptolemy II
Kai Lin GEON
Ashraf Memon GEON
Bertram Ludaescher BIRN, GEON,
SDM, SEEK
Steve Mock NMI
Steve Neuendorffer Ptolemy II
Jing Tao SEEK
Mladen Vouk SDM
Xiaowen Xin SDM
Yang Zhao Ptolemy II
Bing Zhu SEEK
•••
E-Science Link-up Project
Recommended for NEON
Agenda
Goal: To give you the knowledge and training needed to begin
to develop grid-enabled applications in Kepler





Prototype project: ENM Mammal Project
Resource sharing and grid technologies (Tues am)
Metadata requirements (Tues pm)
Kepler training (Wed am)
Kepler applications in ENM



What you can do now (or very soon) (Wed am/pm)
What expanded functionality needs to be added (Thurs am/pm)
Feedback and planning (Thurs pm)
Important Disclaimer

Kepler is a CIS research project in its
EARLY stage… there are many, many still
to be done. If:





Something crashes….it’s a work in progress
Something looks weird…it’s a work in progress
Something doesn’t work…it’s a work in
progress
Something should be done a different way…it’s
a work in progress
 best to keep a sense of humor 
Acknowledgements
This material is based upon work supported by the National Science
Foundation under awards 0225676 for SEEK and 0225673 (AWSFL008-DS3)
for GEON and by the Department of Energy under Contract No. DE-FC0201ER25486 for SciDAC/SDM and by DARPA under Contract No. F33615-00C-1703 for Ptolemy. Any opinions, findings and conclusions or
recomendations expressed in this material are those of the author(s) and do
not necessarily reflect the views of the National Science Foundation (NSF).
The National Center for Ecological Analysis and Synthesis, a Center funded
by NSF (Grant Number 0072909), the University of California, and the UC
Santa Barbara campus.
The Andrew W. Mellon Foundation.
PBI Collaborators: NCEAS, University of New Mexico (Long Term Ecological
Research Network Office), San Diego Supercomputer Center, University of
Kansas (Center for Biodiversity Research)
Kepler contributors: SEEK, Ptolemy II, SDM/SciDAC, GEON