Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
A Web-based Collaboratory for Supporting Environmental Science Research Xiaorong Xiang Yingping Huang Greg Madey Department of Computer Science and Engineering University of Notre Dame Steve Cabaniss Department of Chemistry University of New Mexico WSS’03: WI/IAT 2003 Workshop on Applications, Products of Web-based Support Systems October 13, 2003, Halifax This research was supported in part by NSF ITR Grant No. 0112820 Introduction Combination of words “collaboration” and “laboratory” first coined by William Wulf (1996): Richard T. Kouzes, James D. Myers, and William A. Wulf. Collaboratories: Doing science on the internet. IEEE Computer, 1996 Diesel Collaboratory: C. M. Pancerella, L. A. Rahn, and C. L. Yang: The diesel combustion collaboratory: combustion researchers collaborating over the internet. In Proceedings of the 1999 ACM/IEEE conference on Supercomputing BioCoRE: http://ks.uiuc.edu/Research/biocore EMSL Collaboratory: http://www.emsl.pnl.gov:2080/docs/collab An example of E-Science G. Fox (2002): E-science meets computational science and information technology. Computing & Engineering R. M. Jakobovits, J. F. Brinkley, C. Rosse, and E.Weinberger (1998): Enabling clinicians, researchers, and eductors to build custom Web-based biomedical information system The NOM Collaboratory Interdisciplinary project Objectives Supporting research on Natural Organic Matter (NOM) Understanding NOM behavior is an important environmental research area Simulations of NOM in the soil and groundwater Information and model sharing Data repository and analysis support tools Electronic communication tools No installation or maintenance of computation resources needed by the environmental scientists NOM Collaboratory Components NOM simulators Search engine NOML upload Molecule editor Molecule validation Simulation finder Automatic restart Completion predictor Data Analysis Graphical reports XML reports Data mining Communication Tools Discussion board Chat room File sharing Web Technologies Sun Java 2 Enterprise Edition (J2EE) RDBMS (Oracle): Java Servlets, Java Server Pages (JSP) Enterprise Java Beans (EJB) Java Transaction Service/API (JTS/JTA) Java DataBase Connectivity (JDBC) Data analysis packages Data mining technologies XML NOML XSQL Web-based Interface Start a new one Submitted simulations Sign up Simulation reports Parameter input Terminate New user Login Dynamic running time prediction Email notification Database Find similar simulations Restarter Invoke simulation Static running time prediction Simulation engines Web-based Interface Logic Input the simulation parameters Invoke the simulation Stop the simulation View the realtime simulation results Web Interface Implementation Example of Interface Interface Features Email notification Running time prediction Static number of molecules number of time steps Dynamic current time step current wall clock time Interface Features (cont) Find similar simulations Environment parameters Molecule types and distributions Retrieve the data sets from database Points on a high dimension space Euclidean distance Ordered list Review the simulation results or restart Automatic restarter Save the state of each objects in the system to database every check point Load the state to the core simulation engine XML-based NOM Markup Language NOML: Facilitates communication Standard data format Environment.dtd, Molecules.dtd, Setup.dtd Environment.xml, Molecules.xml, Setup.xml User ==== User Application ==== Application User ==== Application Extensions planned NOML Uploader Data Input Options Other Tools Molecule editor Molecule validator Define new molecule type Authorized persons (Chemists) to validate data Share the molecule type Search engine Ad-hoc query View results of the completed simulations Restart some simulations Architecture NOM Simulation Engine Design XML Databases Input Packages Agent-based Stochastic Discrete event Swarm RePast Languages Java SQL & PL/SQL Simulation Engine Output Forms NOM Simulation Engine Read simulation parameter from the database (JDBC) Environmental parameters (pH, temperature, light intensity, and so on) Molecule types and distributions User defined time has been separated to a large number of equal size time steps Write relevant data into the database every time step (JDBC) Trace the dynamic properties of individuals and the system over time Data Analysis Analysis SQL Statistics Data mining Presentation Oracle Reports XML/XSLT XSQL Report Example Conclusion Web-based Collaboratory A Web-based simulation architecture (multiple simulation servers, database servers, and data mining technologies) A Web-based configuration interface NOML data upload Future Work Collaboratory More communication tools More simulation models for NOM study NOML extensions JDBC performance Audio Video AutoCommit issues Batch inserts Use of sqlloader User testing Thank You ! Questions?