Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
The University of Washington eScience Institute This afternoon: Phyllis Wise, Provost Ed Lazowska, Computer Science & Engineering Dan Fay, Microsoft Research Martin Savage, Physics David Baker, Biochemistry Andy Connolly, Astronomy eScience: Computational Science for the 21st Century Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering Interim Director, eScience Institute November 2008 http://eScience.washington.edu/ Theory Experiment Observation Theory Experiment Observation Theory Experiment Observation Theory Experiment Observation Computational Science Protein interactions in striated muscles Tom Daniel lab QCD to study interactions of nuclei David Kaplan lab Gas Stars Study of dark matter Dark Matter Tom Quinn lab Theory Experiment Observation Computational Science eScience eScience is driven by data Massive volumes of data from sensors and networks of sensors Apache Point telescope, SDSS 15TB of data (15,000,000,000,000 bytes) Large Synoptic Survey Telescope (LSST) 30TB/day, 60PB in its 10-year lifetime Large Hadron Collider 700MB of data per second, 60TB/day, 20PB/year Illumina Genome Analyzer ~1TB/day Regional Scale Nodes of the NSF Ocean Observatories Initiative 2000 km of fiber optic cable on the seafloor, connecting thousands of chemical, physical, and biological sensors The Web 20+ billion web pages x 20KB = 400+TB One computer can read 30-35 MB/sec from disk => 4 months just to read the web Point-of-sale terminals eScience is about the analysis of data The automated or semi-automated extraction of knowledge from massive volumes of data There’s simply too much of it to look at The technologies of eScience Sensors and sensor networks Databases Data mining Machine learning Data visualization Cluster computing at enormous scale eScience will be pervasive Computational science has been transformational, but to some extent it has been a niche As an institution (e.g., a university), you didn’t need to employ it broadly in order to be competitive eScience capabilities must be broadly available and broadly practiced If not, the institution will simply cease to be competitive The University of Washington eScience Institute Mission Help position the University of Washington at the forefront of research both in modern eScience techniques and technologies, and in the fields that depend upon these techniques and technologies Strategy Increase the sharing of expertise and facilities Bootstrap a cadre of Research Scientists Add faculty in key fields Make the entire University more effective Launched July 1 with $1 million in permanent funding from the Washington State Legislature Sought, and need, $2 million Steering Committee Appointed by Provost Phyllis Wise Ed Lazowska, CSE and eScience Institute Interim Director Mary Lidstrom (chair), Vice Tom Ackerman, Provost for Research Atmospheric Sciences Matt O’Donnell, Engineering Ginger Armbrust, Tom Quinn, Astronomy Oceanography Chance Reschke, eScience Tom Daniel, Biology Institute Technical Coordinator David Goodlett, Medicinal Mani Soma, EE and Office of Chemistry the VP for Research Terry Gray, UW Technology Werner Stuetzle, Arts & Ron Johnson, CTO Sciences David Kaplan, Physics Peter Tarczy-Hornoch, Richard Karpen, Arts & Biomedical & Health Sciences Informatics Activities Direction-setting interviews with UW research leaders regarding technology needs 124 interviews thus far Top researchers of all ages in all fields Technology needs, in priority order 1. Data management facilities • Storage, backup, security 2. Shared expertise • 3. 4. 5. 6. Data management specifically, technology in general Computing power and high-bandwidth network access Data collection and analysis Communication and collaboration technologies Shared laboratories and pricing Initial staffing Research Scientist recruited for cluster computing Chance Reschke Research Scientist being recruited for data management Consulting model developed Jeff Gardner as “TeraGrid Champion” Data management consultancy under development Overall coordination coming on-board Erik Lundberg First faculty search underway Werner Stuetzle chairing search committee Laying the groundwork for broadly shared facilities Data center space coordination and planning UW Tower scheduled to come online in late 2009 ~600KW for research computing EPIC Intelligent use of the research allocation in UW Tower Coordinated, cost-effective compute and storage solutions for the UW eScience community Active exploration of alternative approaches to facilities Amazon Web Services Google/IBM cloud Microsoft Dryad and Azure Participation in proposal preparation Moore Foundation Sequencing Center NSF Data Net - The GRADD Collaboration NSF Track 2d (with PNNL, PSC, CMU) Community building Web site for general information http://eScience.washington.edu/ SIG for eScience technical staff http://staff.washington.edu/reschke/escience-sig/SIG.pdf Monthly technical “brown bag lunch” Regular discussions with research groups across campus regarding their eScience needs We can help you (some currently, better shortly) with … Facilities Proposals Data management issues See posters Email [email protected]