Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Virtual Observatories Wolfgang Voges Max-Planck-Institut für extraterrestrische Physik Garching Workshop ‘‘Astronomie mit Großgeräten‘‘ Am 17.Oktober 2003 in Potsdam Wolfgang Voges 1 Virtual Observatories Overview: Historical roots * What’s happening in the world: IVOA European VO-activities German VO-activities (GAVO) Federation of local data-sets Next-generation search engine Grid Theory in GAVO Outlook * Viewgraphs partly copied from other presentations Workshop ‘‘Astronomie mit Großgeräten‘‘ Am 17.Oktober 2003 in Potsdam Wolfgang Voges 2 Virtual Observatories Historical remarks 1. VO meeting at Caltec in Pasadena (June 2000) Astronomers mostly from the US Very enthusiastic talks Big vision of the future Foundation of the NVO (US) Since then similar meeting in Europe (Garching) Foundation of national European and later Other VOs Wolfgang Voges 3 Data Knowledge ? The exponential growth of data volume (and complexity, quality) driven by the exponential growth in information technology … 1000 100 10 1 0.1 1970 1975 1980 1985 1990 1995 2000 CCDs Glass … But our understanding of the universe increases much more slowly -- Why? Methodological bottleneck VO is the answer Human wetware limitations … AI-assisted discovery NGVO? Wolfgang Voges 4 How and Where are Discoveries Made? • Conceptual Discoveries: e.g., Relativity, QM, Brane World, Inflation … Theoretical, may be inspired by observations • Phenomenological Discoveries: e.g., Dark Matter, QSOs, GRBs, CMBR, Extrasolar Planets, Obscured Universe … Empirical, inspire theories, can be motivated by them New Technical Capabilities IT/VO Observational Discoveries Phenomenological Discoveries: Pushing along some parameter space axis Making new connections (e.g., multi-) Theory (VO) VO useful VO critical! Understanding of complex astrophysical phenomena requires complex, information-rich data (and simulations?) Wolfgang Voges 5 Why is VO a Good Scientific Prospect? • Technological revolutions as the drivers/enablers of the bursts of scientific growth • Historical examples in astronomy: – 1960’s: the advent of electronics and access to space Quasars, CMBR, x-ray astronomy, pulsars, GRBs, … – 1980’s - 1990’s: computers, digital detectors (CCDs etc.) Galaxy formation and evolution, extrasolar planets, CMBR fluctuations, dark matter and energy, GRBs, … – 2000’s and beyond: information technology The next golden age of discovery in astronomy? VO is the mechanism to effect this process Wolfgang Voges 6 A Schematic Illustration of the VO-Based Science Primary Data Providers Surveys Observatories Missions Survey and Mission Archives Secondary Data Providers VO Data Services --------------Data Mining and Analysis, Target Selection Follow-Up Telescopes and Missions Results Digital libraries Wolfgang Voges VO as an integral part of the whole system … 7 The Changing Style of Observational Astronomy The Old Way: Now: Future: Pointed, heterogeneous observations (~ MB - GB) Large, homogeneous sky surveys (multi-TB, ~ 106 - 109 sources) Multiple, federated sky surveys and archives (~ PB) Small samples of objects (~ 100 - 103) Archives of pointed observations (~ TB) Wolfgang Voges Virtual Observatory 8 This quantitative change in the information volume and complexity will enable the Science of a Qualitatively Different Nature: • Statistical astronomy done right – Precision cosmology, Galactic structure, stellar astrophysics … – Discovery of significant patterns and multivariate correlations – Poissonian errors unimportant • Systematic exploration of the observable parameter spaces (NB: Energy content = Information content) – Searches for rare or unknown types of objects and phenomena – Low surface brightness universe, the time domain … • Confronting massive numerical simulations with massive data sets Wolfgang Voges 9 Panchromatic Views of the Universe: A More Complete, Less Biased Picture Radio Far-Infrared Visible Dust Map Visible + X-ray Wolfgang Voges Galaxy Density10Map Examples of Possible VO Projects: • A Panchromatic View of AGN and Their Evolution – Cross-matching of surveys, radio to x-ray – Understanding of the selection effects – Obscuration, Type-2 AGN, a complete census Evolution and net energetics, diffuse backgrounds • A Phase-Space Portrait of Our Galaxy – Matching surveys: visible to NIR (stars), FIR to radio (ISM) – A 3-D picture of stars, gas, and dust, SFR … – Proper motions and gas velocities: a 6-D phase-space picture Structure, dynamics, and formation of the Galaxy • Galaxy Clusters as Probes of the LSS and its Evolution – Cluster selection using a variety of methods: galaxy overdensity, x-rays, S-Z effect … – Understanding of the selection effects Probing the evolution of the LSS, cosmology Wolfgang Voges 11 Exploration of new domains of the observable parameter space: the Time Domain Faint, Fast Transients (Tyson et al.) Existing and Forthcoming surveys: Microlensing experiments (OGLE, MACHO …) Solar System patrols, GRB patrols … DPOSS plate overlaps (Mahabal et al.) QUEST-2 and NEAT at Palomar … and many, many others … The future: LSST ? Wolfgang Voges Megaflares on normal 12 MS stars (DPOSS) Data Mining in the Image Domain: Can We Discover New Types of Phenomena Using Automated Pattern Recognition? (Every object detection algorithm has its biases and limitations) – Effective parametrization of source morphologies and environments – Multiscale analysis (Also: in the time/lightcurve domain) Wolfgang Voges 13 Exploration of observable parameter spaces and searches for rare or new types of objects Wolfgang Voges 14 Advantages of a Virtual Observatory new, more, better, faster, and easier science comparative analysis of multi-instrument data, permit new approaches to research and multi-wavelength exploration, opening discovery capabilities not otherwise possible This is clearly the primary mandate of all VO efforts minimise redundancy: data collected by a single telescope / instrument can be re-used multiple times by different teams and for different scientific purposes data integrity: data are archived and documented in a controlled and uniform fashion, ensuring long-term scientific usage improving calibrations and creating more higher-level data products to make data more science-ready Wolfgang Voges 15 Advantages of a Virtual Observatory interoperability of archives: - strengthening connections to other archives, catalogues and abstract services for broader research parameter space and links to the literature advancing technologies for computers, networks, data compression, and storage media: - to retrieve and analyse more information more readily at lower cost efficient serving of data to the public: - there will be different levels of end-user from professional astronomers to interested (high-school) students and enthusiastic amateurs – many of whom may undertake projects which are simply unrealisable by large institutes data-mining with new software tools and new catalogues of object properties: - to enable higher-order research based on questions posed in scientific terms Wolfgang Voges 16 Advantages of a Virtual Observatory improving the preparation, development, building of new groundbased and space-based projects improving new observation proposals comparison of real data with simulated data – to provide feedback to new insights, new models, new physics Wolfgang Voges 17 International Virtual Observatory Alliance Korea, Japan, China, Australia, India, Russia, Hungary, Italy, France, Germany, Europe (ESO++), Canada, USA Wolfgang Voges 18 Virtual Observatories What’s happening in the world: IVOA International STANDARDS are needed Registry Data-Models VO-Table VO-Query Uniform Content Descriptors (UCD) Simple Image Access (SID) GRID-standards Tools e.g. data-mining Wolfgang Voges 19 Virtual Observatories European VO-activities In the AVO (euro-vo.org) under the leadership of ESO/ESA the following institutes/groups are collaborating: ESO ESA/STECF University of Edinburgh CDS Strasbourg University Louis Pasteur Centre National de la Reserche Scientifique Delegation Paris The Victoria University of Manchester GAVO (RDS:MPE,AIP,HS,MPA) Wolfgang Voges 20 German Astrophysical Virtual Observatory GAVO-Team: Wolfgang Voges (PI) Hans-Martin Adorf, Gerard Lemson, Achim Bohnet, Joachim Paul Max-Planck-Institut für extraterrestrische Physik, Garching Matthias Steinmetz (Co-I) Harry Enke, Detlef Elstner Astrophysikalisches Institut Potsdam Dieter Reimers, (Co-I) Dieter Engels, Peter Hauschildt Hamburger Sternwarte Simon White, Anthony Banday, Volker Springel Max-Planck-Institut für Astrophysik, Garching Other institutes are most welcome to join >>>www.g-vo.org<<< Wolfgang Voges 21 Why do we need a German AVO? to remain internationally competitive (proposals, data utilisation, quality of science output) to make available VO services to everyone and provide support for the science community and public in Germany to prepare and maintain datasets obtained from German facilities for GAVO and IVO to establish a network, within which the needs of the German science community and public are coordinated to obtain financial support from German agencies for such a national task Wolfgang Voges 22 Activities and responsibilities of partners Wolfgang Voges 23 Activities and responsibilities of partners Main goal is science driven, but it will drive science, too - fast access to all kinds of astronomical and related data - capability to use highly sophisticated software tools for new studies - GAVO will provide interoperability of distributed archives over a high speed network through a set of interface/infrastructure tools - GAVO ultimately will be incorporated into larger IVO federation - Astronomical institutes will require expert data centres of different local character e.g. for providing key data archives, documentations, “simple” analysis-, correlation- and visualisation tools - computer science groups will develop data handling and novel analysis tools and are responsible for their maintenance Wolfgang Voges 24 Activities and responsibilities of partners - - university institutes will be able to use GAVO for teaching and will provide a simple gateway to the public and to schools the “service community” will be responsible for designing and developing the interface/infrastructure tools necessary for communication between the users Wolfgang Voges 25 Archive publication through GAVO • • • • • ROSAT source catalogs published in IVOA compliant manner: – simple cone search – webservices RASS Photons stored in PostgreSQL database – Spatial index using HEALPix – Cone search, webservices Federation: fast match between ROSAT source catalogues and RASS photons. Published first proposal for unified datamodel to serve as an ontology for the IVOA. Plans: – Extend query capabilities – Publish ROSAT fields and pointed observations – Federate with SDSS mirror at MPA – Federate ROSAT catalogues with external catalogues for classification of X-ray sources (in collaboration with ClassX team). Wolfgang Voges 26 The local GAVO activities Top priority during initial stage of development federation of local key datasets and provision of key applications ROSAT, SDSS, Planck, RAVE development of meta-data standards, especially for simulations development of common query tools for the local archives need ability to query/compare both real sky and simulated data post-processing tools - must be platform-independent installation of visualisation packages existing software provides a strong foundation to allow extension to different types of data and archives Wolfgang Voges 27 The local GAVO activities Technical challenges and requirements archive standards: rules for ingestion, data quality, associated meta-data schema, data attributes archive maintenance/evolution: migration of data with new technology and enhancements in data attributes meta-data requirements/standards for different data-sets (observations, simulation, calibration) federation of archives, interoperability high-speed networking, streaming formats for data distributed processing power – GRID concept seeking active cooperation with industry in many of these areas Wolfgang Voges 28 Next Generation Search Engine • Download Manager – Retrieves data from multiple distributed databases • Matcher – Matches sources based on sky-position (astronomical sources have no unique identifier) • Classifier – Uses multi-wavelength data for identification purposes Wolfgang Voges 29 NextGen Search Engine (cntd.) Simple Cone Search Service #1 VOTable One or more SCS Queries BaseURLs Simple Cone Search Service #2 VOTable Multi-Catalogue MultiCone Search "Download Manager" Simple Cone Search Service #3 VOTable VOTables VOTable Processor Matcher DataSets Probabilistic Matcher Table Internet BaseURLs Service Registry Table on Local Disk VOTables VOTables Local Disk Wolfgang Voges Table Local Disk 30 Download Manager • Features – Tool … • … accesses registry at JHU – User … • … selects distributed catalogues • … specifies one or more sky-locations – Tool … • … queries remote catalogues • … retrieves datasets for further processing Wolfgang Voges 31 Download Manager (cntd.) Wolfgang Voges 32 Matcher • Essential for data mining • Prototype features • Performs “fuzzy” match between pairs of source lists from different catalogues • Computes probability of real match • Moving matcher into production use – Collaboration with Canadian Virtual Observatory (CVO) – Feeding ROSAT source matches to classifier Wolfgang Voges 33 Matcher (cntd.) Wolfgang Voges 34 Classification of ROSAT X-ray sources • ClassX: in collaboration with US-VO • Requires data from several large skysurveys • • • • X-ray: ROSAT (BSC + FSC) Optical: SDSS, USNO B1.0 Infrared: 2MASS Radio: FIRST, NVSS, SUMSS • True multi-wavelength VO-application Wolfgang Voges 35 Correlation of ROSAT and SDSS data Probing the large-scale structure of the universe with clusters of galaxies Project outline: - (ideally on single photon/galaxy basis) (Schuecker, Boehringer,Voges) identify a sample of galaxy clusters using X-ray/optical correlation >>>>>> see next 3 viewgraphs utilise optical multi-colour images (u,g,r,i,z) to derive photometric redshifts quantify completeness and selection limits by comparison to simulated cluster data search for IR correlation and quantify galaxy evolution in clusters determine correlation with radio surveys to identify the frequency of radio galaxies and AGN in clusters, search for radio halos optical correlation to identify AGN in clusters identify correlations with microwave/sub-mm data to search for the Sunyaev-Zeldovich (SZ) effect (distance measurements, velocity determination) Wolfgang Voges 36 Search for clusters of galaxies Maximum likelihood contours based on RASS-3 X-ray photons (upper panel, 1, 2 .. contours), SDSS galaxies (middle panel, >10), and the combined maximum likelihood contours of RASS-3 and SDSS data (lower panel, >10). Crosses mark the position of the deepest X-ray clustersamples available sofar (REFLEX-2, X-ray flux limit 1.8 .10-12 erg s-1cm-2). Squares mark the position of the X-ray clusters of the final sample. Wolfgang Voges 37 Search for clusters of galaxies Cumulative X-ray cluster number counts of the RASS/SDSSclusters (histograms) for a log-likelihood minimum of 15 applied to SDSS data (continuous line), for 25 (lower dashed line), and for 5 (upper dashed line). The RASS/SDSS cluster counts are compared with results obtained with other surveys (squares: RDCS, REFLEX,REFLEX-2). No corrections for variations of the angular survey-sensitivity (effective survey area) are applied to the RASS/SDSS and REFLEX-2 data. The figure shows that with the combination of RASS and SDSS data a 10 times deeper X-ray flux limit can be obtained compared to traditional X-ray cluster surveys like REFLEX. Wolfgang Voges 38 Search for clusters of galaxies General remarks: Our first results are quite important as a guideline for future X-ray missions like ROSITA and DUO. For the latter, about 8,000 X-ray clusters are expected to be detectable with standard methods. The application of the matchedfilter technique allows the extraction of about 30,000 X-ray clusters with DUO. Such large numbers of X-ray clusters are needed for precise tests of the dark energy and alternative gravitational theories. Wolfgang Voges 39 Correlation of ROSAT and SDSS data VO functionality required: - - - federation of relevant datasets including interchange/merging of meta-data identification of candidate cluster members by appropriate query applications to optical and/or X-ray catalogues acquire multi-colour information to determine photometric redshifts identification of candidate radio galaxy cluster members by querying radio catalogues with search criteria (e.g. location) tailored to the derived cluster sample identification of associated SZ by specific queries in existing catalogues; if no candidate SZ cluster can be identified apply suitable search algorithms to Cosmic Microwave Background (CMB) sky surveys to determine effect or limits thereon visualisation of multi-wavelength cluster data deprojection algorithms to allow study of morphology in survey data conversion of simulation data to the space of observable parameters 3D-interface for visualisation (to schools) Wolfgang Voges 40 Query example Correlation between radio, IR, optical and X-ray sources Search for SDSS QSO´s with 1 < z < 2, which are variable in one of the 4 wavelength-bands - search-engine: which datasets do exist and in which archive? multiple availability? parallel handling on different servers data available for different epochs? comparison of fluxes, light curves, period-search Source catalogues available? - radio: FIRST, NVSS, … IR: IRAS, 2MASS, … Optical: SDSS, Tycho-2, HST-GSC, USNO-2… X-ray: ROSAT, ASCA, XMM-Newton, Chandra, … Wolfgang Voges 41 Query example - if no catalogue entry exists postage-stamp (pixel-image with/without contour-lines) creation of light curves, fluxes, spectra, etc. by using original-data - high demand of CPU? GRID implementation - search for publications on derived variable SDSS QSO objects Wolfgang Voges 42 Grid Technology for GAVO GAVO grid : integration of all GAVO-workstations at MPE and AIP into a cluster Basic services on GAVO-grid: CertificationAuthority provides single-sign-on/access-all facility via proxy-ca Resource discovery and runtime information, network-weather for the grid Running distributed applications Running MPI-based applications on the GAVO-cluster Wolfgang Voges 43 Virtual Observatories Theory and GAVO Simulations Comparison of Simulations and observations Wolfgang Voges 44 Simulations in the Virtual Observatory Merging of the Milky Way with the Andromeda galaxy (M31) (3 Mio particles, cluster of 16 CPU’s, 1 week of CPU time) (30 k particles would need 25 minutes of cluster-CPU time) Wolfgang Voges 45 Simulations in the Virtual Observatory COMA type cluster of galaxies (>1000 galaxies, 10^15Msolar, 7 Mio particles) 8 CPUs, runtime:2 days; Gravitation, Hydrodynamics, not included: cooling, star formation (Volker Springel, MPA) Wolfgang Voges 46 The Role of Datasets from Theoretical Astrophysics • Direct Comparisons with Observations – Verification (or not) of Models • Data Mining for Both Observations and Theory – New Applications – Buried Physics • Resource for Education and Outreach Wolfgang Voges 47 Theory and the Virtual Observatory • Size of Datasets Appropriate to VO – Large Scale Simulations, Parameter Space Libraries Imply 10GB – 10 TB Datasets • Rich Complement to Observational Side • Same/Similar Tools as for Obs. Datasets • Use of VO Infrastructure – Grid Technology, Portals, etc. Wolfgang Voges 48 Conclusions • Theoretical Astrophysics is an Essential Part of the Virtual Observatory Concept – Provides Benefits to Theorists – Provides Benefits to Observers – Provides Benefits to Education/Outreach • Drives New Science Wolfgang Voges 49 GAVO efforts on the TVO • • • • Published IVOA whitepaper on “Theory in the VO” Leading theory subgroup in IVOA data modeling effort. Chair in IVOA special interest group on theory Plans: – Publish simulation archives at AIP, LMU-Obs., andMPA – Collaborate with UPitt on publishing services on theoretical datasets (NSF grant proposal) – Collaboration with Technion Haifa to publish observed and simulated Ly- forest spectra (GIF proposal) – Collaboration in RTN proposal for comparison of simulated and observed X-Ray clusters (Boehringer et al@MPE) Wolfgang Voges 50 Virtual Observatories Outlook Great enthusiasm among astronomical community that VO will work and will make life easier BUT still a lot to be done IVOA is combining all available forces to attack the manyfold problems There is still the need to incorporate other fields like mathematics, informatics,computer science, networks etc. since there is parallel work in progress GRID paradigm, fast data-links, super-computer access, etc. MORE MANPOWER NEEDED Wolfgang Voges 51 The end Wolfgang Voges 52