Exploring Space in Cyberspace: Cyber-Enabled Research and

Size: px
Start display at page:

Download "Exploring Space in Cyberspace: Cyber-Enabled Research and"

Transcription

1 011 Exploring Space in Cyberspace: Cyber-Enabled Research and Discovery in Astronomy S. George Djorgovski (Caltech) CDI Workshop, Seattle, Nov Overview Astronomy in the era of information abundance Exponential growth in data volume and complexity The Virtual Observatory concept and status A domain-specific, community-wide, distributed, open framework for science with massive and complex data sets Technology-enabled, but science-driven Examples of VO science drivers Exploration of parameter spaces Exploration of the time domain: distributed analysis and mining of massive data streams Some general comments and musings on Cyber-science What is really new here? What are the important trends? The enhanced science and technology synergies

2 Galactic Center Region (a tiny portion) 2MASS NIR Image Digital Sky Surveys The dominant source of data in astronomy today, typically several tens of TB each, ~ sources detected, ~ attributes per source; all wavelengths, radio to!-ray Examples: SDSS, Palomar surveys, 2MASS, A single survey feeds a broad range of science, from statistical studies of major constituents of the universe, to discovery of rare types of objects Federated in the Virtual Observatory framework Current surveys are mainly single-snapshot; the next generation will be synoptic (multi-pass), opening the time-domain astronomy (cosmic cinematography); Peta-scale data sets Examples: PanSTARRS, LSST; SKA; etc.

3 Radio Far-Infrared Panchromatic Views of the Universe Visible Crab Star forming complex Data Fusion! A More Complete, Less Biased Picture Visible + X-ray Radio + IR Understanding of complex phenomena requires complex data Theoretical Simulations Are Also Becoming More Complex and Generate Many TB s of Data Structure formation in the Universe Supernova explosions Numerical simulations are not just a weak substitute for the analytical theory - they are an inevitable methodology to study theoretically many complex phenomena, e.g., star or galaxy formation, etc.

4 An Example of Where We Are Heading Large digital sky surveys are becoming the dominant source of data in astronomy: ~ TB/survey (soon PB), ~ sources/survey, many wavelengths Data sets many orders of magnitude larger, more complex, and more homogeneous than in the past Data! Knowledge? doubling t! 1.5 yrs The exponential growth of data volume (and also complexity, quality) driven by the exponential growth in detector and computing technology but our understanding of the universe increases much more slowly! (from CCDs A. Szalay) Glass

5 The Virtual Observatory Concept Astronomy community s response to the scientific and technological challenges posed by the exponential growth of data sets and data complexity Technology-enabled, but science-driven: harness the IT advances in service of astronomy A complete, dynamical, distributed, web-based, open research environment for astronomy with massive and complex data sets Provide content (data, metadata) services, standards, and analysis/compute services Federate the existing + forthcoming digital sky surveys and archives, facilitate data inclusion and distribution Develop and provide data exploration and discovery tools From Traditional to Survey to VO-Based Science Traditional: Survey-Based: Another Survey/Archive? Telescope Data Analysis Survey Telescope Archive Target Selection Data Mining Follow-Up Telescope Results Results Highly successful and increasingly prominent, but inherently limited by the information content of individual surveys What comes next, beyond survey science is the VO science

6 Nat l Virtual Observatory: A Systemic View Primary Data Providers Surveys Observatories Missions Digital libraries Survey and Mission Archives Numerical Sim s User Community NVO Data Services: Data discovery Warehousing Federation Standards Compute Services: Data Mining and Analysis, Statistics, Visualization Networking Secondary Data Providers Follow-Up Telescopes and Missions International VO s Virtual Observatory Is Real! ivoa.net

7 VO as a New Research Environment The VO is not yet another data center, archive, mission, or a traditional project It does not fit into any of the usual structures today It is inherently distributed, and web-centric It is based on a rapidly developing technology (IT/CS) It transcends the traditional boundaries between different wavelength regimes, agency domains It has an unusually broad range of constituents and interfaces It is inherently multidisciplinary It is inherently trans-national in its reach The VO represents a novel type of a scientific organization for the era of information abundance Many other fields are building VOs of their own They are always discipline-based, not institution-based Virtual Observatory Enabled Science Statistical astronomy done right Precision cosmology, Galactic structure, stellar astrophysics Discovery of significant patterns and multivariate correlations Poissonian errors unimportant Systematic exploration of the observable parameter spaces Searches for rare or unknown types of objects and phenomena Low surface brightness universe, the time domain Multi-wavelength data fusion to disentangle complex processes and superpositions e.g., interpretation of the precision CMBR measurements Confronting massive numerical simulations with massive data sets + things we have not thought of yet

8 Integrated SZ Grav. Lensing Integ. Sachs-Wolfe Understanding the CMBR Foregrounds CMB Signal Gal. Nonthermal Galactic Thermal Radio Sources Galaxies (SF) Exploration of Parameter Spaces How many different types of objects are there? Which ones are identifiable with known, physically distinct types (e.g., stars, galaxies, quasars, etc.)? Are there rare and/or previously unknown classes, seen as outliers? Are there intermediate or transition types? Are there negative clusters? Anomalies possibly indicative of problems with the data? Are there new multivariate correlations?

9 An Example: Discoveries of High-Redshift Quasars and Type-2 Quasars in DPOSS Known, astrophysically interesting but rare types of objects, with a known or predictable parameter space signature Color-color parameter space used for selection Spectroscopic identification Normal Stars High-z QSOs Type-2 QSOs Rare Types of Objects Discovered as Outliers in a Color Parameter Space Peculiar types of quasars: Peculiar types of stars: DQ White Dwarf Highly unusual CV (Fan et al., SDSS; Djorgovski et al., DPOSS)

10 Dwarf Planets, Flying Rocks, and Snowballs Dwarf planets and KBOs M. Brown et al. Quauar Sedna, Xena,? NEAT, Catalina, etc. Killer Asteroids Tunguska A Rich Variety of Time-Domain Phenomena Flaring stars Novae, Cataclysmic Variables Supernovae Gamma-Ray Bursts Gravitational Microlensing Accretion to SMBHs

11 Blazars: Accelerators in the Sky They are quasars where we are looking straight down the relativistic jet (" ~ ). Instabilities and shocks produce strong variability Known sources of!-rays (up to a few TeV), and probable sources of ultrahigh energy cosmic rays (up to ~ ev ~ 10 8! LHC!) The future of particle (astro)physics? Probes of relativistic physics, AGN, and cosmic star formation history TeV!-ray Detections Donald Rumsfeld s Epistemology There are known knowns, There are known unknowns, and There are unknown unknowns

12 And Some Mysteries Megaflares in normal stars! An example from DPOSS: A normal, mainsequence star which underwent an outburst by a factor of > 300 (orders of magnitude more than the Solar flares). The cause, duration, and frequency of these bursts is currently unknown Archival optical transients! Seen in many surveys (DPOSS, DLS, PQ, SN surveys, ). Their physical nature is unknown The Palomar-Quest Event Factory Detect ~ 1-2!10 6 sources per half-night scan Compare with the baseline sky R tonight baseline Find ~ 10 3 apparent transients (in the data) Remove instrum. artifacts Identify ~ 2-4!10 2 real transients (on the sky) Remove asteroids Identify ~ 1-10 possible Astrophysical transients I Classification and follow-up

13 The VOEventNet Project A telescope sensor network with a feedback Scientific measurements spawning other measurements and data analysis in the real time (time scales ~ minutes/hours/days) Immediate web-based dissemination and publishing Please see P48 External archives PQ Event Factory Compute resources VOEN Engine PI: R. Williams Spons. NSF/DDDAS P60 Robotic telescope network Raptor Web Event Archive Paritel Follow-up obs. Broader and Societal Benefits of a VO Professional Empowerment: Scientists and students anywhere with an internet connection would be able to do a first-rate science A broadening of the talent pool in astronomy, democratization of the field Interdisciplinary Exchanges: The challenges facing the VO are common to most sciences and other fields of the modern human endeavor Intellectual cross-fertilization, feedback to IT/CS Education and Public Outreach: Unprecedented opportunities in terms of the content, broad geographical and societal range, at all levels Astronomy as a magnet for the CS/IT education Weapons of Mass Instruction

14 VO Education and Public Outreach Google Sky: uses DSS, SDSS, HST data, etc., for easy sky browsing Soon also: Microsoft s World Wide Telescope Transformation and Synergy We are entering the second phase of the IT revolution: the rise of the information/data driven computing The impact is like that of the industrial revolution and the invention of the printing press, combined All science in the 21st century is becoming cyber-science (aka e-science) - and with this change comes the need for a new scientific methodology The challenges we are tackling: Management of large, complex, distributed data sets Effective exploration of such data! new knowledge These challenges are universal There is a great emerging synergy of the computationally enabled science, and the science-driven IT

15 Information Technology! New Science The information volume grows exponentially Most data will never be seen by humans! The need for data storage, network, database-related technologies, standards, etc. Information complexity is also increasing greatly Most data (and data constructs) cannot be comprehended by humans directly! The need for data mining, KDD, data understanding technologies, hyperdimensional visualization, AI/Machineassisted discovery We need to create a new scientific methodology on the basis of applied CS and IT VO is the framework to effect this for astronomy A Modern Scientific Discovery Process Data Gathering (e.g., from sensor networks, telescopes ) Data Farming: Storage/Archiving Key Technical Challenges Indexing, Searchability Data Fusion, Interoperability +feedback } Database Technologies Data Mining (or Knowledge Discovery in Databases): Pattern or correlation search Clustering analysis, automated classification Outlier / anomaly searches Hyperdimensional visualization Data Understanding New Knowledge Key Methodological Challenges

16 The key role of data analysis is to replace the raw complexity seen in the data with a reduced set of patterns, regularities, and correlations, leading to their theoretical understanding However, the complexity of data sets and interesting, meaningful constructs in them is starting to exceed the cognitive capacity of the human brain Universal Challenges: Towards The New Scientific Methodology Data farming and harvesting Semantic webs, computational and data grids, universal or transdisciplinary standards and ontologies Digital scholarly publishing and curation (libraries) data, metadata, virtual data, hierarchical data products; legacy vs. dynamical; open vs. proprietary; data, knowledge, and codes; persistency; peer review; web samizdat vs. officially blessed and supported; mandates; etc., etc. Data mining and understanding, knowledge extraction Scalable DM algorithms Hyperdimensional visualization Empirical validation of numerical models Computer science as the new mathematics The art and science of scientific software systems Architecture, design, implementation, validation

17 Some Distinguishing Characteristics of Data/Comp. Enabled (e-)science Data-intensive: massive (TB-scale and beyond) data sets Poissonian errors not important, systematics dominate Data complexity: multi-wavelength and/or multi-scale and/or multi-epoch data sets, 100 s or 1000 s parameters per source, combining imaging, spectroscopy, etc. Heterogeneity and visualization are key issues Computationally intensive Traditional solutions do not scale to the scope of new problems Need new tools and scalable algorithms Data and computing resources (an experise) are generally geographically distributed Inherently cross-cutting in many ways (CS/Astro, multi-# ) Some Thoughts on CyberScience Enables a broad spectrum of users and contributors From large teams, to small teams, to individuals Data volume ~ team size, but scientific returns! f (team size) Human talent is distributed very broadly geographically Open, distributed, web-based nature of new science is a key feature Transition from data-poor to data-rich science Chaotic! Organized regulation vs. creative freedom Can we learn to ask a new kind of questions? Information is cheap, but expertise is expensive Just like the hardware/software situation Computer science as the new mathematics It plays the role in relation to other sciences which mathematics did in ~ 17 th - 20 th century

18 Summary Comments Cyber-enabled (computationally and data-enabled) science is a practical necessity Complex problems! simulations, complex and massive data sets Distributed resources (data, facilities, people )! virtual scientific organizations (VO is an example) It is IT-enabled, and has a potential to drive transformative scientific and practical advances The key challenge is to think differently (computationally) Remember the origins of WWW; now grid, semantic web, knowledge extraction tools, MP/HPC design and apps There is a great deal of methodological commonality between different fields and this commonality can lubricate some genuine multi- or interdisciplinary research, with a great discovery potential Let s avoid wasteful replication of efforts, share the tools, methods

Data Mining Challenges and Opportunities in Astronomy

Data Mining Challenges and Opportunities in Astronomy Data Mining Challenges and Opportunities in Astronomy S. G. Djorgovski (Caltech) With special thanks to R. Brunner, A. Szalay, A. Mahabal, et al. The Punchline: Astronomy has become an immensely datarich

More information

The Big Picture: Information 01100 Technology Revolution, and 1010011 Science in the 21st Century 00101000

The Big Picture: Information 01100 Technology Revolution, and 1010011 Science in the 21st Century 00101000 011 The Big Picture: Information 01100 Technology Revolution, and 1010011 Science in the 21st Century 00101000 Roy & George s Excellent Adventure 1110100011 001001110110110 100101010001011101 Lecture 4

More information

Learning from Big Data in

Learning from Big Data in Learning from Big Data in Astronomy an overview Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ From traditional astronomy 2 to Big Data

More information

Astrophysics with Terabyte Datasets. Alex Szalay, JHU and Jim Gray, Microsoft Research

Astrophysics with Terabyte Datasets. Alex Szalay, JHU and Jim Gray, Microsoft Research Astrophysics with Terabyte Datasets Alex Szalay, JHU and Jim Gray, Microsoft Research Living in an Exponential World Astronomers have a few hundred TB now 1 pixel (byte) / sq arc second ~ 4TB Multi-spectral,

More information

LSST and the Cloud: Astro Collaboration in 2016 Tim Axelrod LSST Data Management Scientist

LSST and the Cloud: Astro Collaboration in 2016 Tim Axelrod LSST Data Management Scientist LSST and the Cloud: Astro Collaboration in 2016 Tim Axelrod LSST Data Management Scientist DERCAP Sydney, Australia, 2009 Overview of Presentation LSST - a large-scale Southern hemisphere optical survey

More information

The Tonnabytes Big Data Challenge: Transforming Science and Education. Kirk Borne George Mason University

The Tonnabytes Big Data Challenge: Transforming Science and Education. Kirk Borne George Mason University The Tonnabytes Big Data Challenge: Transforming Science and Education Kirk Borne George Mason University Ever since we first began to explore our world humans have asked questions and have collected evidence

More information

How To Teach Data Science

How To Teach Data Science The Past, Present, and Future of Data Science Education Kirk Borne @KirkDBorne http://kirkborne.net George Mason University School of Physics, Astronomy, & Computational Sciences Outline Research and Application

More information

Data Literacy For All: Astrophysics and Beyond (Astronomy is evidence-based forensic science, thus it is a data & information science)

Data Literacy For All: Astrophysics and Beyond (Astronomy is evidence-based forensic science, thus it is a data & information science) Data Literacy For All: Astrophysics and Beyond (Astronomy is evidence-based forensic science, thus it is a data & information science) Kirk Borne George Mason University, Fairfax, VA www.kirkborne.net

More information

The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory

The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory Astronomy in the XXI century The Internet revolution (the dot com boom ) has transformed

More information

Analytics-as-a-Service: From Science to Marketing

Analytics-as-a-Service: From Science to Marketing Analytics-as-a-Service: From Science to Marketing Data Information Knowledge Insights (Discovery & Decisions) Kirk Borne George Mason University, Fairfax, VA www.kirkborne.net @KirkDBorne Big Data: What

More information

College of Science George Mason University Fairfax, VA 22030

College of Science George Mason University Fairfax, VA 22030 College of Science George Mason University Fairfax, VA 22030 Dr. Sidney Wolff and the LSST Board of Directors LSST Corporation 933 N. Cherry Avenue Tucson, AZ 85721-0009 June 14, 2010 Dear Dr. Wolff and

More information

Top 10 Discoveries by ESO Telescopes

Top 10 Discoveries by ESO Telescopes Top 10 Discoveries by ESO Telescopes European Southern Observatory reaching new heights in astronomy Exploring the Universe from the Atacama Desert, in Chile since 1964 ESO is the most productive astronomical

More information

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health Lecture 1: Data Mining Overview and Process What is data mining? Example applications Definitions Multi disciplinary Techniques Major challenges The data mining process History of data mining Data mining

More information

and the VO-Science Francisco Jiménez Esteban Suffolk University

and the VO-Science Francisco Jiménez Esteban Suffolk University The Spanish-VO and the VO-Science Francisco Jiménez Esteban CAB / SVO (INTA-CSIC) Suffolk University The Spanish-VO (SVO) IVOA was created in June 2002 with the mission to facilitate the international

More information

MANAGING AND MINING THE LSST DATA SETS

MANAGING AND MINING THE LSST DATA SETS MANAGING AND MINING THE LSST DATA SETS Astronomy is undergoing an exciting revolution -- a revolution in the way we probe the universe and the way we answer fundamental questions. New technology enables

More information

A Preliminary Summary of The VLA Sky Survey

A Preliminary Summary of The VLA Sky Survey A Preliminary Summary of The VLA Sky Survey Eric J. Murphy and Stefi Baum (On behalf of the entire Science Survey Group) 1 Executive Summary After months of critical deliberation, the Survey Science Group

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research

More information

Conquering the Astronomical Data Flood through Machine

Conquering the Astronomical Data Flood through Machine Conquering the Astronomical Data Flood through Machine Learning and Citizen Science Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ The Problem:

More information

The Challenge of Data in an Era of Petabyte Surveys Andrew Connolly University of Washington

The Challenge of Data in an Era of Petabyte Surveys Andrew Connolly University of Washington The Challenge of Data in an Era of Petabyte Surveys Andrew Connolly University of Washington We acknowledge support from NSF IIS-0844580 and NASA 08-AISR08-0081 The science of big data sets Big Questions

More information

Science and the Taiwan Airborne Telescope

Science and the Taiwan Airborne Telescope Cosmic Variability Study in Taiwan Wen-Ping Chen Institute of Astronomy National Central University, Taiwan 2010 November 16@Jena/YETI Advantages in Taiwan: - Many high mountains - Western Pacific longitude

More information

Taming the Internet of Things: The Lord of the Things

Taming the Internet of Things: The Lord of the Things Taming the Internet of Things: The Lord of the Things Kirk Borne @KirkDBorne School of Physics, Astronomy, & Computational Sciences College of Science, George Mason University, Fairfax, VA Taming the Internet

More information

Data Driven Discovery In the Social, Behavioral, and Economic Sciences

Data Driven Discovery In the Social, Behavioral, and Economic Sciences Data Driven Discovery In the Social, Behavioral, and Economic Sciences Simon Appleford, Marshall Scott Poole, Kevin Franklin, Peter Bajcsy, Alan B. Craig, Institute for Computing in the Humanities, Arts,

More information

UCLA Graduate School of Education and Information Studies UCLA

UCLA Graduate School of Education and Information Studies UCLA UCLA Graduate School of Education and Information Studies UCLA Peer Reviewed Title: Slides for When use cases are not useful: Data practices, astronomy, and digital libraries Author: Wynholds, Laura, University

More information

Science Standard 4 Earth in Space Grade Level Expectations

Science Standard 4 Earth in Space Grade Level Expectations Science Standard 4 Earth in Space Grade Level Expectations Science Standard 4 Earth in Space Our Solar System is a collection of gravitationally interacting bodies that include Earth and the Moon. Universal

More information

Computational Science and Informatics (Data Science) Programs at GMU

Computational Science and Informatics (Data Science) Programs at GMU Computational Science and Informatics (Data Science) Programs at GMU Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ Outline Graduate Program

More information

Software challenges in the implementation of large surveys: the case of J-PAS

Software challenges in the implementation of large surveys: the case of J-PAS Software challenges in the implementation of large surveys: the case of J-PAS 1/21 Paulo Penteado - IAG/USP pp.penteado@gmail.com http://www.ppenteado.net/ast/pp_lsst_201204.pdf (K. Taylor) (A. Fernández-Soto)

More information

Observer Access to the Cherenkov Telescope Array

Observer Access to the Cherenkov Telescope Array Observer Access to the Cherenkov Telescope Array IRAP, Toulouse, France E-mail: jknodlseder@irap.omp.eu V. Beckmann APC, Paris, France E-mail: beckmann@apc.in2p3.fr C. Boisson LUTh, Paris, France E-mail:

More information

The Very High Energy source catalogue at the ASI Science Data Center

The Very High Energy source catalogue at the ASI Science Data Center The Very High Energy source catalogue at the ASI Science Data Center a,b, F. Lucarelli a,b, L. A. Antonelli a,b, P. Giommi b E-mail: alessandro.carosi@oa-roma.inaf.it a INAF, National Institute for Astrophysics,

More information

Engineering the Data Processing Pipeline

Engineering the Data Processing Pipeline Engineering the Data Processing Pipeline Mark Stalzer Center for Advanced Computing Research California Institute of Technology stalzer@caltech.edu October 29, 2009 A systems engineering view of computational

More information

The Sino-French Gamma-Ray Burst Mission SVOM (Space-based multi-band astronomical Variable Objects Monitor)

The Sino-French Gamma-Ray Burst Mission SVOM (Space-based multi-band astronomical Variable Objects Monitor) The Sino-French Gamma-Ray Burst Mission SVOM (Space-based multi-band astronomical Variable Objects Monitor) Didier BARRET on behalf of the SVOM collaboration didier.barret@cesr.fr Outline SVOM background

More information

WIMP dark matter and the isotropic radio signal Roberto A. Lineros R. Instituto de Física Corpuscular - CSIC/U. Valencia @Roberto_Lineros Outline Introduction Cosmic ray propagation Synchrotron emission

More information

Commentary on Techniques for Massive- Data Machine Learning in Astronomy

Commentary on Techniques for Massive- Data Machine Learning in Astronomy 1 of 24 Commentary on Techniques for Massive- Data Machine Learning in Astronomy Nick Ball Herzberg Institute of Astrophysics Victoria, Canada The Problem 2 of 24 Astronomy faces enormous datasets Their

More information

I. Justification and Program Goals

I. Justification and Program Goals MS in Data Science proposed by Department of Computer Science, B. Thomas Golisano College of Computing and Information Sciences Department of Information Sciences and Technologies, B. Thomas Golisano College

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)

More information

Stony Brook University Caltech Palomar Observatory Partnership

Stony Brook University Caltech Palomar Observatory Partnership Stony Brook University Caltech Palomar Observatory Partnership Proposal Stanimir Metchev, Jin Koda, Frederick Walter, Michal Simon Department of Physics & Astronomy, Stony Brook University, NY 11794 and

More information

Solar Ast ro p h y s ics

Solar Ast ro p h y s ics Peter V. Foukal Solar Ast ro p h y s ics Second, Revised Edition WI LEY- VCH WILEY-VCH Verlag Co. KCaA Contents Preface 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2 2.1 2.1.1 2.1.2 2.2 2.2.1 2.2.2 2.2.3 2.3

More information

First Discoveries. Asteroids

First Discoveries. Asteroids First Discoveries The Sloan Digital Sky Survey began operating on June 8, 1998. Since that time, SDSS scientists have been hard at work analyzing data and drawing conclusions. This page describes seven

More information

Data analysis of L2-L3 products

Data analysis of L2-L3 products Data analysis of L2-L3 products Emmanuel Gangler UBP Clermont-Ferrand (France) Emmanuel Gangler BIDS 14 1/13 Data management is a pillar of the project : L3 Telescope Caméra Data Management Outreach L1

More information

Hubble Diagram S George Djorgovski. Encyclopedia of Astronomy & Astrophysics P. Murdin

Hubble Diagram S George Djorgovski. Encyclopedia of Astronomy & Astrophysics P. Murdin eaa.iop.org DOI: 10.1888/0333750888/2132 Hubble Diagram S George Djorgovski From Encyclopedia of Astronomy & Astrophysics P. Murdin IOP Publishing Ltd 2006 ISBN: 0333750888 Institute of Physics Publishing

More information

Information Visualization WS 2013/14 11 Visual Analytics

Information Visualization WS 2013/14 11 Visual Analytics 1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and

More information

Einstein Rings: Nature s Gravitational Lenses

Einstein Rings: Nature s Gravitational Lenses National Aeronautics and Space Administration Einstein Rings: Nature s Gravitational Lenses Leonidas Moustakas and Adam Bolton Taken from: Hubble 2006 Science Year in Review The full contents of this book

More information

Data-Driven Discovery through e-science Technologies

Data-Driven Discovery through e-science Technologies Data-Driven Discovery through e-science Technologies Kirk D. Borne George Mason University, QSS Group Inc., and NASA-GSFC Kirk.borne@gsfc.nasa.gov Abstract Future space missions and science programs will

More information

Summary of Data Management Principles Dark Energy Survey V2.1, 7/16/15

Summary of Data Management Principles Dark Energy Survey V2.1, 7/16/15 Summary of Data Management Principles Dark Energy Survey V2.1, 7/16/15 This Summary of Data Management Principles (DMP) has been prepared at the request of the DOE Office of High Energy Physics, in support

More information

Class 2 Solar System Characteristics Formation Exosolar Planets

Class 2 Solar System Characteristics Formation Exosolar Planets Class 1 Introduction, Background History of Modern Astronomy The Night Sky, Eclipses and the Seasons Kepler's Laws Newtonian Gravity General Relativity Matter and Light Telescopes Class 2 Solar System

More information

Data Pipelines & Archives for Large Surveys. Peter Nugent (LBNL)

Data Pipelines & Archives for Large Surveys. Peter Nugent (LBNL) Data Pipelines & Archives for Large Surveys Peter Nugent (LBNL) Overview Major Issues facing any large-area survey/search: Computational power for search - data transfer, processing, storage, databases

More information

August, 2000 LIGO-G000313-00-D. http://www.ligo.caltech.edu/~smarka/sn/

August, 2000 LIGO-G000313-00-D. http://www.ligo.caltech.edu/~smarka/sn/ LIGO-G000313-00-D Proposal to the LSC for Entry of LIGO into the Supernova Nova Early Warning System (SNEWS) and Prototype Development of Real-Time LIGO Supernova Alert 1 Barry Barish, Kenneth Ganezer(CSUDH),

More information

Scientific Computing Meets Big Data Technology: An Astronomy Use Case

Scientific Computing Meets Big Data Technology: An Astronomy Use Case Scientific Computing Meets Big Data Technology: An Astronomy Use Case Zhao Zhang AMPLab and BIDS UC Berkeley zhaozhang@cs.berkeley.edu In collaboration with Kyle Barbary, Frank Nothaft, Evan Sparks, Oliver

More information

EPSRC Cross-SAT Big Data Workshop: Well Sorted Materials

EPSRC Cross-SAT Big Data Workshop: Well Sorted Materials EPSRC Cross-SAT Big Data Workshop: Well Sorted Materials 5th August 2015 Contents Introduction 1 Dendrogram 2 Tree Map 3 Heat Map 4 Raw Group Data 5 For an online, interactive version of the visualisations

More information

Making the Most of Missing Values: Object Clustering with Partial Data in Astronomy

Making the Most of Missing Values: Object Clustering with Partial Data in Astronomy Astronomical Data Analysis Software and Systems XIV ASP Conference Series, Vol. XXX, 2005 P. L. Shopbell, M. C. Britton, and R. Ebert, eds. P2.1.25 Making the Most of Missing Values: Object Clustering

More information

HTRA Instrumentation I

HTRA Instrumentation I HTRA Instrumentation I Phil Charles (Univ. of Southampton) Lecture 1: 1. Introduction, basic concepts of HTRA 2. History (~60yrs) of HTRA 3. Space science developments multi-wavelength 4. Fast timing in

More information

A short history of telescopes and astronomy: Galileo to the TMT

A short history of telescopes and astronomy: Galileo to the TMT A short history of telescopes and astronomy: Galileo to the TMT Telescopes in the last 400 years Galileo 1608 Hans Lippershey applied for a patent for seeing things far away as if they were nearby 1609

More information

ASKAP Science Data Archive: Users and Requirements CSIRO ASTRONOMY AND SPACE SCIENCE (CASS)

ASKAP Science Data Archive: Users and Requirements CSIRO ASTRONOMY AND SPACE SCIENCE (CASS) ASKAP Science Data Archive: Users and Requirements CSIRO ASTRONOMY AND SPACE SCIENCE (CASS) Jessica Chapman, Data Workshop March 2013 ASKAP Science Data Archive Talk outline Data flow in brief Some radio

More information

Astronomical Data Analysis Software & Systems XVI

Astronomical Data Analysis Software & Systems XVI Astronomical Data Analysis Software & Systems XVI 15-18 October 2006 Tucson, Arizona, USA Events ADASS XVI Today Calendar Conference Schedule Meeting Program Recent News Birds of a Feather Banquet Conference

More information

Canadian Astronomy Data Centre. Séverin Gaudet David Schade Canadian Astronomy Data Centre

Canadian Astronomy Data Centre. Séverin Gaudet David Schade Canadian Astronomy Data Centre Canadian Astronomy Data Centre Séverin Gaudet David Schade Canadian Astronomy Data Centre Data Activities in Astronomy Features of the astronomy data landscape Multi-wavelength datasets are increasingly

More information

What is the Sloan Digital Sky Survey?

What is the Sloan Digital Sky Survey? What is the Sloan Digital Sky Survey? Simply put, the Sloan Digital Sky Survey is the most ambitious astronomical survey ever undertaken. The survey will map one-quarter of the entire sky in detail, determining

More information

Community Training: Partitioning Schemes in Good Shape for Federated Data Grids

Community Training: Partitioning Schemes in Good Shape for Federated Data Grids : Partitioning Schemes in Good Shape for Federated Data Grids Tobias Scholl, Richard Kuntschke, Angelika Reiser, Alfons Kemper 3rd IEEE International Conference on e-science and Grid Computing Bangalore,

More information

An ArrayLibraryforMS SQL Server

An ArrayLibraryforMS SQL Server An ArrayLibraryforMS SQL Server Scientific requirements and an implementation László Dobos 1,2 --dobos@complex.elte.hu Alex Szalay 2, José Blakeley 3, Tamás Budavári 2, István Csabai 1,2, Dragan Tomic

More information

MAST: The Mikulski Archive for Space Telescopes

MAST: The Mikulski Archive for Space Telescopes MAST: The Mikulski Archive for Space Telescopes Richard L. White Space Telescope Science Institute 2015 April 1, NRC Space Science Week/CBPSS A model for open access The NASA astrophysics data archives

More information

Bringing the Night Sky Closer: Discoveries in the Data Deluge

Bringing the Night Sky Closer: Discoveries in the Data Deluge EARTH AND ENVIRONMENT Bringing the Night Sky Closer: Discoveries in the Data Deluge Alyssa A. Goodman Harvard University Curtis G. Wong Microsoft Research Th r o u g h o u t h i s t o r y, a s t r o n

More information

Introduction to Data Mining

Introduction to Data Mining Bioinformatics Ying Liu, Ph.D. Laboratory for Bioinformatics University of Texas at Dallas Spring 2008 Introduction to Data Mining 1 Motivation: Why data mining? What is data mining? Data Mining: On what

More information

Galaxy Survey data analysis using SDSS-III as an example

Galaxy Survey data analysis using SDSS-III as an example Galaxy Survey data analysis using SDSS-III as an example Will Percival (University of Portsmouth) showing work by the BOSS galaxy clustering working group" Cosmology from Spectroscopic Galaxy Surveys"

More information

Undergraduate Studies Department of Astronomy

Undergraduate Studies Department of Astronomy WIYN 3.5-meter Telescope at Kitt Peak near Tucson, AZ Undergraduate Studies Department of Astronomy January 2014 Astronomy at Indiana University General Information The Astronomy Department at Indiana

More information

The Gaia Archive. Center Forum, Heidelberg, June 10-11, 2013. Stefan Jordan. The Gaia Archive, COSADIE Astronomical Data

The Gaia Archive. Center Forum, Heidelberg, June 10-11, 2013. Stefan Jordan. The Gaia Archive, COSADIE Astronomical Data The Gaia Archive Astronomisches Rechen-Institut am Zentrum für Astronomie der Universität Heidelberg http://www.stefan-jordan.de 1 2 Gaia 2013-2018 and beyond Progress with Gaia 3 HIPPARCOS Gaia accuracy

More information

LSST Data Management. Tim Axelrod Project Scientist - LSST Data Management. Thursday, 28 Oct 2010

LSST Data Management. Tim Axelrod Project Scientist - LSST Data Management. Thursday, 28 Oct 2010 LSST Data Management Tim Axelrod Project Scientist - LSST Data Management Thursday, 28 Oct 2010 Outline of the Presentation LSST telescope and survey Functions and architecture of the LSST data management

More information

Astronomy & Physics Resources for Middle & High School Teachers

Astronomy & Physics Resources for Middle & High School Teachers Astronomy & Physics Resources for Middle & High School Teachers Gillian Wilson http://www.faculty.ucr.edu/~gillianw/k12 A cosmologist is.... an astronomer who studies the formation and evolution of the

More information

Challenges and Solutions for Big Data in the Public Sector:

Challenges and Solutions for Big Data in the Public Sector: Challenges and Solutions for Big Data in the Public Sector: Digital Government Institute s Annual Big Data Conference, October 9, Washington, DC Reagan Building Dr. Brand Niemann Director and Senior Data

More information

The High-Redshift Universe Bologna, June 5-6

The High-Redshift Universe Bologna, June 5-6 The High-Redshift Universe Bologna, June 5-6 Why a workshop on the High-redshift Universe? To report on the state of the art in the high-redshift Universe field To combine expertise in high-redshift AGN

More information

Sanjeev Kumar. contribute

Sanjeev Kumar. contribute RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 sanjeevk@iasri.res.in 1. Introduction The field of data mining and knowledgee discovery is emerging as a

More information

Data Requirements from NERSC Requirements Reviews

Data Requirements from NERSC Requirements Reviews Data Requirements from NERSC Requirements Reviews Richard Gerber and Katherine Yelick Lawrence Berkeley National Laboratory Summary Department of Energy Scientists represented by the NERSC user community

More information

Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration

Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration Overcoming the Technical and Policy Constraints That Limit Large-Scale Data Integration Revised Proposal from The National Academies Summary An NRC-appointed committee will plan and organize a cross-disciplinary

More information

Collaborations between Official Statistics and Academia in the Era of Big Data

Collaborations between Official Statistics and Academia in the Era of Big Data Collaborations between Official Statistics and Academia in the Era of Big Data World Statistics Day October 20-21, 2015 Budapest Vijay Nair University of Michigan Past-President of ISI vnn@umich.edu What

More information

NASA s Future Missions in X-ray Astronomy

NASA s Future Missions in X-ray Astronomy NASA s Future Missions in X-ray Astronomy Nicholas E. White NASA s Goddard Space Flight Center Laboratory for High Energy Astrophysics Greenbelt, MD 20771 USA nwhite@lheapop.gsfc.nasa.gov Abstract The

More information

Indiana University Science with the WIYN One Degree Imager

Indiana University Science with the WIYN One Degree Imager Indiana University Science with the WIYN One Degree Imager Katherine Rhode (Indiana University, WIYN SAC member) Indiana University Department of Astronomy Nine faculty members, plus active emeritus faculty

More information

The Virtual Observatory in Action

The Virtual Observatory in Action The Virtual Observatory in Action VO drivers VO vision VO progress World AstroGrid VO Desktop demo Oxford erc Andy Lawrence Jan 2008 VO drivers : science science services several trends lead to science

More information

The World-Wide Telescope, an Archetype for Online Science

The World-Wide Telescope, an Archetype for Online Science The World-Wide Telescope, an Archetype for Online Science Jim Gray, Microsoft Research Alex Szalay, Johns Hopkins University June 2002 Technical Report MSR-TR-2002-75 Microsoft Research Microsoft Corporation

More information

A long time ago, people looked

A long time ago, people looked Supercool Space Tools! By Linda Hermans-Killam A long time ago, people looked into the dark night sky and wondered about the stars, meteors, comets and planets they saw. The only tools they had to study

More information

The Legacy Value of Large Public Surveys: the SDSS Archive. Alexander Szalay The Johns Hopkins University

The Legacy Value of Large Public Surveys: the SDSS Archive. Alexander Szalay The Johns Hopkins University The Legacy Value of Large Public Surveys: the SDSS Archive Alexander Szalay The Johns Hopkins University Sloan Digital Sky Survey The Cosmic Genome Project Started in 1992, finished in 2008 Data is public

More information

UNIVERSITY OF HAWAI I AT MĀNOA ASTRONOMY & ASTROPHYSICS. The University of Hawai i is an equal opportunity/affirmative action institution.

UNIVERSITY OF HAWAI I AT MĀNOA ASTRONOMY & ASTROPHYSICS. The University of Hawai i is an equal opportunity/affirmative action institution. UNIVERSITY OF HAWAI I AT MĀNOA ASTRONOMY & The University of Hawai i is an equal opportunity/affirmative action institution. A M E SSAG E F R O M THE PROGRAM U N D E R G R A D U AT E P R O G R A M S I

More information

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI Yudho Giri Sucahyo, Ph.D, CISA (yudho@cs.ui.ac.id) Faculty of Computer Science, University of Indonesia Objectives

More information

Evolution of Close Binary Systems

Evolution of Close Binary Systems Evolution of Close Binary Systems Before going on to the evolution of massive stars and supernovae II, we ll think about the evolution of close binary systems. There are many multiple star systems in the

More information

High Energy Physics (HEP) Program Status Update to the AAAC Meeting. June 1, 2015

High Energy Physics (HEP) Program Status Update to the AAAC Meeting. June 1, 2015 OFFICE OF SCIENCE High Energy Physics (HEP) Program Status Update to the AAAC Meeting June 1, 2015 Kathy Turner Program Manager, Cosmic Frontier Office of High Energy Physics Office of Science, U.S. Department

More information

STFC s Science and Technology Strategy

STFC s Science and Technology Strategy STFC s Science and Technology Strategy Director, Science Programmes Nuclear Physics and Particle Physics Town Meetings April 2008 Addressing the challenges of the twenty-first century The twenty-first

More information

HPC & Visualization. Visualization and High-Performance Computing

HPC & Visualization. Visualization and High-Performance Computing HPC & Visualization Visualization and High-Performance Computing Visualization is a critical step in gaining in-depth insight into research problems, empowering understanding that is not possible with

More information

Science Drivers for Big Data Joseph Lazio SKA Program Development Office & Jet Propulsion Laboratory, California Institute of Technology

Science Drivers for Big Data Joseph Lazio SKA Program Development Office & Jet Propulsion Laboratory, California Institute of Technology Science Drivers for Big Data Joseph Lazio SKA Program Development Office & Jet Propulsion Laboratory, California Institute of Technology 2010 California Institute of Technology. Government sponsorship

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland EISCAT User Meeting, Uppsala,6 May 2013 2 Exponential growth Data trends Zettabytes

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering, and Education (CIF21) investment

More information

Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing.

Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Dr Liz Lyon, UKOLN, University of Bath Introduction and Objectives UKOLN is undertaking

More information

Please note that only the German version of the Curriculum is legally binding. All other linguistic versions are provided for information only

Please note that only the German version of the Curriculum is legally binding. All other linguistic versions are provided for information only Please note that only the German version of the Curriculum is legally binding. All other linguistic versions are provided for information only Curriculum for the Erasmus Mundus Joint Master Program in

More information

Visualization of Large Multi-Dimensional Datasets

Visualization of Large Multi-Dimensional Datasets ***TITLE*** ASP Conference Series, Vol. ***VOLUME***, ***PUBLICATION YEAR*** ***EDITORS*** Visualization of Large Multi-Dimensional Datasets Joel Welling Department of Statistics, Carnegie Mellon University,

More information

SYLLABUS FORM WESTCHESTER COMMUNITY COLLEGE Valhalla, NY l0595. l. Course #:PHYSC 151 2. NAME OF ORIGINATOR /REVISOR: PAUL ROBINSON

SYLLABUS FORM WESTCHESTER COMMUNITY COLLEGE Valhalla, NY l0595. l. Course #:PHYSC 151 2. NAME OF ORIGINATOR /REVISOR: PAUL ROBINSON SYLLABUS FORM WESTCHESTER COMMUNITY COLLEGE Valhalla, NY l0595 l. Course #:PHYSC 151 2. NAME OF ORIGINATOR /REVISOR: PAUL ROBINSON NAME OF COURSE: ASTRONOMY 3. CURRENT DATE: OCTOBER 26, 2011. Please indicate

More information

Challenges in e-science: Research in a Digital World

Challenges in e-science: Research in a Digital World Challenges in e-science: Research in a Digital World Thom Dunning National Center for Supercomputing Applications National Center for Supercomputing Applications University of Illinois at Urbana-Champaign

More information

Taught Postgraduate programmes in the School of Physics and Astronomy

Taught Postgraduate programmes in the School of Physics and Astronomy Astrophysics (MSc) Theoretical Physics (MSc) Physics: Advanced Materials (MSc) Physics: Energy and the Environment (MSc) Physics: Global Security (MSc) Physics: Life Sciences (MSc) Physics: Nuclear Technology

More information

Software Development for Virtual Observatories

Software Development for Virtual Observatories Software Development for Virtual Observatories BRAVO Workshop February 2007 Rafael Santos 1 Warning! This presentation is biased. I'll talk about VO software development, including some under the hood

More information

Data centres in the. Virtual Observatory. F. Genova, IVOA Small Project meeting, September 2006 1

Data centres in the. Virtual Observatory. F. Genova, IVOA Small Project meeting, September 2006 1 Data centres in the Virtual Observatory F. Genova, IVOA Small Project meeting, September 2006 1 VO status (1) Many national projects Very different contexts/financing agencies A really world-wide, global

More information

Data Centric Systems (DCS)

Data Centric Systems (DCS) Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems

More information

165 points. Name Date Period. Column B a. Cepheid variables b. luminosity c. RR Lyrae variables d. Sagittarius e. variable stars

165 points. Name Date Period. Column B a. Cepheid variables b. luminosity c. RR Lyrae variables d. Sagittarius e. variable stars Name Date Period 30 GALAXIES AND THE UNIVERSE SECTION 30.1 The Milky Way Galaxy In your textbook, read about discovering the Milky Way. (20 points) For each item in Column A, write the letter of the matching

More information

Modeling Galaxy Formation

Modeling Galaxy Formation Galaxy Evolution is the study of how galaxies form and how they change over time. As was the case with we can not observe an individual galaxy evolve but we can observe different galaxies at various stages

More information

Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data.

Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data. Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data. 1 Advances in information technologies are transforming the fabric of our society and data represent

More information

Use the following image to answer the next question. 1. Which of the following rows identifies the electrical charge on A and B shown above?

Use the following image to answer the next question. 1. Which of the following rows identifies the electrical charge on A and B shown above? Old Science 30 Physics Practice Test A on Fields and EMR Test Solutions on the Portal Site Use the following image to answer the next question 1. Which of the following rows identifies the electrical charge

More information

Is Big Data a Big Deal? What Big Data Does to Science

Is Big Data a Big Deal? What Big Data Does to Science Is Big Data a Big Deal? What Big Data Does to Science Netherlands escience Center Wilco Hazeleger Wilco Hazeleger Student @ Wageningen University and Reading University Meteorology PhD @ Utrecht University,

More information