Montage: Architecture and Applications of an Astronomical Image Mosaic Engine

Similar documents
Creating A Galactic Plane Atlas With Amazon Web Services

MONTAGE: A GRID ENABLED ENGINE FOR DELIVERING CUSTOM SCIENCE-GRADE MOSAICS ON DEMAND

Software challenges in the implementation of large surveys: the case of J-PAS

MONTAGE An Astronomical Image Mosaic Service for the National Virtual Observatory

Montage: a grid portal and software toolkit for science-grade astronomical image mosaicking

Efficient data reduction and analysis of DECam images using multicore architecture Poor man s approach to Big data

The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory

Scientific Computing Meets Big Data Technology: An Astronomy Use Case

The Cost of Doing Science on the Cloud: The Montage Example

Einstein Rings: Nature s Gravitational Lenses

Using Photometric Data to Derive an HR Diagram for a Star Cluster

Libraries and Large Data

A Preliminary Summary of The VLA Sky Survey

Data Sharing Options for Scientific Workflows on Amazon EC2

Data Mining Challenges and Opportunities in Astronomy

HOW WILL ASTRONOMY ARCHIVES SURVIVE THE DATA TSUNAMI?

Migrating a (Large) Science Database to the Cloud

LSST Resources for Data Analysis

Description of the Dark Energy Survey for Astronomers

ASKAP Science Data Archive: Users and Requirements CSIRO ASTRONOMY AND SPACE SCIENCE (CASS)

StruxureWare TM Center Expert. Data

The Application of Cloud Computing to Scientific Workflows: A Study of Cost and Performance

MAST: The Mikulski Archive for Space Telescopes

What is the Sloan Digital Sky Survey?

Astro 301/ Fall 2005 (48310) Introduction to Astronomy

AST 4723 Lab 3: Data Archives and Image Viewers

Visualization of Large Multi-Dimensional Datasets

Enabling Science in the Cloud: A Remote Sensing Data Processing Service for Environmental Science Analysis

StruxureWare TM Data Center Expert

The Status of Astronomical Control Systems. Steve Scott OVRO

Data Pipelines & Archives for Large Surveys. Peter Nugent (LBNL)

The Astronomical Data Warehouse

Indiana University Science with the WIYN One Degree Imager

Canadian Astronomy Data Centre. Séverin Gaudet David Schade Canadian Astronomy Data Centre

Introduction to LSST Data Management. Jeffrey Kantor Data Management Project Manager

Cloud Computing. Lecture 5 Grid Case Studies

Astrophysics with Terabyte Datasets. Alex Szalay, JHU and Jim Gray, Microsoft Research

Observing the Universe

Summary of Data Management Principles Dark Energy Survey V2.1, 7/16/15

MONTAGE An Astronomical Image Mosaic Service for the National Virtual Observatory

The Application of Cloud Computing to Astronomy: A Study of Cost and Performance

Chapter 6 Telescopes: Portals of Discovery. How does your eye form an image? Refraction. Example: Refraction at Sunset.

Reduced data products in the ESO Phase 3 archive (Status: 15 May 2015)

Data Management Plan Extended Baryon Oscillation Spectroscopic Survey

Product: Order Delivery Tracking

165 points. Name Date Period. Column B a. Cepheid variables b. luminosity c. RR Lyrae variables d. Sagittarius e. variable stars

How To Process Data From A Casu.Com Computer System

The application of cloud computing to scientific workflows: a study of cost and performance

Challenges of Managing Scientific Workflows in High-Throughput and High- Performance Computing Environments

Science Data Center Overview

PISGAH ASTRONOMICAL RESEARCH INSTITUTE AND ASTRONOMICAL PHOTOGRAPHIC DATA ARCHIVE.

Galaxy Morphological Classification

COMPARISON OF GAS AND DUST COOLING RATES IN NEARBY GALAXIES

Spitzer Space Telescope proposal process

Chapter 15.3 Galaxy Evolution

Enterprise Architectures for Large Tiled Basemap Projects. Tommy Fauvell

Autodesk Inventor on the Macintosh

Silviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania)

LSST Data Management. Tim Axelrod Project Scientist - LSST Data Management. Thursday, 28 Oct 2010

On the Use of Cloud Computing for Scientific Workflows

Class 2 Solar System Characteristics Formation Exosolar Planets

Science Drivers for Big Data Joseph Lazio SKA Program Development Office & Jet Propulsion Laboratory, California Institute of Technology

Data analysis of L2-L3 products

Undergraduate Studies Department of Astronomy

The Virtual Grid Application Development Software (VGrADS) Project

Resultados Concurso Apex 2014-A

The Application of Cloud Computing to the Creation of Image Mosaics and Management of Their Provenance

Astronomical Data Analysis Software & Systems XVI

MANAGING AND MINING THE LSST DATA SETS

Science Standard 4 Earth in Space Grade Level Expectations

Cross-Matching Very Large Datasets

DataGrids 2.0 irods - A Second Generation Data Cyberinfrastructure. Arcot (RAJA) Rajasekar DICE/SDSC/UCSD

First Discoveries. Asteroids

ALMA Overview. Leonardo Testi (European Southern Observatory)

Parallel and Distributed Astronomical Data Analysis on Grid Datafarm

The PACS Software System. (A high level overview) Prepared by : E. Wieprecht, J.Schreiber, U.Klaas November, Issue 1.

Software Development for Virtual Observatories

Lecture 5b: Data Mining. Peter Wheatley

An ArrayLibraryforMS SQL Server

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago

J-PAS: low-resolution (R 50) spectroscopy over 8000 deg 2

How can new technologies can be of service to astronomy? Community effort

MY FIRST STEPS IN SLIT SPECTROSCOPY

Rest optical spectroscopy of FeLoBALs and LoBALs at intermediate redshift

Connecting Scientific Data to Scientific Experiments with Provenance

SURVEY ON THE ALGORITHMS FOR WORKFLOW PLANNING AND EXECUTION

Data Centre Alliance - Science

Redmine: A project management software tool. January, 2013

Modeling Galaxy Formation

IBM Tivoli Composite Application Manager for WebSphere

Bringing the Night Sky Closer: Discoveries in the Data Deluge

GridSolve: : A Seamless Bridge Between the Standard Programming Interfaces and Remote Resources

Galaxy Classification and Evolution

Towards the Detection and Characterization of Smaller Transiting Planets

Commentary on Techniques for Massive- Data Machine Learning in Astronomy

Association of Universities for Research in Astronomy

Learning from Big Data in

Desktop and Professional Editions

Exploring Multiwavelength AGN Variability with Swift Archival Data

Big Data Visualization on the MIC

Transcription:

Montage: Architecture and Applications of an Astronomical Image Mosaic Engine ESTO http://montage.ipac.caltech.edu/ Bruce Berriman, John Good and Anastasia Laity IPAC Joseph C. Jacob and Daniel S. Katz JPL Ewa Deelman, Gurmeet Singh and Mei-Hui Su ISI Thomas Prince Caltech

Science Drivers for Montage Imaging surveys are addressing fundamental questions in astronomy Montage overcomes limitations in maximizing scientific return from these image datasets Many astrophysical structures are extended and structure on the sky contaminated by variable background radiation from instrument or sky Survey results are published in widely varying coordinates, map projections, sizes and spatial resolutions => limits study of structures and sources over multiple wavelengths Montage delivers science grade image mosaics Preserves astrometry and flux of input images Delivers mosaic according to users specifications of projection, coordinates, spatial sampling, mosaic size, image rotation Rectifies background radiation to a common level SDSS DPOSS 2MASS IRAS

Montage Design Drivers Major design drivers Support generation of data products by incorporating Montage into pipelines & processing environments Support variety of hardware architectures used by astronomers: single processor; a cluster of multiple processors with a shared file system; multiple clusters, each with a shared file system; grid processing Support on-request image mosaic requests submitted through a web interface Montage Architecture Stand-alone core processing modules for performing steps in developing image mosaic Reprojection of input images; rectification of background radiation to a common level; co-addition of rectified, reprojected images to form mosaic Utilities for managing mosaics - tiling, creating browse products, Processing flow controlled by calls to simple executives Code written in ANSI C for performance and portability

Design Drivers cont Montage Architecture - Parallel Technologies Message Passing Interface (MPI) Grid Technology Best performance Requires a set of processors with a shared file system Very good performance Built-in fault tolerance Can use multiple sets of processors Wall Clock Times to Build 6 x 6 Deg mosaic for MPI and Grid 6 x 6 degree 2MASS Mosaic, NCSA TeraGrid Pegasus Portal

Applications of Montage - I Generation of Science Data Products NEW NEW Visible and Infrared Survey Telescope for Astronomy (VISTA) (under evaluation) Spitzer Space Telescope Legacy Projects NEW SWIRE - trace the evolution of dusty, star-forming galaxies, evolved stellar populations, and active galactic nuclei to redshifts of z ~ 3 GLIMPSE and MIPSGAL - infrared surveys of galactic plane to study global history of star formation and energetics of interstellar medium COSMOS Hubble Treasury Program Multi-wavelength studies of the large scale distribution of matter in the Universe The INT/WFC Photometric H-alpha Survey (IPHAS) of the Northern Galactic Plane Deep survey in the red (Sloan R and I bands) National Virtual Observatory All-Sky 2MASS mosaic

Applications of Montage -II Incorporation into Science Processing and Data Access Environments NEW Spitzer Space Telescope Outreach Generating as educational products distributed through the Cool Cosmos website Leverages Montage s capabilities of generating mosaics in uncommon projections, often the best one for E/PO products IRSA (NASA s InfraRed Science Archive) Supports services for accessing all-sky image surveys Includes creation of browse products, image cutout services NSF National Virtual Observatory (NVO) Atlasmaker project Hyperatlas grid, a standard set of image parameters for displaying image data sets originally in different projections, coordinates,... Leveraging Montage to deliver tools that will generate map images showing spatial coverage of astronomical data sets UK Astrogrid Virtual Observatory Montage adopted as a key application for end users to generate custom mosaics

Applications - VISTA Visible and Infrared Survey Telescope for Astronomy (2007) 4-m class imaging survey telescope in Chile 67-million pixel camera (1.6 deg FoV) Evaluation of Montage Six stepped exposures fill a "Tile" with at least two exposures Automated pipeline processes each "pawprint", Montage is under evaluation as part of postpipeline backend to stitch the pawprints together to create the tile Compute the image geometry to high precision and co-add the individual images Compensate for variability in sky conditions Reprojection of supporting observations for quality assurance Field of View of the 16 2048 x 2048 arrays Observing Mode

GLIMPSE and MIPSGAL: Complementary Surveys of the Galactic Plane Survey Instrument Bands (μm) Field-of-View (arcmin) GLIMPSE IRAC 3.5, 4.5, 5.8, 8.0 5.2. x 5.2 MIPSGAL MIPS 24 5.4 x 5.4 70 5.25 x 2.6 160 0.5 x 0.5 Generate multi-wavelength images of the Galactic as a resource for studying star formation - process IRAC, MIPS, Midcourse Space Experiment (8.3, 12.1, 14.6 and 21.3 μm) and 2MASS (1.2, 1.6 and 2.2 μm) Three-color section of the Galactic Plane measured by the IRAC

Applications -2MASS All Sky Mosaic Why Build An All-Sky Mosaic? Value-added 2MASS product combines overlapping images and includes deep survey data Demonstrate applicability of computing at scale to astronomical image data sets pathfinder for future missions Demonstrate management of data sets at scale Shakedown TeraGrid hardware at SDSC Input : 4,121,440 files, 2 MB in size (32-bits) Output 1734 x 4 GB plates 6 deg on a side for 3 bands (64 bits) Each plate tiled into a 12x12 array of 26-MB files The total size is about 20 TB in 750,000 tiles. Processing Statistics MPI Python module managed the process 13,000 CPU hours Files on mosaic boundaries opened multiple times 14 TB in 6,275,494 files opened Below: Section of 3-color 2MASS mosaic along the Galactic plane

Applications - All Sky Image Access http://irsa.ipac.caltech.edu/applications/finderchart IRSA s Finder Chart Service Creates on-demand 3-color images of three major sky surveys - SDSS, 2MASS, DSS Quick Look Quality Assurance of fields astronomers are observing Are moving objects real or artificial? Are objects of unusual color inherent in the object or the field it is in? Left: 3-color images of the the Galaxy NGC 5584 in the 2MASS and DSS surveys

An On-Request Compute Service Enter mosaic request Monitor job status View messages View and download mosaic

On-Request Service Architecture IPAC TeraGrid

Request Handling Request Management Environment (ROME). Developed at IPAC for the NVO Enterprise Java Beans (EJBs) Accept processing requests from users (via servelets) Manage processing queues Ensure that resources distribute processing among users Handles monitoring and user notification

Process Plan Generation Montage application implemented via a Condor DAG (Directed Acyclic Graph) Generate an abstract DAG Process input parameters, set up storage space Query resources to discover images needed Pegasus processes abstract DAG into a Condor-specific DAG targeting TeraGrid resources. Pegasus in turn utilizes the Globus Replication Location Service (RLS) to find physical location of file resources Custom to Montage

TeraGrid Processing TeraGrid tools. This environment processes the mosaic Marshals compute resources and intermediate file storage space, Handles errors by rescheduling subtasks, reports results back through the submitting machine to ROME and thence to the user. Uses copy of the 2MASS data stored at SDSC in a distributed file system (/gpfs-wan) as input Places results in a URL-accessible location within SDSC s Storage Resource Broker (SRB) system. Flexible design - backend can be, e.g., a Condor pool IPAC installing 20-machine cluster of Dell Dual Core 2-processor 3.2 GHz Xeon compute servers at IPAC.

Science Result - Verification of a Bar in the Spiral Galaxy M31 Beaton et al. Ap J Lett in press Montage used to generate mosaic of M31 galaxy Background rectification brings out boxy bulge - signature of bar

Summary Use of Montage in astronomy is growing Generation of science products Performing scientific research Performing pathfinder studies in demonstrating processing of images at scale On-request service under evaluation by astronomers and will be publicly deployed in Fall 2006 To learn more, visit the project web site at http://montage.ipac.caltech.edu/ Right: 3-color mosaic of the star formation region NGC 6357 War and Peace Nebula Combines MSX and 2MASS data sets; 2.5 degree square mosaic - 9,000 pixels on a side