Usage statistics and archiving process of VizieR data in the VO context



Similar documents
Organization of VizieR's Catalogs Archival

THE US NATIONAL VIRTUAL OBSERVATORY. IVOA WebServices. William O Mullane The Johns Hopkins University

Astro Runtime An API for the Virtual Observatory

taplint: TAP validator tool

Exploring Gaia data with TOPCAT and the Virtual Observatory

How To Understand And Understand The Science Of Astronomy

Data centres in the. Virtual Observatory. F. Genova, IVOA Small Project meeting, September

Introduction Recall of the Theory Group context and SimDM Accessing theoretical data The SimDAL proposal Remarks Conclu.

Proposal for Quality Assurance of Services in the VO

Lecture 5b: Data Mining. Peter Wheatley

The astronomical Virtual Observatory : lessons learnt, looking forward. Françoise Genova - Forum VO-PDC d après ADASS XXI, Paris, nov.

MultiMimsy database extractions and OAI repositories at the Museum of London

MOC HEALPix Multi-Order Coverage map Version 1.0

SITools2 as VO service provider: an example with Herschel at IDOC (Integrated Data and Operation Center)

The NOAO Science Archive and NVO Portal: Information & Guidelines

Virtual Archive as a prototype distributed data system for scientific knowledge base

Taking full advantage of the medium does also mean that publications can be updated and the changes being visible to all online readers immediately.

CoSADIE Data Centre Forum. Summary and conclusions

2007 to 2010 SharePoint Migration - Take Time to Reorganize

Multidimensional Data in the Virtual Observatory

The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory

Structured Content: the Key to Agile. Web Experience Management. Introduction

VisIVO: data exploration of complex data

D. Briukhov, L. Kalinichenko, i D. Martynov, N. Skvortsov, S.Stupnikov, A. Vovchenko, V. Zakharov, O. Zhelenkova

How To Use Open Source Software For Library Work

Technical concepts of kopal. Tobias Steinke, Deutsche Nationalbibliothek June 11, 2007, Berlin

DRIVER Providing value-added services on top of Open Access institutional repositories

Long Term Knowledge Retention and Preservation

EUR-Lex 2012 Data Extraction using Web Services

The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols

Audit TM. The Security Auditing Component of. Out-of-the-Box

Building integration environment based on OAI-PMH protocol. Novytskyi Oleksandr Institute of Software Systems NAS Ukraine

BusinessObjects XI R2 Product Documentation Roadmap

Configuring SharePoint 2013 Document Management and Search. Scott Jamison Chief Architect & CEO Jornata scott.jamison@jornata.com

CERN Document Server

--Preliminary-- Science Data Access Architectures Mike Martin, 11/20/06

Data providers technical feedback

Archiving Systems. Uwe M. Borghoff Universität der Bundeswehr München Fakultät für Informatik Institut für Softwaretechnologie.

SAP Data Services 4.X. An Enterprise Information management Solution

VisIVO, an open source, interoperable visualization tool for the Virtual Observatory

ERwin R8 Reporting Easier Than You Think Victor Rodrigues. Session Code ED05

AN INTEGRATION APPROACH FOR THE STATISTICAL INFORMATION SYSTEM OF ISTAT USING SDMX STANDARDS

The use of file validation tools in the University of St Andrews digital archive for research data

CAE DATA & PROCESS MANAGEMENT WITH ANSA

BI xpress Product Overview

SAS BI Course Content; Introduction to DWH / BI Concepts

Metadata driven framework for the Canada Research Data Centre Network

Archive I. Metadata. 26. May 2015

EHR Interoperability Framework Overview

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001

Data Warehouses in the Path from Databases to Archives

Best Practices for Structural Metadata Version 1 Yale University Library June 1, 2008

Ex Libris Rosetta: A Digital Preservation System Product Description

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

EuroPlaNet-RI / EuroPlaNet-Table Access Protocol

Oracle Warehouse Builder 10g

EPrints Preservation Update

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories

THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy Page 1 of 8

IVOA Interop Meeting Kyoto May VO Web Services Basic Profile. (reference document VO-WS-Basic-Profile-0.21)

SQL Server 2012 Business Intelligence Boot Camp

Creating an Enterprise Reporting Bus with SAP BusinessObjects

Instrument Location Service User Manual Version 0.1

Analysing log files. Yue Mao Supervisor: Dr Hussein Suleman, Kyle Williams, Gina Paihama. University of Cape Town

Logi Ad Hoc Reporting System Administration Guide

FreeForm Designer. Phone: Fax: POB 8792, Natanya, Israel Document2

Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context

In ediscovery and Litigation Support Repositories MPeterson, June 2009

Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access

1. Put your Webapp in the Cloud 2. Virtual Observatory 3. Web Services

METADATA-DRIVEN QLIKVIEW APPLICATIONS AND POWERFUL DATA INTEGRATION WITH QLIKVIEW EXPRESSOR

Research Data Management Guide

CADC and CANFAR: Extending the role of the data centre. Séverin Gaudet Canadian Astronomy Data Centre

Service Oriented Architecture

Digital Preservation. OAIS Reference Model

Chapter 3. Database Environment - Objectives. Multi-user DBMS Architectures. Teleprocessing. File-Server

Automated Workflow for the Ingest and Preservation of Electronic Journals. Evan Owens Chief Technology Officer Portico

Functional Requirements for Digital Asset Management Project version /30/2006

Beginning ASP.NET 4.5

Search and Real-Time Analytics on Big Data

Storage Virtualisation in the Cloud

Karl Lum Partner, LabKey Software Evolution of Connectivity in LabKey Server

Two new DB2 Web Query options expand Microsoft integration As printed in the September 2009 edition of the IBM Systems Magazine

Preserving digital data - risk assessment and digital preservation strategies

SIF 3: A NEW BEGINNING

The Czech Digital Library and Tools for the Management of Complex Digitization Processes

B SVF - Bavaria Long Term Preservation

Draft Response for delivering DITA.xml.org DITAweb. Written by Mark Poston, Senior Technical Consultant, Mekon Ltd.

Improving the visualisation of statistics: The use of SDMX as input for dynamic charts on the ECB website

<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features

Invenio: A Modern Digital Library for Grey Literature

Microsoft SQL Server 2012: What to Expect

Islandora: An Open Source Institutional Repository Solution. Consortium of MnPALS Libraries Annual Meeting April 2014

FUSE-ESB4 An open-source OSGi based platform for EAI and SOA

Wave Analytics Platform Setup Guide

Oracle BI 11g R1: Build Repositories

OVERVIEW OF JPSEARCH: A STANDARD FOR IMAGE SEARCH AND RETRIEVAL

The Institutional Repository at West Virginia University Libraries: Resources for Effective Promotion

Tools and Services for the Long Term Preservation and Access of Digital Archives

Hypertable Architecture Overview

Transcription:

Usage statistics and archiving process of VizieR data in the VO context

VO Implementation status VO Implementation status Application VOTable (1.1 1.2 1.3) Semantic Data Access Layer Data Model MOC SAMP UCD1 UCD1+ Simple Cone Search TAP 1.1 SIA (V1) SSA (V1) ObsTAP Photometry Model (IVOA note) ObsCore Available through Aladin and to query tables using a MOC Need some arrangements to facilitate client access! beta release planned before the end of the year Providing Photometric Data Measurements Description in VOTables (S.Derriere) mandatory items only VizieR catalogues and the TAP VizieR service are in registries IVOA 2015 (VizieR) - Gilles Landais 2/10

VO Output statistics VOTable output statistics Output type evolution CDS Xmatch API Number of queries per mounth (log) 1,00E+08 1,00E+07 1,00E+06 1,00E+05 1,00E+04 1,00E+03 1,00E+02 VOTable TSV HTML 700 IP/day Number queries average (VOTable) 2014 ~410,000 queries/day 2015 ~190,000 queries/day 92% of VOTable output comes from Simple Cone Search queries Date Output type repartition 35000000 Cone search in VOTable output 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% HTML TSV VOTable 30000000 Cone Search 25000000 VOTable 20000000 15000000 10000000 5000000 0 2014/03 2014/07 2014/11 2015/03 2014/01 2014/05 2014/09 2015/01 2015/05 Date IVOA 2015 (VizieR) - Gilles Landais 3/10

VO Output statistics TAPVizieR statistics TAPVizier contains all VizieR tables (except obsolete catalogues) Statistics 2015, without bots and registry queries (~26,000 queries/day): Num ber of queries 8000 7000 6000 5000 4000 3000 2000 1000 sync Number of TAP queries per mounth async 10 IP/day ~150 queries/day 7 10% queries/day contain ADQL geometrical functions 0 Need some arrangements to facilitate client access! Difficulties to work with TAPVizieR, because: Custom use of schema due to the the important number of columns Columns/tables names which need quotes. To provide quoted names in TAP_SCHEMA and VOSI output? IVOA 2015 (VizieR) - Gilles Landais 4/10

The VO visibility The VO visibility The VO is an adapted framework to provide data in the preservation context VO standards (protocols, formats, registries (OAI-PMH)) guaranty reusable data Matches with definition of the Access layer of OAIS (Open Archive Information system) OAIS architecture VO framework IVOA 2015 (VizieR) - Gilles Landais 5/10

The VizieR contents Assigning UCD The most popular UCD in VizieR Usage number CDS documentalists pay particular attention to UCD attribution A distribution with a long tail Main UCD (position, magnitude) are well assigned But, important usage of generic UCD and sometimes not optimum 50% columns 40000 35000 30000 25000 20000 15000 10000 5000 0 The difficulty to have a perfect matching! 75% UCD attribution repartition 90% sorted by ranking and restricted to the 500 most popular UCD rank others stat.error meta.record meta.id;meta.main meta.note meta.id pos.eq.ra;meta.main pos.eq.dec;meta.main meta.code.error meta.code meta.number meta.ref.url time.epoch phot.mag;em.opt.v phys.abund stat.fit.param spect.line.eqwidth meta.ref phot.mag;em.opt.b phot.mag;em.opt.i meta.ref;pos.frame others 434,867 columns 3350 different UCD 6,5% columns with no UCD IVOA 2015 (VizieR) - Gilles Landais 6/10

The VizieR contents The UCD assignment CDS documentalists set UCD1 for each columns using a UCD1 builder UCD1+ is constructed from UCD1 and other meta-data The reason to assign UCD1: The Simple Cone Search needs UCD1 for main positions Easier to work with a simple and restricted list than to construct UCD1+ UCD1 photometry is used to describe filters of the magnitude columns example: UCD1 Filter PHOT_JHN_B PHOT_COUS_I PHOT_HST_F170W PHOT_WLRV_W Johnson, B Cousins I HST/WFPC2, F814W Walraven, W IVOA 2015 (VizieR) - Gilles Landais 7/10

Providing and preserving data for the VO To provide and preserve through the VO Data preservation : the original data in input containing meta-data (ex: FITS) and the data provided (ex: VOTable) Author, journals ASCII table, FITS?VOTable ASCII, FITS, HTML VOTable IVOA 2015 (VizieR) - Gilles Landais 8/10

Providing and preserving data for the VO To accept Votable in input? Currently no VOTable are stored in the VizieR repository FITS headers are not well standardized VOTable containing rich metadata could improve the pipelines To provide the VOTable in their original format using SIA, SSA To promote VO standards in input VizieR encourage today the space agencies to provide VO standards for the VizieR logs pipeline (B/* catalogues updated weekly..) Nextly, VOTable will be soon accepted in VizieR for Associated data (spectra/time series/images using Saada and indexed with ObsCore) Note: Saada extract the metadata from simple VOTable IVOA 2015 (VizieR) - Gilles Landais 9/10

Providing and preserving data for the VO Maintenance/actions needed to preserve and provide rich meta-data Adding new mandatory items in output DM requires significant efforts for CDS (documentalists, engineer) to search the informations. In particular for old catalogue So Please, we would like to have the possibility of notset or noinfo In Input, rich meta-data in VOTable (DML, utype or other): Needs libraries for authors/instruments for the VOTable generation Need to continue to provide classicals header for current clients Increases the maintenance for preservation because: The evolution of the VO standards Obsolete VOTable needs migration which includes obsolete search processing, format migration and search adding information IVOA 2015 (VizieR) - Gilles Landais 10/10