International Data Sharing Framework Including ICSU World Data System Dr. Yasuhiro Murayama ICSU-WDS Scientific Committee ex officio Member of Cabinet Office Expert Panel of Open Science Associate member, Science Council of Japan Director, Integrated Science Data Systems Research Lab. Natl. Inst. Information & Communications Technology International Programme Office Hosted by Based in Tokyo, Japan 1
Toady s Talk Open Research Data (International Political? Landscape) International Coordination of Data Sharing (WDS) A view of research information management Example: Practices in Japan
Open Research Data in International community
G8 2013 Science Ministers Agreement of Open Research Data Open Government Data
Society and Science: scholarly papers and data Conventional Science Method Question Hypothesis Experiment/ Observatoin Analysis/ Discussion Conclusion Data Paper Who does? Immediately? Mandatory? General Society/ political decision making Community Consensus Forming Open discussion and criticism Shared in the research community An article is not sufficient for validating results. Reproducibility issue Research integrity issue Data as 1 st -class research output Social information asset, provided to the general society Essential for irreproducible natural phenomena Global change, space, living organisms, health... Mutual trust between Science and Society
Landscape of WDS and Open Science/Open Data (from my viewpoint) Earth, Space, Physics, Informatics, Seismology 2008- Earth Science Space Sci. Computer Sci. Physics WDS-IPO Earth Sci. Atmosphere Ionosphere WDC 電 離 層 Space Weather Linguistics History Psychology Linguistics Social Science etc.(91 Member Bodies) 日 本 学 術 会 議 Science Council of Japan Cabinet Office of Japan 総 合 科 学 技 術 イノベーション 会 議 Council for Sci. Tech. & Innovation Future Earth (ICSU, UNESCO, UNEP, UNU, Belmont Forum, ) GEOSS/DIAS (U Tokyo, JAXA JAMSTEC, NIES etc) 文 部 科 学 省 Ministry of Education, Science 科 学 技 術 振 興 機 構 Japan Sci. & Tech Agency MOU MOU RDA (Research Data Alliance) G8 Science Minister Meeting (2013.6) OECD Open Science WG 2012-6
International Coordination of Data Sharing
Creation of ICSU-World Data System ICSU 29 th General Assembly decision (October 28, 2008): PAST (since 1950 s) WDC (World Data Center) : 50 WDSs at max. FAGS (Federation of Astronomical and Geophysical Data Analysis Services) PRESENT (2008~) ICSU International Scientific Unions data bodies ICSU National Members data bodies 92 Members (June 2015) 60 Regular Data curation & data analysis services ICSU Interdisciplinary Bodies data activities 8 10 Network Networks of Regular Members & umbrella organizations 4 Partner Do not deal directly with data stewardship, but support to ICSU-WDS 18 Associate Organizations interested in the WDS endeavour 8
Inauguration of WDS Intl. Programme Office Opening Ceremony of WDS Intl. Programme Office (May 2012) NICT President Prof. Miyahara ICSU President Prof. Y.T.Lee (2012) SCJ former Vice President Prof. Doi Minister of Internal Affairs & Communications Vice Minister of MEXT SCJ President Prof. Ohnishi ICSU (Intl. Council for Science) has established ICSU-WDS (October 2008). Science Council of Japan (SCJ) cooperated hosting WDS Intl. Programme Office at NICT, Japan. WDS-IPO Executive Director: Dr. Mustapha Mokrane Scientific Committee [Y. Murayama, 2012] [F. Kasuga, 2013]
ICSU-WDS Objectives Enable universal and equitable access to quality-assured multi-disciplinary scientific data; Ensure long term data preservation/ stewardship; Foster compliance to agreed-upon data standards and conventions; Provide mechanisms to facilitate and improve access to data and data products. In harmony with GEOSS open data policy (ICSU/CODATA contirubted to) 10
Examples of Implementing Actions WDS RDA Data Publication Working Group Global Registry for Trusted Digital Services DSA WDS Certification Working Group WDS Metadata Catalogue, WDS Knowledge Network Support Other International Programmes for Future Earth, Urban Health, etc ICSU requires Future Earth be supported by WDS and CODATA regarding research data management.
Fitness for use [M. Mokrane, M.Diepenbroek, 2014] The Long Tail Managed & published data Large scale monitoring, computed data, and disciplinary data centers Somewhat managed & open access data Unmanaged & non-published Data from individual scientists, labs, or smaller projects Total volume of scientific
Research Data Alliance New initiative which started in 2013. Kicked by G8+O6 data infrastructure WG. So far supported by USA, EU, and Australia. For data sharing: to form community consensus [RDA secretariat, 2015]
http://www.icsu-wds.org/community/webinars/ webinar-2/rdawdspublishingdataigwebinarintro.pdf
A view of research information management
Data Sharing, the Informatics Way: DOI (Digital Object Identifier) for Research Data [Adapted from Hideaki Takeda 2015) ] Data Life Cycle, Its Steps, and Its Stakeholders Research Institute Library Resgister Identifiers Metadata Domain Science Contents Research Project Researchers Produce Produce Resgister Revise Identifier s Metadata Produce Resgister Revise Domain Metadata Preserve Publish Revise Discard
Example of Metadata for Research Data DOI ITEM FIELD NAME DESCRIPTION DOI DOI DOI UDL URL URL Title Title Data title [Adapted from H. Takeda (2015)] Subject Subject Subject, keywords, class, Creator Creator Names of data producers Affiliation Affiliation Affiliation of Creators Researcher ID Researcher ID Person ID such as ORCID, e- Rad, KAKEN, etc. Publication Date Publication Date Year month day of data publication Data Publisher Publisher Data Publisher (research institute, university, etc.) Contributor Contributor Data manager, product manager, funding agency, etc. Edition Edition Variation (publisher ver.,... 他 署 名 関 連 オブジェクト 情 報 位 置 情 報 など authors ver. etc.) Version (1.0, 2.1, etc.) Format (file format) Management information for all scientific domains (similar to bibliographic (book-management) metadata]
Example: Practices in Japan
Japanese Metadata framework IUGONET (inter-univ. upper atmos. obs. network) IUGONET Metadata Database and search system MD Schema Extension For Groundbased Observations MD Schema Partnership EU-ESPAS Project Ontology & MD NASA s Metadata Schema SPASE
Japan Link Center s Experimental Project for minting/registration of dataset s DOI DOI (Digital Object Identifier) for citing an object digitally. DOI is managed by Intl. DOI Foundation (IDF). DOI can be given only by Registration Agency (RA) under IDF. Japan Link Center (JaLC) is only a RA in Japan. JaLC experiment project: Japan s first attempt to register dataset s DOI Project steering committee: universities, natl. research institutes, & NDL. Japanese Data Centers Assign doi:10.xxxx (DOI prefix) Web Interface DOI prefix Register DOI-URL mapping DOI System 9 RAs [Nose et a., 2014] 20
[ Modified from [ Nose et al.(2013) al., 2013] ] Example of DOI-minting to Earth Science database Mesospheric wind velocity data (30min. mean) observed with MF radar at Poker Flat, Alaska First Data-DOI Registration by Japanese Platform doi:10.17591/55838dbd6c0ad Digital data http://www2.nict.go.jp/isd/doilandingpage/wds/10.17591 5 5838dbd6c0ad.html Data plot Landing Page Data description, Data format, Link to data, etc. 21 data citation Cited by Kinoshita, T., Y. Murayama, and S. Kawamura (2015), Tidal modulations of mesospheric gravity wave kinetic energy observed with MF radar at Poker Flat Research Range, Alaska, J. Geophys. Res., 120,
Two WDS-Members in Japan Successful Practices of World-Wide Data Sharing Since 1957-58 World Data Center for Geomagnetism, Kyoto World Data Center for Ionosphere and Space Weather Now data-sharing style is to be renewed. With metadata (MD) brokering? MD catalogues? semantic approach?...
Big data of New Met. Satellites: Himawari-8.
Big data of New Met. Satellites: Himawari-8 Japan Met. Agency (JMA) operates the new geostationary met. satellite Himawari-8. Data size will be 150TB per year. 50 times larger than the former Himawari s data. Partners are trying to archive/service the data. JMA s Himawari-8 partners: NICT, U. Tokyo, Chiba U.,
Concluding remarks Ideal science practice may include openness and sharing, for an optimal use of Science for Society. But practically difficult to do. Some say is it just a dream? Information management covers part of science. We are facing a realistic and practical difficulty to share our own specific science data. Technical Data format, metadata schema Social/Community Brokering service, interoperable framework, system of systems ( scale-free network concept?) E-infrastructure managed in a layered model! (persistent preservation layer, computer tech. layer, librarian/archivist layer, scientist layer, social layer, )