General concepts: DDI
|
|
|
- Lisa Thomas
- 9 years ago
- Views:
Transcription
1 General concepts: DDI Irena Vipavc Brvar, ADP SEEDS Kick-off meeting, Lausanne, May 2015
2
3 How to describe our survey What we learned so far: If we want to use data at some point in the future, data need to be properly documented, saved in trusted place. Users need to be able to find and access it. How do we achieve that. - > by using a standard We would like surveys in our institutions to be describe in the same way. So every new colleague would know how to do it. And possibly we would like to use such a standard that is used in similar organizations interoperability between DA. DDI stands for Data Documentation initiative. - > to establish a standard for technical documentation describing social science data.
4 Hstory Idea was to produce metadata specification for the description of social science data resources. - Initiated in 1994 (ICPSR) / XML DTD already in 1997 Contributors to the efforts of the DDI come from social science data archives and libraries in USA, Canada and EU and from major producers of statistical data (like the US Bureau of the Census, the US Bureau of Labour statistics, Statistics Canada and Health Canada) - to replace the existing and widely used OSIRIS Codebook/data dictionary standard with a more modern and Web-aware specification. - The first official version of the DDI specification (version 1.0) was published in March V V3.0 (Ryssevik, 2001)
5 2 development lines DDI-Codebook DDI-Codebook is a more light-weight version of the standard, intended primarily to document simple survey data. Originally DTD-based, DDI-C is now available as an XML Schema. The current version of DDI-C is 2.5. DDI-Lifecycle Encompassing all of the DDI-Codebook specification and extending it, DDI-Lifecycle is designed to document and manage data across the entire life cycle, from conceptualization to data publication and analysis and beyond. Based on XML Schemas, DDI-Lifecycle is modular and extensible. Current DDI-L 3.2.
6 BASIC STRUCTURE OF DDI 2.* - Section 1.0 Document Description consists of bibliographic information that can be considered as the header whose elements uniquely describe the full contents of the compliant DDI file. - Section 2.0 Study Description consists of information about the data collection. This section includes information about who collected and who distributes the data, about the scope and coverage, sampling (if relevant), data collection methods and processing, citation requirements, etc.
7 BASIC STRUCTURE OF DDI 2.* - Section 3.0 Data Files Description provides information about the Data file(s). - Section 4.0 Variable Description provides a detailed description of variables, including (when relevant) the variable type, variable and value labels, literal questions, computation or imputation methods, instructions to interviewers, universe, descriptive statistics, etc. - Section 5.0 Other Study Related Materials allows for the inclusion of other materials related to the study such as questionnaires, user manuals, computer programs, interviewer manuals, maps, coding information, etc.
8 Controled vocabularity (CESSDA topic classification, ELSST, DDI vocabulary) Multilingual support - > CESSDA Catalogue Approximate number of elements in each specification DDI DDI DDI DDI 2 Lite - 80
9 PREPARING METADATA Prepare a form in which researcher will insert information about the survey you need. Gain clean data and other materials. Prepare data and materials for long term preservation and distribution. Prepare metadata description of the survey using information in the form (important who are the authors (main and other), add project ID funding OpenAIRE compatible.) - Use tools // Possible export of question text, basic frequencies and descriptive statistics. Distribute metadata (web, Nesstar etc.) - Make XML openly available CESSDA catalogue // question bank
10 Some data about Nesstar usage Nesstar is currently run by most archives in Europe, and a reasonable number of data libraries in US/Canada. Nesstar was originally developed by and for archives, and is designed to fit many important documentation and dissemination use-cases for data archives. Nesstar was also the first tool to support DDI, which is still a highly relevant standard for data documentation. There are currently > 130 instances of Nesstar Server worldwide, from Vancouver to Taiwan and from South-Africa to Iceland. In volume, the International Household Survey Network ( is the most important Nesstar user. IHSN do not use Nesstar Server, but they use Nesstar Publisher as a documentation tool for statistical agencies in a large number of (developing) countries on all continents.
11 Nesstar also fully supports multilingual metadata, which makes it possible to document data in more than one language (without duplicating data). 11 Nesstar Server comes with a set of APIs that allow for third-party integration with data, metadata and functionality (e.g. tabulation and download operations) on the server. Because of the APIs and the DDI support, the Nesstar platform is also very easy to repurpose for other services, e.g. the CESSDA Portal and the DwB Data Discovery portal. Important/high profile users of Nesstar include: European Social Survey: UK Data Service GESIS ZACAT
12 Nesstar Publisher (Located on desktop) Nesstar Publisher a sophisticated authoring environment that can publish data from a variety of sources (including SPSS, SAS, Excel etc.). The tool includes a specialised metadata editor, data and metadata validation routines and metadata templates that provide standardisation and control. Easy editing/creation and export of DDI documented datasets with XML experience needed. Tools to validate metadata and variables. The ability to include automatically generated frequency and summary statistics for each variable. Tools to compute/recode/label new, or existing, variables to be added to a dataset before publishing. The ability to import and export data to the most common statistical formats, including delimited files. Multilingual - Arabic, Chinese, English, French, Portuguese, Russian and Spanish and more. 12
13 Nesstar Publisher 13
14 Nesstar Server (Located on server) Nesstar Server - includes an SQL-based metadata management system, a data storage system, a powerful statistical engine as well as a flexible access control system. Nesstar WebView totally customisable and configurable layer that presents the search, browse, display, analysis and retrieval options to the user. Able to seamlessly handle survey data, cubes and other resources. Multiple crosstabulation and recoding Regression and correlation analysis 14
15 Nesstar web view 15
16 Using Common Metadata for Harmonisation for Data Integration The CESSDA portal is an example of integration of data in heterogeneous, autonomous resources (data archives) by using harmonised descriptive metadata represented in a common metadata standard, and using controlled vocabularies and code schemes. Harmonisation of metadata is done by the DAs, and the harmonised metadata are made available in local servers for harvesting, and for presenting in the CESSDA portal. <- Retaining the autonomy of the resources/das. More in Deliverable 7.1 and of DwB project
17 Events - EDDI yearly conference (since 2009) / aslo in the US - DDI workshops in Castle Dagstuhl (since 2007) - Presentations that are related to DDI at IASSIST conferences - Trainings organized by CESSDA archives / CESSDA expert seminars
18 DDI Alliance [ ] IHSN: Metadata Editor (Nesstar Publisher 4.0.9) [ ] IHNS (2007): Quick Reference Guide for Data Archivists [ ecklist_od_ pdf, ] Ryssevik, J. (2001). The Data Documentation Initiative (DDI) metadata specification. Paper prepared for MetaNet 2001, Voorburg, Netherlands. [ ] Martinez, L. (2008): The Data Documentation Initiative (DDI) and Institutional Repositories [ ]
Collaboration in Data Documentation: Developing STARDAT - The Data Archiving Suite
Collaboration in Data Documentation: Developing STARDAT - The Data Archiving Suite Wolfgang Zenk-Möltgen IASSIST 2011 - Data Science Professionals: a Global Community of Sharing May 30 June 3, 2011, Vancouver,
DDI Lifecycle: Moving Forward Status of the Development of DDI 4. Joachim Wackerow Technical Committee, DDI Alliance
DDI Lifecycle: Moving Forward Status of the Development of DDI 4 Joachim Wackerow Technical Committee, DDI Alliance Should I Wait for DDI 4? No! DDI Lifecycle 4 is a long development process DDI Lifecycle
Documenting the research life cycle: one data model, many products
Documenting the research life cycle: one data model, many products Mary Vardigan, 1 Peter Granda, 2 Sue Ellen Hansen, 3 Sanda Ionescu 4 and Felicia LeClere 5 Introduction Technical documentation for social
Metadata driven framework for the Canada Research Data Centre Network
Metadata driven framework for the Canada Research Data Centre Network IASSIST 2010 Session A4: DDI3 Tools Pascal Heus, Metadata Technology North America [email protected] http://www.metadatatechnology.com
Nesstar Server Nesstar WebView Version 3.5
Unlocking data creating knowledge Version 3.5 Release Notes November 2006 Introduction These release notes contain general information about the latest version of the Nesstar products and the new features
Use of the IHSN Microdata Management Toolkit to Document Agricultural Census Data
African Commission on Agricultural Statistics Twenty-second Session Addis Ababa, Ethiopia 30 Nov 3 Dec, 2011 Use of the IHSN Microdata Management Toolkit to Document Agricultural Census Data Alemayehu
Checklist for a Data Management Plan draft
Checklist for a Data Management Plan draft The Consortium Partners involved in data creation and analysis are kindly asked to fill out the form in order to provide information for each datasets that will
Introduction to the Survey Research Data Archive of Taiwan ( 學 術 調 查 研 究 資 料 庫 )
Introduction to the Survey Research Data Archive of Taiwan ( 學 術 調 查 研 究 資 料 庫 ) Ruoh-rong Yu Center for Survey Research Research Center for Humanities and Social Sciences Academia Sinica 于 若 蓉 調 查 研 究
Digital Assets Repository 3.0. PASIG User Group Conference Noha Adly Bibliotheca Alexandrina
Digital Assets Repository 3.0 PASIG User Group Conference Noha Adly Bibliotheca Alexandrina DAR 3.0 DAR manages the full lifecycle of a digital asset: its creation, ingestion, metadata management, storage,
Using Dataverse Virtual Archive Technology for Research Data Management. Jonathan Crabtree Thu-Mai Christian Amanda Gooch
Using Dataverse Virtual Archive Technology for Research Data Management Jonathan Crabtree Thu-Mai Christian Amanda Gooch H. W. Odum Institute Archive Services The Howard W. Odum Institute was founded in
Implementing SharePoint 2010 as a Compliant Information Management Platform
Implementing SharePoint 2010 as a Compliant Information Management Platform Changing the Paradigm with a Business Oriented Approach to Records Management Introduction This document sets out the results
OCLC CONTENTdm. Geri Ingram Community Manager. Overview. Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015
OCLC CONTENTdm Overview Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015 Geri Ingram Community Manager Overview Audience This session is for users library staff, curators,
OpenAIRE Research Data Management Briefing paper
OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement
Research Data Archival Guidelines
Research Data Archival Guidelines LEROY MWANZIA RESEARCH METHODS GROUP APRIL 2012 Table of Contents Table of Contents... i 1 World Agroforestry Centre s Mission and Research Data... 1 2 Definitions:...
<odesi> Survey Example Canadian Community Health Survey (CCHS)
Survey Example Canadian Community Health Survey (CCHS) Your mission: To find, subset and download data from the Canadian Community Health Survey 2010 (CCHS) Part 1: FIND your survey Step 1. Open
Survey of Canadian and International Data Management Initiatives. By Diego Argáez and Kathleen Shearer
Survey of Canadian and International Data Management Initiatives By Diego Argáez and Kathleen Shearer on behalf of the CARL Data Management Working Group (Working paper) April 28, 2008 Introduction Today,
ProQuest Dissertations & Theses
ProQuest Dissertations & Theses The world s most comprehensive collection of dissertations and theses On-demand digital access to the scholarly record The database of record for graduate research; the
Adlib Library. Software for the professional management of collections in libraries and information centres. Comprehensive, Flexible, User-friendly
Adlib Library Software for the professional management of collections in libraries and information centres Comprehensive, Flexible, User-friendly Adlib Library Software for efficient library management
EXPLORING AND SHARING GEOSPATIAL INFORMATION THROUGH MYGDI EXPLORER
EXPLORING AND SHARING GEOSPATIAL INFORMATION THROUGH MYGDI EXPLORER Subashini Panchanathan Malaysian Centre For Geospatial Data Infrastructure ( MaCGDI ) Ministry of National Resources and Environment
Portal Version 1 - User Manual
Portal Version 1 - User Manual V1.0 March 2016 Portal Version 1 User Manual V1.0 07. March 2016 Table of Contents 1 Introduction... 4 1.1 Purpose of the Document... 4 1.2 Reference Documents... 4 1.3 Terminology...
European Forest Information and Communication Platform
1 Metadata Model for the European Forest Information and Communication Platform D. Tilsner 1, C. Figueiredo 1, H. Silva 2, B. Chartier 3, J. San-Miguel 4, A. Camia 4, M. Millot 4 1 EDISOFT, S.A., Lisbon,
Notes about possible technical criteria for evaluating institutional repository (IR) software
Notes about possible technical criteria for evaluating institutional repository (IR) software Introduction Andy Powell UKOLN, University of Bath December 2005 This document attempts to identify some of
Data Publishing Workflows with Dataverse
Data Publishing Workflows with Dataverse Mercè Crosas, Ph.D. Twitter: @mercecrosas Director of Data Science Institute for Quantitative Social Science, Harvard University MIT, May 6, 2014 Intro to our Data
What s new in Carmenta Server 4.2
What s new in Carmenta Server 4.2 A complete solution for cost-effective visualisation and distribution of GIS data through web services Carmenta Server provides cost-effective technology for building
Functional Requirements for Digital Asset Management Project version 3.0 11/30/2006
/30/2006 2 3 4 5 6 7 8 9 0 2 3 4 5 6 7 8 9 20 2 22 23 24 25 26 27 28 29 30 3 32 33 34 35 36 37 38 39 = required; 2 = optional; 3 = not required functional requirements Discovery tools available to end-users:
The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols
The FAO Open Archive: Enhancing Access to FAO Publications Using International Standards and Exchange Protocols Claudia Nicolai; Imma Subirats; Stephen Katz Food and Agriculture Organization of the United
Research Data Management Guide
Research Data Management Guide Research Data Management at Imperial WHAT IS RESEARCH DATA MANAGEMENT (RDM)? Research data management is the planning, organisation and preservation of the evidence that
Preservation and Dissemination Policy of the LISS Data Archive
Preservation and Dissemination Policy of the LISS Data Archive date 21 March 2016 authors Marika de Bruijne, Arnaud Wijnant, Edwin de Vet, Eric Balster version 1.3 classification standard CentERdata, Tilburg,
Quick Reference Guide for Data Archivists
International Household Survey Network IHSN Quick Reference Guide for Data Archivists DRAFT - Version 2007.03 June 2007 Olivier Dupriez (World Bank) and Geoffrey Greenwell (PARIS21) Content Introduction...1
SowiDataNet. Bringing Social and Economic Research Data Together
SowiDataNet Bringing Social and Economic Research Data Together Monika Linne, Data Archive for the Social Sciences GESIS Leibniz Institute for the Social Sciences SowiDataNet General Overview What is SowiDataNet?
General principles and architecture of Adlib and Adlib API. Petra Otten Manager Customer Support
General principles and architecture of Adlib and Adlib API Petra Otten Manager Customer Support Adlib Database management program, mainly for libraries, museums and archives 1600 customers in app. 30 countries
RCAAP: Building and maintaining a national repository network
RCAAP: Building and maintaining a national repository network José Carvalho [email protected] Eloy Rodrigues [email protected] Pedro Príncipe [email protected] Ricardo Saraiva [email protected]
Information Management Advice 61 How to review your records holdings
Information Management Advice 61 How to review your records holdings There are many reasons why agencies may decide to survey records holdings. Agencies may be implementing information security classification,
Statistical Metadata System based on SDMX
Statistical Metadata System based on SDMX Petko I. Yanev; Jean-François Fracheboud Federal Statistical Office Switzerland Statistical Metadata System : Content Challenges Vision / Metadata Strategy Architecture
D 2.2.3 EUOSME: European Open Source Metadata Editor (revised 2010-12-20)
Project start date: 01 May 2009 Acronym: EuroGEOSS Project title: EuroGEOSS, a European Theme: FP7-ENV-2008-1: Environment (including climate change) Theme title: ENV.2008.4.1.1.1: European Environment
A Guide to the Research Data Service
A Guide to the Research Data Service DMP online ONLINE DATASHARE MY RESEARCH DATA PURE DATA SYNC DATA VAULT DATA STORE This booklet was produced in April 2016 by the Research Data Service Team, Information
THE BRITISH LIBRARY. Unlocking The Value. The British Library s Collection Metadata Strategy 2015-2018. Page 1 of 8
THE BRITISH LIBRARY Unlocking The Value The British Library s Collection Metadata Strategy 2015-2018 Page 1 of 8 Summary Our vision is that by 2020 the Library s collection metadata assets will be comprehensive,
National Integrated Services Framework The Foundation for Future e-health Connectivity. Peter Connolly HSE May 2013
National Integrated Framework The Foundation for Future e-health Connectivity Peter Connolly HSE May 2013 The Context Introduction A national approach to interoperability is essential for Ireland s E-Health
Queensland recordkeeping metadata standard and guideline
Queensland recordkeeping metadata standard and guideline June 2012 Version 1.1 Queensland State Archives Department of Science, Information Technology, Innovation and the Arts Document details Security
Jochen Schirrwagen, Najko Jahn. Bielefeld University Library, Germany. Research in Context
Jochen Schirrwagen, Najko Jahn Bielefeld University Library, Germany Research in Context In the light of recent results from OpenAIREplus and from the Library perspective Seminar to Access of Grey Literature
EndNote Beyond the Basics
IOE Library Guide EndNote Beyond the Basics These notes assume that you know EndNote basics and are using it regularly. Additional tips and instruction is contained within the guides and FAQs available
Adlib Museum. Software for professional collections management in museums and other collecting institutions. Comprehensive, Flexible, User-friendly
Adlib Museum Software for professional collections management in museums and other collecting institutions Comprehensive, Flexible, User-friendly Adlib Museum More than collection management Adlib Museum
Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan
Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:
Adlib Internet Server
Adlib Internet Server Software for professional collections management in archives, libraries and museums Comprehensive, Flexible, User-friendly Adlib Internet Server Put your data online, the easy way
ProLibis Solutions for Libraries in the Digital Age. Automation of Core Library Business Processes
ProLibis Solutions for Libraries in the Digital Age Automation of Core Library Business Processes We see a modern library not only as a book repository, but also and most importantly as a state-of-the-art,
Test Data Management Concepts
Test Data Management Concepts BIZDATAX IS AN EKOBIT BRAND Executive Summary Test Data Management (TDM), as a part of the quality assurance (QA) process is more than ever in the focus among IT organizations
Draft Response for delivering DITA.xml.org DITAweb. Written by Mark Poston, Senior Technical Consultant, Mekon Ltd.
Draft Response for delivering DITA.xml.org DITAweb Written by Mark Poston, Senior Technical Consultant, Mekon Ltd. Contents Contents... 2 Background... 4 Introduction... 4 Mekon DITAweb... 5 Overview of
European Soil Data Centre (ESDAC) Marc Van Liedekerke Land Management and Natural Harzards Unit
European Soil Data Centre (ESDAC) Marc Van Liedekerke Land Management and Natural Harzards Unit 1 Outline 1. What is ESDAC? 2. Requirements 3. Information providers; Inventory 4. ESDAC Repository 5. Implementation
INDEX. OutIndex Services...2. Collection Assistance...2. ESI Processing & Production Services...2. Computer-Based Language Translation...
SERVICES INDEX OutIndex Services...2 Collection Assistance...2 ESI Processing & Production Services...2 Computer-Based Language Translation...3 OutIndex E-Discovery Deployment & Installation Consulting...3
Archival of raw and analysed radar data at EISCAT and worldwide
Archival of raw and analysed radar data at EISCAT and worldwide Carl-Fredrik Enell, EISCAT Scientific Association COOPEUS workshop and EGI-CC kickoff, 11 March 2015 C-F Enell, EISCAT Radar data archival
Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access
Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix Jennifer Clegg, SAS Institute Inc., Cary, NC Eric Hill, SAS Institute Inc., Cary, NC ABSTRACT Release 2.1 of SAS
Building next generation consortium services. Part 3: The National Metadata Repository, Discovery Service Finna, and the New Library System
Building next generation consortium services Part 3: The National Metadata Repository, Discovery Service Finna, and the New Library System Kristiina Hormia-Poutanen, Director of Library Network Services
OCLC CONTENTdm and the WorldCat Digital Collection Gateway Overview
OCLC CONTENTdm and the WorldCat Digital Collection Gateway Overview Geri Ingram OCLC Community Manager June 2015 Overview Audience This session is for users library staff, curators, archivists, who are
How To Useuk Data Service
Publishing and citing research data Research Data Management Support Services UK Data Service University of Essex April 2014 Overview While research data is often exchanged in informal ways with collaborators
How To Teach Social Science To A Class
Date submitted: 18/06/2010 Using Web-based Software to Promote Data Literacy in a Large Enrollment Undergraduate Course Harrison Dekker UC Berkeley Libraries Berkeley, California, USA Meeting: 86. Social
Microsoft SharePoint and Records Management Compliance
Microsoft SharePoint and Records Management Compliance White Paper Revision: 2 Date created: 20 February 2015 Principal author: Nigel Carruthers-Taylor, Principal, icognition Reference: 15/678 Summary
Managing explicit knowledge using SharePoint in a collaborative environment: ICIMOD s experience
Managing explicit knowledge using SharePoint in a collaborative environment: ICIMOD s experience I Abstract Sushil Pandey, Deependra Tandukar, Saisab Pradhan Integrated Knowledge Management, ICIMOD {spandey,dtandukar,spradhan}@icimod.org
CASRAI, eurocris, Lattes, and VIVO: Four Perspectives on Research Information Standards
CASRAI, eurocris, Lattes, and VIVO: Four Perspectives on Research Information Standards David Baker, Keith Jeffery, José Salm, and Jon Corson-Rikert Laure Haak, Moderator August 24, 2012 1 Format A round
European Data Infrastructure - EUDAT Data Services & Tools
European Data Infrastructure - EUDAT Data Services & Tools Dr. Ing. Morris Riedel Research Group Leader, Juelich Supercomputing Centre Adjunct Associated Professor, University of iceland BDEC2015, 2015-01-28
Conference on Data Quality for International Organizations
Committee for the Coordination of Statistical Activities Conference on Data Quality for International Organizations Newport, Wales, United Kingdom, 27-28 April 2006 Session 5: Tools and practices for collecting
INSPIRE Dashboard. Technical scenario
INSPIRE Dashboard Technical scenario Technical scenarios #1 : GeoNetwork catalogue (include CSW harvester) + custom dashboard #2 : SOLR + Banana dashboard + CSW harvester #3 : EU GeoPortal +? #4 :? + EEA
ENTERPRISE DOCUMENTS & RECORD MANAGEMENT
ENTERPRISE DOCUMENTS & RECORD MANAGEMENT DOCWAY PLATFORM ENTERPRISE DOCUMENTS & RECORD MANAGEMENT 1 DAL SITO WEB OLD XML DOCWAY DETAIL DOCWAY Platform, based on ExtraWay Technology Native XML Database,
How To Understand And Understand The Science Of Astronomy
Introduction to the VO [email protected] ESAVO ESA/ESAC Madrid, Spain The way Astronomy works Telescopes (ground- and space-based, covering the full electromagnetic spectrum) Observatories Instruments
