THE DATA CITATION INDEX Advancing the discovery, access, and citing of research data Dr Ning Ning Solution Consultant Smart discovery starts here
BIG DATA Data Never Sleeps every minute Google Receives 2,000,000 search queries Facebook users share 684,478 pieces of content Email Users Send 204,166,667 messages (Source: DOMO) 1 exabyte = 10 18 bytes = 1 000 000 000 GB = 1 000 000 TB
SOUND DISCOVERY RELIES UPON SOLID SUPPORTING DATA Movement toward open sharing of research data a great, and growing volume of research data is now available. NATURE News Gene data to hit milestone* With close to one million gene-expression data sets now in publicly accessible repositories, researchers can identify disease trends without ever having to enter a laboratory. This article describes how the publicly available Gene Expression Omnibus research data repository was used by Stanford investigators to lead them to identify a new drug target for diabetes. The investigators explain the beauty of analyzing data from multiple experiments is that biases should cancel out between data sets, stating there is safety in numbers. * Nature News, Nature Publishing Group, Jul 18, 2012. Copyright 2012, Rights Managed by Nature Publishing Group
RESEARCH DATA: CHALLENGES However, gaining a clear understanding of what data exists, and where, is a challenge. Research data repositories are many, they are separately maintained, with varying levels of searchability and structure
RESEARCH DATA: MAKING IT DISCOVERABLE, ACCESSIBLE, & CITABLE Single point of access to quality research data from repositories across disciplines and across the globe.
REPOSITORY EVALUATION AND SELECTION As with all Thomson Reuters resources, quality is extremely important. To identify, evaluate, and select key repository content for inclusion Editorial Content - ensuring that material is desirable to the research community. Persistence and stability of the repository, with a steady flow of new information. Thoroughness and detail of descriptive information. REUTERS/ Mohsin Raza Links from data to research literature.
INDEXING REPOSITORIES TR takes descriptive metadata feed from repository Repository raw metadata is analyzed by TR TR adds metadata Data Citation Index records data repository data study data set Data Citation Index records link back to the source, to the repositories, data studies, and data sets themselves.
DOCUMENT TYPES Repository/ Source Data Study Data Set Repository/Source: Comprises data studies, data sets, etc. Stores and provides access to the raw data. Data Study: Descriptions of studies or experiments with associated data which have been used in the data study. Data Set: A single or coherent set of data or a data file provided by the repository, as part of a collection, data study or experiment.
COVERAGE Coverage of nearly 2.8 million records from quality repositories 500,000 records added each year Reciprocal links to/from DCI Repositories and Web of by Science Discipline records Arts and Humanities Life Sciences Multi-discipline 20% 7% Physical Sciences Social Sciences 2% 23% 48%
ischemic heart disease The Data Citation Index is presented within the Web of Knowledge platform with the same look and feel as other resources, such as Web of Science.
Data Citation Index presents all of the powerful Web of Knowledge options for exploring search results.
Our main interest in this example is to locate data studies, and we Refine by this Document Type.
After Refining by Data Study a quick scan of results reveals this study, and we click to view the full record.
The full record presents fundamental information about this data study an abstract, data type, miscellaneous descriptors, and basic taxonomic data. Through recommendation of a standard format for citing research data we hope to impact the research community s citing practices facilitating capture and unification of cites to research data going forward.
The full record serves as a central point from which to collect information around this data study, and link to related information such as the articles that have referenced this Data Study.
Above all though the Data Citation Index is about getting users to research data itself. Link to the Data Set information within the repository.
Above all though the Data Citation Index is about getting users to research data itself. Link to the Data Set information within the repository. Remaining within the Data Citation Index, link to all records associated with this data study -- or link out directly to associated data sets.
Information may of course be printed, e-mailed, or archived within EndNote Web, EndNote, or added to one s ResearcherID publications list.
運 用 數 據 助 力 研 究 讓 研 究 數 據 更 容 易 發 現, 更 快 速 訪 問, 更 輕 鬆 引 用! 不 再 重 複 勞 動, 節 約 研 究 資 源 站 在 巨 人 的 肩 膀, 站 在 同 一 起 跑 線
THE DATA CITATION INDEX Advancing the discovery, access, and citing of research data THANK YOU! Dr Ning Ning Solution Consultant