Why are data sharing and reuse so difficult?
|
|
|
- Collin Baldwin
- 10 years ago
- Views:
Transcription
1 Why are data sharing and reuse so difficult? Chris9ne L. Borgman Professor and Presiden9al Chair in Informa9on Studies University of California, Los Angeles hmp://chris9neborgman.info and UCLA Knowledge Infrastructures Team: Peter Darch, Milena Golshan, Irene PasqueMo, Ashley Sands, Sharon Traweek FaceBase All Hands Mee9ng Informa9on Sciences Ins9tute, Marina del Rey, CA Thursday, January 8,
2 hmp://knowledgeinfrastructures.gseis.ucla.edu/
3 Knowledge Infrastructures Project Research Design Ramping up data collec9on Big Data Large Synop9c Survey Telescope (LSST) Small Data Center for Dark Energy Biosphere Inves9ga9ons (C- DEBI) Ramping down data collec9on Sloan Digital Sky Survey, Parts I & II (SDSS) Center for Embedded Network Sensing (CENS) Knowledge Infrastructures
4 Knowledge Infrastructures Image: Alyssa Goodman, Seamless Astronomy, Harvard- CfA
5 hmp://knowledgeinfrastructures.org
6 Precondi9on: Researchers share data 6
7 Researchers perspec9ves on data sharing Rewards Responsibility Data Incen9ves Persistent URL: photography.si.edu/searchimage.aspx?id=5799 Repository: Smithsonian Ins9tu9on Archives 7
8 Researchers perspec9ves on data sharing Rewards Responsibility Data Incen9ves Persistent URL: photography.si.edu/searchimage.aspx?id=5799 Repository: Smithsonian Ins9tu9on Archives 8
9 Publica9ons Grants Awards and honors Teaching Service Technologies Data Rewards may vary hmp://blog.stargreshtoday.com/portals/170402/images/improve- credit- score1.jpg
10 Researchers perspec9ves on data sharing Rewards Responsibility Data Incen9ves Persistent URL: photography.si.edu/searchimage.aspx?id=5799 Repository: Smithsonian Ins9tu9on Archives 10
11 Responsibility Publica9ons are arguments made by authors, and data are the evidence used to support the arguments. C.L. Borgman (2015). Big Data, Li*le Data, No Data: Scholarship in the Networked World. MIT Press
12 Responsibility Publica9ons Independent units Authorship is nego9ated Data Compound objects Ownership is rarely clear AMribu9on Long term responsibility: Inves9gators Exper9se for interpreta9on: Data collectors and analysts hudsonalpha.org
13 AMribu9on of data Legal responsibility Licensed data Specific amribu9on required Scholarly credit: contributorship Author of data Contributor of data to this publica9on Colleague who shared data Somware developer Data collector Instrument builder Data curator Data manager Data scien9st Field site staff Data calibra9on Data analysis, visualiza9on Funding source Data repository Lab director Principal inves9gator University research office Research subjects Research workers, e.g., ci9zen science 13
14 Researchers perspec9ves on data sharing Rewards Responsibility Data Incen9ves Persistent URL: photography.si.edu/searchimage.aspx?id=5799 Repository: Smithsonian Ins9tu9on Archives 14
15 What are data? NASA Astronomy Picture of the Day Marie Curie s notebook aip.org hudsonalpha.org ncl.ucar.edu 15 hmp://
16 16
17 Center for Embedded Networked Sensing NSF Science & Tech Ctr, universi9es, plus partners 300 members Computer science and engineering Science applica9on areas Slide by Jason Fisher, UC-Merced, Center for Embedded Networked Sensing (CENS) 17
18 Documen9ng Data for Interpreta9on Engineering researcher: Temperature is temperature. CENS Robo9cs team Biologist: There are hundreds of ways to measure temperature. The temperature is 98 is low- value compared to, the temperature of the surface, measured by the infrared thermopile, model number XYZ, is 98. That means it is measuring a proxy for a temperature, rather than being in contact with a probe, and it is measuring from a distance. The accuracy is plus or minus.05 of a degree. I [also] want to know that it was taken outside versus inside a controlled environment, how long it had been in place, and the last Qme it was calibrated, which might tell me whether it has drired.."
19 Center for Dark Energy Biosphere Inves9ga9ons Repository for seafloor cores. Photo: Peter Darch Interna9onal Ocean Discovery Program Iodp.tamu.org NSF Science & Tech Ctr, universi9es, plus partners (35 ins9tu9ons) 90 scien9sts Biological sciences Physical sciences 19
20 Researchers perspec9ves on data sharing Rewards Responsibility Data Incen9ves Persistent URL: photography.si.edu/searchimage.aspx?id=5799 Repository: Smithsonian Ins9tu9on Archives 20
21 Incen9ves Publica9ons that report the research Vs. Data that are reusable by others Image: Alyssa Goodman, Harvard Astronomy 21
22 22 Pepe, A., Mayernik, M. S., Borgman, C. L. & Van de Sompel, H. (2010). From Ar9facts to Aggrega9ons: Modeling Scien9fic Life Cycles on the Seman9c Web. Journal of the American Society for Informa9on Science and Technology, 61(3):
23 Metadata Metadata is structured informa9on that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an informa9on resource.* descrip9ve structural administra9ve *Na9onal Informa9on Standards Organiza9on 2004 photo
24 Provenance Libraries: Origin or source Museums: Chain of custody Internet: Provenance is informa9on about en99es, ac9vi9es, and people involved in producing a piece of data or thing, which can be used to form assessments about its quality, reliability or trustworthiness.* *World Wide Web Consor9um (W3C) Provenance working group Bri9sh Library, provenance record: Bes9ary - cap9on: 'Owl mobbed by smaller birds'
25 Reuse across place and 9me Reuse by inves9gator Reuse by collaborators Reuse by colleagues Reuse by unaffiliated others Reuse at later 9mes Months Years Decades Centuries hmp://chandra.harvard.edu/photo/2013/kepler/kepler_525.jpg 25
26 Economics of the Knowledge Commons Subtractability / Rivalry Exclusion Difficult Low Public Goods General knowledge Public domain data High Common- pool resources Libraries Data archives Easy Toll or Club Goods Subscrip9on journals Subscrip9on data Private Goods Printed books Raw or compe99ve data Adapted from C. Hess & E. Ostrom (Eds.), Understanding knowledge as a commons: From theory to pracqce. MIT Press. 26
27 Q to explore in FaceBase community How do you assign credit and responsibility for data crea9on, cura9on, use, and reuse? How will you balance discipline/species- specific data models and policies with integra9ve models? What data do you expect you to share, with whom, how, and for how long? What scien9fic value do you expect to gain from sharing data via FaceBase?
28 Q to explore in FaceBase community Who invest in data cura9on, and at what stages of sharing and reuse? What is the scope of overlap between contributors and users of FaceBase data? What scien9fic value can users obtain from these data, with what kinds of investments?
29 Acknowledgements UCLA Data Practices team Peter Darch, Milena Golshan, Irene Pasquetto, Ashley Sands, Sharon Traweek Former members: Rebekah Cummings, David Fearon, Ariel Hernandez, Elaine Levia, Jaklyn Nunga, Matthew Mayernik, Alberto Pepe, Kalpana Shankar, Katie Shilton, Jillian Wallis, Laura Wynholds, Kan Zhang Research funding: National Science Foundation, Alfred P. Sloan Foundation, Microsoft Research University of Oxford: Balliol College, Oliver Smithies Fellowship, Oxford Internet Institute, Oxford eresearch Center, Bodleian Library
Big Data Research at DKRZ
Big Data Research at DKRZ Michael Lautenschlager and Colleagues from DKRZ and Scien:fic Compu:ng Research Group Symposium Big Data in Science Karlsruhe October 7th, 2014 Big Data in Climate Research Big
Science Gateways What are they and why are they having such a tremendous impact on science? Nancy Wilkins- Diehr [email protected]
Science Gateways What are they and why are they having such a tremendous impact on science? Nancy Wilkins- Diehr [email protected] What is a science gateway? science gateway /sī əәns gāt wā / n. 1. an
Open Science, Big Data and Research Reproducibility. Tony Hey Senior Data Science Fellow escience Ins>tute University of Washington tony.hey@live.
Open Science, Big Data and Research Reproducibility Tony Hey Senior Data Science Fellow escience Ins>tute University of Washington [email protected] The Vision of Open Science Vision for a New Era of Research
(Why) Should Research Universi6es Have Schools of Educa6on?
Spencer F!ndation Annual Lecture (Why) Should Research Universi6es Have Schools of Educa6on? Deborah Loewenberg Ball April 14, 2009 San Diego, California A closer look at the ques6on It s a real ques6on...
Mission. To provide higher technological educa5on with quality, preparing. competent professionals, with sound founda5ons in science, technology
Mission To provide higher technological educa5on with quality, preparing competent professionals, with sound founda5ons in science, technology and innova5on, commi
AstroFIt Astronomy Fellowship in Italy FP- 7 Grant Agreement n. 267251
AstroFIt Astronomy Fellowship in Italy FP- 7 Grant Agreement n. 267251 G. Micela Scien,fic Coordinator ) INAF co- funded programme to promote the interna9onal mobility of young astronomers (at post- doc
QEM /CAREER Workshop. Wanda E. Ward, PhD. Office of International and Integrative Activities (IIA) National Science Foundation March 14, 2014
QEM /CAREER Workshop Wanda E. Ward, PhD Office of International and Integrative Activities (IIA) National Science Foundation March 14, 2014 First Lady Michelle Obama speaking at the White House in connec;on
NSF/Intel Partnership on Cyber- Physical Systems Security and Privacy (CPS- Security)
NSF Webinar on NSF Solicita9on 14-571 NSF/Intel Partnership on Cyber- Physical Systems Security and Privacy (CPS- Security) Farnam Jahanian, Keith Marzullo, Angelos D. Keromy9s, David Corman Jeremy Epstein,
Why do we do what we do?
Why do we do what we do? Dissemina/on of Prac/ce Doctorate Scholarship: Impact Needed Julee Waldrop, DNP, FAANP School of Nursing University of North Carolina Flip Side: Research 1 3/26/15 How Do We Communicate
Astrophysics with Terabyte Datasets. Alex Szalay, JHU and Jim Gray, Microsoft Research
Astrophysics with Terabyte Datasets Alex Szalay, JHU and Jim Gray, Microsoft Research Living in an Exponential World Astronomers have a few hundred TB now 1 pixel (byte) / sq arc second ~ 4TB Multi-spectral,
The Importance of Intellectual Property Management in Universi9es and Public Research Organiza9ons The Brazilian Experience
The Importance of Intellectual Property Management in Universi9es and Public Research Organiza9ons The Brazilian Experience Shirley Cou9nho Dar es Salaam United Republic of Tanzania 12 March, 2013 Outline
How To Useuk Data Service
Publishing and citing research data Research Data Management Support Services UK Data Service University of Essex April 2014 Overview While research data is often exchanged in informal ways with collaborators
OT- Med: Objec,va Terra - Mediterraneum. Joël Guiot
OT- Med: Objec,va Terra - Mediterraneum Joël Guiot Context The OT- Med Labex has been defined in this context q The Mediterranean Basin has been a key area of human- environment interac@ons for thousands
Research at the Department of Computer Science and Software Engineering. Professor Yong Yue BEng, PhD, CEng, FIET, FIMechE 17 October 2014
Research at the Department of Computer Science and Software Engineering Professor Yong Yue BEng, PhD, CEng, FIET, FIMechE 17 October 2014 Research Areas Ar%ficial intelligence Robo%cs Data mining Image
Workshop : Open and Big Data for Life Imaging
Workshop : Open and Big Data for Life Imaging Chris'an Barillot Michel Dojat March 2015 FLI- IAM 1 Many Good Reasons for Sharing Data and Tools in In Vivo Imaging Scien'fic At Least 3. «Power failure:
MSc Data Science at the University of Sheffield. Started in September 2014
MSc Data Science at the University of Sheffield Started in September 2014 Gianluca Demar?ni Lecturer in Data Science at the Informa?on School since 2014 Ph.D. in Computer Science at U. Hannover, Germany
Lesson 3: Data Management Planning
Lesson 3: CC image by Joe Hall on Flickr What is a data management plan (DMP)? Why prepare a DMP? Components of a DMP NSF requirements for DMPs Example of NSF DMP CC image by Darla Hueske on Flickr After
The Data Reservoir. 10 th September 2014. Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons
Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Solu4ons The Reservoir 10 th September 2014 A growing demand Business Teams want Open access to more informa4on More
AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT
AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT Dalton Cervo Author, Consultant, Management Expert September 2015 This presenta?on contains extracts from books that are: Copyright 2011 John Wiley & Sons,
Program Model: Muskingum University offers a unique graduate program integra6ng BUSINESS and TECHNOLOGY to develop the 21 st century professional.
Program Model: Muskingum University offers a unique graduate program integra6ng BUSINESS and TECHNOLOGY to develop the 21 st century professional. 163 Stormont Street New Concord, OH 43762 614-286-7895
Second EUDAT Conference, October 2013 Data Management Plans and Certification Motivation: increasing importance of Data Management Planning
Second EUDAT Conference, October 2013 Data Management Plans and Certification Motivation: increasing importance of Data Management Planning Simon Lambert Scientific Computing Department STFC Rutherford
Vision of Interoperability Jamie Ferguson, Stan Huff, Cris Ross
Vision of Interoperability Jamie Ferguson, Stan Huff, Cris Ross Evolu&on of Interoperability As HIE evolves, the interoperability framework standards advance for reliable exchange and data integra=on across
DTCC Data Quality Survey Industry Report
DTCC Data Quality Survey Industry Report November 2013 element 22 unlocking the power of your data Contents 1. Introduction 3 2. Approach and participants 4 3. Summary findings 5 4. Findings by topic 6
31 December 2011. Dear Sir:
Office of Science and Technology Policy on behalf of National Science and Technology Council Attention: Ted Wackler, Deputy Chief of Staff Re: Response to Notice for Request for Information: Public Access
Digital Public Library of America (DPLA)
Digital Public Library of America (DPLA) Front End Design and Implementation Request For Proposal Summary The Digital Public Library of America (DPLA) seeks a skilled interactive agency to design and develop
To outsource or not to outsource?
To outsource or not to outsource? Tips and tools for the society publisher Caitlin Meadows, Publishing Services Director, The Charlesworth Group caitlin.meadows@charlesworth- group.com Why I m here! Our
Data Management at UT
Data Management at UT Maria Esteva, TACC, [email protected] Colleen Lyon, UT Libraries, [email protected] Angela Newell, ITS, [email protected] What is data management? systematic organization
Databases & Data Infrastructure. Kerstin Lehnert
+ Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,
Managing Social Media as Official Records
Managing Social Media as Official Records Archives & Records: Ensuring Access COSA, NAGARA, SAA Joint Annual Mee?ng August 10-16, 2014 Washington, D.C. Geof Huth, Director of Government Records Services,
We are pleased to offer the following program to Woodstock Area Educators:
DATE: Spring 2016 TO: RE: Woodstock Area Educators Upcoming Cohort Programs Presently, many teachers are enrolled in cohort graduate programs through partnerships between local regional offices of education,
The FDA s Mini- Sen*nel Program and the Learning Health System
info@mini- sen*nel.org 1 The FDA s Mini- Sen*nel Program and the Learning Health System Richard PlaB, MD, MS Harvard Pilgrim Health Care Ins*tute Harvard Medical School October 1, 2014 Vision We seek the
Pa"ent Reported Outcomes Useful for Whom? Industry s Perspec/ve. Pri/ Jhingran, Ph.D. GlaxoSmithKline
Pa"ent Reported Outcomes Useful for Whom? Industry s Perspec/ve Pri/ Jhingran, Ph.D. GlaxoSmithKline AGENDA Why PROs? Applica0ons of PROs in Drug Development US Healthcare Reform Enhanced Value of PROs
How To Learn At Caluniversity.Edu
California Intercontinental University New Student Orientation 1 Introduc8on Welcome to the CALUNIVERSITY Community! Our curriculum is a challenge calling upon you to think independently, discover sources
Interna'onal Standards Ac'vi'es on Cloud Security EVA KUIPER, CISA CISSP [email protected] HP ENTERPRISE SECURITY SERVICES
Interna'onal Standards Ac'vi'es on Cloud Security EVA KUIPER, CISA CISSP [email protected] HP ENTERPRISE SECURITY SERVICES Agenda Importance of Common Cloud Standards Outline current work undertaken Define
GAME-CHANGING TRENDS IN SUPPLY CHAIN
customer teams FIRST focused ANNUAL on serving REPORT override system designations BY THE of SUPPLY available CHAIN MANAGEMENT FACULTY AT THE The research partners at UNIVERSITY Ernst and Young OF TENNESSEE
How To Teach Distance Education In Russia
University of Maryland University College Distance Education in Dual Degree Partnerships: Challenges and Opportunities Distance Educa-on in the United States Unlike Russia, the U.S. does not dis3nguish
Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collecting Composers' Websites
Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collecting Composers' Websites June 24, 2015 IAML/IMS Anna Perricci, Columbia University Laura Stokes, Brown University What is CCWA?
1. Harvard University. (1) State: Massachusetts
1. Harvard University (1) State: Massachusetts (2) Climate of Massachusetts: Massachusetts has a humid continental climate. Winters are cold, with average January temperatures below freezing nearly throughout
Research in Simulation: Research and Grant Writing 101
Research in Simulation: Research and Grant Writing 101 Amar Patel, MS, NREMT-P, CFC Director, Center for Innovative Learning WakeMed Health & Hospitals Geoff Miller Director Eastern Virginia Medical School
Na#onal Asbestos Forum 2013: Advance in Medical Research on Asbestos- Related Diseases
Na#onal Asbestos Forum 2013: Advance in Medical Research on Asbestos- Related Diseases Professor Nico van Zandwijk Asbestos Diseases Research Ins#tute Content List of Asbestos- Related Diseases Epidemiology
