Big Data and the Uses and Disadvantages of Scientificity for Social Research



Similar documents
Big Data Analytics of Multi-Relationship Online Social Network Based on Multi-Subnet Composited Complex Network

Research Note What is Big Data?

International Conference PRSSA meeting

SOCIAL MEDIA & THE JOB SEARCH. Using Today s Most Popular Online Communities for Job-Hunting

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

!!! The Fallacy of Big Data! Brian Fine and Con Menictas!

Barack Obama won the battle on social media too!

Big Data in Communication Research: Its Contents and Discontents

Developing the SMEs Innovative Capacity Using a Big Data Approach

RMM has been advising and helping organisations use social media, technologies and platforms since We re a team of 15 analysts, trainers,

understanding media metrics WEB METRICS Basics for Journalists FIRST IN A SERIES

UNIVERSITY OF BELGRADE FACULTY OF PHILOSOPHY. Part two: INFORMATION ON DEGREE PROGRAMS

CORRALLING THE WILD, WILD WEST OF SOCIAL MEDIA INTELLIGENCE

Business Intelligence. Data Mining and Optimization for Decision Making

Big Data and the brave new world of social media research

Sponsorship & Social/ Digital Media Brent Barootes May 28, WHITE PAPER Presented by Partnership Group Sponsorship Specialists

6 TWITTER ANALYTICS TOOLS. SOCIAL e MEDIA AMPLIFIED

Data Driven Discovery In the Social, Behavioral, and Economic Sciences

Analyzing Big Data: The Path to Competitive Advantage

Available online at ScienceDirect. Procedia Economics and Finance 11 ( 2014 )

Using Social Media. to improve your Career prospects

Welcome. Opening Session Internet Archives & Research Potential Building Community: Research Highlights. Discussion and Challenges

Exploring Big Data in Social Networks

Big Data Introduction, Importance and Current Perspective of Challenges

Carmel Byers University of Hertfordshire

SOCIAL MEDIA CHRIS SIGFRIDS SENIOR ONLINE MARKETING MANAGER CARIE FREIMUTH VICE PRESIDENT, ASSOCIATE PUBLISHER

Data Analytics in Organisations and Business

Big Data a threat or a chance?

Shawn O Neal. Driving the Data Engine: How Unilever is Using Analytics to Accelerate Customer Understanding. An interview with

The Ideological and Political Education in China's Universities Based on Big Data Thought. Abstract

The British Academy of Management. Website and Social Media Policy

Computer Programming for the Social Sciences

Grounded Theory. 1 Introduction Applications of grounded theory Outline of the design... 2

BIG DATA IN SUPPLY CHAIN MANAGEMENT: AN EXPLORATORY STUDY

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Delivering new insights and value to consumer products companies through big data

Social Media, Youth Participation and Australian Elections

ENHANCING INTELLIGENCE SUCCESS: DATA CHARACTERIZATION Francine Forney, Senior Management Consultant, Fuel Consulting, LLC May 2013

A WEB WITHOUT ADVERTISING

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014

COURSE DESCPRIPTIONS. ZSEM international summer school June 27-July Zagreb School of Economics and Management, Zagreb, Croatia

ASSESSING THE EFFECTS OF SANDY ON THE ELECTION: THE DOG THAT DIDN T BARK

Big Data for Marketing:

AUDIENCE ENGAGEMENT DISCOVERY VS. SEARCH VS. SOCIAL

Utilizing big data to bring about innovative offerings and new revenue streams DATA-DERIVED GROWTH

Network Theory: 80/20 Rule and Small Worlds Theory

Web Archiving and Scholarly Use of Web Archives

Social Media Intelligence

Ethnography and Big Data

Big Data: How can it enhance your strategy?

Corporate Finance: Mergers & Acquisitions

LinkedIn Tutorial. An Introduction to Today s Leading Job-Search Social Network

A Scientific Approach to Implementing Change

Ethnography and Big Data: A Rapprochement?

How to Build Online Brand Authority

Statistical Challenges with Big Data in Management Science

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

RESEARCH 33 STATS TO KNOW WHEN MARKETING TO THE CONSTRUCTION INDUSTRY

2010 Brightcove, Inc. and TubeMogul, Inc Page 2

See how social media listening and engagement can help your business

Big Data and Open Data

How to Optimize Your Web Presence for Lead Generation

Introduction to Social Media

1

Focus Experts Briefing: Five Ways Modern ERP Solutions Increase Business Agility

Are You Ready for Big Data?

Searching the Social Network: Future of Internet Search?

You ve Got A Social Media Site Now What? Liz Gross Social Media Strategist, Great Lakes Presented at the WASFAA Conference April 14, 2014

A Future Without Secrets. A NetPay Whitepaper. more for your money

Your Individual Website Assessment Includes comparison to June 2008 manufacturing study data NAME of COMPANY for WEBSITENAME

Transcription:

Big Data and the Uses and Disadvantages of Scientificity for Social Research Ralph Schroeder, Professor Eric Meyer, Research Fellow Linnet Taylor, Researcher SPRU, May 24, 2013

Source: Leonard John Matthews, CC BY SA (http://www.flickr.com/photos/mythoto/3033590171)

Big data are data that are unprecedented in scale and scope in relation to a given phenomenon. They are often streams of data (rather than fixed datasets), accumulating large volumes, often at high velocity. Is the tail of the availability of big data and computational methods wagging the dog of good research questions and advancing social science? If not, how do big data advance research? What are the opportunities and challenges?

Business Value versus Academic Value Strategic Knowledge Generally time limited (with exceptions) Value comes from knowing what your competitors don t Often has high monetary value if it can be exploited

Business Value versus Academic Value Durable Knowledge Less time limited (with exceptions) Value comes from adding to the world s knowledge (the global brain is cumulative/scientific) Rarely has direct monetary value, but has value in terms of creating the possibility both of future knowledge and of future exploitation and commercial uses

Commercial/Governmental versus Social Science Research: Diverging Aims, with Overlap Manipulation of Behaviour: For aims limited to research in social science. The threat of social science knowledge, and of commercial/governmental knowledge and control of the natural environment.

Big Data Analytics Access to data Cost of analytical tools Skills to use the tools Why should anyone share? How different skills and disciplines work together Starting with questions, or starting with data? Prediction? A/B and other experiments Gaps? Futures

From Big Data to Big (Hi res) Picture Marketing Tailoring Forecasting Prediction Complex Trends Linking datasets plus modelling

See http://www.oii.ox.ac.uk/research/projects/?id=98

Twitter bots OII master s students Alexander Furnas and Devin Gaffney saw a large spike in then US presidential candidate Mitt Romney s Twitter followers, and decided to look at the new followers: Furnas, A. and Gaffney, D. (2012). Statistical Probability That Mitt Romney's New Twitter Followers Are Just Normal Users: 0%. The Atlantic, July 31, http://www.theatlantic.com/technology/archive/2012/07/statistical probability that mitt romneys new twitter followers are just normal users 0/260539/ (accessed August 31, 2012).

Source: http://www.flickr.com/photos/nakedcharlton/597075830/ Source: http://www.flickr.com/photos/jamescridland/613445810/

the distinctiveness of the network of mathematical practitioners is that they focus their attention on the pure, contentless form of human communicative operations: on the gestures of marking items as equivalent and of ordering them in series, and on the higher order operations which reflexively investigate the combinations of such operations mathematical rapid discovery science the lineage of techniques for manipulating formal symbols representing classes of communicative operations

Research computing Supercomputing The Grid Web 2.0 Clouds Big Data

Digital transformations of research Computational Manipulability + Research Technologies (Mathematization) Socio Technical Organization (Computerization movements) Transformations of Research Front (For different fields)

Case 1: Search engine behaviour Waller s analysis of Australian Google Users Key findings: Mainly leisure > 2% contemporary issues No perceptible class differences Novel advance: Unprecedented insight into what people search for Challenge: Replicability Securing access to commercial data

??? Surprisingly,? the distribution of? types of search query did not? vary significantly across the different? Lifestyle Groups (p>0.01).?? Source: Waller, V. (2011). Not Just Information:Who Searches for What on the Search Engine Google? Journal of the American Society for Information Science & Technology 62(4): 761 775.

Case 2: Large scale text analysis Michel et al. culturomic analysis of 5 Million Digitized Google Books and Heuser & Le Khac of 2779 19th Century British Novels Key findings: Patterns of key terms Industrialization tied to shift from abstract to concrete words Novel advance: Replicability, extension to other areas, systematic analysis of cultural materials Challenge: Data quality

J Michel et al. Science 2011;331:176-182

Platform Paper Size of Data in relation to phenomenon investigated Theoretical question/practical aim Key findings Facebook Backstrom et al. (2012) 69 billion friendship links between 721 million Facebook users Ugander et al. (2012) 54 million invitation emails to Facebook users Re examine Milgram s six degrees of separation online How does structure of contacts affect invitation acceptance? Four degrees of separation on Facebook Not number of contacts, but number of distinct contexts, matters for acceptance Bond et al. (2012) 600000 Facebook users Facebook experiment about how to mobilize voters Voters can be mobilized via Facebook friends more than via informational messages Twitter Kwak et al. (2010) 1.47 billion directed Twitter relations Cha et al. (2010) 1.7 billion tweets among 54 million users Is Twitter a broadcast medium or a social network? Who influences whom? Most use is for information, not as a social network Top influentials dominate, but some variation by topic Bakshy et al. (2011) 1.6 million Twitter users Who influences whom? Ordinary user influencers can sometimes be more effective than top influencers Wikipedia Loubser (2009) All Wikipedia activity How is editing organized? Administrators can impact negatively on participation Yasseri, Kertesz (2012) West, Weber and Castillo (2012) Editorial activity on Wikipedia, especially reverts Wikipedia contributions related to Yahoo! browsing Understanding conflict and collaboration What characterizes Wikipedia contributors information behaviour compared to Wikipedia readers and non readers Types of conflicts can be modelled Wikipedia contributors are more information hungry, especially about their topics

Scientificity and Big Data: Pro and Con Pro Replicability, extension to new domain Total datasets, whole universe No sampling needed, data for all behaviour and over whole existence Ready made manipulability Powerful relation of data to object Con Limited access to object, skills needed for manipulability Not known who users are often Company does not say how data gathered Researcher does not ask what is of interest without givenness Datasets capture limited dimensions, and about one object Object in isolation, not framed for social change significance

Conclusions Savage and Burrows?, who ask are commercial data outpacing social science? Boyd and Crawford?, who ask if big data raise ethical and epistemological conundrums?... No... The connection between research technologies and the advance of knowledge The threats and opportunities represented by unprecedented windows into people s minds and thoughts Does this lead to more scientific (i.e. cumulative) social sciences and humanities?

Implications For research Develop theoretical frame in which to embed big data (for new media), including power/function, relation to traditional media, and role in society For research policy Robust base for advancing research, including shared and open databases For society Awareness of how research can generate transparency and manipulability

Additional readings and references Bond, Robert et al. (2012). A 61 million person experiment in social influence and political mobilization, Nature 489: 295 298. Bruns, A. and Liang, Y.E. (2012). Tools and methods for capturing Twitter data during natural disasters, First Monday, 17 (4 2), http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/viewarticle/3937/3193 Furnas, A. and Gaffney, D. (2012). Statistical Probability That Mitt Romney's New Twitter Followers Are Just Normal Users: 0%. The Atlantic, July 31, http://www.theatlantic.com/technology/archive/2012/07/statisticalprobability that mitt romneys new twitter followers are just normal users 0/260539/ (accessed August 31, 2012). Giles, J. (2012). Making the Links: From E mails to Social Networks, the Digital Traces left Life in the Modern World are Transforming Social Science, Nature, 488: 448 50. Kwak, H. et al. (2010). What is Twitter, a Social Network or a News Media? Proceedings of the 19th International World Wide Web (WWW) Conference, April 26 30, 2010, Raleigh NC. Manyika, J. et al. (2011). Big data: the next frontier for innovation, competition and productivity, McKinsey Global Institute, available at: http://www.mckinsey.com/insights/mgi/research/technology_and_innovation/ big_data_the_next_frontier_for_innovation (last accessed August 29, 2012). Silver, Nate. (2012). The Signal and the Noise: The Art and Science of Prediction. London: Allen Lane. Tancer, B. (2009). Click: What Millions of People are Doing Online and Why It Matters. New York: Harper Collins, 2009. Wu, S., J.M. Hofman, W.A. Mason, and D.J. Watts, (2011). Who says what to whom on twitter, Proceedings of the 20th international conference on World Wide Web. (on Duncan Watts webpage, http://research.microsoft.com/en us/people/duncan/, last accessed August 29, 2012).

Ralph Schroeder ralph.schroeder@oii.ox.ac.uk http://www.oii.ox.ac.uk/people/?id=26 Oxford Internet Institute See http://www.oii.ox.ac.uk/research/projects/?id=98 With support from: