Using R for Social Media Analytics



Similar documents
Exploring the Structure of Government on the Web

Text Mining with R Twitter Data Analysis 1

Wiley. Automated Data Collection with R. Text Mining. A Practical Guide to Web Scraping and

CPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015

Web Data Visualization

See how social media listening and engagement can help your business

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS November 7, Machine Learning Group

6 TWITTER ANALYTICS TOOLS. SOCIAL e MEDIA AMPLIFIED

Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding

BUDT 758B-0501: Big Data Analytics (Fall 2015) Decisions, Operations & Information Technologies Robert H. Smith School of Business

Social Media Analytics, from Data to Deliverables. REQUEST A FREE TRIAL. Visit or call

Social Media Measurement Meeting Robert Wood Johnson Foundation April 25, 2013 SOCIAL MEDIA MONITORING TOOLS

Assignment 5: Visualization

Computer-Based Text- and Data Analysis Technologies and Applications. Mark Cieliebak

Socialbakers Analytics User Guide

IC05 Introduction on Networks &Visualization Nov

Data Analytics at NICTA. Stephen Hardy National ICT Australia (NICTA)

Big Data Analytics in R

Spatio-Temporal Patterns of Passengers Interests at London Tube Stations

Manual. Netumo NETUMO HELP MANUAL Copyright Netumo 2014 All Rights Reserved

An Ontology Based Text Analytics on Social Media

QUICK START GUIDE. Cloud based Web Load, Stress and Functional Testing

The 2012 State of Web and Social Media Analytics in Higher Education

Digital Data Analysis:

Social Relationship Analysis with Data Mining

The 4 Pillars of Technosoft s Big Data Practice

SOCIAL MEDIA FOR MSMEs A turning point. By DR. PRALAY DEY National Small Industries Corporation (NSIC)

Location-Based Social Media Intelligence

JustClust User Manual

The Matsu Wheel: A Cloud-based Scanning Framework for Analyzing Large Volumes of Hyperspectral Data

a Data Science Univ. Piraeus [GR]

Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot

Cymon.io. Open Threat Intelligence. 29 October 2015 Copyright 2015 esentire, Inc. 1

Creating Usable Customer Intelligence from Social Media Data:

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

Website, Blogs, Social Sites : Create web presence in the world of Internet rcchak@gmail.com, June 21, 2015.

Information Management course

Above the fold: It refers to the section of a web page that is visible to a visitor without the need to scroll down.

How we analyzed Twitter social media networks with NodeXL

SnapTrends v2.6. User Guide.

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1

Leveraging Big Data in Introductory Programming Courses. David Reed Department of Journalism, Media & Computing Creighton University

Innovation Fellowship Final Assessment

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Bayesian networks - Time-series models - Apache Spark & Scala

NVivo 10 for Windows and NVivo for Mac

CUSTOMER Presentation of SAP Predictive Analytics

Combining Social Data and Semantic Content Analysis for L Aquila Social Urban Network

Automating Big Data Management, by DISIT Lab Distributed [Systems and Internet, Data Intelligence] Technologies Lab Prof. Ph.D. Eng.

Certification In SAS Programming. Introduction to SAS Program

Exploring Big Data in Social Networks

Volunteer Software Feature Comparison

Social Network Mining

Detecting Spam Bots in Online Social Networking Sites: A Machine Learning Approach

Social Media Business Partner Advertising Kit. Click to add text IBM Corporation

Professional Diploma in Digital Marketing

Tweets as big data. Rob Procter and Alex Voss. andrews.ac.

Visualization methods for patent data

How To Create A Data Science System

How To Make Sense Of Data With Altilia

Research Article International Journal of Emerging Research in Management &Technology ISSN: (Volume-4, Issue-4) Abstract-

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Customer Behaviour Analytics: Billions of Events to one Customer-Product Graph. Budapest BI Forum, 6th November 2013 Presented by Paul Lam

We all need to be able to capture our profit generating keywords and understand what it is that is adding the most value to our business in 2014.

LDIF - Linked Data Integration Framework

Hadoop & SAS Data Loader for Hadoop

R Tools Evaluation. A review by Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015

BIG DATA IN BUSINESS ENVIRONMENT

Using Data Mining for Mobile Communication Clustering and Characterization

Big Data Analytics. Lucas Rego Drumond

Jan 28 th, 2015 FREE Webinar by

Introduction to Data Mining

Search and Data Mining: Techniques. Introduction Anna Yarygina Boris Novikov

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

Digital Media Technologies. Web & Social Media Marketing. Australian College of Information Technology Gold Cost Brisbane AUSTRALIA

Big Data and Society: The Use of Big Data in the ATHENA project

Annotea and Semantic Web Supported Collaboration

Technology overview for the HPE Living Progress Challenge

Collective Behavior Prediction in Social Media. Lei Tang Data Mining & Machine Learning Group Arizona State University

the beginner s guide to SOCIAL MEDIA METRICS

Social Media Monitoring: Engage121

Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate

Speaker Monique Sherrett

Semantic Sentiment Analysis of Twitter

CLUSTER ANALYSIS WITH R

ISSN: CONTEXTUAL ADVERTISEMENT MINING BASED ON BIG DATA ANALYTICS

Use of social media data for official statistics

Social Media Planning. for TRIUMF

Social Media Analytics. Social Media Workshop Twitter Facebook Instagram

Pattern Insight Clone Detection

Final Project Report

Multichannel Customer Listening and Social Media Analytics

Six Degrees of Separation in Online Society

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Final Report by Charlotte Burney, Ananda Costa, Breyuana Smith. Social Media Engagement & Evaluation

Magento 1.4 Themes Design

Impressive Analytics

CASE STUDY: SPIRAL16

Transcription:

Using R for Social Media Analytics Presentation to Tools for Teaching and Learning of Social Media Analytics Blue Sky workshop, 2015 International Communication Association conference (San Juan, Puerto Rico, 21-25 May) Dr Robert Ackland Virtual Observatory for the Study of Online Networks (VOSON) Lab Research School of Social Sciences Australian National University homepage: https://researchers.anu.edu.au/researchers/ackland-rj VOSON Lab: http://voson.anu.edu.au May 2015

Introduction R is free and open source software for statistical computing and graphics www.r-project.org The R syntax and approach presents a bit of a learning curve (especially for social scientists) but benefits are worth the effort: incredibly wide range of R packages/libraries are available promotes scientific collaboration via open source code and data R is hard to pass up if you want to do social media research and (intermediate-to-advanced) teaching Three R packages being developed by the VOSON Lab (and associates): vosonr client for VOSON software for WWW hyperlink network and website text content collection and analysis SocialMediaLab collection and analysis of social media data AdaptiveSampling network sampling (not discussed here) 2

VOSON VOSON collection and analysis of WWW hyperlink and text content VOSON 2.0 web interface works with Firefox, Chrome, Safari, ipad Available for academics/students since 2006, for general use since 2010 VOSON is not free and open source (free version and paid versions) For VOSON accounts: http://www.uberlink.com 3

vosonr R client for VOSON software (via web service) With vosonr, you can schedule a crawl and import networks into R (as igraph graph objects). Also exports graphml. ICA2015 alpha version for download: http://voson.anu.edu.au/tools Beta version will be released soon Contributors: Robert Ackland (vosonr, VOSON core development team) Tim Graham (vosonr) Jamsheed Shorish (VOSON core development team) Email me if would like to know more... 4

SocialMediaLab There are many R packages that can be used for collection and analysis of social media data However some of these packages require advanced programming skills... SocialMediaLab is an R package being developed by Tim Graham (Univ. Queensland) and Robert Ackland (ANU) Provides a virtual lab for the collection and analysis of social media data A wrapper around many other R packages, making it easier for nonprogrammers to do social media analytics research and teaching in R The package is being developed for use in Big Data for Social Scientists ACSPRI short course (29 Jun - 3 Jul, Brisbane): http://voson.anu.edu.au/big-data-analysis We will release alpha version of SocialMediaLab package and associated open source courseware by 29 June 5

SocialMediaLab data collection Twitter for supplied search term (e.g. hashtag), produces: networks (graphml files, igraph graph objects): actor network (users who tweeted term or mentioned in these tweets), bimodal network (actors and extracted terms), semantic network of extracted terms csv file containing tweet payloads Youtube for supplied video id, produces: networks: actor network (thread network from video comments), bimodal network (actors and extracted terms), semantic network (extracted terms) csv file containing comments Facebook fan pages for supplied fan page, produces: networks: actor network (based on co-comments, co-likes), bimodal network (actors and posts), semantic network of extracted terms csv file containing fan page posts and comments WWW hyperlink networks (via vosonr) 6

SocialMediaLab & course analysis Network analysis (core): Intro to SNA, covering main network-level and node-level metrics, and clustering ( community detection ) Visualisation (core): Network visualisations, interactive networks (on the web), word clouds and dendrograms from network data. Text analysis (core): Supervised machine learning (e.g. support vector machines), unsupervised machine learning (topic modelling), sentiment analysis, hierarchical clustering, and descriptive analytics. 7

SocialMediaLab & course analysis Temporal analysis (advanced): Analysing networks and text over time, identifying significant changes in behaviour of individual nodes, clusters or entire networks. Filtering and sampling (advanced): Techniques for targeting data collection and analysis on a particular set of actors. Reducing the scale of datasets via sampling. Scaling up to very large datasets (advanced): What if your dataset is too large for your computer? 8

R packages used in SocialMediaLab (planned) igraph (network analysis and visualisation) twitter (for collecting Twitter data) tm (text mining) RTextTools (machine learning package for automatic text classification) RCurl (collecting WWW data) XML (reading and creating XML documents) R.utils (programming utilities) wordcloud (text word clouds) ape and dendextend (dendograms, hierarchical clustering) FactoMineR and homals (multiple correspondence analysis) plyr and stringr (text sentiment analysis) 9