Rise of Crowdsourcing. Crowdsourcing = Harvesting society s wisdom, skill, creativity, and scale to solve a task

Similar documents
International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

Crowdsourcing for Big Data Analytics

Big Data and Analytics: Challenges and Opportunities

Technology Services...Ahead of Times. Enterprise Application on ipad

COMP9321 Web Application Engineering

REQUESTER BEST PRACTICES GUIDE

Big Data Analytics. Chances and Challenges. Volker Markl

workforceiq Job Despatch and Workforce Management software for Smartphones

Participatory Cloud Computing and the Privacy and Security of Medical Information Applied to A Wireless Smart Board Network

Grid SM for Help Desk Spring Market Presence. Satisfaction. Zendesk Desk.com. Freshdesk. Parature. Kayako IssueTrak. TeamSupport.

Technology Enablement

GigaSpaces Real-Time Analytics for Big Data

Web analytics: Data Collected via the Internet

Real-Time Analytics on Large Datasets: Predictive Models for Online Targeted Advertising

The Challenges of Geospatial Analytics in the Era of Big Data

Constant Contact. Responsyssys. VerticalResponse. Bronto. Satisfaction

The Future of Business Analytics is Now! 2013 IBM Corporation

Basheer Al-Duwairi Jordan University of Science & Technology

Integrating Social Media into Emergency Management Planning

Zeenov Agora High Level Architecture

Big Data: Key Concepts The three Vs

A Crowdsourcing Model for Receiving Design Critique

People centric sensing Leveraging mobile technologies to infer human activities. Applications. History of Sensing Platforms

Grid SM for E-Signature Winter 2014

Workshop on Android and Applications Development

Crowdsourcing and Cloud Computing Towards Trustful Pavement Condition Ranking

DIGITAL MARKETING TRAINING

SECURITY AND REGULATORY COMPLIANCE OVERVIEW

CSci 8980 Mobile Cloud Computing. MCC Overview

Computers, Smartphones & Tablets Sales:

Chapter 6. Attracting Buyers with Search, Semantic, and Recommendation Technology

Professional Diploma in Digital Marketing

Augmented Search for Software Testing

Android Developer Applications

Big Data, Physics, and the Industrial Internet! How Modeling & Analytics are Making the World Work Better."

Digital marketing strategy

Context Aware Predictive Analytics: Motivation, Potential, Challenges

MASHUPS FOR THE INTERNET OF THINGS

Best Practices Social Roadmap from American Marketing Association:

Human Activity Recognition

Graph Distances in the Streaming Model

A System for Human-Machine Interactive Learning. Kevin Jamieson University of Wisconsin - Madison / UC Berkeley AMP Lab

media kit 2014 PUBLISH / DEVELOP Global Mobile Ad Network

How To Create A Spam Detector On A Web Browser

Feature Rich and Intuitive Sophisticated, yet easy to use, tools for top marketers.

The future: Big Data, IoT, VR, AR. Leif Granholm Tekla / Trimble buildings Senior Vice President / BIM Ambassador

Using Crowdsourcing to Enhance Crisis Management

A Scalable Network Monitoring System as a Public Service on Cloud

WHITEPAPER: CROWDSOURCED TRANSLATIONS FOR ANDROID APP LOCALIZATION: HOWS & WHYS. Publication by. Guide for Android developers

The approach Microsoft has taken with its Windows Phone 7 platform is

Mobile Cloud Computing In Business

This Conference brought to you by

New Possibilities for Extending LivePerson Solutions

Inbound Marketing Services

Your Next Customer Is Looking For Your Products & Services On Local Search Websites, Apps & Maps, RIGHT NOW!

URBAN MOBILITY IN CLEAN, GREEN CITIES

Experts in mobility business strategy and industry verticals

media kit 2014 Advertise Global Mobile Ad Network

GEOGRAPHIC CONTEXT ANALYSIS OF VOLUNTEERED INFORMATION

Open-Source, Web-Based, Framework for Integrating Applications with Social Media Services and Personal Cloudlets

Collaborations between Official Statistics and Academia in the Era of Big Data

C O M M U N I C A T I O N S I N C. Marketing. Presented by Claudia Guerrero

The Berkeley AMPLab - Collaborative Big Data Research

GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING

Webcasting vs. Web Conferencing. Webcasting vs. Web Conferencing

Leveraging Mobile Context for Effective Collaboration and Task Management

Cloud Computing Trends

THE ENTERPRISE GAMING COOKBOOK

Rapid Application Development

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

BIG DATA TRENDS AND TECHNOLOGIES

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

IoT and Big Data- The Current and Future Technologies: A Review

How To Teach A Mobile Operating System To An It Project

Statistical Challenges with Big Data in Management Science

Sworn to software quality SWORN TO SOFTWARE QUALITY

LEVERAGE BIG DATA ANALYTICS TO IMPROVE CUSTOMER EXPERIENCE

Digital Tactics for Community Engagement Marketing

Exploring Big Data in Social Networks

MOBILE MICROAPPS. The shortest path to enterprise mobility

It Takes a Village to Raise a Machine Learning Model. Lucian

Google Apps Education Edition Kommits Conference

Monetizing Mobile Applications How to maximize investment, move up the value chain and expand into new markets

GERONIMO. SUCCESS STORY. KMS Technology Inc. 550 Pharr Road NE, Suite 525, Atlanta, GA (678) 813-1KMS

Big Data Big Deal? Salford Systems

The Real Questions about. Social Media Monitoring/Web Listening

Turker-Assisted Paraphrasing for English-Arabic Machine Translation

Voice. Internet. Apps. Data Center. Wide Area Networks. Business is better in the cloud

Sound-Proof: Usable Two-Factor Authentication Based on Ambient Sound

Internet Marketing with Yieldbot and Site Visits

Mobile App Development Using App Inventor

Five Strategies for Performance Testing Mobile Applications

UNB s Mobility Strategy

Using Stranger as Sensors: Temporal and Geo-sensitive Question Answering via Social Media

Security challenges for internet technologies on mobile devices

Appscend Mobile Platform Whitepaper

Tomer Shiran VP Product Management MapR Technologies. November 12, 2013

MOBILE APP DEVELOPMENT CUSTOM CROSS PLATFORM APPLICATIONS

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

Building HTML5 and hybrid mobile apps using cloud services. Andrei Glazunov

Transcription:

Rise of Crowdsourcing Crowdsourcing: Opportunities and Challenges Deepak Ganesan Associate Professor UMass Amherst Crowdsourcing = Harvesting society s wisdom, skill, creativity, and scale to solve a task... Examples of crowdsourcing systems

Examples of crowdsourcing systems Classifying by complexity Outline What is the Amazon Mechanical Turk? What is crowdsourcing? Human computation using Mechanical Turk Crowdsourced data collection using mcrowd Course Overview Project Ideas

AMT Basics AMT as a Research Enabler Mechanical Turk provides a set of primitives HIT properties (reward, instructions, requirements, qualifications) Assignment can be approved or rejected Worker can be bonused Worker can be blocked Qualification can be assigned or revoked No explicit requester reputation but several websites (e.g. turkopticon) provide information on good/bad requesters. Advancing Computer Vision with Humans in the Loop (ACVHL) Why is AMT popular for research? Challenges in using AMT Scalable: 50K+ humans in steady state. Fast: Rapid responses from thousands of workers. Cheap: One or few cents per task. Hassle-free: Subject anonymity/identifiability/prescreening/diversity Speed Price Accuracy How much to pay workers? How to reduce delay for responses? How to maximize accuracy?

How much to pay workers? How to reduce delay? 1 disinterest vs spam Oveall Delay Compared! 160 140 120 Cumulative Prob 0.1 0.01 100 200 300 400 500 600 700 800 900 1000 Time(s) C01 C03 C05 100 80 60 40 20 0 1/29/2014 3/1/2014 4/1/2014 5/1/2014 6/1/2014 7/1/2014 avg(min) Outline Mobile crowdsourcing for data creation What is crowdsourcing? Human computation using Mechanical Turk Crowdsourced data collection using mcrowd Course Overview Project Ideas Data creation leverage millions of smartphone users to provide real-time, contextaware data about environment, transportation, health, civic issues, etc

Rise in Mobile Crowdsourcing Apps mcrowd: A Task Market for Mobile Sensing Marketplace TASKS Geo-tagged sensor data (audio, video, image,...) User surveys Activity traces Event reporting Q&A Annotation tasks mcrowd Architecture mcrowd: Viewing Tasks Web Services API mcrowd client Web Services API Admin GPS traces Signal quality

mcrowd: Creating a Task Why mcrowd? Enable micro-crowdsourcing efforts Gather water quality info on a stream near home. Setup crowdsourcing system for students on a field trip. Specify widgets Blacklist/Whitelist Geo-scope Deadline Incentives Data publishing Data visualization Mobile Clients Why mcrowd? Enable micro-crowdsourcing efforts Avoid fragmented user base Enable micro-crowdsourcing efforts Avoid fragmented user base Explore diverse incentives for user retention Percentage Retention Why mcrowd? 100 points $$ rewards time-varying incentives location-based incentives 75 50 25 0 30 days 60 days Flurry Analytics 90 days

Why mcrowd? Enable micro-crowdsourcing efforts Avoid fragmented user base Explore diverse incentives for user retention Study data quality assessment issues Raw Data: Noise, Bias, Error, Redundancy Outline What is crowdsourcing? Human computation using Mechanical Turk Crowdsourced data collection using mcrowd Course Overview Project Ideas Filtered Data: Verified, unbiased, relevant What do I expect from you? Taking course for one credit: Two paper presentations Written reviews for any ten papers Taking course for three credits: Two paper presentations Written reviews for any twenty papers Significant course project All reviews will be online on the course webpage. Course structure Invited Talks Nathan Eagle (MIT) - Mobile crowdsourcing Panos Ipeirotis (NYU) - Data quality management Arvind Thiagarajan (MIT) - Traffic crowdsourcing Jordan Boyd-Graber (UMD) - NLP crowdsourcing Chris Callison-Burch (JHU) - NLP crowdsourcing John Horton (Harvard) - Policy/Economics & crow... Rob Baker (Ushahidi) - Disaster relief & management Shaili Jain (Yale) - Incentives in crowdsourcing Murat Demirbas (UBuffalo) - Twitter for sensing..

Course structure Papers from several conferences/workshops NAACL, MobiSys, Mobicom, Ubicomp, CHI, EC, AAAI, KDD, SenSys, HCOMP... Need four volunteers for papers next week: Games with a purpose (IEEE Computer) Predicting the present with Google Trends (Google TR) TurKit: Human Computation Algorithms on... (UIST 2010) Who are the crowdworkers? Shifting demogra...(chi 2010) Paper presentations Paper presentation: 15 minutes Discuss the main ideas in the paper. Focus on new aspects that have not been discussed earlier in the seminar. End with discussion points: you are responsible for leading a discussion on the paper. 10 minute discussion of the paper. Outline Available Software for AMT Projects What is crowdsourcing? Human computation using Mechanical Turk Crowdsourced data collection using mcrowd Course Overview Project Ideas http://data.doloreslabs.com

Project Idea 1: Using AMT to process data Pick a data corpus that is hard to process using automated computer algorithms. Should improve on existing approaches, and/or demonstrate a new application domain for AMT. Example: Data cleaning engine for sensor data Accelerometer, GPS traces, temperature traces,..etc Can AMT workers help us make better sense of this data than ML algorithms to detect events? Project Idea 2: Toolkit to Optimize Delay, Quality, Cost Existing toolkits focus on data quality. Design toolkit that offers delay-quality-cost tradeoffs in using AMT. Model the online behavior of Turkers for the task: time-of-day effects price vs delay behavior data quality for individuals. Use learnt behavior to iteratively improve performance over time. Project Idea 3: Crowdsourced Measurement Infrastructure Use AMT + mcrowd to augment existing PlanetLabbased internet measurement infrastructure. Internet measurement is largely based on using fixed infrastructure (e.g. iplane). But large swaths of the Internet are not measured using this framework. Project Idea 4: Fine-grained model of AMT Monitor behavior of AMT at fine-granularity (minutes) using PlanetLab. Validate power-law distribution and other conclusions from existing studies. Can we utilize crowdsourcing to augment existing infrastructure?

Project Idea 5: MapReduce for AMT Tasks Available software for mobile crowdsourcing Design a MapReduce-like programming framework for using AMT Commonalities Large task divided into smaller sub-problems Work distributed among worker nodes (turkers) Collect all answers and combine them Challenges: Delay variability in obtaining responses Use Turker reputation to determine map function Support for delay-cost-quality tradeoffs Crisis crowdsourcing: Support for several phones, web-backend, report validation engine, Mechanical Turk filtering engine... mcrowd: support for iphone; surveys, image, audio collection; $$ + points incentives; REST APIs; Web backend (contact: Moaj Musthag) RCP: Crowdsourced data collection campaigns. Web backend; Android code for instances. (based on idea from Omar Alonso, Microsoft) Project Idea 6: Data Quality filtering engine using mcrowd + AMT Project Idea 7: A Privacy Engine for mcrowd Privacy is an important problem when dealing with data from phones. Design a privacy engine for mcrowd that: redundancy bad data wrong species provides simple & effective policies for data provider Use Crowdsourcing for data quality assessment Image processing Filter using untrained masses Expert validation supports backend data anonymization/perturbation/..

Project Idea 8: Have an application in mind? Project Idea 9: Come up with something that excites you... Design a mobile crowdsourcing application for your favorite cause. Given the time constraints of course: keep development time small (1.5 months) focus on deployment study with a reasonable number of users.