Reading News with Maps p.1/13

Similar documents
TwitterStand: News in Tweets

Reading News with Maps: The Power of Searching with Spatial Synonyms

Buffer Operations in GIS

Study Guide #2 for MKTG 469 Advertising Types of online advertising:

Data Warehousing and Data Mining in Business Applications

Exploring Big Data in Social Networks

Keyword Research for Social Media

SEO Workshop Today s Coach Lynn Stevenson. SEO Analyst

Capturing Meaningful Competitive Intelligence from the Social Media Movement

Courtesy of D W A R D M A R K E T I N G. david@dwardmarketing.com (413)

TIM 50 - Business Information Systems

How To Map Human Dynamics With Social Media For Disaster Alerts

YouTube SEO How-To Guide: Optimize, Socialize & Analyze Your YouTube Presence

Kodiak Island Borough GIS Online Tools

Analysis of Social Media Streams

The Orthopaedic Surgeon Online Reputation & SEO Guide

The Real-time Monitoring System of Social Big Data for Disaster Management

Simple solutions in a maze of options. Print Digital SEM Online Banner Mobile Consulting 2015 MEDIA KIT

Crime Mapping Methods. Assigning Spatial Locations to Events (Address Matching or Geocoding)

Implementing a Web-based Transportation Data Management System

Steps to Higher Local SEO Rankings for Law Firms.

Societal Data Resources and Data Processing Infrastructure

Search Engine Optimization (SEO): Improving Website Ranking

North Highland Data and Analytics. Data Governance Considerations for Big Data Analytics

Slide 7. Jashapara, Knowledge Management: An Integrated Approach, 2 nd Edition, Pearson Education Limited Nisan 14 Pazartesi

ArcGISSM. Online. The Mapping Platform for Your Organization

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

Using Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams

Creating common operational pictures for disaster response with collaborative work

XYZ Medica Inc. Change Management

STEWARD: Architecture of a Spatio-Textual Search Engine

SEO 101. Learning the basics of search engine optimization. Marketing & Web Services

Geovisualization. Geovisualization, cartographic transformation, cartograms, dasymetric maps, scientific visualization (ViSC), PPGIS

Search and Information Retrieval

Journal of Global Research in Computer Science RESEARCH SUPPORT SYSTEMS AS AN EFFECTIVE WEB BASED INFORMATION SYSTEM

Best Practice Search Engine Optimisation

Technology in Action. Alan Evans Kendall Martin Mary Anne Poatsy. Eleventh Edition. Copyright 2015 Pearson Education, Inc.

Leveraging Social Media

DIGITAL MARKETING TRAINING

Search Engine Optimization (SEO) Best Practices For your Industry Web Pages

C o p yr i g ht 2015, S A S I nstitute Inc. A l l r i g hts r eser v ed. INTRODUCTION TO SAS TEXT MINER

EVILSEED: A Guided Approach to Finding Malicious Web Pages

Connecting library content using data mining and text analytics on structured and unstructured data

Search Engine Optimization Content is Key. Emerald Web Sites-SEO 1

Newspaper Multiplatform Usage

INCREASE YOUR VISIBILITY. IMPROVE YOUR REPUTATION.

Your Business s Online Check-Up

Campaign Goals, Objectives and Timeline SEO & Pay Per Click Process SEO Case Studies SEO & PPC Strategy On Page SEO Off Page SEO Pricing Plans Why Us

Doctoral Consortium 2013 Dept. Lenguajes y Sistemas Informáticos UNED

Social Media For Economic Development

Mission: To Help Digital Marketers Succeed Online.

Draft Response for delivering DITA.xml.org DITAweb. Written by Mark Poston, Senior Technical Consultant, Mekon Ltd.

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

Using Social Media in Research: Regulatory and IRB Considerations

The Eyes Have It: A Task by Data Type Taxonomy for Information Visualizations. Ben Shneiderman, 1996

DIGITAL MARKETING BASICS: SEO

SnapTrends v2.6. User Guide.

Electronic Medical Record Integration and Controversy: Visualizing the Adoption of EMRs in the U.S.

LOCAL SEO WHITE PAPER. Making Your Brand Famous in Your Location

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS November 7, Machine Learning Group

A How-to Guide to Social Media Marketing

Understanding customers to identify penetration and expansion opportunities

LOCAL SEO WHITE PAPER

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Web Archiving and Scholarly Use of Web Archives

Inbound Marketing Services

The State of Inbound Lead Generation

How to Make a Smashing B2B Content Marketing Plan

Web School: Search Engine Optimization

Advanced & Actionable SEO for WordPress. Ryan Erwin Internet Marketing Chicago LLC. Internetmarekting.chicago.com

SPATIAL DATA CLASSIFICATION AND DATA MINING

Northwestern University Feinberg School of Medicine Search Engine Optimization Guide

Quick Guide to Getting Started: Twitter for Small Businesses and Nonprofits

SEO for News Media. Jesse Kohl Content Optimization Manager

A Web-based CBMS Dataset Visualization and Simulation Tool

Evolving Infrastructure: Growth and Evolution of Spatial Portals

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

Search Engine Optimization For Local Businesses

Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment

JamiQ Social Media Monitoring Software

Using Twitter for Business

1 o Semestre 2007/2008

KICK YOUR CONTENT MARKETING STRATEGY INTO HIGH GEAR

Obesity in America: A Growing Trend

Local SEO vs. Google Ads

Landing page tagging specification. Version 3.7

SEO for Profit. A Wordtracker Masterclass in search engine optimization. Mark Nunney

Network Big Data: Facing and Tackling the Complexities Xiaolong Jin

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

Indexed vs. Unindexed Searching:

Easy Map Excel Tool USER GUIDE

Search Engines are #1 Way to be Found

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

DIGITAL MARKETING. The Page Title Meta Descriptions & Meta Keywords

Stanford Newspaper Visualization

Easily Identify the Right Customers

Web 3.0 image search: a World First

The difference between. BI and CPM. A white paper prepared by Prophix Software

Promoting your Site: Search Engine Optimisation and Web Analytics

How to get top placement on Google Maps

Transcription:

Reading News with Maps Hanan Samet hjs@cs.umd.edu Department of Computer Science University of Maryland College Park, MD 20742, USA Reading News with Maps p.1/13

Extend GIS Notions to Textually Specified Spatial Data Location-based vs: feature-based queries 1. Determine all documents mentioning location/region R 2. Determine all locations/regions mentioned by articles about topic T T is not necessarily known a priori Topics are ranked by importance which could be defined by the number of documents that comprise them Extend further to spatial data specified by direct manipulation actions such as pointing, pan, and zoom Reading News with Maps p.2/13

STEWARD: A Spatio-Textual Search Engine 1. Spatio-Textual Extraction on the Web Aiding Retrieval of Documents 2. Sample spatio-textual query: Keyword: rock concert Location: near College Park, MD 3. Result documents are relevant to both keyword and location Mention of rock concert Spatial focus near College Park, MD 4. Issues with results from conventional search engines: Is it the intended College Park? What about spatial synonyms such as rock concerts in Hyattsville or Greenbelt? Don t usually understand the various forms of specifying geographic content More than just postal addresses! Results often based on other measures, e.g., link structure Reading News with Maps p.3/13

STEWARD Is Not Google Local 1. Google Local geocodes postal addresses into points on the map Address strings are well-formatted Most results drawn from online yellow pages 2. STEWARD works on unstructured text documents Document is a bag of words Goal is more than searching for addresses in documents, which is easier 3. STEWARD Goals: Identify all geographic locations mentioned in document (i.e., Geotagging) Identify geographic focus of document Retrieve documents by spatio-textual proximity Reading News with Maps p.4/13

STEWARD http://donar.umiacs.umd.edu/steward/index.shtml E.g., Colonias Reading News with Maps p.5/13

NewsStand:Spatio-Textual Aggregation of News and Display 1. Extend GIS Notions to Textually Specified Spatial Data 2. Apply STEWARD ideas to a real time collection of news articles 3. Crawls the web looking for news sources and feeds Indexing 8,000 news sources About 50,000 news articles per day 4. Aggregate news articles by both content similarity and location Articles about the same event are grouped into clusters 5. Rank clusters by importance which is based on: Number of articles in cluster Number of unique newspapers in cluster Event s rate of propagation to other newspapers 6. Associate each cluster with its geographic focus or foci 7. Display each cluster at the positions of the geographic foci Reading News with Maps p.6/13

Goal: Change News Reading Paradigm Use a map to read news for all media (e.g., text, photos, videos) Choose place of interest and find topics/articles relevant to it Topics/articles determined by location and level of zoom No predetermined boundaries on sources of articles Application: monitoring hot spots 1. Investors 2. National security 3. Disease monitoring One-stop shopping for spatially-oriented news reading 1. Summarize the news What are the top stories happening? 2. Explore the news What is happening in Darfur? 3. Discover patterns in the news How are the Olympics and Darfur related? Overall goal: Make the map the medium for presenting all information with a spatial component Reading News with Maps p.7/13

Existing News Readers 1. Microsoft Bing Rather primitive and top stories presented linearly Little or no classification by topic 2. Google News Reader Classifies articles by topic Local news search a. Aggregates articles by zip code or city, state specification E.g., articles mentioning College Park, MD b. Provides a limited number of articles (9 at the moment) c. Seems to be based on the host of the articles E.g., LA Times provides local articles for Los Angeles, CA d. Seems to use Google Search with location names as search keys E.g., articles for ZIP 20742 are those mentioning College Park, MD or University of Maryland e. Has no notion of story importance in the grand scheme f. International versions use international news sources Reading News with Maps p.8/13

General Geotagging Issues 1. Identify geographical references in text Does Jefferson refer to a person or a geographical location? 2. Disambiguate a geographical reference Does London mean London, UK, London, Ontario, or one of 2570 other Londons in our gazetteer? 3. Determine spatial focus of a document Is Singapore relevant to a news article about hurricane Katrina? Not so, if article appeared in Singapore Strait Times Reading News with Maps p.9/13

NewsStand http://newsstand.umiacs.umd.edu http://newsstand.umiacs.umd.edu/news/light Reading News with Maps p.10/13

TwitterStand: News from Tweets News gathering system using Twitter Twitter is a popular social networking website Tweets are 140 character messages akin to SMS Mostly non-news, often frivolous TwitterStand is a spontaneous news medium Idea: Users of Twitter to help gather news Distributed news gathering Scooping tool bypassing reporters or newspapers E.g., Michael Jackson s death, Iranian election, Haitian earthquake Reading News with Maps p.11/13

TwitterStand: News from Tweets News gathering system using Twitter Twitter is a popular social networking website Tweets are 140 character messages akin to SMS Mostly non-news, often frivolous TwitterStand is a spontaneous news medium Idea: Users of Twitter to help gather news Distributed news gathering Scooping tool bypassing reporters or newspapers E.g., Michael Jackson s death, Iranian election, Haitian earthquake Key challenges: Managing the Deluge Twitter is a noisy medium as most of the Tweets are not news Challenge: Extract news Tweets from mountain of non-news Tweets Tweets are coming at a furious pace Tweets capture the pulse of the moment So, not a good strategy to store and process them in batches TwitterStand uses online algorithms Works without access to entire dataset (i.e., being offline) Determine spatial focus of stories enabling news reading on map Reading News with Maps p.11/13

Ex: Tweets about Michael Jackson s Death Reading News with Maps p.12/13

Ex: Tweets about Michael Jackson s Death Notice that Twitter beat LA times by more than two hours Reading News with Maps p.12/13

Live Demo: TwitterStand System http://donar.umiacs.umd.edu:4000/news Reading News with Maps p.13/13