Big Data Analytics in Mobile Environments



Similar documents
Recommendations in Mobile Environments. Professor Hui Xiong Rutgers Business School Rutgers University. Rutgers, the State University of New Jersey

FOR A FEW TERABYTES MORE THE GOOD, THE BAD and THE BIG DATA. Cenk Kiral Senior Director of BI&EPM solutions ECEMEA region

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Exploring the Relationship between Mobile Phone Call Intensity and Taxi Volume in Urban Area

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India

Big Data: Are You Ready? Kevin Lancaster

From Data to Knowledge to Action : A Taxi Business Intelligence System

Big Data Are You Ready? Thomas Kyte

Research of Smart Space based on Business Intelligence

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

A Framework for Data Warehouse Using Data Mining and Knowledge Discovery for a Network of Hospitals in Pakistan

Advanced Methods for Pedestrian and Bicyclist Sensing

Healthcare Measurement Analysis Using Data mining Techniques

TEXT ANALYTICS INTEGRATION

BEYOND BI: Big Data Analytic Use Cases

GeoLife: A Collaborative Social Networking Service among User, Location and Trajectory

User Needs and Requirements Analysis for Big Data Healthcare Applications

Data Mining: Opportunities and Challenges

DATAMEER WHITE PAPER. Beyond BI. Big Data Analytic Use Cases

Introduction to Data Mining

The New World of Data. Don Strickland President, Strickland & Associates

12/7/2015. Data Science Master s programs

Big Data Use Cases Update

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

Tracking System for GPS Devices and Mining of Spatial Data

Exploring spatial decay effect in mass media and social media: a case study of China

Context-Aware Online Traffic Prediction

House Committee on the Judiciary. Subcommittee on the Constitution, Civil Rights, and Civil Liberties

Estimation of Human Mobility Patterns and Attributes Analyzing Anonymized Mobile Phone CDR:

MALLET-Privacy Preserving Influencer Mining in Social Media Networks via Hypergraph

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Formal Methods for Preserving Privacy for Big Data Extraction Software

Augmented Search for Software Testing

Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo

Deep Insights Smart Decisions Motionlogic

The Internet of Everything: The Next Industrial Revolution

Towards Cloud big data services for intelligent transport systems

Voice. listen, understand and respond. enherent. wish, choice, or opinion. openly or formally expressed. May Merriam Webster.

UNIVERSITY OF INFINITE AMBITIONS. MASTER OF SCIENCE COMPUTER SCIENCE DATA SCIENCE AND SMART SERVICES

Internet of Things (IoT): A vision, architectural elements, and future directions

DATA MINING TECHNIQUES AND APPLICATIONS

Location-Based Social Networks: Users

Modern Agricultural Digital Management Network Information System of Heilongjiang Reclamation Area Farm

Augmented Search for IT Data Analytics. New frontier in big log data analysis and application intelligence

The Internet of Things. Giles Norman MobileFirst Consulting Manager, IBM. Daniel Dombach Director EMEA, Industry Solutions, Zebra Technologies

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Twitter Analytics: Architecture, Tools and Analysis

Fujitsu Big Data Software Use Cases

Why big data? Lessons from a Decade+ Experiment in Big Data

Finding your Big Data Way

MOBILITY DATA MODELING AND REPRESENTATION

Enable Location-based Services with a Tracking Framework

- M2M Connections & Modules: Network connections, sim-cards, module types.

Applying Data Science to Sales Pipelines for Fun and Profit

Smart City Live! 9-10 May 2016, Nice

How To Use Big Data Effectively

How Big Is Big Data Adoption? Survey Results. Survey Results Big Data Company Strategy... 6

International Journal of World Research, Vol: I Issue XIII, December 2008, Print ISSN: X DATA MINING TECHNIQUES AND STOCK MARKET

Smart Cities. How Can Data Mining and Optimization Shape Future Cities? Francesco Calabrese

Preventing Denial-of-request Inference Attacks in Location-sharing Services

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Business Challenges and Research Directions of Management Analytics in the Big Data Era

Profile Based Personalized Web Search and Download Blocker

Dr. Hui Xiong. MSIS Department Rutgers, the State University of New Jersey Voice: (973)

Comparative Study of GPS Positioning For Vehicles with Algorithms in Global and Local Position

Research of Smart Distribution Network Big Data Model

Probes and Big Data: Opportunities and Challenges

Proposed Advance Taxi Recommender System Based On a Spatiotemporal Factor Analysis Model

Deep Diving in Retail Big Data to Excel Business Performance

Big Data og Smart City. Knut H. H. Johansen CEO esmart System 7. mai 2015

New Clinical Research & Care Opportunities Through Big Data Informatics

Visualization Method of Trajectory Data Based on GML, KML

SMART MINDS + SMART CITIES

BIG DATA: PROMISE, POWER AND PITFALLS NISHANT MEHTA

Data Science and Prediction*

Continuous Fastest Path Planning in Road Networks by Mining Real-Time Traffic Event Information

Security Infrastructure for Trusted Offloading in Mobile Cloud Computing

Public Sector Solutions

BUSINESS ANALYTICS. Overview. Lecture 0. Information Systems and Machine Learning Lab. University of Hildesheim. Germany

General overview, and sources and uses of Big Data for urban and regional analysis

NYC Taxi Trip and Fare Data Analytics using BigData

Information Security in Big Data using Encryption and Decryption

How To Understand The Benefits Of Big Data

Using Predictive Maintenance to Approach Zero Downtime

Sustainable Development with Geospatial Information Leveraging the Data and Technology Revolution

Big Data - An Automotive Outlook

Bruhati Technologies. About us. ISO 9001:2008 certified. Technology fit for Business

Instructor Introduction

Mining Network Relationships in the Internet of Things

Big Data, Physics, and the Industrial Internet! How Modeling & Analytics are Making the World Work Better."

The Role of Big Data and Analytics in the Move Toward Scientific Transportation Engineering

Crowdsourcing mobile networks from experiment

Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013

User Modeling in Big Data. Qiang Yang, Huawei Noah s Ark Lab and Hong Kong University of Science and Technology 杨 强, 华 为 诺 亚 方 舟 实 验 室, 香 港 科 大

Science: what is possible. Engineering: turn science into an everyday commodity (cheap, safe, reliable, resilient, )

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

SPATIAL DATA CLASSIFICATION AND DATA MINING

Data-Driven Decisions: Role of Operations Research in Business Analytics

Asian Institute of Technology, Pathum Thani, Thailand

Transcription:

1 Big Data Analytics in Mobile Environments 熊 辉 教 授 罗 格 斯 - 新 泽 西 州 立 大 学 2012-10-2 Rutgers, the State University of New Jersey

Why big data: historical view? Productivity versus Complexity (interrelatedness, ambiguity) Complex versus Complicated While the complicated can be unfolded for analysis, the complex cannot. Big Machines Software Connectivity Knowledge for Business Operation Hardware Storage Big Data

Similarities Between Data Miners and Doctors Very Often, No Standardized Solutions Data Characteristics Data Mining Techniques Medical Devices

So What is Big Data? Big Data refers to datasets that grow so large that it is difficult to capture, store, manage, share, analyze and visualize with the typical database software tools.

What Makes it Big Data? SOCIAL BLOG SMART METER 101100101001 001001101010 101011100101 010100100101 VOLUME VELOCITY VARIETY VALUE Big is also a relative concept. Data Size / Solution-Time-Window >= Computing Capacity Per Time Unit

Big Data Use Cases Today s Challenge New Data What s Possible Healthcare Expensive office visits Hospital Dynamics Manufacturing In-person support Location-Based Services Based on home zip code Finance Fast-paced, Variety Retail One size fits all marketing Remote patient monitoring, Hospital Sensors Product sensors Real time location data Social Media, Highfrequency Trading Data Market basket data, user behavior logs Preventive care, reduced hospitalization, reduced human mistakes Automated diagnosis, customized support Geo-advertising, urban computing, mobile recommendation Sentiment analysis Finance engineering Personalized Recommendation, Segmentation

10 Ways Mobile Tech Is Changing Our World 7 1. Elections Will Never Be The Same 1. Doing Good By Texting 2. Bye-Bye, Wallets 3. The Phone Knows All 4. Your Life Is Fully Mobile 5. The Grid Is Winning 6. A Camera Goes Anywhere 7. Toys Get Unplugged 8. Gadgets Go To Class 9. Disease Can't Hide

Human Mobility Human mobility is people s movement trajectories which can be Phone traces or trajectories of driving routes a sequences of posts (like geo-tweets, geo-tagged photos, or check-ins) Indoor Traces and Outdoor Traces.

Urban Geography Urban geography is a set of geographic characteristics of a city including road networks, public transportation places of interest (POIs), regional functions

Public transportation data

Point of Interests (POI)

Outdoor Location Traces Taxi GPS trajectories

Data Miners in Big Data Analytics Big Data Analytics Understand goals of business Collaborate in interdisciplinary teams Integrate large volumes of structured and unstructured data Formulate problems, develop solutions Blend statistical modeling, data mining, forecasting, optimization Develop/run integrated software solutions Gain higher visibility Change business operation

14 Big Data Application Requirements Timely observation Timely analysis Timely solution

Big Data Application Trends 15 Micro-scope Macro-scope Personalized Recommendation Urban Computing Micro-Scope Classic Macro-Scope Applications/Techniques

Data Driven Solutions 16 Theoretical top-down solutions Data driven bottomup solutions

Big Data Application Requirements 17 Technical Knowledge Domain Knowledge Big Data Application

Big Data Experiences 18 Understanding Data Characteristics Data Distribution, Data Quality etc. Feature Engineering Feature engineering is one of the key strategy for the success of big data analytics. The goal is to explicitly reveal important information to the model by feature selection or feature generation Original features different encoding of the features combined features Instance Selection (particularly mobile environment) The goal is to select the right instances/objects for the underlying data analytics

Mobile Recommender Systems 19 Background Revolution in Mobile Devices GPS WiFi Mobile phone The Urgent Demand for Better Service Driving route suggestion Mobile tourist guides Definition Mobile pervasive recommendation is promised to provide mobile users access to personalized recommendations anytime, anywhere.

Mobile Recommender Systems 20 Route Recommendation Travel Recommendation

Challenges for Mobile 21 Recommendation (I) Complexity of the Mobile Data Heterogeneous Spatial and temporal auto-correlation Noisy The Validation Problem No Ratings The Generality Problem Different application domains with different recommendation techniques

22 Challenges for Mobile Recommendation (II) The Cost Constraints Time Price The Life Cycle Problem The Transplantation Problem Difficult to apply traditional Recommendation techniques for mobile recommendation

23 The Characteristics of Mobile Data Two Cases Case 1. Location trace by taxi drivers Case2. The tourism data Why? A good coverage of unique characteristics of mobile data Can be naturally exploited for developing mobile recommender systems They are the real-world data

24 The Characteristics of Mobile Data Case 1. Location trace by taxi drivers Data Description GPS traces Location information (Longitude, Latitude), timestamp The operation status (with or without passengers) Experienced drivers can usually have more driving hours and high occupancy rates Inexperienced drivers tend to have less driving hours and low occupancy rates

The Characteristics of Mobile Data 25 Case 1. Location trace by taxi drivers Driving pattern comparison The experienced drivers have a wider operation area. The experienced drivers know the roads as well the traffic patterns better.

26 The Characteristics of Mobile Data Case 1. Location trace by taxi drivers Develop a mobile recommender system Users ~ Taxi drivers Items ~ Potential pick-up points What did we learn? The difference between Mobile RS and traditional RS The items are application-dependent There is some cost to extract items The items are not i.i.d while spatial auto-correlation

The Characteristics of Mobile Data 27 Case2. The travel data Data Description Expense records Tourists: ID, travel time Package: ID, name, landscapes, price, travel days Duration: 2000 2010 Recommender System Users ~ Tourists Items ~ Packages

28 The Characteristics of Mobile Data Case2. The travel data Characteristics of Tourism Data (I) Spatial auto correlation of packages For example, the 1-day Niagara Falls Tour The Sparseness Much sparser than the Netflix data set.

The Characteristics of Mobile Data 29 Case2. The travel data Characteristics of Tourism Data(II) The time dependence Packages and tourists have seasonal tendency Packages have a life cycle Short-period packages are more popular

Theoretical Abstraction 30 Given: a set of objects O={O 1, O 2,, O n } Find: An ordered subset S={S 1, S 2,, S k } O The order of S 1, S 2,, S k is optimized subject to certain constraints. For taxi driver recommendation, the set O is a set of potential pick-up points For travel package recommendation, the set O is a set of landscapes.

Data Driven Solutions 31 Theoretical top-down solutions Data driven bottomup solutions

JOBS: Projected shortage of 140,000-190,000 people with deep analytical talent in the US by the year 2018. Source: Big data: The next frontier for innovation, competition, and productivity, McKinsey Global Institute, May 2011. Source: Big data: The next frontier for innovation, competition, and productivity, McKinsey Global Institute, June 2011.

Data Miners in Big Data Analytics Big Data Analytics Understand goals of business Collaborate in interdisciplinary teams Integrate large volumes of structured and unstructured data Formulate problems, develop solutions Blend statistical modeling, data mining, forecasting, optimization Develop/run integrated software solutions Gain higher visibility Change business operation

Thank You! 34 My WEB site: http://datamining.rutgers.edu

Reference 35 Yong Ge, Hui Xiong, Alexander Tuzhilin, Keli Xiao, Marco Gruteser, Michael J. Pazzani, An Energy-Efficient Mobile Recommender System, the 16th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining (KDD 2010), pp. 899-908, 2010. Yong Ge, Qi Liu, Hui Xiong, Alexander Tuzhilin, Jian Chen, Costaware Travel Tour Recommendation, the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2011), to appear, 2011. Qi Liu, Yong Ge, Zhongmou Li, Enhong Chen, Hui Xiong, Personalized Travel Package Recommendation, the 11th IEEE International Conference on Data Mining (ICDM 2011) (ICDM 2011), Best Research Paper Award, 2011. Yong Ge, Chuanren Liu, Hui Xiong, A Taxi Business Intelligence System, the 17th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining (KDD 2011), to appear,2011.

Reference 36 Chuanren Liu, Hui Xiong, Yong Ge, Wei Geng, Matt Perkins. A Stochastic Model for Context-Aware Anomaly Detection in Indoor Location Traces. the 12th IEEE Conference on Data Mining (ICDM 2012), to appear, 2012. Baik Hoh, Marco Gruteser, Hui Xiong, Ansaf Alrabady, Preserving Privacy in GPS Traces via Uncertainty-Aware Path Cloaking, the 14th ACM Conference on Computer and Communication Security, (ACM CCS), pp. 161-171, 2007. Jing Yuan, Yu Zheng, Xing Xie: Discovering regions of different functions in a city using human mobility and POIs. the 18th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining ( KDD 2012), pp. 186-194, 2012.