Geointelligence New Opportunities and Research Challenges in Spatial Mining and Business Intelligence

Size: px
Start display at page:

Download "Geointelligence New Opportunities and Research Challenges in Spatial Mining and Business Intelligence"

Transcription

1 Geointelligence New Opportunities and Research Challenges in Spatial Mining and Business Intelligence Stefan Wrobel Christine Körner, Michael May, Hans Voss

2 Fraunhofer Society Joseph von Fraunhofer, German physicist and entrepreneur Fraunhofer mission: - do state-of-the-art research and use it in challenging customer projects - Funding is 33% research grants, 33% customer projects, 33% institutional funding 57 institutes, 40 locations, employees, 1 bill. annual volume Best-known invention: MP3 2

3 Fraunhofer IAIS: Intelligent Analysis- and Information Systems From sensor data to business intelligence, from media analysis to visual information systems: Our research allows companies to do more with data New name, long-standing experience - Founded in 2006 as a merger of the Fraunhofer institutes AIS and IMK 230 people: scientists, project engineers, technical and administrative staff Located on Fraunhofer Campus Schloss Birlinghoven/Bonn Joint research groups and cooperation with Univ. Bonn 3

4 Fraunhofer IAIS: research and projects Core research areas: Machine learning and adaptive systems Data Mining and Business Intelligence Automated media analysis Interactive access and exploration Autonomous systems 4

5 Outline Introduction to spatial data mining - Project example: Geomarketing The spatial data mining process - Project example: Outdoor media reach estimation The importance of data - Project example: Customer selection in the gas industry Spatial mining tools and visual analytics - CommonGIS and SPIN! Research challenge: track data - Project example: SPR - GeoPKDD 5

6 Outline Introduction to spatial data mining - Project example: Geomarketing The spatial data mining process - Project example: Outdoor media reach estimation The importance of data - Project example: Customer selection in the gas industry Spatial mining tools and visual analytics - CommonGIS and SPIN! Research challenge track data - Project example: SPR - GeoPKDD 6

7 Why Spatial Data Mining now? Almost all data are (or can be) spatially referenced Almost all database and business intelligence systems handle spatial data New data sources push the topic - Satellite data (GPS, Galileo) - Toll collection data - Mobile phone data - RFID - GoogleEarth etc. Spatial Data Mining combines statistics, machine learning, databases, visualization with spatial data 7

8 A classic example for spatial analysis Disease cluster Dr. John Snow Deaths of cholera epidemia London, September 1854 Infected water pump? 8

9 Goals of Spatial Data Mining Identifying spatial patterns Identifying spatial objects that are potential generators of patterns Identifying information relevant for explaining the spatial pattern (and hiding irrelevant information) Presenting the information in a way that is intuitive and supports further analysis 9

10 Spatial Data point objects - located at x, y, (z) coordinates area objects - suitable area description (circle, polygon, path boundary) fields - quantity assumed continuously defined in 2 D or 3 D (+ time!) - e. g. temperature 10

11 Example: without spatial attributes 11

12 Example: with spatial attributes 12

13 Handling spatial data treat as ordinary variables no special algorithms needed spatial properties ignored, e. g. discontiguous areas make spatial relationships explicit e. g. infer topological relationship expensive, but allows normal algorithms to be used specialized algorithms - Neighborhood methods, kriging, Gaussian processes, density-based clustering Use proper combination of data, preprocessing, algorithms, and interaction software! 13

14 Outline Introduction to spatial data mining - Project example: Geomarketing The spatial data mining process - Project example: Outdoor media reach estimation The importance of data - Project example: Customer selection in the gas industry Spatial mining tools and visual analytics - CommonGIS and SPIN! Research challenge track data - Project example: SPR - GeoPKDD 14

15 Project example: Outdoor Advertising Reach - Frequency Atlas Customer: Fachverband für Außenwerbung (FAW; Outdoor Advertising Association) Task: Performance value assessment of advertising media Traffic volume forecast separate for private cars, public transport, pedestrians Spatial data mining, active learning procedures 15

16 Determining reach of a poster board Gesellschaft für Konsumforschung Frequency + Media factories = poster reach 16

17 The project in numbers Complete model for all German cities with more than inhabitants (192 cities) = ca street segments! Complete model includes, for each segment, item - car frequency - pedestrian frequency - public transport frequency The model is presently beeing extended to to all cities with between and inhabitants 17

18 Basic Data: traffic measurements Manual traffic measurement at selected poster locations - 4 times 6 minutes at four days of the week at four times of day Additional empirical model of day totals Properties - Well defined measurements - Distribution of measurements tries to avoid systematic bias - Extended measurement period, so conceptdrift can not be excluded Total of manual measurements 18

19 Secondary data Street network Soxiodemographics + Socioeconomics Points of Interest (POI) Frequency measurements Public transport network DATA MINING Frequency classes 19

20 Smoothing based on flow constraints Measurement errors lead to inconsistencies Need plausible assignment of frequencies Solution: Use Kirchhoff s law as constraint - Sum of inputs = sum of outputs Smoothing algorithm finds locally optimal solution using constraint relaxation 20

21 Numerical prediction with model trees ORTSTEIL = INNENSTADT (LR)... Fussgängerzone: Nein Ja Straßenkategorie: Nebenstr. Hauptstr. Bahnhof Nein Ja Distanz_zu_Bahnhof: <= 150 > 150 Anzahl_Restaurants : <= 5 > 5 Anzahl_Restaurants : <= 15 > 15 X-Koordinate <= > Y-Koordinate LM1 LM2 LM3 LM4 LM5 <= 9.6 > 9.6 LM1 FREQUENZ = * X * ANZAHL_EINKAUF * MESSE LM6 21

22 Final result: frequency atlas (cars, public transport, pedestrians) ~1 ~1Million Millionstreet streetsegments segments predicted based on predicted based on measurements measurements Accuracy Accuracyincreased increasedtwofold twofold 22

23 Outline Introduction to spatial data mining - Project example: Geomarketing The spatial data mining process - Project example: Outdoor media reach estimation The importance of data - Project example: Customer selection in the gas industry Spatial mining tools and visual analytics - CommonGIS and SPIN! Research challenge track data - Project example: SPR - GeoPKDD 23

24 Project example: New customer acquisition for gas supplier Given - nationwide address data with consumer and group data - Response data from original calling campaign To be determined - Nationwide addresses with a high probability of customer interest in a sales representative visit Regional address Interest in visit Nation wide address Consumer attributes Group attributes Interest in visit... yes ??? 24

25 Project example: New customer aquisition for gas supplier 1. Use addresses to transfer consumer and group attributes to the regional sample 2. Construct a model for interest in visits based on the enhanced regional sample 3. Apply the model to the nation wide address data Regional address Interest in visit Consumer attributes Group attributes Nation wide Address Consumer attributes Group attributes Interest in visit... yes ,8% 25

26 Aggregation level of available consumer data 16 federal states 41 districts 441 counties ca zip codes distribution Aggregation ca cities ca statistical districts ca Market Cluster ca voting districts ca market cells ca. 1,5 Mio street segments ca. 20 Mio. Household data 26

27 Outline Introduction to spatial data mining - Project example: Geomarketing The spatial data mining process - Project example: Outdoor media reach estimation The importance of data - Project example: Customer selection in the gas industry Spatial mining tools and visual analytics - CommonGIS and SPIN! Research challenge track data - Project example: SPR - GeoPKDD 27

28 Interactive Exploratory Analysis: CommonGIS and SPIN! Choropleth maps showing distribution of variable(s) in space Parallel Coordinate Plot Combining spatial and non-spatial displays Variables selected and manipulated by the user Powerful for lowdimensional dependencies (3-4) Displays dynamically linked Scatter Plot 28

29 Representation of in the database (Oracle) Klösgen & May 02 A set of relations R 1,...,R n, such that - any relation R i possesses a geometry attribute G i - or an identifier A i which allows joining R i with another relation R k, which in turn possesses a geometry attribute geometry attributes G i consist of sets of x,y-pairs, which define points, lines or polygons different kinds of spatial objects are stored in different relations R i (geographic layers) e.g. streets, rivers, districts, buildings every layer has a single geometry attribute and its own proper set of attributes A 1,..., A n 29

30 Division of labor between Oracle RDBMS Klösgen & May 02 and search manager mining query Database Server Search Algorithm sufficient statistics Mining Server Database integration: efficiently organize mining queries Mining query delivers statistics (aggregations) ufficient for evaluating many hypotheses search in hypothesis space generation and evaluation of hypotheses (subgroup patterns) 30

31 Outline Introduction to spatial data mining - Project example: Geomarketing The spatial data mining process - Project example: Outdoor media reach estimation The importance of data - Project example: Customer selection in the gas industry Spatial mining tools and visual analytics - CommonGIS and SPIN! Research challenge track data - Project example: SPR - GeoPKDD 31

32 Mobility analysis based on GPS-tracks introduction of new pricing model for poster sites based on GPS tracks registration of contact frequencies with poster sites contact extrapolation for target groups: - socio-demographic characteristics - residential areas 32

33 Time patterns Patterns / Questions - How long (days) does it take till x% of objects visit all locations? - How long does it take till x% of objects visit at least one location twice? Applications - determine mobility of a group of people - reach of poster networks - find popularity of locations (theatres, supermarkets, hospitals) 33

34 Challenges of track data Goals: - investigate the relationship between spatio-temporal data and frequency measurements - improve prediction performance with active learning Data: - tracks of mobile phones and / or GPS devices - street-map - possibly frequency measurements Tasks: - track-to-street mapping - prediction of traffic frequencies (regression) 34

35 Track-to-street Mapping Mapping of tracks from cell-level to street-level many possibilities 35

36 Track-to-street Mapping Mapping of tracks from cell-level to street-level Suppose, we have prior knowledge about traffic frequencies highly frequented streets some routes become more likely. 36

37 Frequency Prediction with Track Data Steps in Extrapolation: - count number of intersections of streets and tracks within a certain timeframe (e.g. one week) - extrapolate from sample to population Problems: Y-Coordinate - sample is not representative (biased), e.g. more young people have mobile phones than older people, different trafffic behavior of old and young people mobile data is sensitive, possibly only opt in customers - streets with 0-frequency - large gaps within tracks - censored data (people drop out of survey before end) - noise X-Coordinate 37

38 Probabilistic active track to street mapping [PhD thesis Körner] Tasks: 1. track-to-street mapping 2. extrapolation of traffic frequencies 3. improvement of (online) sampling by active learning 1. mapping 2. extrapolation 3. active learning 38

39 Integration of Spatial Background Knowledge Aggregation of attributes within a buffer of given location buffer spatially defined buffer places within a radius of 200 m driving zones temporally defined bufffer what places can be reached on foot / by car within the next 20 minutes 4 restaurants within 200m of X 2 hospitals to reach within 12 min 39

40 Research Questions 1. Can track data be used for frequency prediction? What problems arise? 2. How can track data and frequency measurements benefit from each other? improvement of track-to-street mapping with frequency data enhancement of frequency prediciton using tracks 3. How to incorporate active learning to improve the data model? How to select places for additional traffic measurements? How to select persons for track monitoring? 40

41 Summary New data sources make spatial mining a very promising topic Spatial data mining is a process consisting of data, preprocessing, algorithms and visualization - Project examples: Geomarketing, outdoor media frequencies Selection of the right data is crucial - Project example: gas industry Spatial data mining is inherently visual - Tools such as CommonGIS Research challenge track data - Project example: SPR GPS tracks, GeoPKDD and and We We are are hiring! hiring! 41

Spatial Data Mining for Customer Segmentation

Spatial Data Mining for Customer Segmentation Spatial Data Mining for Customer Segmentation Data Mining in Practice Seminar, Dortmund, 2003 Dr. Michael May Fraunhofer Institut Autonome Intelligente Systeme Spatial Data Mining, Michael May, Fraunhofer

More information

Introduction to Spatial Data Mining

Introduction to Spatial Data Mining Introduction to Spatial Data Mining 7.1 Pattern Discovery 7.2 Motivation 7.3 Classification Techniques 7.4 Association Rule Discovery Techniques 7.5 Clustering 7.6 Outlier Detection Introduction: a classic

More information

GEO-VISUALIZATION SUPPORT FOR MULTIDIMENSIONAL CLUSTERING

GEO-VISUALIZATION SUPPORT FOR MULTIDIMENSIONAL CLUSTERING Geoinformatics 2004 Proc. 12th Int. Conf. on Geoinformatics Geospatial Information Research: Bridging the Pacific and Atlantic University of Gävle, Sweden, 7-9 June 2004 GEO-VISUALIZATION SUPPORT FOR MULTIDIMENSIONAL

More information

Data Visualization Techniques and Practices Introduction to GIS Technology

Data Visualization Techniques and Practices Introduction to GIS Technology Data Visualization Techniques and Practices Introduction to GIS Technology Michael Greene Advanced Analytics & Modeling, Deloitte Consulting LLP March 16 th, 2010 Antitrust Notice The Casualty Actuarial

More information

Spatial Data Analysis

Spatial Data Analysis 14 Spatial Data Analysis OVERVIEW This chapter is the first in a set of three dealing with geographic analysis and modeling methods. The chapter begins with a review of the relevant terms, and an outlines

More information

Oracle BI and Geo-Spatial Big Data

Oracle BI and Geo-Spatial Big Data Oracle Business Intelligence 11g Antony Heljula Technical Director Peak Indicators Limited 2 Introduction Aim of Presentation 5 Steps for Demonstration Summary Further Information Questions Peak Indicators

More information

CHAPTER-24 Mining Spatial Databases

CHAPTER-24 Mining Spatial Databases CHAPTER-24 Mining Spatial Databases 24.1 Introduction 24.2 Spatial Data Cube Construction and Spatial OLAP 24.3 Spatial Association Analysis 24.4 Spatial Clustering Methods 24.5 Spatial Classification

More information

Seize the Opportunity

Seize the Opportunity Big Data Perspectives for Germany Seize the Opportunity Prof. Dr. Stefan Wrobel Fraunhofer-Institut für Intelligente Analyseund Informationssysteme IAIS Fraunhofer Big Data Initiative www.iais.fraunhofer.de

More information

Exploratory Data Analysis for Ecological Modelling and Decision Support

Exploratory Data Analysis for Ecological Modelling and Decision Support Exploratory Data Analysis for Ecological Modelling and Decision Support Gennady Andrienko & Natalia Andrienko Fraunhofer Institute AIS Sankt Augustin Germany http://www.ais.fraunhofer.de/and 5th ECEM conference,

More information

Spatial Data Mining Methods and Problems

Spatial Data Mining Methods and Problems Spatial Data Mining Methods and Problems Abstract Use summarizing method,characteristics of each spatial data mining and spatial data mining method applied in GIS,Pointed out that the space limitations

More information

A HYBRID APPROACH FOR AUTOMATED AREA AGGREGATION

A HYBRID APPROACH FOR AUTOMATED AREA AGGREGATION A HYBRID APPROACH FOR AUTOMATED AREA AGGREGATION Zeshen Wang ESRI 380 NewYork Street Redlands CA 92373 Zwang@esri.com ABSTRACT Automated area aggregation, which is widely needed for mapping both natural

More information

Tutorial on Geographic and Spatial Data Mining

Tutorial on Geographic and Spatial Data Mining Tutorial on Geographic and Spatial Data Mining 5th Italian Symposium on Advanced Database Systems - SEBD 7 Torre Canne, Italy June 7th Fraunhofer Society Joseph von Fraunhofer, German physicist and entrepreneur

More information

Introduction to GIS (Basics, Data, Analysis) & Case Studies. 13 th May 2004. Content. What is GIS?

Introduction to GIS (Basics, Data, Analysis) & Case Studies. 13 th May 2004. Content. What is GIS? Introduction to GIS (Basics, Data, Analysis) & Case Studies 13 th May 2004 Content Introduction to GIS Data concepts Data input Analysis Applications selected examples What is GIS? Geographic Information

More information

Tracking System for GPS Devices and Mining of Spatial Data

Tracking System for GPS Devices and Mining of Spatial Data Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja

More information

Predictive Dynamix Inc Turning Business Experience Into Better Decisions

Predictive Dynamix Inc Turning Business Experience Into Better Decisions Overview Geospatial Data Mining for Market Intelligence By Paul Duke, Predictive Dynamix, Inc. Copyright 2000-2001. All rights reserved. Today, there is a huge amount of information readily available describing

More information

Big Data Collection and Utilization for Operational Support of Smarter Social Infrastructure

Big Data Collection and Utilization for Operational Support of Smarter Social Infrastructure Hitachi Review Vol. 63 (2014), No. 1 18 Big Data Collection and Utilization for Operational Support of Smarter Social Infrastructure Kazuaki Iwamura Hideki Tonooka Yoshihiro Mizuno Yuichi Mashita OVERVIEW:

More information

Deep Insights Smart Decisions Motionlogic

Deep Insights Smart Decisions Motionlogic Deep Insights Smart Decisions Motionlogic About Motionlogic Big Data business of Deutsche Telekom 100% subsidiary Analytics of people movement behavior and demographic indicators Using anonymized network

More information

The STC for Event Analysis: Scalability Issues

The STC for Event Analysis: Scalability Issues The STC for Event Analysis: Scalability Issues Georg Fuchs Gennady Andrienko http://geoanalytics.net Events Something [significant] happened somewhere, sometime Analysis goal and domain dependent, e.g.

More information

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel Big Data and Analytics: Getting Started with ArcGIS Mike Park Erik Hoel Agenda Overview of big data Distributed computation User experience Data management Big data What is it? Big Data is a loosely defined

More information

SPATIAL DATA CLASSIFICATION AND DATA MINING

SPATIAL DATA CLASSIFICATION AND DATA MINING , pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal

More information

Visual Analytics and Data Mining

Visual Analytics and Data Mining Visual Analytics and Data Mining in S-T-applicationsS Gennady Andrienko & Natalia Andrienko Fraunhofer Institute AIS Sankt Augustin Germany http://www.ais.fraunhofer.de/and Mining Spatio-Temporal Data

More information

Short-Term Forecasting in Retail Energy Markets

Short-Term Forecasting in Retail Energy Markets Itron White Paper Energy Forecasting Short-Term Forecasting in Retail Energy Markets Frank A. Monforte, Ph.D Director, Itron Forecasting 2006, Itron Inc. All rights reserved. 1 Introduction 4 Forecasting

More information

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA ABSTRACT Current trends in data mining allow the business community to take advantage of

More information

Crime Mapping Methods. Assigning Spatial Locations to Events (Address Matching or Geocoding)

Crime Mapping Methods. Assigning Spatial Locations to Events (Address Matching or Geocoding) Chapter 15 Crime Mapping Crime Mapping Methods Police departments are never at a loss for data. To use crime mapping is to take data from myriad sources and make the data appear on the computer screen

More information

Introduction. Introduction. Spatial Data Mining: Definition WHAT S THE DIFFERENCE?

Introduction. Introduction. Spatial Data Mining: Definition WHAT S THE DIFFERENCE? Introduction Spatial Data Mining: Progress and Challenges Survey Paper Krzysztof Koperski, Junas Adhikary, and Jiawei Han (1996) Review by Brad Danielson CMPUT 695 01/11/2007 Authors objectives: Describe

More information

MOBILITY DATA MODELING AND REPRESENTATION

MOBILITY DATA MODELING AND REPRESENTATION PART I MOBILITY DATA MODELING AND REPRESENTATION 1 Trajectories and Their Representations Stefano Spaccapietra, Christine Parent, and Laura Spinsanti 1.1 Introduction For a long time, applications have

More information

Bowdoin Computer Science

Bowdoin Computer Science Bowdoin Science What is computer science, what are its applications in other disciplines, and its impact in society? 101: Introduction to CS Pre-requisites: none Assumes no prior knowledge of programming

More information

Selected references Data mining for manufacturing of individaul medical products

Selected references Data mining for manufacturing of individaul medical products Data mining for manufacturing of individaul medical products Individualization of medical products Collection of all relevant data along the process chain Complete process monitoring Concept creation for

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:

More information

Geocoding in Law Enforcement Final Report

Geocoding in Law Enforcement Final Report Geocoding in Law Enforcement Final Report Geocoding in Law Enforcement Final Report Prepared by: The Crime Mapping Laboratory Police Foundation August 2000 Report to the Office of Community Oriented Policing

More information

Chapter 6. The stacking ensemble approach

Chapter 6. The stacking ensemble approach 82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described

More information

Monica Pratesi, University of Pisa

Monica Pratesi, University of Pisa DEVELOPING ROBUST AND STATISTICALLY BASED METHODS FOR SPATIAL DISAGGREGATION AND FOR INTEGRATION OF VARIOUS KINDS OF GEOGRAPHICAL INFORMATION AND GEO- REFERENCED SURVEY DATA Monica Pratesi, University

More information

Technology and Trends for Smarter Business Analytics

Technology and Trends for Smarter Business Analytics Don Campbell Chief Technology Officer, Business Analytics, IBM Technology and Trends for Smarter Business Analytics Business Analytics software Where organizations are focusing Business Analytics Enhance

More information

Prediction of Stock Performance Using Analytical Techniques

Prediction of Stock Performance Using Analytical Techniques 136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University

More information

Alison Hayes November 30, 2005 NRS 509. Crime Mapping OVERVIEW

Alison Hayes November 30, 2005 NRS 509. Crime Mapping OVERVIEW Alison Hayes November 30, 2005 NRS 509 Crime Mapping OVERVIEW Geographic data has been important to law enforcement since the beginning of local policing in the nineteenth century. The New York City Police

More information

Big Data Analytics in Mobile Environments

Big Data Analytics in Mobile Environments 1 Big Data Analytics in Mobile Environments 熊 辉 教 授 罗 格 斯 - 新 泽 西 州 立 大 学 2012-10-2 Rutgers, the State University of New Jersey Why big data: historical view? Productivity versus Complexity (interrelatedness,

More information

Fuzzy Spatial Data Warehouse: A Multidimensional Model

Fuzzy Spatial Data Warehouse: A Multidimensional Model 4 Fuzzy Spatial Data Warehouse: A Multidimensional Model Pérez David, Somodevilla María J. and Pineda Ivo H. Facultad de Ciencias de la Computación, BUAP, Mexico 1. Introduction A data warehouse is defined

More information

Chapter ML:XI. XI. Cluster Analysis

Chapter ML:XI. XI. Cluster Analysis Chapter ML:XI XI. Cluster Analysis Data Mining Overview Cluster Analysis Basics Hierarchical Cluster Analysis Iterative Cluster Analysis Density-Based Cluster Analysis Cluster Evaluation Constrained Cluster

More information

Location tracking: technology, methodology and applications

Location tracking: technology, methodology and applications Location tracking: technology, methodology and applications Marina L. Gavrilova SPARCS Laboratory Co-Director Associate Professor University of Calgary Interests and affiliations SPARCS Lab Co-Founder

More information

Practical Data Science @ Etsy. Dr. Jason Davis

Practical Data Science @ Etsy. Dr. Jason Davis Practical Data Science @ Etsy Dr. Jason Davis About me Ph.D. Machine learning & data mining Entrepreneur. Founder of ad startup Adtuitive Engineering Director @ Etsy. Search & Data Topics Etsy's data science

More information

Company Profile. www.valuelab.it

Company Profile. www.valuelab.it Company Profile www.valuelab.it Profile VALUE LAB is an innovative management consulting and IT solutions company specialized in Marketing, Sales and Retailing. Our goal is to help Manufacturers, Retailers

More information

Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University

Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University Mining Big Data Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University Website: http://www.cse.msu.edu/~ptan Google Trends Big Data Smart Cities Big Data and

More information

Information Visualization WS 2013/14 11 Visual Analytics

Information Visualization WS 2013/14 11 Visual Analytics 1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and

More information

Buffer Operations in GIS

Buffer Operations in GIS Buffer Operations in GIS Nagapramod Mandagere, Graduate Student, University of Minnesota npramod@cs.umn.edu SYNONYMS GIS Buffers, Buffering Operations DEFINITION A buffer is a region of memory used to

More information

Customer Analytics. Turn Big Data into Big Value

Customer Analytics. Turn Big Data into Big Value Turn Big Data into Big Value All Your Data Integrated in Just One Place BIRT Analytics lets you capture the value of Big Data that speeds right by most enterprises. It analyzes massive volumes of data

More information

Big Data in Transportation Engineering

Big Data in Transportation Engineering Big Data in Transportation Engineering Nii Attoh-Okine Professor Department of Civil and Environmental Engineering University of Delaware, Newark, DE, USA Email: okine@udel.edu IEEE Workshop on Large Data

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

ESRI Business Analyst for Telecommunications

ESRI Business Analyst for Telecommunications ESRI Business Analyst for Telecommunications GIS Market Analysis Package Drive Business Results with ESRI Business Analyst ESRI Business Analyst helps you Analyze your competition: Track customer churn.

More information

Easily Identify Your Best Customers

Easily Identify Your Best Customers IBM SPSS Statistics Easily Identify Your Best Customers Use IBM SPSS predictive analytics software to gain insight from your customer database Contents: 1 Introduction 2 Exploring customer data Where do

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

Applications of Dynamic Representation Technologies in Multimedia Electronic Map

Applications of Dynamic Representation Technologies in Multimedia Electronic Map Applications of Dynamic Representation Technologies in Multimedia Electronic Map WU Guofeng CAI Zhongliang DU Qingyun LONG Yi (School of Resources and Environment Science, Wuhan University, Wuhan, Hubei.

More information

Optimal Parameters for Space- Time Cluster Detection of Infectious Disease. Evan Caten Masters Candidate Salem State College May 4, 2009

Optimal Parameters for Space- Time Cluster Detection of Infectious Disease. Evan Caten Masters Candidate Salem State College May 4, 2009 Optimal Parameters for Space- Time Cluster Detection of Infectious Disease Evan Caten Masters Candidate Salem State College May 4, 2009 Presentation Outline Overview of masters thesis Introduction Objectives

More information

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK Agenda Analytics why now? The process around data and text mining Case Studies The Value of Information

More information

USING SPATIAL DATA MINING TO DISCOVER THE HIDDEN RULES IN THE CRIME DATA

USING SPATIAL DATA MINING TO DISCOVER THE HIDDEN RULES IN THE CRIME DATA USING SPATIAL DATA MINING TO DISCOVER THE HIDDEN RULES IN THE CRIME DATA Karel, JANEČKA 1, Hana, HŮLOVÁ 1 1 Department of Mathematics, Faculty of Applied Sciences, University of West Bohemia Abstract Univerzitni

More information

Mario Guarracino. Data warehousing

Mario Guarracino. Data warehousing Data warehousing Introduction Since the mid-nineties, it became clear that the databases for analysis and business intelligence need to be separate from operational. In this lecture we will review the

More information

May 2012 Oracle Spatial User Conference

May 2012 Oracle Spatial User Conference 1 May 2012 Oracle Spatial User Conference May 23, 2012 Ronald Reagan Building and International Trade Center Washington, DC USA Amit Ghosh Lead Architect, Nokia How Nokia Uses Oracle Spatial to Create

More information

Challenges and Lessons from NIST Data Science Pre-pilot Evaluation in Introduction to Data Science Course Fall 2015

Challenges and Lessons from NIST Data Science Pre-pilot Evaluation in Introduction to Data Science Course Fall 2015 Challenges and Lessons from NIST Data Science Pre-pilot Evaluation in Introduction to Data Science Course Fall 2015 Dr. Daisy Zhe Wang Director of Data Science Research Lab University of Florida, CISE

More information

Easily add Maps and Geo Analytics in MicroStrategy

Easily add Maps and Geo Analytics in MicroStrategy Easily add Maps and Geo Analytics in MicroStrategy Agenda Introduction Configure to use Maps in MicroStrategy MicroStrategy Geo Analysis Capabilities and Examples Key Takeaways and Q&A Why Geospatial Analysis

More information

NetView 360 Product Description

NetView 360 Product Description NetView 360 Product Description Heterogeneous network (HetNet) planning is a specialized process that should not be thought of as adaptation of the traditional macro cell planning process. The new approach

More information

Customer Classification And Prediction Based On Data Mining Technique

Customer Classification And Prediction Based On Data Mining Technique Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor

More information

DIGITS CENTER FOR DIGITAL INNOVATION, TECHNOLOGY, AND STRATEGY THOUGHT LEADERSHIP FOR THE DIGITAL AGE

DIGITS CENTER FOR DIGITAL INNOVATION, TECHNOLOGY, AND STRATEGY THOUGHT LEADERSHIP FOR THE DIGITAL AGE DIGITS CENTER FOR DIGITAL INNOVATION, TECHNOLOGY, AND STRATEGY THOUGHT LEADERSHIP FOR THE DIGITAL AGE INTRODUCTION RESEARCH IN PRACTICE PAPER SERIES, FALL 2011. BUSINESS INTELLIGENCE AND PREDICTIVE ANALYTICS

More information

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India Big Data and Semantic Web in Manufacturing Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India Outline Big data in Manufacturing Big data Analytics Semantic web technologies Case

More information

BI Tools and Data Flow

BI Tools and Data Flow BI Tools and Data Flow http://web.peralta.edu/indev/peralta business intelligence bi tool Mike Orkin, Ph.D. Associate Vice Chancellor of Academic Affairs Peralta Community College District 1 Business Intelligence

More information

Oracle Spatial and Graph. Jayant Sharma Director, Product Management

Oracle Spatial and Graph. Jayant Sharma Director, Product Management Oracle Spatial and Graph Jayant Sharma Director, Product Management Agenda Oracle Spatial and Graph Graph Capabilities Q&A 2 Oracle Spatial and Graph Complete Open Integrated Most Widely Used 3 Open and

More information

Integration of GPS Traces with Road Map

Integration of GPS Traces with Road Map Integration of GPS Traces with Road Map Lijuan Zhang Institute of Cartography and Geoinformatics Leibniz University of Hannover Hannover, Germany +49 511.762-19437 Lijuan.Zhang@ikg.uni-hannover.de Frank

More information

NHPSS An Automated OTC Pharmaceutical Sales Surveillance System

NHPSS An Automated OTC Pharmaceutical Sales Surveillance System NHPSS An Automated OTC Pharmaceutical Sales Surveillance System Xiaohui Zhang, Ph.D., Reno Fiedler, and Michael Popovich Introduction Development of public health surveillance systems requires multiple

More information

Strengthening Diverse Retail Business Processes with Forecasting: Practical Application of Forecasting Across the Retail Enterprise

Strengthening Diverse Retail Business Processes with Forecasting: Practical Application of Forecasting Across the Retail Enterprise Paper SAS1833-2015 Strengthening Diverse Retail Business Processes with Forecasting: Practical Application of Forecasting Across the Retail Enterprise Alex Chien, Beth Cubbage, Wanda Shive, SAS Institute

More information

CARTOGRAPHIC VISUALIZATION FOR SPATIAL ANALYSIS. Jason Dykes Department of Geography, University of Leicester, Leicester, LE2 ITF, U.K.

CARTOGRAPHIC VISUALIZATION FOR SPATIAL ANALYSIS. Jason Dykes Department of Geography, University of Leicester, Leicester, LE2 ITF, U.K. POSTER SESSIONS 257 CARTOGRAPHIC VISUALIZATION FOR SPATIAL ANALYSIS Jason Dykes Department of Geography, University of Leicester, Leicester, LE2 ITF, U.K. Abstract Characteristics of Visualization in Scientific

More information

DATA QUALITY IN GIS TERMINOLGY GIS11

DATA QUALITY IN GIS TERMINOLGY GIS11 DATA QUALITY IN GIS When using a GIS to analyse spatial data, there is sometimes a tendency to assume that all data, both locational and attribute, are completely accurate. This of course is never the

More information

A quick overview of geographic information systems (GIS) Uwe Deichmann, DECRG <udeichmann@worldbank.org>

A quick overview of geographic information systems (GIS) Uwe Deichmann, DECRG <udeichmann@worldbank.org> A quick overview of geographic information systems (GIS) Uwe Deichmann, DECRG Why is GIS important? A very large share of all types of information has a spatial component ( 80

More information

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments Contents List of Figures Foreword Preface xxv xxiii xv Acknowledgments xxix Chapter 1 Fraud: Detection, Prevention, and Analytics! 1 Introduction 2 Fraud! 2 Fraud Detection and Prevention 10 Big Data for

More information

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

Enhanced Boosted Trees Technique for Customer Churn Prediction Model IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V5 PP 41-45 www.iosrjen.org Enhanced Boosted Trees Technique for Customer Churn Prediction

More information

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) 305 REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) (See also General Regulations) Any publication based on work approved for a higher degree should contain a reference

More information

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep Neil Raden Hired Brains Research, LLC Traditionally, the job of gathering and integrating data for analytics fell on data warehouses.

More information

A Visualization is Worth a Thousand Tables: How IBM Business Analytics Lets Users See Big Data

A Visualization is Worth a Thousand Tables: How IBM Business Analytics Lets Users See Big Data White Paper A Visualization is Worth a Thousand Tables: How IBM Business Analytics Lets Users See Big Data Contents Executive Summary....2 Introduction....3 Too much data, not enough information....3 Only

More information

MEng, BSc Computer Science with Artificial Intelligence

MEng, BSc Computer Science with Artificial Intelligence School of Computing FACULTY OF ENGINEERING MEng, BSc Computer Science with Artificial Intelligence Year 1 COMP1212 Computer Processor Effective programming depends on understanding not only how to give

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON BIG DATA ISSUES AMRINDER KAUR Assistant Professor, Department of Computer

More information

Data Mining & Data Stream Mining Open Source Tools

Data Mining & Data Stream Mining Open Source Tools Data Mining & Data Stream Mining Open Source Tools Darshana Parikh, Priyanka Tirkha Student M.Tech, Dept. of CSE, Sri Balaji College Of Engg. & Tech, Jaipur, Rajasthan, India Assistant Professor, Dept.

More information

Exploratory Spatial Data Analysis

Exploratory Spatial Data Analysis Exploratory Spatial Data Analysis Part II Dynamically Linked Views 1 Contents Introduction: why to use non-cartographic data displays Display linking by object highlighting Dynamic Query Object classification

More information

ICT Perspectives on Big Data: Well Sorted Materials

ICT Perspectives on Big Data: Well Sorted Materials ICT Perspectives on Big Data: Well Sorted Materials 3 March 2015 Contents Introduction 1 Dendrogram 2 Tree Map 3 Heat Map 4 Raw Group Data 5 For an online, interactive version of the visualisations in

More information

bulzī Mobile Data Services connecting mobile to out of home

bulzī Mobile Data Services connecting mobile to out of home bulzī Mobile Data Services connecting mobile to out of home There are almost as many mobile phones in use as there are people on the planet. Each day trillions of data packets are sent to servers connected

More information

Data Preprocessing. Week 2

Data Preprocessing. Week 2 Data Preprocessing Week 2 Topics Data Types Data Repositories Data Preprocessing Present homework assignment #1 Team Homework Assignment #2 Read pp. 227 240, pp. 250 250, and pp. 259 263 the text book.

More information

Sanjeev Kumar. contribute

Sanjeev Kumar. contribute RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 sanjeevk@iasri.res.in 1. Introduction The field of data mining and knowledgee discovery is emerging as a

More information

Knowledge Discovery and Data Mining. Structured vs. Non-Structured Data

Knowledge Discovery and Data Mining. Structured vs. Non-Structured Data Knowledge Discovery and Data Mining Unit # 2 1 Structured vs. Non-Structured Data Most business databases contain structured data consisting of well-defined fields with numeric or alphanumeric values.

More information

NTT DATA Big Data Reference Architecture Ver. 1.0

NTT DATA Big Data Reference Architecture Ver. 1.0 NTT DATA Big Data Reference Architecture Ver. 1.0 Big Data Reference Architecture is a joint work of NTT DATA and EVERIS SPAIN, S.L.U. Table of Contents Chap.1 Advance of Big Data Utilization... 2 Chap.2

More information

Course Syllabus For Operations Management. Management Information Systems

Course Syllabus For Operations Management. Management Information Systems For Operations Management and Management Information Systems Department School Year First Year First Year First Year Second year Second year Second year Third year Third year Third year Third year Third

More information

Software for Supply Chain Design and Analysis

Software for Supply Chain Design and Analysis Software for Supply Chain Design and Analysis Optimize networks Improve product flow Position inventory Simulate service Balance production Refine routes The Leading Supply Chain Design and Analysis Application

More information

Big Data and Its Role in the Health IT Space Presented by MobileHelp

Big Data and Its Role in the Health IT Space Presented by MobileHelp Big Data and Its Role in the Health IT Space Presented by MobileHelp With excerpts from an interview with Jean Robichaud, CTO of MobileHelp Big Data and Its Role in the Health IT Space Presented by MobileHelp

More information

Data-Driven Optimization

Data-Driven Optimization Data-Driven Optimization John Turner Assistant Professor The Paul Merage School of Business University of California, Irvine October 24, 2014: UCI Data Science Initiative Analytics Unlocking the Potential

More information

Lesson 15 - Fill Cells Plugin

Lesson 15 - Fill Cells Plugin 15.1 Lesson 15 - Fill Cells Plugin This lesson presents the functionalities of the Fill Cells plugin. Fill Cells plugin allows the calculation of attribute values of tables associated with cell type layers.

More information

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table

More information

How To Understand The History Of Navigation In French Marine Science

How To Understand The History Of Navigation In French Marine Science E-navigation, from sensors to ship behaviour analysis Laurent ETIENNE, Loïc SALMON French Naval Academy Research Institute Geographic Information Systems Group laurent.etienne@ecole-navale.fr loic.salmon@ecole-navale.fr

More information

PROGRAM DIRECTOR: Arthur O Connor Email Contact: URL : THE PROGRAM Careers in Data Analytics Admissions Criteria CURRICULUM Program Requirements

PROGRAM DIRECTOR: Arthur O Connor Email Contact: URL : THE PROGRAM Careers in Data Analytics Admissions Criteria CURRICULUM Program Requirements Data Analytics (MS) PROGRAM DIRECTOR: Arthur O Connor CUNY School of Professional Studies 101 West 31 st Street, 7 th Floor New York, NY 10001 Email Contact: Arthur O Connor, arthur.oconnor@cuny.edu URL:

More information

Oracle Big Data Spatial and Graph

Oracle Big Data Spatial and Graph Oracle Big Data Spatial and Graph Oracle Big Data Spatial and Graph offers a set of analytic services and data models that support Big Data workloads on Apache Hadoop and NoSQL database technologies. For

More information

Data Mining Analytics for Business Intelligence and Decision Support

Data Mining Analytics for Business Intelligence and Decision Support Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing

More information

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases Andreas Züfle Geo Spatial Data Huge flood of geo spatial data Modern technology New user mentality Great research potential

More information

MEng, BSc Applied Computer Science

MEng, BSc Applied Computer Science School of Computing FACULTY OF ENGINEERING MEng, BSc Applied Computer Science Year 1 COMP1212 Computer Processor Effective programming depends on understanding not only how to give a machine instructions

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

Numerical Algorithms Group

Numerical Algorithms Group Title: Summary: Using the Component Approach to Craft Customized Data Mining Solutions One definition of data mining is the non-trivial extraction of implicit, previously unknown and potentially useful

More information

High-Performance Visualization of Geographic Data

High-Performance Visualization of Geographic Data High-Performance Visualization of Geographic Data Presented by Budhendra Bhaduri Alexandre Sorokine Geographic Information Science and Technology Computational Sciences and Engineering Managed by UT-Battelle

More information