Geo Data Mining and Visual Analytics

Similar documents
Information Visualization WS 2013/14 11 Visual Analytics

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April ISSN

Data Mining Applications in Higher Education

Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction

Driving Value From Big Data

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept

Database Marketing, Business Intelligence and Knowledge Discovery

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS November 7, Machine Learning Group

Data Mining System, Functionalities and Applications: A Radical Review

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

REFLECTIONS ON THE USE OF BIG DATA FOR STATISTICAL PRODUCTION

Social Data Science for Intelligent Cities

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

How To Become A Data Scientist

The Big Data methodology in computer vision systems

The 4 Pillars of Technosoft s Big Data Practice

Spatial Data Mining Methods and Problems

Chapter 6 - Enhancing Business Intelligence Using Information Systems

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining

A Statistical Text Mining Method for Patent Analysis

CONNECTING DATA WITH BUSINESS

Chapter 11. Managing Knowledge

A Capability Model for Business Analytics: Part 2 Assessing Analytic Capabilities

Transforming the Telecoms Business using Big Data and Analytics

GEO-VISUALIZATION SUPPORT FOR MULTIDIMENSIONAL CLUSTERING

Introduction. A. Bellaachia Page: 1

DATA ANALYSIS USING BUSINESS INTELLIGENCE TOOL. A Thesis. Presented to the. Faculty of. San Diego State University. In Partial Fulfillment

DATA MINING CONCEPTS AND TECHNIQUES. Marek Maurizio E-commerce, winter 2011

Topics in basic DBMS course

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Data Exploration with GIS Viewsheds and Social Network Analysis

Outline. What is Big data and where they come from? How we deal with Big data?

Text Mining - Scope and Applications

Towards applying Data Mining Techniques for Talent Mangement

International Journal of Innovative Research in Computer and Communication Engineering

SPATIAL DATA CLASSIFICATION AND DATA MINING

relevant to the management dilemma or management question.

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Big Data in Transportation Engineering

BIG DATA: CHALLENGES AND OPPORTUNITIES IN LOGISTICS SYSTEMS

Analysis of Data Mining Concepts in Higher Education with Needs to Najran University

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

Manjula Ambur NASA Langley Research Center April 2014

Volume 3, Issue 8, August 2015 International Journal of Advance Research in Computer Science and Management Studies

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI

Healthcare Measurement Analysis Using Data mining Techniques

A Spatial Decision Support System for Property Valuation

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

The Challenges of Geospatial Analytics in the Era of Big Data

locuz.com Big Data Services

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining. Data Analysis and Knowledge Discovery

Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies

A Hurwitz white paper. Inventing the Future. Judith Hurwitz President and CEO. Sponsored by Hitachi

Selection of Optimal Discount of Retail Assortments with Data Mining Approach

Information Management course

CPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015

Industry Trends & Challenges in Oil & Gas

Visual Analytics. Daniel A. Keim, Florian Mansmann, Andreas Stoffel, Hartmut Ziegler University of Konstanz, Germany

Geographically weighted visualization interactive graphics for scale-varying exploratory analysis

A STUDY REGARDING INTER DOMAIN LINKED DOCUMENTS SIMILARITY AND THEIR CONSEQUENT BOUNCE RATE

Data Isn't Everything

Distributed Database for Environmental Data Integration

North Highland Data and Analytics. Data Governance Considerations for Big Data Analytics

Problems to store, transfer and process the Big Data 6/2/2016 GIANG TRAN - TTTGIANG2510@GMAIL.COM 1

How To Find Out What A Web Log Data Is Worth On A Blog

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

How To Create A Text Classification System For Spam Filtering

Doctor of Philosophy (Ph.D.), Major in. Geographic Information Science. Ph.D. Program. Educational Goal. Admission Policy. Advancement to Candidacy

A Framework of Context-Sensitive Visualization for User-Centered Interactive Systems

Big Data Strategies Creating Customer Value In Utilities

2015 Workshops for Professors

Introduction to Text Mining and Semantics. Seth Grimes -- President, Alta Plana

Geospatial Data Integration Open Service Bus Architecture

Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo

The Value of Visualization for Understanding Data and Making Decisions

Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.

WEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS

Where is... How do I get to...

Transcription:

Geo Data Mining and Visual Analytics Beyond Limits Developments in Cadastral Domain Workshop, Zürich 19 March 2015 Susanne Bleisch Institute of Geomatics Engineering School of Architecture, Civil Engineering and Geomatics HABG FHNW University of Applied Sciences and Arts Northwestern Switzerland

Define - Exemplify - Use

Data Science Statistics - Data mining - Machine Learning - Artificial Intelligence Figure: Mamatha Upadhyaya. Capgemini. http://www.slideshare.net/slideshow/embed_code/36866068

(Geographic) Data Mining u u u u Data- and computation-poor to data- and computation-rich environments Traditional (spatial) analytical methods developed for small and homogenous data sets From confirmatory (a priori hypotheses) to exploratory (gain insight) Find hidden information in the data: novel, valid, useful and understandable Miller, H. J., & Han, J. (Eds.). (2009). Geographic Data Mining and Knowledge Discovery (2nd ed., p. 458). Parkway: CRC Press.

Clustering Outlier analysis Typical data mining tasks Classification Modelling/Prediction A A A A A? B B B Association analysis Sequence analysis {wine, cheese} -> {bread} C > A > B

Visual Analytics Humans in the loop Visual Analytics is the science of analytical reasoning supported by interactive visual interfaces.... detect the expected and discover the unexpected... Thomas, J.J. & Cook, K.A., 2005. Illuminating the Path: The Research and Development Agenda for Visual Analytics, IEEE Computer Society. Figure: http://www.visual-analytics.eu/faq/

Visual Analytics Andrienko, G. & Andrienko, N., 2008. Visual Analytics. http://geoanalytics.net/and/.

u Textual data: documents, news, email, tweets,... u Databases and collections u Image data: from satellite to user generated u Sensor data u Video data u Networks, linked data u Structured, semi-structured, unstructured u... Data Challenges: u Heterogeneous sources and types u massive amounts but incomplete, inconsistent, time-varying and potentially inaccessible,...

Spatial is special u u u u u GI is uncertain, imprecise or vague. GI is correlated, i.e. non-random. GI is dynamic, it represents an ever-changing world. GI is scale dependent. GI is heterogeneous.

Define - Exemplify - Use

recaptcha https://www.google.com/recaptcha/intro/index.html

Information from small pieces of data Rudder, C. (2014). Dataclysm: Who We Are (When We Think No One s Looking). London, UK: Crown/Archetype.

Information from small pieces of data http://blog.okcupid.com/index.php/ dont-be-ugly-by-accident/

Geographic information retrieval Midlands Wales Arampatzis, A., van Kreveld, M., Reinbacher, I., Jones, C. B., Vaid, S., Clough, P., Hideo, J. & Sanderson, M. (2006). Web-based delineation of imprecise regions. Computers, Environment and Urban Systems, 30, 436 459.

Geographic information retrieval Hollenstein, Livia; Purves, Ross S (2010). Exploring place through user-generated content: Using Flickr to describe city cores. Journal of Spatial Information Science, 1(1):21-48.

Descriptions > maps We re sitting in Baretto s in the Alan Gilbert Building, across Grattan street is one of the medical buildings. Down the hill along Grattan street the new building being constructed is the Peter Doherty Institute and diagonally across the road (Royal Parade) is Melbourne Hospital. In the other direction... Vasardani, M., Timpf, S., Winter, S. and Tomko, M. (2013) From Descriptions to Depictions: A conceptual Framework. COSIT 2013, Sept. 2th - 6th, Scarborough, UK. In Lecture Notes in Computer Science (LNCS), 8116. 299-319, Springer.

a b c d e f g h i j k l m n o p q r s t u v w 2008 2009 2010 2011

Event causes Event Full moon causes Start of fish migration State allows Event High river flow allows Start of fish migration Bleisch, S., Duckham, M., Galton, A., Laube, P., & Lyon, J., 2014. Mining candidate causal relationships in movement patterns. Int. J. of Geographical Information Science, 28(2), 363 382.

Data Mining Sequence analysis Fish migration Fish migration upstream downstream Moon phases expected observed Bleisch, S., Duckham, M., Galton, A., Laube, P., & Lyon, J., 2014. Mining candidate causal relationships in movement patterns. Int. J. of Geographical Information Science, 28(2), 363 382.

Data Mining Assocation analysis (river flow) Fish migration Fish migration upstream downstream expected observed Bleisch, S., Duckham, M., Galton, A., Laube, P., & Lyon, J., 2014. Mining candidate causal relationships in movement patterns. Int. J. of Geographical Information Science, 28(2), 363 382.

Event causes Event -> event sequences Subsequence Support Count 1 (1) 0.98536210 1279 2 (7) 0.50693374 658 3 (7)-(1) 0.49152542 638 4 (5) 0.42604006 553 5 (5)-(1) 0.40369800 524 6 (1)-(1) 0.38212635 496 7 (5)-(5) 0.33281972 432 8 (10) 0.31432974 408 9 (5)-(5)-(1) 0.30739599 399 10 (10)-(1) 0.29583975 384 11 (2) 0.24653313 320 12 (2)-(1) 0.23959938 311 13 (5)-(7) 0.23497689 305 14 (7)-(5) 0.22804314 296 15 (5)-(7)-(1) 0.22187982 288 16 (7)-(7) 0.21263482 276 17 (7)-(5)-(1) 0.21109399 274 18 (1)-(5) 0.21032357 273 19 (1)-(7) 0.20801233 270 20 (7)-(7)-(1) 0.20801233 270 21 (1)-(7)-(1) 0.19491525 253 22 (5)-(7)-(5) 0.19029276 247 23 (1)-(5)-(1) 0.18489985 240 24 (5)-(7)-(5)-(1) 0.17796610 231 25 (7)-(1)-(1) 0.17642527 229 26 (5)-(5)-(5) 0.17334361 225 84 (7)-(7)-(7)-(1) 0.08089368 105 85 (7)-(1)-(5)-(1) 0.08012327 104 86 (2)-(7) 0.07935285 103 87 (7)-(5)-(7) 0.07935285 103 88 (2)-(5)-(1) 0.07858243 102 89 (5)-(1)-(7) 0.07858243 102 90 (5)-(2) 0.07858243 102 91 (1)-(5)-(7)-(5)-(1) 0.07781202 101 92 (4) 0.07704160 100 93 (10)-(5)-(5) 0.07704160 100 94 (2)-(7)-(1) 0.07627119 99 95 (5)-(5)-(10)-(1) 0.07550077 98 96 (5)-(2)-(1) 0.07473035 97 97 (7)-(7)-(1)-(1) 0.07473035 97 98 (7)-(5)-(7)-(1) 0.07395994 96 99 (4)-(1) 0.07318952 95 100 (5)-(1)-(5)-(5)-(1) 0.07318952 95 101 (5)-(1)-(7)-(1) 0.07318952 95 102 (9)-(7) 0.07318952 95 103 (2)-(1)-(1) 0.07241911 94 104 (5)-(7)-(7)-(5) 0.07241911 94 105 (10)-(5)-(5)-(1) 0.07241911 94 106 (1)-(3) 0.07164869 93 107 (5)-(5)-(1)-(5)-(1) 0.07087827 92 108 (1)-(3)-(1) 0.06933744 90 109 (7)-(1)-(10) 0.06933744 90

Define - Exemplify - Use

Thank you! Beyond Limits Developments in Cadastral Domain Workshop, Zürich 19 March 2015 Susanne Bleisch susanne.bleisch@fhnw.ch