Clustering Components of PySAL



Similar documents
New Tools for Spatial Data Analysis in the Social Sciences

Clustering UE 141 Spring 2013

Constrained Clustering of Territories in the Context of Car Insurance

ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining

Spatial Data Analysis Using GeoDa. Workshop Goals

Alejandro Betancourt. Curriculum Vitae. Education. Awards. Experience. As researcher

Spatial Analysis with GeoDa Spatial Autocorrelation

Crime Hotspots Analysis in South Korea: A User-Oriented Approach

Clustering. Adrian Groza. Department of Computer Science Technical University of Cluj-Napoca

GeoDa 0.9 User s Guide

Exploring Spatial Data with GeoDa TM : A Workbook

Clustering Connectionist and Statistical Language Processing

MultiExperiment Viewer Quickstart Guide

Territorial Analysis for Ratemaking. Philip Begher, Dario Biasini, Filip Branitchev, David Graham, Erik McCracken, Rachel Rogers and Alex Takacs

Web Work Module User s Guide

Data Mining. Cluster Analysis: Advanced Concepts and Algorithms

Improving Regional PCE Estimates Using Credit Card Transaction Data

A Study of Web Log Analysis Using Clustering Techniques

Environmental Remote Sensing GEOG 2021

Applying Data Analysis to Big Data Benchmarks. Jazmine Olinger

Graduated with honors (Magna Cum Laude). Grade average: 9.6 over 10.

STOCK MARKET TRENDS USING CLUSTER ANALYSIS AND ARIMA MODEL

Raster Operations. Local, Neighborhood, and Zonal Approaches. Rebecca McLain Geography 575 Fall Raster Operations Overview

They can be obtained in HQJHQH format directly from the home page at:

Introduction to Clustering

Master s Programs Offering Dual/Joint Degrees Report for NADD October 2012

Data Mining Applications in Higher Education

Chapter 7. Cluster Analysis

BIRCH: An Efficient Data Clustering Method For Very Large Databases

Citibank Switch Kit Forms

Clustering. Data Mining. Abraham Otero. Data Mining. Agenda

Data Exploration and Preprocessing. Data Mining and Text Mining (UIC Politecnico di Milano)

SPSS Tutorial. AEB 37 / AE 802 Marketing Research Methods Week 7

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and

Parallel Programming Map-Reduce. Needless to Say, We Need Machine Learning for Big Data

Statistical Databases and Registers with some datamining

Tuition and Fees. & Room and Board. Costs

Creating and Manipulating Spatial Weights

Social Media Mining. Data Mining Essentials

A New Method of Estimating Locality of Industry Cluster Regions Using Large-scale Business Transaction Data

How To Get Into A University

How To Create A Large Enterprise Cloud Storage System From A Large Server (Cisco Mds 9000) Family 2 (Cio) 2 (Mds) 2) (Cisa) 2-Year-Old (Cica) 2.5

Curriculum Vita Fall 2015

COST MODEL FOR THE FIXED TELECOMMUNICATIONS NETWORK - PORTUGAL

Cluster Analysis. Alison Merikangas Data Analysis Seminar 18 November 2009

K-means Clustering Technique on Search Engine Dataset using Data Mining Tool

Data Mining 資 料 探 勘. 分 群 分 析 (Cluster Analysis)

An Overview of Knowledge Discovery Database and Data mining Techniques

K-Means Cluster Analysis. Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 1

CLUSTER ANALYSIS FOR SEGMENTATION

Cluster Analysis: Advanced Concepts

Map-Reduce for Machine Learning on Multicore

Updated CellTracker software manual

How To Teach A Spatial Analysis Course

How To Teach Deaf And Mute Communication

Data Warehouse Design

Cluster Analysis: Basic Concepts and Algorithms

XIAOBAI (BOB) LI ACADEMIC EXPERIENCE RESEARCH HIGHLIGHTS TEACHING HIGHLIGHTS

University Your selection: 169 universities

Spatial Analysis of Five Crime Statistics in Turkey

Building Data Cubes and Mining Them. Jelena Jovanovic

Configuration Management Training Foundation

Data Mining Clustering (2) Sheets are based on the those provided by Tan, Steinbach, and Kumar. Introduction to Data Mining

Geography. LOWER-DIVISION TRANSFER PATTERN California State University (CSU) Statewide Pattern

FastStats & Dashboard Product Overview

Add-on MODULES FOR THE STONE PROFIT SYSTEM

INTEGRATING GIS AND SPATIAL DATA MINING TECHNIQUE FOR TARGET MARKETING OF UNIVERSITY COURSES

Design & Analysis of Ecological Data. Landscape of Statistical Methods...

Tracking System for GPS Devices and Mining of Spatial Data

RAVEN: A GUI and an Artificial Intelligence Engine in a Dynamic PRA Framework

DR. YUEXIAO DONG. Department of Statistics Fox School of Business Temple University 1810 N. 13th Street Philadelphia, PA 19122

MASTER OF SOFTWARE ENGINEERING DEGREE TRACKS 1. WEB DEVELOPMENT/JAVA:

Corporate Clients. Client List, Feb Aerotek. Bellagio. Booz Allen Hamilton. FirstPic Inc. FranklinCovey. Genesis - United Kingdom

The THREDDS Data Repository: for Long Term Data Storage and Access

Comparative Analysis of EM Clustering Algorithm and Density Based Clustering Algorithm Using WEKA tool.

GeoDa: An Introduction to Spatial Data Analysis

IB 411 Bioinspiration - Syllabus Fall 2015

Fluency With Information Technology CSE100/IMT100

Metamodeling by using Multiple Regression Integrated K-Means Clustering Algorithm

Clustering census data: comparing the performance of self-organising maps and k-means algorithms

Transcription:

Clustering Components of PySAL Sergio Rey 1,2 Juan Carlos Duque 1 Luc Anselin 2,3 1 Regional Analysis Laboratory (REGAL) Department of Geography San Diego State University 2 Regional Economics Application Laboratory (REAL) University of Illinois Urbana Champaign 3 Department of Geography University of Illinois Urbana Champaign Regional Science Association International Las Vegas, Nevada November 10-12, 2005 ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 1 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 2 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 2 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 2 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 2 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 3 / 36

PySAL Origins Collaborative Project OpenSpace (UIUC) STARS (SDSU) ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 4 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 5 / 36

Objectives Collaborative Project OpenSpace (UIUC) STARS (SDSU) ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 6 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 7 / 36

Contiguity Constrained Clustering Aggregation of N areas to M regions (M < N), such that: 1 Each area belong to only one region. 2 The areas assigned to a region must be geographically connected. N = 47 M = 6 ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 8 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 9 / 36

Methods ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 10 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 11 / 36

Spatial Aggregation Module Sergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 12 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 13 / 36

Methods ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 14 / 36

Menu Sergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 15 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 16 / 36

ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 17 / 36

K-means two stages ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 18 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 19 / 36

Including Centroids Coordinates ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 20 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 21 / 36

Sergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 22 / 36

AZP-tabu ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 23 / 36

Comparison K-means two stages = 59.346 ARISeL = 45.599 AZP-Tabu = 48.088 Sergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 24 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 25 / 36

ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 26 / 36

Continue... ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 27 / 36

ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 28 / 36

ARISeL ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 29 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 30 / 36

Sergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 31 / 36

ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 32 / 36

Sergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 33 / 36

Outline 1 PySAL Origins Objectives 2 Clustering Components Definition Methods 3 Aggregation Models in PySAL Forward Planning Up to date... K-means two stages Including Centroids Coordinates AZP-tabu Automatic Regionalization with Initial Seeds Location: ARISeL Minimum Spanning Tree Integrating Local Indicators of Spatial Association (LISA) 4 Future Directions ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 34 / 36

Integrating Local Indicators of Spatial Association (LISA) LISA yl i = f (w i., y) (1) Integrating into regionalization algorithms Kmeans Select k significant LISAs to serve as initial seeds. Kmeans Extend attribute set to include local statistics Kmeans Combine: LISA seeds and LISA attributes ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 35 / 36

Future Directions Regionalization Extensions to what we have done here. PySAL Clustering Other aspects of Clustering in PySAL ergio Rey, Juan Carlos Duque, Luc Anselin (San Diego State Clustering University) in PySAL RSAI 2005 36 / 36