Exploratory Data Analysis for Ecological Modelling and Decision Support
|
|
|
- Derick Watkins
- 10 years ago
- Views:
Transcription
1 Exploratory Data Analysis for Ecological Modelling and Decision Support Gennady Andrienko & Natalia Andrienko Fraunhofer Institute AIS Sankt Augustin Germany 5th ECEM conference, Pushchino, Russia, Outline 1. Geo-visualisation s view on ecological modelling: demanding problems and challenging tasks 2. Case study 1: pesticide accumulation 3. Case study 2: forest dynamics 4. A systematic approach to exploratory data analysis (EDA): elements of the general theory 5. Software issues 5th ECEM conference, Pushchino, Russia,
2 A View on Ecological Modelling Complex and multidimensional; May contain errors May have many parameters; May contain errors Need to be interpreted; To be used for informed decision making Input data Model Output data Data Complexity: 1) Space, 2) Time, 3) Multiple attributes & dimensions, 4) Variability of values, abrupt changes 5th ECEM conference, Pushchino, Russia, Outputs of simulations Multiple attributes referring to Simulation scenarios; Spatial locations (objects); Time moments; (e.g. species, age groups etc.) 5th ECEM conference, Pushchino, Russia,
3 Complexities Number of attributes Length of time series Number of spatial objects High dimensionality => huge number of combinations (normally )! Abrupt temporal changes Great variability of values 5th ECEM conference, Pushchino, Russia, Complexities: example 1 5th ECEM conference, Pushchino, Russia,
4 Complexities: example 2 5th ECEM conference, Pushchino, Russia, Our goals Support analysts and decision makers in: Preparing and harmonizing input data; Tuning models and their parameters; Interpreting outputs of simulation; Exploring alternatives for decision making; Justifying and communicating the resulting decisions. Instruments: interactive visualisation enhanced by intelligent aggregation tools and other tools for exploratory data analysis 5th ECEM conference, Pushchino, Russia,
5 Outline 1. Geo-visualisation s view on ecological modelling: demanding problems and challenging tasks 2. Case study 1: pesticide accumulation 3. Case study 2: forest dynamics 4. A systematic approach to exploratory data analysis (EDA): elements of the general theory 5. Software issues 5th ECEM conference, Pushchino, Russia, GIMMI project Multiple simulation scenarios (different crops and active ingredients) about 1000 plots Geographical Information and Mathematic Model Interoperability IST , TXT Italy, EIG Germany, AIS Germany simulation depth: 10+ years several output variables that characterize various environmental aspects (pesticide accumulation etc.) 5th ECEM conference, Pushchino, Russia,
6 Decisions to be made What crop? What active ingredient? In what concentration? for individual plots and for the whole territory 5th ECEM conference, Pushchino, Russia, Pesticide accumulation dynamics In fact, we see the extreme values only! Zoom? 5th ECEM conference, Pushchino, Russia,
7 Zoomed pesticide accumulation The extreme values and the overall view are lost But the details are still not visible! 5th ECEM conference, Pushchino, Russia, Log10 transformed values Let s transform values to log10 Now we can see something! 5th ECEM conference, Pushchino, Russia,
8 Only means, medians, and envelopes Remove individual lines: look only at the dynamics of the general characteristics 5th ECEM conference, Pushchino, Russia, Details of the distribution And finally add more details on the overall level! 5th ECEM conference, Pushchino, Russia,
9 Count plots within intervals 1. Specify classes according to pesticide concentration 2. Count number of plots within each interval for each year 3. Draw the counts as stacked bars 4. Possible extension: use areas or other amounts instead of the counts 5th ECEM conference, Pushchino, Russia, Compare the accumulation in all scenarios for the whole territory Another view on the overall characteristics of all 5 scenarios 5th ECEM conference, Pushchino, Russia,
10 Look at the individual plots 2 nd scenario => 7 th year Aggregated diagram representation supports the evaluation of the scenarios for individual plots Dynamic linking between displays supports selection of interesting plots 5th ECEM conference, Pushchino, Russia, Outline 1. Geo-visualisation s view on ecological modelling: demanding problems and challenging tasks 2. Case study 1: pesticide accumulation 3. Case study 2: forest dynamics 4. A systematic approach to exploratory data analysis (EDA): elements of the general theory 5. Software issues 5th ECEM conference, Pushchino, Russia,
11 Silvics project 104 forest compartments 4 scenarios of development: NATural Selective CUtting Legal RUssian ILLegal Simulation results for 200 years (41 time moments) 6 species 13 age groups 104*4*41*6*13=1,000,000 combinations For these combinations: 20 attributes! SILVICS - Silvicultural Systems for Sustainable Forest Resources Management Univ. Wageningen (NL), EFI (FI), AIS (DE), RAS (RU) INTAS, th ECEM conference, Pushchino, Russia, Compare biomass in two scenarios SCU: rather stable number of forest compartments in all classes; LRU: high temporal variability 5th ECEM conference, Pushchino, Russia,
12 Look at biodiversity: Dominant Species/Age classification Oak, 7 th age group dominates in some compartments Two interactively specified thresholds 1. Presence 2. Dominance level Tilia, 5 th age group is present in some compartments 5th ECEM conference, Pushchino, Russia, Dominant Species and Age Class (1) natural selective Russian illegal 5th ECEM conference, Pushchino, Russia,
13 Dominant Species and Age Class (2) natural selective Russian illegal 5th ECEM conference, Pushchino, Russia, Species Structure (1) 5th ECEM conference, Pushchino, Russia,
14 Species Structure (2) 5th ECEM conference, Pushchino, Russia, Age Structure (1) 5th ECEM conference, Pushchino, Russia,
15 Age Structure (2) 5th ECEM conference, Pushchino, Russia, Outline 1. Geo-visualisation s view on ecological modelling: demanding problems and challenging tasks 2. Case study 1: pesticide accumulation 3. Case study 2: forest dynamics 4. A systematic approach to exploratory data analysis (EDA): elements of the general theory 5. Software issues 5th ECEM conference, Pushchino, Russia,
16 Recap: aggregation tools 1. Several variants of time series aggregation 2. Aggregation of multiple attributes via selection of the dominant attribute both in a spatial context closely integrated with interactive visualisation and data transformation 5th ECEM conference, Pushchino, Russia, Roles of aggregation tools in EDA Aggregation supports grasping the overall characteristics on the processes / scenarios To be instrumental, aggregation tools should be interactive and dynamic for: 1. Flexible and powerful data transformation 2. Immediate feedback on visual displays 3. Analysis of sensitivity to the aggregation parameters 4. Selection of interesting data instances, access to them Intelligent aggregation is important for decision support as a tool for the exploration and evaluation of alternatives 5th ECEM conference, Pushchino, Russia,
17 What Is EDA? Emerged in statistics in 1970ies; originator: John Tukey A philosophy and discipline of unbiased looking at data: What can data tell me? rather than Do they agree with my expectations? Similar to the work of a detective (J.Tukey) Need to look at data focus on visualisation and user interaction with data displays 5th ECEM conference, Pushchino, Russia, Purposes of EDA Uncover peculiarities of the data and, on this basis, understand how the data should be further processed (e.g. filtered, transformed, split into parts, fused, ) Generate hypotheses for further testing (e.g. using statistical methods) Choose proper methods for in-depth analysis (possibly, domain-specific) 5th ECEM conference, Pushchino, Russia,
18 EDA vs. other analyses EDA does not substitute rigor methods of numerical analysis, either general or domain-specific, but should give the understanding what methods and how to apply Original data 1. EDA 2. Data processing Understanding of the data (mental model) Processed data 3. In-depth analysis Conclusions, theories, decisions, 5th ECEM conference, Pushchino, Russia, EDA vs. information presentation EDA makes intensive use of graphics However, nice presentation and reporting are not EDA purposes Primary goal of presentation: convey certain idea or set of ideas to others Understandably Convincingly Aesthetically attractively This requires different visual means than exploration 5th ECEM conference, Pushchino, Russia,
19 Case study 3: EDA for the exploration of forest defoliations Large volume: 6169 spatially-referenced time series Two dimensions: S&T Many missing values No full compatibility across countries, species, time etc. Data from NEFIS project 5th ECEM conference, Pushchino, Russia, General procedure of the EDA 1. See the whole Space + Time 2 complementary views 1) Evolution of spatial patterns in time 2) Distribution of temporal behaviours in space 2. Divide and focus Data are complex Have to be explored by slices and subsets (species, age groups, countries, years, ) 3. Attend to particulars Detect outliers, strange behaviours, unexpected patterns, 5th ECEM conference, Pushchino, Russia,
20 See the whole: Handle large data volumes General approach: Data aggregation Task 1: Explore evolution of spatial patterns Appropriate data transformation: aggregate by small space compartments (regular grid with 4025 cells); separately for different species; various aggregates (mean, max) Gain: no symbol overlapping 5th ECEM conference, Pushchino, Russia, Explore evolution of spatial patterns a) Animated map b) Map sequence Observations: Persistently high values in Poland Improvement in Belarus Mosaic distribution in most countries: great differences between close locations Outliers 5th ECEM conference, Pushchino, Russia,
21 Divide and Focus: Exploration on country level Recommendable due to inconsistencies between countries Observation: abrupt changes between locations spatial smoothing methods are not appropriate 5th ECEM conference, Pushchino, Russia, Explore spatial distribution of temporal behaviours Are behaviours in neighbouring places similar? Step 1. Smoothing supports revealing general patterns and disregarding fluctuations and outliers (we shall look at outliers later) 5th ECEM conference, Pushchino, Russia,
22 Explore spatial distribution of temporal behaviours Are behaviours in neighbouring places similar? Step 2. Temporal comparison (e.g. with particular year, mean for a period) helps to disregard absolute differences in values and thus focus on behaviours Observation: no strong similarity between neighbouring places 5th ECEM conference, Pushchino, Russia, Compare behaviours in plots with different main species Mosaic signs: 6 rows for species; 14 columns for years ; Colours encode defoliation values Observation: behaviours differ for different main species 5th ECEM conference, Pushchino, Russia,
23 Explore overall temporal trends Line overlapping obstructs data analysis apply aggregation 5th ECEM conference, Pushchino, Russia, Aggregation method 1: by quantiles 5th ECEM conference, Pushchino, Russia,
24 Aggregation method 2: by intervals 5th ECEM conference, Pushchino, Russia, Divide and Focus: Germany 5th ECEM conference, Pushchino, Russia,
25 Divide and Focus: age groups 1,3 5th ECEM conference, Pushchino, Russia, Attend to particulars Types of particulars (examples): Extreme values Extreme changes High variability Questions: When? Where? What is around? Why? (a question for further, in-depth analysis) Domain knowledge is essential 5th ECEM conference, Pushchino, Russia,
26 Attend to particulars: extreme values 1. Click on a segment corresponding to extreme values 2. The behaviour(s) is(are) highlighted on the time graph 3. The location(s) is(are) highlighted on the map 5th ECEM conference, Pushchino, Russia, Attend to particulars: what is around? In some neighbouring places the behaviours during the period are somewhat similar 5th ECEM conference, Pushchino, Russia,
27 Attend to particulars: extreme changes 1. Transform the time graph to show changes 2. Select extreme changes in a specific year (here 2003) 5th ECEM conference, Pushchino, Russia, Attend to particulars: high variation 1. Aggregate time graph by quantiles 2. Save counts 3. Visualise e.g. on a scatter plot 4. Select items with high variation 5th ECEM conference, Pushchino, Russia,
28 Attend to particulars: high fluctuation Select items with maximal number of jumps between quantiles 5th ECEM conference, Pushchino, Russia, Attend to particulars: stable extremes Select items being always in the topmost 10% 5th ECEM conference, Pushchino, Russia,
29 Attend to particulars: stable increase 1. Turn the time graph in the segmentation mode 2. Choose increase and set minimum difference 3. Select a sequence of years by clicking 4. Check sensitivity to the time period! 5th ECEM conference, Pushchino, Russia, Recap: Exploration procedure See the whole Evolution of spatial patterns in time Distribution of temporal behaviours in space Divide and focus Data were explored by slices and subsets (species, age groups, countries, years, ) Attend to particulars Extreme values, extreme changes, high variation, high fluctuations, stable growth 5th ECEM conference, Pushchino, Russia,
30 Recap: Tools Visualisation on thematic maps, time graphs, other aspatial displays Aggregation: reduce data volume & symbol overlapping Filtering: divide and focus (select subsets) Marking: see corresponding data on several displays Data transformation: smoothing, computing changes, normalisation etc. It is important to use the tools in combination 5th ECEM conference, Pushchino, Russia, Elements of the theory of EDA Data Have structure and properties Observations, findings, conclusions, decisions Principles Task = Target + Constraints defined by properties of the data Are suitable for specific types of data and tasks Tasks Principles Tools 5th ECEM conference, Pushchino, Russia,
31 Notes for designers of new tools Tool design (in particular, map design) should base on task analysis! Data Potential tasks Assessment of existing tools Data structure Tool requirements Combining existing tools and inventing new ones 5th ECEM conference, Pushchino, Russia, Outline 1. Geo-visualisation s view on ecological modelling: demanding problems and challenging tasks 2. Case study 1: pesticide accumulation 3. Case study 2: forest dynamics 4. A systematic approach to exploratory data analysis (EDA): elements of the general theory 5. Software issues 5th ECEM conference, Pushchino, Russia,
32 Requirements to EDA software Space- and Time-awareness Work with complex multidimensional data Support for uncertain and missing data Scalability Support and encouraging of several complementary views on the same data Dynamic linking and coordination of several data displays From the overall view to particulars of interest From idea generation to hypothesis testing using statistical methods, followed by reporting 5th ECEM conference, Pushchino, Russia, GIS for EDA: major problems Time-awareness Work with complex and multidimensional data Processing uncertain and missing data Scalability Interactivity of visualisations Dynamic linking of multiple displays Idea processing 5th ECEM conference, Pushchino, Russia,
33 Potentially useful tools for EDA Information visualisation tools, for example, HCE & TimeSearcher from HCIL, Univ. Maryland Geovisualisation tools, for example GeoVistaStudio (Penn State Univ.) and Descartes/CommonGIS (Fraunhofer Institute AIS) Graphical statistics tools, for example, Manet & Mondrian (Augsburg Univ.) Usually such systems are research prototypes that implement innovative ideas, but provide restricted functionality and limited user support 5th ECEM conference, Pushchino, Russia, CommonGIS (not a common GIS ) A variety of well-integrated tools for EDA Time-aware maps + statistical graphics; several mechanisms of display coordination Designed to gain synergy of Visualisation Display manipulation Data manipulation Querying Computational techniques, including aggregation and data mining Quick demo? 5th ECEM conference, Pushchino, Russia,
34 Still open issues (for all tools!) Work with qualitative (non-numeric) data Work with fuzzy, uncertain, and missing data Continue scalability efforts Intelligent guidance through the overall process of data analysis, avoiding cognitive complexity Adaptability to user, data, tasks, and hardware Support in processing and management of observations: recording, structuring, browsing, searching, checking, combining, interpreting Help in visual communication of derived data, constructed knowledge, and recommended decisions 5th ECEM conference, Pushchino, Russia, Conclusions 1. EDA is essential in ecological modelling for preparation of data, verification and tuning of models, interpretation of results, and evaluation of decision alternatives 2. Systematic application of EDA requires careful consideration of characteristics of data, relevant analytical tasks, properties of tools 3. EDA tools should combine interactive visualisation with data transformation, dynamic query, and sophisticated computations 4. Still there are many things to do for scientists and for software developers 5th ECEM conference, Pushchino, Russia,
35 To Learn More: Software: Papers, tutorials, on-line demos: Book to appear: Natalia and Gennady Andrienko Exploratory Analysis of Spatial and Temporal data. A Systematic Approach (Springer-Verlag, end 2005) A theoretical framework for linking tasks, tools, and principles of data analysis 5th ECEM conference, Pushchino, Russia, In press, to appear end th ECEM conference, Pushchino, Russia,
GEO-VISUALIZATION SUPPORT FOR MULTIDIMENSIONAL CLUSTERING
Geoinformatics 2004 Proc. 12th Int. Conf. on Geoinformatics Geospatial Information Research: Bridging the Pacific and Atlantic University of Gävle, Sweden, 7-9 June 2004 GEO-VISUALIZATION SUPPORT FOR MULTIDIMENSIONAL
DESCARTES & EFIMOD: An Integrated System for Simulation Modelling and Exploration Data Analysis for Decision Support in Sustainable Forestry
DESCARTES & EFIMOD: An Integrated System for Simulation Modelling and Exploration Data Analysis for Decision Support in Sustainable Forestry A. Komarov a, O. Chertov b, G. Andrienko c, N. Andrienko c,
Exploratory Spatial Data Analysis
Exploratory Spatial Data Analysis Part II Dynamically Linked Views 1 Contents Introduction: why to use non-cartographic data displays Display linking by object highlighting Dynamic Query Object classification
Interactive Analysis of Event Data Using Space-Time Cube
Interactive Analysis of Event Data Using Space-Time Cube Peter Gatalsky, Natalia Andrienko, and Gennady Andrienko Fraunhofer Institute for Autonomous Intelligent Systems Schloss Birlinghoven, D-53754 Sankt
COM CO P 5318 Da t Da a t Explora Explor t a ion and Analysis y Chapte Chapt r e 3
COMP 5318 Data Exploration and Analysis Chapter 3 What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping
Information Visualization WS 2013/14 11 Visual Analytics
1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and
GEOGRAPHIC INFORMATION SYSTEMS
GEOGRAPHIC INFORMATION SYSTEMS WHAT IS A GEOGRAPHIC INFORMATION SYSTEM? A geographic information system (GIS) is a computer-based tool for mapping and analyzing spatial data. GIS technology integrates
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 1 What is data exploration? A preliminary
The STC for Event Analysis: Scalability Issues
The STC for Event Analysis: Scalability Issues Georg Fuchs Gennady Andrienko http://geoanalytics.net Events Something [significant] happened somewhere, sometime Analysis goal and domain dependent, e.g.
Blending Aggregation and Selection: Adapting Parallel Coordinates for the Visualization of Large Datasets
The Cartographic Journal Vol. 42 No. 1 pp. 49 60 June 2005 # The British Cartographic Society 2005 REFEREED PAPER Blending Aggregation and Selection: Adapting Parallel Coordinates for the Visualization
Knowledge Discovery and Data Mining. Structured vs. Non-Structured Data
Knowledge Discovery and Data Mining Unit # 2 1 Structured vs. Non-Structured Data Most business databases contain structured data consisting of well-defined fields with numeric or alphanumeric values.
Data Exploration Data Visualization
Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select
Business Intelligence and Process Modelling
Business Intelligence and Process Modelling F.W. Takes Universiteit Leiden Lecture 2: Business Intelligence & Visual Analytics BIPM Lecture 2: Business Intelligence & Visual Analytics 1 / 72 Business Intelligence
Data Visualization Techniques
Data Visualization Techniques From Basics to Big Data with SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Generating the Best Visualizations for Your Data... 2 The
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar What is data exploration? A preliminary exploration of the data to better understand its characteristics.
Data Visualization Techniques
Data Visualization Techniques From Basics to Big Data with SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Generating the Best Visualizations for Your Data... 2 The
How To Understand The History Of Navigation In French Marine Science
E-navigation, from sensors to ship behaviour analysis Laurent ETIENNE, Loïc SALMON French Naval Academy Research Institute Geographic Information Systems Group [email protected] [email protected]
TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:
Table of contents: Access Data for Analysis Data file types Format assumptions Data from Excel Information links Add multiple data tables Create & Interpret Visualizations Table Pie Chart Cross Table Treemap
DHL Data Mining Project. Customer Segmentation with Clustering
DHL Data Mining Project Customer Segmentation with Clustering Timothy TAN Chee Yong Aditya Hridaya MISRA Jeffery JI Jun Yao 3/30/2010 DHL Data Mining Project Table of Contents Introduction to DHL and the
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS Stacey Franklin Jones, D.Sc. ProTech Global Solutions Annapolis, MD Abstract The use of Social Media as a resource to characterize
Data Visualisation and Its Application in Official Statistics. Olivia Or Census and Statistics Department, Hong Kong, China [email protected].
Data Visualisation and Its Application in Official Statistics Olivia Or Census and Statistics Department, Hong Kong, China [email protected] Abstract Data visualisation has been a growing topic of
Business Intelligence System for Monitoring, Analysis and Forecasting of Socioeconomic Development of Russian Territories
Business Intelligence System for Monitoring, Analysis and Forecasting of Socioeconomic Development of Russian Territories The Ministry of Economic Development of the Russian Federation is responsible for
Lecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
Visualization methods for patent data
Visualization methods for patent data Treparel 2013 Dr. Anton Heijs (CTO & Founder) Delft, The Netherlands Introduction Treparel can provide advanced visualizations for patent data. This document describes
Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015
Principles of Data Visualization for Exploratory Data Analysis Renee M. P. Teate SYS 6023 Cognitive Systems Engineering April 28, 2015 Introduction Exploratory Data Analysis (EDA) is the phase of analysis
APPENDIX E THE ASSESSMENT PHASE OF THE DATA LIFE CYCLE
APPENDIX E THE ASSESSMENT PHASE OF THE DATA LIFE CYCLE The assessment phase of the Data Life Cycle includes verification and validation of the survey data and assessment of quality of the data. Data verification
Risk Visualization: Presenting Data to Facilitate Better Risk Management
Track 4: New Dimensions in Financial Risk Management Risk Visualization: Presenting Data to Facilitate Better Risk Management 1:30pm 2:20pm Presenter: Jeffrey Bohn Senior Managing Director, Head of Portfolio
Data, Measurements, Features
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
Table of Contents Find the story within your data
Visualizations 101 Table of Contents Find the story within your data Introduction 2 Types of Visualizations 3 Static vs. Animated Charts 6 Drilldowns and Drillthroughs 6 About Logi Analytics 7 1 For centuries,
CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler
CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler 1) Operating systems a) Windows b) Unix and Linux c) Macintosh 2) Data manipulation tools a) Text Editors b) Spreadsheets
DATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
Tutorial for proteome data analysis using the Perseus software platform
Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information
PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY
QÜESTIIÓ, vol. 25, 3, p. 509-520, 2001 PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY GEORGES HÉBRAIL We present in this paper the main applications of data mining techniques at Electricité de France,
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
INTRAFOCUS. DATA VISUALISATION An Intrafocus Guide
DATA VISUALISATION An Intrafocus Guide September 2011 Table of Contents What is Data Visualisation?... 2 Where is Data Visualisation Used?... 3 The Market View... 4 What Should You Look For?... 5 The Key
Introduction to Exploratory Data Analysis
Introduction to Exploratory Data Analysis A SpaceStat Software Tutorial Copyright 2013, BioMedware, Inc. (www.biomedware.com). All rights reserved. SpaceStat and BioMedware are trademarks of BioMedware,
WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat
Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise
Theoretical Perspective
Preface Motivation Manufacturer of digital products become a driver of the world s economy. This claim is confirmed by the data of the European and the American stock markets. Digital products are distributed
Data Visualization Handbook
SAP Lumira Data Visualization Handbook www.saplumira.com 1 Table of Content 3 Introduction 20 Ranking 4 Know Your Purpose 23 Part-to-Whole 5 Know Your Data 25 Distribution 9 Crafting Your Message 29 Correlation
Geovisual Analytics Exploring and analyzing large spatial and multivariate data. Prof Mikael Jern & Civ IngTobias Åström. http://ncva.itn.liu.
Geovisual Analytics Exploring and analyzing large spatial and multivariate data Prof Mikael Jern & Civ IngTobias Åström http://ncva.itn.liu.se/ Agenda Introduction to a Geovisual Analytics Demo Explore
Iris Sample Data Set. Basic Visualization Techniques: Charts, Graphs and Maps. Summary Statistics. Frequency and Mode
Iris Sample Data Set Basic Visualization Techniques: Charts, Graphs and Maps CS598 Information Visualization Spring 2010 Many of the exploratory data techniques are illustrated with the Iris Plant data
cognitive work analysis.
Microsoft access as a modelling tool for cognitive work analysis Introduction A number of computer based modelling tools have been proposed and some developed for components of cognitive work analysis
What is Visualization? Information Visualization An Overview. Information Visualization. Definitions
What is Visualization? Information Visualization An Overview Jonathan I. Maletic, Ph.D. Computer Science Kent State University Visualize/Visualization: To form a mental image or vision of [some
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Topics Exploratory Data Analysis Summary Statistics Visualization What is data exploration?
Create Cool Lumira Visualization Extensions with SAP Web IDE Dong Pan SAP PM and RIG Analytics Henry Kam Senior Product Manager, Developer Ecosystem
Create Cool Lumira Visualization Extensions with SAP Web IDE Dong Pan SAP PM and RIG Analytics Henry Kam Senior Product Manager, Developer Ecosystem 2015 SAP SE or an SAP affiliate company. All rights
SQL Server Administrator Introduction - 3 Days Objectives
SQL Server Administrator Introduction - 3 Days INTRODUCTION TO MICROSOFT SQL SERVER Exploring the components of SQL Server Identifying SQL Server administration tasks INSTALLING SQL SERVER Identifying
Getting Started With Mortgage MarketSmart
Getting Started With Mortgage MarketSmart We are excited that you are using Mortgage MarketSmart and hope that you will enjoy being one of its first users. This Getting Started guide is a work in progress,
COMP 150-04 Visualization. Lecture 11 Interacting with Visualizations
COMP 150-04 Visualization Lecture 11 Interacting with Visualizations Assignment 5: Maps Due Wednesday, March 17th Design a thematic map visualization Option 1: Choropleth Map Implementation in Processing
3D Interactive Information Visualization: Guidelines from experience and analysis of applications
3D Interactive Information Visualization: Guidelines from experience and analysis of applications Richard Brath Visible Decisions Inc., 200 Front St. W. #2203, Toronto, Canada, [email protected] 1. EXPERT
DICON: Visual Cluster Analysis in Support of Clinical Decision Intelligence
DICON: Visual Cluster Analysis in Support of Clinical Decision Intelligence Abstract David Gotz, PhD 1, Jimeng Sun, PhD 1, Nan Cao, MS 2, Shahram Ebadollahi, PhD 1 1 IBM T.J. Watson Research Center, New
Visualization Quick Guide
Visualization Quick Guide A best practice guide to help you find the right visualization for your data WHAT IS DOMO? Domo is a new form of business intelligence (BI) unlike anything before an executive
Introduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
Customer Analytics. Turn Big Data into Big Value
Turn Big Data into Big Value All Your Data Integrated in Just One Place BIRT Analytics lets you capture the value of Big Data that speeds right by most enterprises. It analyzes massive volumes of data
PROGRAM DIRECTOR: Arthur O Connor Email Contact: URL : THE PROGRAM Careers in Data Analytics Admissions Criteria CURRICULUM Program Requirements
Data Analytics (MS) PROGRAM DIRECTOR: Arthur O Connor CUNY School of Professional Studies 101 West 31 st Street, 7 th Floor New York, NY 10001 Email Contact: Arthur O Connor, [email protected] URL:
The Masters of Science in Information Systems & Technology
The Masters of Science in Information Systems & Technology College of Engineering and Computer Science University of Michigan-Dearborn A Rackham School of Graduate Studies Program PH: 313-593-5361; FAX:
An Introduction to Point Pattern Analysis using CrimeStat
Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign
WebSphere Business Monitor
WebSphere Business Monitor Dashboards 2010 IBM Corporation This presentation should provide an overview of the dashboard widgets for use with WebSphere Business Monitor. WBPM_Monitor_Dashboards.ppt Page
Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis
9/3/2013 Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis Seton Hall University, South Orange, New Jersey http://www.shu.edu/go/dava Visualization and
Data Exploration and Preprocessing. Data Mining and Text Mining (UIC 583 @ Politecnico di Milano)
Data Exploration and Preprocessing Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques", The Morgan Kaufmann
Intro to GIS Winter 2011. Data Visualization Part I
Intro to GIS Winter 2011 Data Visualization Part I Cartographer Code of Ethics Always have a straightforward agenda and have a defining purpose or goal for each map Always strive to know your audience
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
NV301 Umoja Advanced BI Navigation. Umoja Advanced BI Navigation Version 5 1
NV301 Umoja Advanced BI Navigation Umoja Advanced BI Navigation Version 5 Last Copyright Modified: United 08-August-13 Nations 1 Agenda Course Introduction Module 1: Analysis Workspaces Module 2: Define
I Newsletter Q1 WELCOME TO OUR FIRST NEWSLETTER EDITION FOR 2015!
I Newsletter Q1 WELCOME TO OUR FIRST NEWSLETTER EDITION FOR 2015! It s been a while since our last newsletter, but don t worry -- it s not you, it s us! We ve had our heads down hard at work speeding up
Oracle Data Miner (Extension of SQL Developer 4.0)
An Oracle White Paper September 2013 Oracle Data Miner (Extension of SQL Developer 4.0) Integrate Oracle R Enterprise Mining Algorithms into a workflow using the SQL Query node Denny Wong Oracle Data Mining
Business Intelligence, Analytics & Reporting: Glossary of Terms
Business Intelligence, Analytics & Reporting: Glossary of Terms A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Ad-hoc analytics Ad-hoc analytics is the process by which a user can create a new report
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
Data Analysis and Statistical Software Workshop. Ted Kasha, B.S. Kimberly Galt, Pharm.D., Ph.D.(c) May 14, 2009
Data Analysis and Statistical Software Workshop Ted Kasha, B.S. Kimberly Galt, Pharm.D., Ph.D.(c) May 14, 2009 Learning Objectives: Data analysis commonly used today Available data analysis software packages
Sanjeev Kumar. contribute
RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 [email protected] 1. Introduction The field of data mining and knowledgee discovery is emerging as a
Data Mining for Successful Healthcare Organizations
Data Mining for Successful Healthcare Organizations For successful healthcare organizations, it is important to empower the management and staff with data warehousing-based critical thinking and knowledge
Geostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras [email protected]
<no narration for this slide>
1 2 The standard narration text is : After completing this lesson, you will be able to: < > SAP Visual Intelligence is our latest innovation
What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11
What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11 W elcome to North Face Software s software. With this software, you can accomplish
Data Visualization. or Graphical Data Presentation. Jerzy Stefanowski Instytut Informatyki
Data Visualization or Graphical Data Presentation Jerzy Stefanowski Instytut Informatyki Data mining for SE -- 2013 Ack. Inspirations are coming from: G.Piatetsky Schapiro lectures on KDD J.Han on Data
Multivariate Data Visualization
Multivariate Data Visualization VOTech/Universities of Leeds & Edinburgh Richard Holbrey Outline Focus on data exploration Joined VOTech in June, so...... informal introduction Examine some definitions
OPTIMIZATION OF DATABASE STRUCTURE FOR HYDROMETEOROLOGICAL MONITORING SYSTEM
OPTIMIZATION OF DATABASE STRUCTURE FOR HYDROMETEOROLOGICAL MONITORING SYSTEM Ph.D. Robert SZCZEPANEK Cracow University of Technology Institute of Water Engineering and Water Management ul.warszawska 24,
Impact of Data and Task Characteristics on Design of Spatio-Temporal Data Visualization Tools
Exploring Geovisualization J. Dykes, A.M. MacEachren, M.-J. Kraak (Editors) q 2005 Elsevier Ltd. All rights reserved. preprint : November 2004 - do not redistribute. Chapter 10 Impact of Data and Task
Visual Data Mining. Motivation. Why Visual Data Mining. Integration of visualization and data mining : Chidroop Madhavarapu CSE 591:Visual Analytics
Motivation Visual Data Mining Visualization for Data Mining Huge amounts of information Limited display capacity of output devices Chidroop Madhavarapu CSE 591:Visual Analytics Visual Data Mining (VDM)
GeoGebra Statistics and Probability
GeoGebra Statistics and Probability Project Maths Development Team 2013 www.projectmaths.ie Page 1 of 24 Index Activity Topic Page 1 Introduction GeoGebra Statistics 3 2 To calculate the Sum, Mean, Count,
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
Exploratory Data Analysis
Exploratory Data Analysis Johannes Schauer [email protected] Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
EXPLORING SPATIAL PATTERNS IN YOUR DATA
EXPLORING SPATIAL PATTERNS IN YOUR DATA OBJECTIVES Learn how to examine your data using the Geostatistical Analysis tools in ArcMap. Learn how to use descriptive statistics in ArcMap and Geoda to analyze
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
Chapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
ROLAP with Column Store Index Deep Dive. Alexei Khalyako SQL CAT Program Manager [email protected]
ROLAP with Column Store Index Deep Dive Alexei Khalyako SQL CAT Program Manager [email protected] What are we doing now? 1000 of concurrent users 4TB Cube High speed loading Current design 14 th Jan
