Visualizing multivariate data with clustering and heatmaps. Reija Autio School of Health Sciences University of Tampere
|
|
- Damian Barnett
- 7 years ago
- Views:
Transcription
1 Visualizing multivariate data with clustering and heatmaps Reija Autio School of Health Sciences University of Tampere
2 Visualization in R R is a powerful software for visualization Limitation often between the computer and chair ;) More advanced illustration options require R-packages Sometimes requires installation of assisting software not the case with clustering and heatmaps
3 Visualization in R Publication quality graphics Completely programmable and reproducible Several packages available Resulting figures can be viewed and saved in On-screen graphics Postscript, pdf, svg, jpeg, png, tiff,
4 Graphic environments Low-level R base graphics Bar plots, scatter plots, line plots, pie charts, boxplots, etc.. Grid High-level Lattice ggplot2 In this presentation the focus is on ggplot2
5 Clustering Clustering is the classification/partitioning of data objects based on their similarity into groups (clusters). This similarity is computed according to a distance between variables It is used in many fields, such as data mining, machine learning, pattern recognition, image analysis, genomics, systems biology, etc. In machine learning clustering is defined as a form of unsupervised learning.
6 Why to cluster? Pre-Clustering Post-Clustering Clustering data reveals efficiently trends and similarities between the variables. There are several clustering various methods to be used. Clustering is standard data analysis methods in many fields.
7 Types of clustering Today we focus on hierarchical clustering
8 Distance methods There are several distance methods to be used in clustering Euclidean distance Binary Cityblock (Manhattan), Correlation based distances: 1-R Pearsson, Spearman,
9 Cluster linkage Single linkage Complete linkage Average linkage
10 Hierachical clustering Hierarchical clustering (HC) is a straightforward method to illustrate the groupings within the data HC can be used for different types of data: Examples: Car data Gene data
11 Car data This the example data mtcars Here Euclidean distance and complete linkage used in clustering (defaults in R)
12 Gene data Data from article: Tuomela, et al. (2013) Gene Expression Profiling of Immune-Competent Human Cells Exposed to Engineered Zinc Oxide or Titanium Dioxide Nanoparticles, PloS ONE 2013 Jul 22;8(7):e68415 Human Jurkat samples exposed to nanoparticles With pearson correlation distance Euclidean distance
13 Hierarchical clustering HC works iteratively Identify clusters or variables with shortest distances Group them to new cluster Compute the distance between the clusters/variable (now the new cluster is a variable) Continue on step 1 Iterate until all the clusters are joined into one big cluster
14 Clustering agglomerative (step by step) 0.2 Join two closest to a cluster 0.3 Join two closest to a cluster 0.4 Join two closest to a cluster 0.6 Join two closest to a cluster All the nodes now in one cluster STOP Clustering is ready
15 Heatmaps Lots of various colormaps You can also create an own colormap # creates a own color palette from red to green my_palette <- colorramppalette(c("red", "yellow", "green"))(n = 299)
16 Heatmaps mtcars
17 Clustering
18 Parameters change the resulting heatmap a lot Scaling rows vs scaling columns
19 heatmap.2 heatmap.2 function includes more than 40 visible arguments that can be used to tune the resulting figure. These arguments include scaling, selecting clustering method, labeling, showing density info, handling missing values etc.
20 Left correlation, right euclidean, colormap swaped, both standardized based on rows, complete distance
21 Thank you for your attention! Further reading: Kaufman, L., & Rousseeuw, P. J. (2005). Finding Groups in Data. An Introduction to Cluster Analysis (p. 342). John Wiley & Sons Inc. Maechler, M. (2013). Cluster Analysis Extended Rousseeuw et al. CRAN.
All Visualizations Documentation
All Visualizations Documentation All Visualizations Documentation 2 Copyright and Trademarks Licensed Materials - Property of IBM. Copyright IBM Corp. 2013 IBM, the IBM logo, and Cognos are trademarks
More informationHierarchical Clustering Analysis
Hierarchical Clustering Analysis What is Hierarchical Clustering? Hierarchical clustering is used to group similar objects into clusters. In the beginning, each row and/or column is considered a cluster.
More informationThe Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA
Paper 156-2010 The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA Abstract JMP has a rich set of visual displays that can help you see the information
More informationData Visualization. BUS 230: Business and Economic Research and Communication
Data Visualization BUS 230: Business and Economic Research and Communication Data Visualization 1/ 16 Purpose of graphs and charts is to show a picture that can enhance a message, or quickly communicate
More informationUNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS
UNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS Dwijesh C. Mishra I.A.S.R.I., Library Avenue, New Delhi-110 012 dcmishra@iasri.res.in What is Learning? "Learning denotes changes in a system that enable
More informationBest Practices in Data Visualizations. Vihao Pham January 29, 2014
Best Practices in Data Visualizations Vihao Pham January 29, 2014 Agenda Best Practices in Data Visualizations Why We Visualize Understanding Data Visualizations Enhancing Visualizations Visualization
More informationBest Practices in Data Visualizations. Vihao Pham 2014
Best Practices in Data Visualizations Vihao Pham 2014 Agenda Best Practices in Data Visualizations Why We Visualize Understanding Data Visualizations Enhancing Visualizations Visualization Considerations
More informationChapter ML:XI (continued)
Chapter ML:XI (continued) XI. Cluster Analysis Data Mining Overview Cluster Analysis Basics Hierarchical Cluster Analysis Iterative Cluster Analysis Density-Based Cluster Analysis Cluster Evaluation Constrained
More informationR Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol
R Graphics Cookbook Winston Chang Beijing Cambridge Farnham Koln Sebastopol O'REILLY Tokyo Table of Contents Preface ix 1. R Basics 1 1.1. Installing a Package 1 1.2. Loading a Package 2 1.3. Loading a
More informationCSU, Fresno - Institutional Research, Assessment and Planning - Dmitri Rogulkin
My presentation is about data visualization. How to use visual graphs and charts in order to explore data, discover meaning and report findings. The goal is to show that visual displays can be very effective
More informationSTC: Descriptive Statistics in Excel 2013. Running Descriptive and Correlational Analysis in Excel 2013
Running Descriptive and Correlational Analysis in Excel 2013 Tips for coding a survey Use short phrases for your data table headers to keep your worksheet neat, you can always edit the labels in tables
More informationDistances, Clustering, and Classification. Heatmaps
Distances, Clustering, and Classification Heatmaps 1 Distance Clustering organizes things that are close into groups What does it mean for two genes to be close? What does it mean for two samples to be
More informationSteven M. Ho!and. Department of Geology, University of Georgia, Athens, GA 30602-2501
CLUSTER ANALYSIS Steven M. Ho!and Department of Geology, University of Georgia, Athens, GA 30602-2501 January 2006 Introduction Cluster analysis includes a broad suite of techniques designed to find groups
More informationJustClust User Manual
JustClust User Manual Contents 1. Installing JustClust 2. Running JustClust 3. Basic Usage of JustClust 3.1. Creating a Network 3.2. Clustering a Network 3.3. Applying a Layout 3.4. Saving and Loading
More informationDATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS
DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS 1 AND ALGORITHMS Chiara Renso KDD-LAB ISTI- CNR, Pisa, Italy WHAT IS CLUSTER ANALYSIS? Finding groups of objects such that the objects in a group will be similar
More informationData representation and analysis in Excel
Page 1 Data representation and analysis in Excel Let s Get Started! This course will teach you how to analyze data and make charts in Excel so that the data may be represented in a visual way that reflects
More informationDiagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
More informationToday's Topics. COMP 388/441: Human-Computer Interaction. simple 2D plotting. 1D techniques. Ancient plotting techniques. Data Visualization:
COMP 388/441: Human-Computer Interaction Today's Topics Overview of visualization techniques 1D charts, 2D plots, 3D+ techniques, maps A few guidelines for scientific visualization methods, guidelines,
More informationBuilding Capacity to Use Infographic Tools
Building Capacity to Use Infographic Tools free options for creating charts, graphs, and pictorial infographics Arielle Winchester and Amanda Makulec John Snow Inc. Objectives Describe two reasons for
More informationComparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
More informationVisualization methods for patent data
Visualization methods for patent data Treparel 2013 Dr. Anton Heijs (CTO & Founder) Delft, The Netherlands Introduction Treparel can provide advanced visualizations for patent data. This document describes
More informationLadder/Grid Control - Manual betting
Ladder/Grid Control - Manual betting - Updated on 1 st March 2015-1 Contents 1. Ladder control... 3 2. Grid control... 7 3. Resources... 9 2 1. Ladder control Ladder control is used for placing Back or
More informationMultiExperiment Viewer Quickstart Guide
MultiExperiment Viewer Quickstart Guide Table of Contents: I. Preface - 2 II. Installing MeV - 2 III. Opening a Data Set - 2 IV. Filtering - 6 V. Clustering a. HCL - 8 b. K-means - 11 VI. Modules a. T-test
More informationNeural Networks Lesson 5 - Cluster Analysis
Neural Networks Lesson 5 - Cluster Analysis Prof. Michele Scarpiniti INFOCOM Dpt. - Sapienza University of Rome http://ispac.ing.uniroma1.it/scarpiniti/index.htm michele.scarpiniti@uniroma1.it Rome, 29
More informationComputer Skills Microsoft Excel Creating Pie & Column Charts
Computer Skills Microsoft Excel Creating Pie & Column Charts In this exercise, we will learn how to display data using a pie chart and a column chart, color-code the charts, and label the charts. Part
More informationData Mining Project Report. Document Clustering. Meryem Uzun-Per
Data Mining Project Report Document Clustering Meryem Uzun-Per 504112506 Table of Content Table of Content... 2 1. Project Definition... 3 2. Literature Survey... 3 3. Methods... 4 3.1. K-means algorithm...
More informationTutorial for proteome data analysis using the Perseus software platform
Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information
More informationSOMERS POINT BRAND IDENTITY STYLE GUIDE
SOMERS POINT BRAND IDENTITY STYLE GUIDE WHAT IT IS The new Somers Point logo will be the single most recognizable icon used to represent our City. WHY IT MATTERS Our logo immediately identifies our brand.
More informationVisualCalc AdWords Dashboard Indicator Whitepaper Rev 3.2
VisualCalc AdWords Dashboard Indicator Whitepaper Rev 3.2 873 Embarcadero Drive, Suite 3 El Dorado Hills, California 95762 916.939.2020 www.visualcalc.com Introduction The VisualCalc AdWords Dashboard
More informationUsing Data Mining for Mobile Communication Clustering and Characterization
Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer
More information"Excel with Excel 2013: Pivoting with Pivot Tables" by Venu Gopalakrishna Remani. October 28, 2014
Teaching Excellence and Innovation 1 Pivot table Pivot table does calculations with criteria Data should be arranged as : Field names in the first rows, records in rows No blank rows or blank columns should
More informationData Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining
Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 8 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 4/8/2004 Hierarchical
More informationCluster Analysis using R
Cluster analysis or clustering is the task of assigning a set of objects into groups (called clusters) so that the objects in the same cluster are more similar (in some sense or another) to each other
More informationGephi Tutorial Visualization
Gephi Tutorial Welcome to this Gephi tutorial. It will guide you to the basic and advanced visualization settings in Gephi. The selection and interaction with tools will also be introduced. Follow the
More informationMicrosoft Excel 2013: Charts June 2014
Microsoft Excel 2013: Charts June 2014 Description We will focus on Excel features for graphs and charts. We will discuss multiple axes, formatting data, choosing chart type, adding notes and images, and
More informationCONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19
PREFACE xi 1 INTRODUCTION 1 1.1 Overview 1 1.2 Definition 1 1.3 Preparation 2 1.3.1 Overview 2 1.3.2 Accessing Tabular Data 3 1.3.3 Accessing Unstructured Data 3 1.3.4 Understanding the Variables and Observations
More informationModifying Colors and Symbols in ArcMap
Modifying Colors and Symbols in ArcMap Contents Introduction... 1 Displaying Categorical Data... 3 Creating New Categories... 5 Displaying Numeric Data... 6 Graduated Colors... 6 Graduated Symbols... 9
More informationCharts, Tables, and Graphs
Charts, Tables, and Graphs The Mathematics sections of the SAT also include some questions about charts, tables, and graphs. You should know how to (1) read and understand information that is given; (2)
More informationData, Measurements, Features
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
More informationThey can be obtained in HQJHQH format directly from the home page at: http://www.engene.cnb.uam.es/downloads/kobayashi.dat
HQJHQH70 *XLGHG7RXU This document contains a Guided Tour through the HQJHQH platform and it was created for training purposes with respect to the system options and analysis possibilities. It is not intended
More informationCreating Charts and Graphs
Creating Charts and Graphs Title: Creating Charts and Graphs Version: 1. First edition: December 24 First English edition: December 24 Contents Overview...ii Copyright and trademark information...ii Feedback...ii
More informationTutorial Segmentation and Classification
MARKETING ENGINEERING FOR EXCEL TUTORIAL VERSION 1.0.8 Tutorial Segmentation and Classification Marketing Engineering for Excel is a Microsoft Excel add-in. The software runs from within Microsoft Excel
More informationStep-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER JMP Genomics Step-by-Step Guide to Bi-Parental Linkage Mapping Introduction JMP Genomics offers several tools for the creation of linkage maps
More informationTIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:
Table of contents: Access Data for Analysis Data file types Format assumptions Data from Excel Information links Add multiple data tables Create & Interpret Visualizations Table Pie Chart Cross Table Treemap
More informationData Visualization in R
Data Visualization in R L. Torgo ltorgo@fc.up.pt Faculdade de Ciências / LIAAD-INESC TEC, LA Universidade do Porto Oct, 2014 Introduction Motivation for Data Visualization Humans are outstanding at detecting
More informationAdvanced Microsoft Excel 2010
Advanced Microsoft Excel 2010 Table of Contents THE PASTE SPECIAL FUNCTION... 2 Paste Special Options... 2 Using the Paste Special Function... 3 ORGANIZING DATA... 4 Multiple-Level Sorting... 4 Subtotaling
More informationVisualization Quick Guide
Visualization Quick Guide A best practice guide to help you find the right visualization for your data WHAT IS DOMO? Domo is a new form of business intelligence (BI) unlike anything before an executive
More information1051-232 Imaging Systems Laboratory II. Laboratory 4: Basic Lens Design in OSLO April 2 & 4, 2002
05-232 Imaging Systems Laboratory II Laboratory 4: Basic Lens Design in OSLO April 2 & 4, 2002 Abstract: For designing the optics of an imaging system, one of the main types of tools used today is optical
More informationExcel Tutorial. Bio 150B Excel Tutorial 1
Bio 15B Excel Tutorial 1 Excel Tutorial As part of your laboratory write-ups and reports during this semester you will be required to collect and present data in an appropriate format. To organize and
More informationData Mining and Visualization
Data Mining and Visualization Jeremy Walton NAG Ltd, Oxford Overview Data mining components Functionality Example application Quality control Visualization Use of 3D Example application Market research
More informationVisualization Plugin for ParaView
Alexey I. Baranov Visualization Plugin for ParaView version 1.3 Springer Contents 1 Visualization with ParaView..................................... 1 1.1 ParaView plugin installation.................................
More informationGuide for Data Visualization and Analysis using ACSN
Guide for Data Visualization and Analysis using ACSN ACSN contains the NaviCell tool box, the intuitive and user- friendly environment for data visualization and analysis. The tool is accessible from the
More informationGENEGOBI : VISUAL DATA ANALYSIS AID TOOLS FOR MICROARRAY DATA
COMPSTAT 2004 Symposium c Physica-Verlag/Springer 2004 GENEGOBI : VISUAL DATA ANALYSIS AID TOOLS FOR MICROARRAY DATA Eun-kyung Lee, Dianne Cook, Eve Wurtele, Dongshin Kim, Jihong Kim, and Hogeun An Key
More informationLab 11: Budgeting with Excel
Lab 11: Budgeting with Excel This lab exercise will have you track credit card bills over a period of three months. You will determine those months in which a budget was met for various categories. You
More informationIntroduction Course in SPSS - Evening 1
ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/
More informationExcel: Analyze PowerSchool Data
Excel: Analyze PowerSchool Data Trainer Name Trainer/Consultant PowerSchool University 2012 Agenda Welcome & Introductions Organizing Data with PivotTables Displaying Data with Charts Creating Dashboards
More informationCluster Analysis. Isabel M. Rodrigues. Lisboa, 2014. Instituto Superior Técnico
Instituto Superior Técnico Lisboa, 2014 Introduction: Cluster analysis What is? Finding groups of objects such that the objects in a group will be similar (or related) to one another and different from
More informationHow to build Dashboard - Step by Step tutorial/recipe
How to build Dashboard - Step by Step tutorial/recipe Contents How to build Dashboard - Step by Step tutorial/recipe...1 How to create Excel Dashboard [ as direct connection ]...2 Purpose of this Dashboard
More informationClustering UE 141 Spring 2013
Clustering UE 141 Spring 013 Jing Gao SUNY Buffalo 1 Definition of Clustering Finding groups of obects such that the obects in a group will be similar (or related) to one another and different from (or
More informationARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)
ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING) Gabriela Ochoa http://www.cs.stir.ac.uk/~goc/ OUTLINE Preliminaries Classification and Clustering Applications
More informationHow To Cluster
Data Clustering Dec 2nd, 2013 Kyrylo Bessonov Talk outline Introduction to clustering Types of clustering Supervised Unsupervised Similarity measures Main clustering algorithms k-means Hierarchical Main
More informationCalibration Control. Calibration Management Software. Tools for Management Systems
Calibration Control Calibration Management Software Calibration Control CREATED FROM THE MANUFACTURING USER S PERSPECTIVE MAJOR FEATURES WINDOWS 7 & 8 21 CFR PART 11 COMPLIANT MS ACCESS OR SQL SERVER DANISH,
More informationRegression Clustering
Chapter 449 Introduction This algorithm provides for clustering in the multiple regression setting in which you have a dependent variable Y and one or more independent variables, the X s. The algorithm
More informationLogo Standards Guideline
Logo Standards Guideline TABLE OF CONTENTS Nurturing The Brand 1 Logo Guidelines 2 Correct Usage 2 Color Guidelines 6 How to Use the Provided Logo Files 9 Glossary 10 NURTURING THE BRAND THE FOLLOWING
More informationA Guide to Using Excel in Physics Lab
A Guide to Using Excel in Physics Lab Excel has the potential to be a very useful program that will save you lots of time. Excel is especially useful for making repetitious calculations on large data sets.
More informationHeat Map Explorer Getting Started Guide
You have made a smart decision in choosing Lab Escape s Heat Map Explorer. Over the next 30 minutes this guide will show you how to analyze your data visually. Your investment in learning to leverage heat
More informationAccountable Care Organization Quality Explorer. Quick Start Guide
Accountable Care Organization Quality Explorer Quick Start Guide 1 P age Background HealthLandscape (a division of the American Academy of Family Physicians [AAFP]) and the Robert Graham Center for Policy
More informationClustering Connectionist and Statistical Language Processing
Clustering Connectionist and Statistical Language Processing Frank Keller keller@coli.uni-sb.de Computerlinguistik Universität des Saarlandes Clustering p.1/21 Overview clustering vs. classification supervised
More informationThere are a number of different methods that can be used to carry out a cluster analysis; these methods can be classified as follows:
Statistics: Rosie Cornish. 2007. 3.1 Cluster Analysis 1 Introduction This handout is designed to provide only a brief introduction to cluster analysis and how it is done. Books giving further details are
More informationClustering & Visualization
Chapter 5 Clustering & Visualization Clustering in high-dimensional databases is an important problem and there are a number of different clustering paradigms which are applicable to high-dimensional data.
More informationIdentification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation
Identification of rheumatoid arthritis and osterthritis patients by transcriptome-based rule set generation Bering Limited Report generated on September 19, 2014 Contents 1 Dataset summary 2 1.1 Project
More informationRIFIS Ad Hoc Reports
RIFIS Ad Hoc Reports To retrieve the entire list of all Ad Hoc Reports, including the Base reports and any additional reports published to your Role, select Ad Hoc for the Type under Filter Report By and
More informationClustering: Techniques & Applications. Nguyen Sinh Hoa, Nguyen Hung Son. 15 lutego 2006 Clustering 1
Clustering: Techniques & Applications Nguyen Sinh Hoa, Nguyen Hung Son 15 lutego 2006 Clustering 1 Agenda Introduction Clustering Methods Applications: Outlier Analysis Gene clustering Summary and Conclusions
More informationCS171 Visualization. The Visualization Alphabet: Marks and Channels. Alexander Lex alex@seas.harvard.edu. [xkcd]
CS171 Visualization Alexander Lex alex@seas.harvard.edu The Visualization Alphabet: Marks and Channels [xkcd] This Week Thursday: Task Abstraction, Validation Homework 1 due on Friday! Any more problems
More informationvrealize Operations Manager User Guide
vrealize Operations Manager User Guide vrealize Operations Manager 6.0.1 This document supports the version of each product listed and supports all subsequent versions until the document is replaced by
More informationTIBCO Spotfire Network Analytics 1.1. User s Manual
TIBCO Spotfire Network Analytics 1.1 User s Manual Revision date: 26 January 2009 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO
More informationIntroduction to Microsoft Excel 2007/2010
to Microsoft Excel 2007/2010 Abstract: Microsoft Excel is one of the most powerful and widely used spreadsheet applications available today. Excel's functionality and popularity have made it an essential
More informationUnited States Department of Agriculture (USDA) Agricultural Marketing Service (AMS) Livestock and Grain Market News (LGMN)
United States Department of Agriculture (USDA) Agricultural Marketing Service (AMS) Livestock and Grain Market News (LGMN) Livestock Mandatory Reporting (LMR) Dashboard Users Guide Version 3.0 / 11 May
More informationData exploration with Microsoft Excel: analysing more than one variable
Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical
More informationMarket Pricing Override
Market Pricing Override MARKET PRICING OVERRIDE Market Pricing: Copy Override Market price overrides can be copied from one match year to another Market Price Override can be accessed from the Job Matches
More informationMachine Learning using MapReduce
Machine Learning using MapReduce What is Machine Learning Machine learning is a subfield of artificial intelligence concerned with techniques that allow computers to improve their outputs based on previous
More informationAdobe Lens Profile Creator User Guide. Version 1.0 Wednesday, April 14, 2010 Adobe Systems Inc
Adobe Lens Profile Creator User Guide Version 1.0 Wednesday, April 14, 2010 Adobe Systems Inc ii Table of Contents INTRODUCTION:... 1 TERMINOLOGY:... 2 PROCEDURES:... 4 OPENING AND RUNNING FILES THROUGH
More informationBuilding Data Cubes and Mining Them. Jelena Jovanovic Email: jeljov@fon.bg.ac.yu
Building Data Cubes and Mining Them Jelena Jovanovic Email: jeljov@fon.bg.ac.yu KDD Process KDD is an overall process of discovering useful knowledge from data. Data mining is a particular step in the
More informationMicroStrategy Desktop
MicroStrategy Desktop Quick Start Guide MicroStrategy Desktop is designed to enable business professionals like you to explore data, simply and without needing direct support from IT. 1 Import data from
More informationColor quality guide. Quality menu. Color quality guide. Page 1 of 6
Page 1 of 6 Color quality guide The Color Quality guide helps users understand how operations available on the printer can be used to adjust and customize color output. Quality menu Menu item Print Mode
More informationZhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University. Xu Liang ** University of California, Berkeley
P1.1 AN INTEGRATED DATA MANAGEMENT, RETRIEVAL AND VISUALIZATION SYSTEM FOR EARTH SCIENCE DATASETS Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University Xu Liang ** University
More informationCluster software and Java TreeView
Cluster software and Java TreeView To download the software: http://bonsai.hgc.jp/~mdehoon/software/cluster/software.htm http://bonsai.hgc.jp/~mdehoon/software/cluster/manual/treeview.html Cluster 3.0
More information9. Text & Documents. Visualizing and Searching Documents. Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08
9. Text & Documents Visualizing and Searching Documents Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08 Slide 1 / 37 Outline Characteristics of text data Detecting patterns SeeSoft
More information5 Correlation and Data Exploration
5 Correlation and Data Exploration Correlation In Unit 3, we did some correlation analyses of data from studies related to the acquisition order and acquisition difficulty of English morphemes by both
More informationAn Analysis on Density Based Clustering of Multi Dimensional Spatial Data
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data K. Mumtaz 1 Assistant Professor, Department of MCA Vivekanandha Institute of Information and Management Studies, Tiruchengode,
More informationExample: Document Clustering. Clustering: Definition. Notion of a Cluster can be Ambiguous. Types of Clusterings. Hierarchical Clustering
Overview Prognostic Models and Data Mining in Medicine, part I Cluster Analsis What is Cluster Analsis? K-Means Clustering Hierarchical Clustering Cluster Validit Eample: Microarra data analsis 6 Summar
More informationAnalysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk
Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
More informationFinal Software Tools and Services for Traders
Final Software Tools and Services for Traders TPO and Volume Profile Chart for NinjaTrader Trial Period The software gives you a 7-day free evaluation period starting after loading and first running the
More informationPackage RIGHT. March 30, 2015
Type Package Title R Interactive Graphics via HTML Version 0.2.0 Date 2015-03-30 Package RIGHT March 30, 2015 Author ChungHa Sung, TaeJoon Song, JongHyun Bae, SangGi Hong, Jae W. Lee, and Junghoon Lee
More informationInstructions for SPSS 21
1 Instructions for SPSS 21 1 Introduction... 2 1.1 Opening the SPSS program... 2 1.2 General... 2 2 Data inputting and processing... 2 2.1 Manual input and data processing... 2 2.2 Saving data... 3 2.3
More informationData Visualization. or Graphical Data Presentation. Jerzy Stefanowski Instytut Informatyki
Data Visualization or Graphical Data Presentation Jerzy Stefanowski Instytut Informatyki Data mining for SE -- 2013 Ack. Inspirations are coming from: G.Piatetsky Schapiro lectures on KDD J.Han on Data
More informationChapter 4 Displaying and Describing Categorical Data
Chapter 4 Displaying and Describing Categorical Data Chapter Goals Learning Objectives This chapter presents three basic techniques for summarizing categorical data. After completing this chapter you should
More informationGood Scientific Visualization Practices + Python
Good Scientific Visualization Practices + Python Kristen Thyng Python in Geosciences September 19, 2013 Kristen Thyng (Texas A&M) Visualization September 19, 2013 1 / 29 Outline Overview of Bad Plotting
More informationKNIME TUTORIAL. Anna Monreale KDD-Lab, University of Pisa Email: annam@di.unipi.it
KNIME TUTORIAL Anna Monreale KDD-Lab, University of Pisa Email: annam@di.unipi.it Outline Introduction on KNIME KNIME components Exercise: Market Basket Analysis Exercise: Customer Segmentation Exercise:
More informationMonday Morning Data Mining
Monday Morning Data Mining Tim Ruhe Statistische Methoden der Datenanalyse Outline: - data mining - IceCube - Data mining in IceCube Computer Scientists are different... Fakultät Physik Fakultät Physik
More information