Agenda. R Does Pivot Tables Sparklines (Edward Tufte) Misc. Graphics Questions Jim Holtman

Size: px
Start display at page:

Download "Agenda. R Does Pivot Tables Sparklines (Edward Tufte) Misc. Graphics Questions Jim Holtman"

Transcription

1 Agenda 10 Minute Workshop R Does Pivot Tables Sparklines (Edward Tufte) Misc. Graphics Questions Jim Holtman

2 What is Open source language and environment for statistical processing Based on the S language developed at Bell Labs by John Chambers in the early 1980s John won the 1998 ACM Award for the development of the S Language I have been using it for the last 25 years Over 85 books available on R/S-Plus Many recent statistics graduates have a background in R. Many new statistical procedures use R as their infrastructure. Gene research is heavy into the use of R. Google nyt r for the recent New York Times article on R. Data Analysts Captivated by R s Power Jim Holtman

3 What is Object oriented Everything in R is an object R is written in R Base routine are written in FORTRAN/C Interpreted, but fast Functional language Interactive development of scripts cut/paste from a text editor to develop an R script Batch processing similar to UNIX shell files Complete programming environment with a learning curve similar to C/Java Jim Holtman

4 What is External interfaces text files (.csv,.txt, ) Relational databases (ODBC) Oracle, Informix, DB2, MySQL, Access, Statistical systems SAS, SPSS, Stata, Systat, Minitab, High quality output sweave package: generate LaTeX output that combines text, computations and graphics Jim Holtman

5 What is Where to find it google r Download binaries for Windows, MacOS & Linux Compile from source; complete build system provided Basic documentation delivered with the package 100 page Introduction to R provides the basic information that you need to start developing R programs. on-line help has examples of each command that you can run to see how they work Jim Holtman

6 Frequency Generate & Summarize 1M Random Numbers Histogram of x x Jim Holtman

7 R Objects The basic object is a vector. This can hold zero, or more, values of the same type; e.g.: character vector of character strings of varying lengths numeric real (floating point) numbers integer signed integers (typically 32-bits) logical TRUE/FALSE complex complex numbers list a vector of R objects (similar to struct in C/Java) matrix 2 dimensional array of objects of the same type array - n-dimensional array of objects dataframe 2 dimensional object, where each column can be a different type (think of an EXCEL spreadsheet) POSIXct date/time class user defined create your own objects and methods to work on them Jim Holtman

8 R Language R is a complete programming environment with a number of operators, control structures and functions Operators: +,-,*,/,<,<=,>,>=,==,!=,&,,...and more... Control structures: if (condition) true else false for (variable in sequence) statement while (condition) statement repeat statement break Creating function: func <- function(parameters) statement Jim Holtman

9 Vectors Vectors are one dimensional objects with each element being the same type Operations are carried out on all elements of the vector The power of R lies in its ability to perform vectorized operations; you do not have to code an explicit for loop Jim Holtman

10 Question on R? See me at break for any followup If you have some data that you would like to see how it can be processed in R, please feel free to send it to me and I will send back a quick script to do some basic stuff. Attend my workshop at CMG Jim Holtman

11 Pivot Tables & More John Van Wagenen s CMG2008 paper Pivot Tables/Charts Magic Beans Without Living in a Fairly Tale. Pivot tables are a nice way to slice/dice/aggregate data. I had been doing similar things in R, so it motivated me to write a paper on another way to get the same information. I have used his data to illustrate how to do these techniques in R. Now walk through some examples Jim Holtman

12 Excel Spreadsheet CSV File Exported from above (10,696 data lines) Jim Holtman

13 Excel Pivot Table Generated from the Data Read John s paper for the procedure for generating the pivot table in Excel Jim Holtman

14 Jim Holtman

15 This is what the data objects in R look like Jim Holtman

16 Casting New Data From the same melt data, I can create a daily summary and add an indicator for PRIME time: Jim Holtman

17 Excel Spreadsheet (24,560 data points) Pivot Table Chart Jim Holtman

18 R Script 0.6 seconds to read in 24,560 lines of data, summarize by shift and create the pie chart. Breakdown by Shifts WEEKEND HOLIDAY PERIOD2 PERIOD3 PRIME Jim Holtman

19 batch Data Object in R Jim Holtman

20 Frequency Frequency Frequency EDA on the batch Data Histogram of batch$cpu.hrs Histogram of batch$cpu.hrs[batch$cpu.hrs < 0.03] Histogram of batch$cpu.hrs[batch$cpu.hrs < 0.005] batch$cpu.hrs batch$cpu.hrs[batch$cpu.hrs < 0.03] batch$cpu.hrs[batch$cpu.hrs < 0.005] Jim Holtman

21 5/1/2007 6/1/2007 7/1/2007 8/1/2007 9/1/ /1/2 11/1/2 12/1/2 1/1/2008 2/1/2008 3/1/2008 4/1/2008 5/1/2008 6/1/2008 cpu seconds Summarize by Prod & Dev (3 rd character) Excel Spreadsheet Pivot Table Chart From Pivot Table DEV PROD Jim Holtman

22 Summarize by Prod & Dev Using R Jim Holtman

23 Total CPU Seconds Chart from R DEV PROD Jim Holtman

24 Pivot Table Summary R & Excel (and other products) can produce summaries that are equivalent to pivot tables In R it is easy to automate the scripts and run through a set of files and quickly produce output in various formats: PDF, PNG for web pages, WMF for inclusion in WORD/PowerPoint documents, The interactive nature of R makes it easy to do EDA (exploratory data analysis) on your data Jim Holtman

25 Sparklines Invented by Edward Tufte, well known expert on data visualization for more examples Jim Holtman

26 Sparklines from vmstat data Script on production systems log the vmstat data to a file every 30 seconds. This is used to create the daily and monthly utilization charts for a system. Data used to create sparklines of 19 variables in the log file below Jim Holtman

27 Jim Holtman

28 Monthly Data Have used levelplot to show 3D data day of the month on the y- axis, time of day on the x-axis and color to represent the value of the z-axis, which would be the CPU utilization. Sparklines for the month s performance of the system were plotted next to the levelplot for comparison. Both presentation methods allow you to look for patterns. Which do you find the easiest to see patterns in? Sparklines would make an interesting presentation of yearly data. The example just duplicates the monthly data to provide an idea of what it might look like Jim Holtman

29 levelplot and sparklines of the same monthly utilization data.

30 Jim Holtman

31 Transaction Data Consolidated ~79K transactions into 10 transaction groups and 10 user pools to make the reports easier to see. Data has the user, transaction name, start and end time. Response was calculated. Look at this data with some stacked barcharts and mosaic plots. Pivot Table of User/Transaction Counts Jim Holtman

32 Total Transactions Stacked Bar Chart of Transaction Count by User Tran.01 Tran.02 Tran.03 Tran.04 Tran.05 Tran.06 Tran.07 Tran.08 Tran.09 Tran.10 User.01 User.02 User.03 User.04 User.05 User.06 User.07 User.08 User.09 User Jim Holtman

33 Stacked Bar Chart/Mosaic Chart Lets you see who the busy users are in terms of number of transactions. A mosaic chart sows the same data, but the area of the boxes is proportional to the counts. y-axis range is the same for all data elements. Sometimes easier to the ratios (mix) between the use of transactions for a user; may denote a different role for that user. User.06 (lowest count) has on Trans.06, Trans.09 and Trans.10 a higher ratio than User.08 (highest count) Jim Holtman

34 Tran User.01 User.02 User.03 User.04 User.05 User.06 User.07 User.08 User.09 User.10 Mosaic Plot of the Number of Transactions by User - Area Proportional to Count Trans.01 Trans.02 Trans.03 Trans.04 Trans.05 Trans.06 Trans.07 Trans.08 Trans.09 Trans.10 User Jim Holtman

35 References [1] J. Van Wagenen, Pivot Tables/Charts Magic Beans Without Living in a Fairy Tale, CMG 2008 [2] Ron Kaminski, Automating Process Pathology Detection Rule Engine Design Hints, CMG 2008 [3] R Development Core Team, R: A Language and Environment for Statistical Computing, {ISBN} , [4] J. Holtman, Using R for System Performance Analysis, CMG 2004 [5] J. Holtman, Visualization Techniques for Analyzing Patterns in System Performance Data, CMG 2005 [6] N. J. Gunther, Guerrilla Capacity Planning, Springer-Verlag, Heidelberg, Germany, 2007 [7] H. Wickham, Reshaping data with the reshape package, Journal of Statistical Software, 21(12), 2007 [8] Venables, W. N. and Ripley, B. D. Modern Applied Statistics with S. Fourth Edition. Springer, 2002, ISBN [9] Tufte, Edward Beautiful Evidence Graphic Press 2006 [10] Spector, Phil Data Manipulation with R (Use R) Springer, ISBN Jim Holtman

36 Questions? Jim Holtman

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing www.imcfamos.com imc FAMOS at a glance Four editions to Optimize

More information

R Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol

R Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol R Graphics Cookbook Winston Chang Beijing Cambridge Farnham Koln Sebastopol O'REILLY Tokyo Table of Contents Preface ix 1. R Basics 1 1.1. Installing a Package 1 1.2. Loading a Package 2 1.3. Loading a

More information

Silvia Liverani. Department of Statistics University of Warwick. CSC, 24th April 2008. R: A programming environment for Data. Analysis and Graphics

Silvia Liverani. Department of Statistics University of Warwick. CSC, 24th April 2008. R: A programming environment for Data. Analysis and Graphics : A Department of Statistics University of Warwick CSC, 24th April 2008 Outline 1 2 3 4 5 6 What do you need? Performance Functionality Extensibility Simplicity Compatability Interface Low-cost Project

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing imc FAMOS ensures fast results Comprehensive data processing

More information

Introduction Course in SPSS - Evening 1

Introduction Course in SPSS - Evening 1 ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/

More information

Tutorial 2 Online and offline Ship Visualization tool Table of Contents

Tutorial 2 Online and offline Ship Visualization tool Table of Contents Tutorial 2 Online and offline Ship Visualization tool Table of Contents 1.Tutorial objective...2 1.1.Standard that will be used over this document...2 2. The online tool...2 2.1.View all records...3 2.2.Search

More information

"Excel with Excel 2013: Pivoting with Pivot Tables" by Venu Gopalakrishna Remani. October 28, 2014

Excel with Excel 2013: Pivoting with Pivot Tables by Venu Gopalakrishna Remani. October 28, 2014 Teaching Excellence and Innovation 1 Pivot table Pivot table does calculations with criteria Data should be arranged as : Field names in the first rows, records in rows No blank rows or blank columns should

More information

MicroStrategy Desktop

MicroStrategy Desktop MicroStrategy Desktop Quick Start Guide MicroStrategy Desktop is designed to enable business professionals like you to explore data, simply and without needing direct support from IT. 1 Import data from

More information

Introduction to IBM SPSS Statistics

Introduction to IBM SPSS Statistics CONTENTS Arizona State University College of Health Solutions College of Nursing and Health Innovation Introduction to IBM SPSS Statistics Edward A. Greenberg, PhD Director, Data Lab PAGE About This Document

More information

Sources: On the Web: Slides will be available on:

Sources: On the Web: Slides will be available on: C programming Introduction The basics of algorithms Structure of a C code, compilation step Constant, variable type, variable scope Expression and operators: assignment, arithmetic operators, comparison,

More information

Getting Started with R and RStudio 1

Getting Started with R and RStudio 1 Getting Started with R and RStudio 1 1 What is R? R is a system for statistical computation and graphics. It is the statistical system that is used in Mathematics 241, Engineering Statistics, for the following

More information

GETTING STARTED WITH R AND DATA ANALYSIS

GETTING STARTED WITH R AND DATA ANALYSIS GETTING STARTED WITH R AND DATA ANALYSIS [Learn R for effective data analysis] LEARN PRACTICAL SKILLS REQUIRED FOR VISUALIZING, TRANSFORMING, AND ANALYZING DATA IN R One day course for people who are just

More information

Sisense. Product Highlights. www.sisense.com

Sisense. Product Highlights. www.sisense.com Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze

More information

SQL Server An Overview

SQL Server An Overview SQL Server An Overview SQL Server Microsoft SQL Server is designed to work effectively in a number of environments: As a two-tier or multi-tier client/server database system As a desktop database system

More information

How To Use Statgraphics Centurion Xvii (Version 17) On A Computer Or A Computer (For Free)

How To Use Statgraphics Centurion Xvii (Version 17) On A Computer Or A Computer (For Free) Statgraphics Centurion XVII (currently in beta test) is a major upgrade to Statpoint's flagship data analysis and visualization product. It contains 32 new statistical procedures and significant upgrades

More information

Table of Contents Find the story within your data

Table of Contents Find the story within your data Visualizations 101 Table of Contents Find the story within your data Introduction 2 Types of Visualizations 3 Static vs. Animated Charts 6 Drilldowns and Drillthroughs 6 About Logi Analytics 7 1 For centuries,

More information

Please follow these guidelines when preparing your answers:

Please follow these guidelines when preparing your answers: PR- ASSIGNMNT 3000500 Quantitative mpirical Research The objective of the pre- assignment is to review the course prerequisites and get familiar with SPSS software. The assignment consists of three parts:

More information

Aspose.Cells Product Family

Aspose.Cells Product Family time and effort by using our efficient and robust components instead of developing your own. lets you open, create, save and convert files from within your application without Microsoft Excel, confident

More information

From The Little SAS Book, Fifth Edition. Full book available for purchase here.

From The Little SAS Book, Fifth Edition. Full book available for purchase here. From The Little SAS Book, Fifth Edition. Full book available for purchase here. Acknowledgments ix Introducing SAS Software About This Book xi What s New xiv x Chapter 1 Getting Started Using SAS Software

More information

An Introduction to the Use of R for Clinical Research

An Introduction to the Use of R for Clinical Research An Introduction to the Use of R for Clinical Research Dimitris Rizopoulos Department of Biostatistics, Erasmus Medical Center d.rizopoulos@erasmusmc.nl PSDM Event: Open Source Software in Clinical Research

More information

Computational Mathematics with Python

Computational Mathematics with Python Boolean Arrays Classes Computational Mathematics with Python Basics Olivier Verdier and Claus Führer 2009-03-24 Olivier Verdier and Claus Führer Computational Mathematics with Python 2009-03-24 1 / 40

More information

SQL Databases Course. by Applied Technology Research Center. This course provides training for MySQL, Oracle, SQL Server and PostgreSQL databases.

SQL Databases Course. by Applied Technology Research Center. This course provides training for MySQL, Oracle, SQL Server and PostgreSQL databases. SQL Databases Course by Applied Technology Research Center. 23 September 2015 This course provides training for MySQL, Oracle, SQL Server and PostgreSQL databases. Oracle Topics This Oracle Database: SQL

More information

CS 3530 Operating Systems. L02 OS Intro Part 1 Dr. Ken Hoganson

CS 3530 Operating Systems. L02 OS Intro Part 1 Dr. Ken Hoganson CS 3530 Operating Systems L02 OS Intro Part 1 Dr. Ken Hoganson Chapter 1 Basic Concepts of Operating Systems Computer Systems A computer system consists of two basic types of components: Hardware components,

More information

Automate Your BI Administration to Save Millions with Command Manager and System Manager

Automate Your BI Administration to Save Millions with Command Manager and System Manager Automate Your BI Administration to Save Millions with Command Manager and System Manager Presented by: Dennis Liao Sr. Sales Engineer Date: 27 th January, 2015 Session 2 This Session is Part of MicroStrategy

More information

Paper 232-2012. Getting to the Good Part of Data Analysis: Data Access, Manipulation, and Customization Using JMP

Paper 232-2012. Getting to the Good Part of Data Analysis: Data Access, Manipulation, and Customization Using JMP Paper 232-2012 Getting to the Good Part of Data Analysis: Data Access, Manipulation, and Customization Using JMP Audrey Ventura, SAS Institute Inc., Cary, NC ABSTRACT Effective data analysis requires easy

More information

This file contains 2 years of our interlibrary loan transactions downloaded from ILLiad. 70,000+ rows, multiple fields = an ideal file for pivot

This file contains 2 years of our interlibrary loan transactions downloaded from ILLiad. 70,000+ rows, multiple fields = an ideal file for pivot Presented at the Southeastern Library Assessment Conference, October 22, 2013 1 2 3 This file contains 2 years of our interlibrary loan transactions downloaded from ILLiad. 70,000+ rows, multiple fields

More information

WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley

WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley Disclaimer: This material is protected under copyright act AnalytixLabs, 2011. Unauthorized use and/ or duplication of this material or

More information

Storing Measurement Data

Storing Measurement Data Storing Measurement Data File I/O records or reads data in a file. A typical file I/O operation involves the following process. 1. Create or open a file. Indicate where an existing file resides or where

More information

Data processing goes big

Data processing goes big Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,

More information

NEXT Analytics Business Intelligence User Guide

NEXT Analytics Business Intelligence User Guide NEXT Analytics Business Intelligence User Guide This document provides an overview of the powerful business intelligence functions embedded in NEXT Analytics v5. These functions let you build more useful

More information

MicroStrategy Analytics Express User Guide

MicroStrategy Analytics Express User Guide MicroStrategy Analytics Express User Guide Analyzing Data with MicroStrategy Analytics Express Version: 4.0 Document Number: 09770040 CONTENTS 1. Getting Started with MicroStrategy Analytics Express Introduction...

More information

I PUC - Computer Science. Practical s Syllabus. Contents

I PUC - Computer Science. Practical s Syllabus. Contents I PUC - Computer Science Practical s Syllabus Contents Topics 1 Overview Of a Computer 1.1 Introduction 1.2 Functional Components of a computer (Working of each unit) 1.3 Evolution Of Computers 1.4 Generations

More information

Today's Topics. COMP 388/441: Human-Computer Interaction. simple 2D plotting. 1D techniques. Ancient plotting techniques. Data Visualization:

Today's Topics. COMP 388/441: Human-Computer Interaction. simple 2D plotting. 1D techniques. Ancient plotting techniques. Data Visualization: COMP 388/441: Human-Computer Interaction Today's Topics Overview of visualization techniques 1D charts, 2D plots, 3D+ techniques, maps A few guidelines for scientific visualization methods, guidelines,

More information

Vendor: Crystal Decisions Product: Crystal Reports and Crystal Enterprise

Vendor: Crystal Decisions Product: Crystal Reports and Crystal Enterprise 1 Ability to access the database platforms desired (text, spreadsheet, Oracle, Sybase and other databases, OLAP engines.) Y Y 2 Ability to access relational data base Y Y 3 Ability to access dimensional

More information

Using R for Windows and Macintosh

Using R for Windows and Macintosh 2010 Using R for Windows and Macintosh R is the most commonly used statistical package among researchers in Statistics. It is freely distributed open source software. For detailed information about downloading

More information

29200 Northwestern Hwy Suite 350 Southfield, MI 48034. 1-866-4WINSPC winspc.com

29200 Northwestern Hwy Suite 350 Southfield, MI 48034. 1-866-4WINSPC winspc.com 29200 Northwestern Hwy Suite 350 Southfield, MI 48034 1-866-4WINSPC winspc.com 2016 DataNet Quality Systems. All rights reserved. WinSPC is a registered trademark of DataNet Quality Systems. Document Version:

More information

Introduction to RStudio

Introduction to RStudio Introduction to RStudio (v 1.3) Oscar Torres-Reyna otorres@princeton.edu August 2013 http://dss.princeton.edu/training/ Introduction RStudio allows the user to run R in a more user-friendly environment.

More information

Efficient, Quality-assured Data capture and analysis using EpiData

Efficient, Quality-assured Data capture and analysis using EpiData Efficient, Quality-assured Data capture and analysis using EpiData Zaw Myo Tun Ajay M V Kumar Special Acknowledgements Dr Jens M Lauritsen (www.epidata.dk) Hans L Rieder (www.tbrieder.org) Presentation

More information

SAP Business Intelligence ( BI ) Financial and Budget Reporting. 7.0 Edition. (Best Seller At Least 43 copies Sold)

SAP Business Intelligence ( BI ) Financial and Budget Reporting. 7.0 Edition. (Best Seller At Least 43 copies Sold) SAP Business Intelligence ( BI ) Financial and Budget Reporting 7.0 Edition (Best Seller At Least 43 copies Sold) November 2011 Table of Contents Log In... 3 Initial Variable Screen... 5 Multiple / Single

More information

Computational Mathematics with Python

Computational Mathematics with Python Computational Mathematics with Python Basics Claus Führer, Jan Erik Solem, Olivier Verdier Spring 2010 Claus Führer, Jan Erik Solem, Olivier Verdier Computational Mathematics with Python Spring 2010 1

More information

Computational Mathematics with Python

Computational Mathematics with Python Numerical Analysis, Lund University, 2011 1 Computational Mathematics with Python Chapter 1: Basics Numerical Analysis, Lund University Claus Führer, Jan Erik Solem, Olivier Verdier, Tony Stillfjord Spring

More information

DiskPulse DISK CHANGE MONITOR

DiskPulse DISK CHANGE MONITOR DiskPulse DISK CHANGE MONITOR User Manual Version 7.9 Oct 2015 www.diskpulse.com info@flexense.com 1 1 DiskPulse Overview...3 2 DiskPulse Product Versions...5 3 Using Desktop Product Version...6 3.1 Product

More information

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

There are six different windows that can be opened when using SPSS. The following will give a description of each of them. SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet

More information

Introduction to Web Development with R

Introduction to Web Development with R Introduction to Web Development with R moving to the cloud... Jeroen Ooms http://www.stat.ucla.edu/~jeroen UCLA Dept. of Statistics Revolution Analytics user 2010, Gaithersburg, Maryland, USA An example:

More information

Microsoft SQL Server is great for storing departmental or company data. It. A Quick Guide to Report Builder - 2013. In association with

Microsoft SQL Server is great for storing departmental or company data. It. A Quick Guide to Report Builder - 2013. In association with In association with A Quick Guide to Report Builder - 2013 Simon Jones explains how to put business information into the hands of your employees thanks to Microsoft SQL Server is great for storing departmental

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

A Short Introduction to Eviews

A Short Introduction to Eviews A Short Introduction to Eviews Note You are responsible to get familiar with Eviews as soon as possible. All homeworks are likely to contain questions for which you will need to use this software package.

More information

ROOT: A data mining tool from CERN What can actuaries do with it?

ROOT: A data mining tool from CERN What can actuaries do with it? ROOT: A data mining tool from CERN What can actuaries do with it? Ravi Kumar, Senior Manager, Deloitte Consulting LLP Lucas Lau, Senior Consultant, Deloitte Consulting LLP Southern California Casualty

More information

Introduction to Data Structures

Introduction to Data Structures Introduction to Data Structures Albert Gural October 28, 2011 1 Introduction When trying to convert from an algorithm to the actual code, one important aspect to consider is how to store and manipulate

More information

Outline. hardware components programming environments. installing Python executing Python code. decimal and binary notations running Sage

Outline. hardware components programming environments. installing Python executing Python code. decimal and binary notations running Sage Outline 1 Computer Architecture hardware components programming environments 2 Getting Started with Python installing Python executing Python code 3 Number Systems decimal and binary notations running

More information

1 Topic. 2 Scilab. 2.1 What is Scilab?

1 Topic. 2 Scilab. 2.1 What is Scilab? 1 Topic Data Mining with Scilab. I know the name "Scilab" for a long time (http://www.scilab.org/en). For me, it is a tool for numerical analysis. It seemed not interesting in the context of the statistical

More information

Systat: Statistical Visualization Software

Systat: Statistical Visualization Software Systat: Statistical Visualization Software Hilary R. Hafner Jennifer L. DeWinter Steven G. Brown Theresa E. O Brien Sonoma Technology, Inc. Petaluma, CA Presented in Toledo, OH October 28, 2011 STI-910019-3946

More information

CHARTS AND GRAPHS INTRODUCTION USING SPSS TO DRAW GRAPHS SPSS GRAPH OPTIONS CAG08

CHARTS AND GRAPHS INTRODUCTION USING SPSS TO DRAW GRAPHS SPSS GRAPH OPTIONS CAG08 CHARTS AND GRAPHS INTRODUCTION SPSS and Excel each contain a number of options for producing what are sometimes known as business graphics - i.e. statistical charts and diagrams. This handout explores

More information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012 Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

More information

Jan 28 th, 2015 FREE Webinar by

Jan 28 th, 2015 FREE Webinar by Google Analytics Data Mining with R (includes 3 Real Applications) Jan 28 th, 2015 FREE Webinar by 1/28/2015 1 Our Speakers Kushan Shah Maintainer of RGoogleAnalytics Library & Web Analyst at Tatvic @

More information

Maximizing ROI on Test and Durability

Maximizing ROI on Test and Durability Maximizing ROI on Test and Durability Product Details Product Overview: ncode Automation is a complete environment for automated data storage, analysis and reporting. It also provides a web-based collaborative

More information

SPSS Introduction. Yi Li

SPSS Introduction. Yi Li SPSS Introduction Yi Li Note: The report is based on the websites below http://glimo.vub.ac.be/downloads/eng_spss_basic.pdf http://academic.udayton.edu/gregelvers/psy216/spss http://www.nursing.ucdenver.edu/pdf/factoranalysishowto.pdf

More information

Business Benefits From Microsoft SQL Server Business Intelligence Solutions How Can Business Intelligence Help You? PTR Associates Limited

Business Benefits From Microsoft SQL Server Business Intelligence Solutions How Can Business Intelligence Help You? PTR Associates Limited Business Benefits From Microsoft SQL Server Business Intelligence Solutions How Can Business Intelligence Help You? www.ptr.co.uk Business Benefits From Microsoft SQL Server Business Intelligence (September

More information

TWO-DIMENSIONAL TRANSFORMATION

TWO-DIMENSIONAL TRANSFORMATION CHAPTER 2 TWO-DIMENSIONAL TRANSFORMATION 2.1 Introduction As stated earlier, Computer Aided Design consists of three components, namely, Design (Geometric Modeling), Analysis (FEA, etc), and Visualization

More information

A Picture Really Is Worth a Thousand Words

A Picture Really Is Worth a Thousand Words 4 A Picture Really Is Worth a Thousand Words Difficulty Scale (pretty easy, but not a cinch) What you ll learn about in this chapter Why a picture is really worth a thousand words How to create a histogram

More information

Figure 1. An embedded chart on a worksheet.

Figure 1. An embedded chart on a worksheet. 8. Excel Charts and Analysis ToolPak Charts, also known as graphs, have been an integral part of spreadsheets since the early days of Lotus 1-2-3. Charting features have improved significantly over the

More information

Top 10 Oracle SQL Developer Tips and Tricks

Top 10 Oracle SQL Developer Tips and Tricks Top 10 Oracle SQL Developer Tips and Tricks December 17, 2013 Marc Sewtz Senior Software Development Manager Oracle Application Express Oracle America Inc., New York, NY The following is intended to outline

More information

Excel Charts & Graphs

Excel Charts & Graphs MAX 201 Spring 2008 Assignment #6: Charts & Graphs; Modifying Data Due at the beginning of class on March 18 th Introduction This assignment introduces the charting and graphing capabilities of SPSS and

More information

Benefits of Upgrading to Phoenix WinNonlin 6.2

Benefits of Upgrading to Phoenix WinNonlin 6.2 Benefits of Upgrading to Phoenix WinNonlin 6.2 Pharsight, a Certara Company 5625 Dillard Drive; Suite 205 Cary, NC 27518; USA www.pharsight.com March, 2011 Benefits of Upgrading to Phoenix WinNonlin 6.2

More information

Appendix 2.1 Tabular and Graphical Methods Using Excel

Appendix 2.1 Tabular and Graphical Methods Using Excel Appendix 2.1 Tabular and Graphical Methods Using Excel 1 Appendix 2.1 Tabular and Graphical Methods Using Excel The instructions in this section begin by describing the entry of data into an Excel spreadsheet.

More information

Black Hat Briefings USA 2004 Cameron Hotchkies cameron@0x90.org

Black Hat Briefings USA 2004 Cameron Hotchkies cameron@0x90.org Blind SQL Injection Automation Techniques Black Hat Briefings USA 2004 Cameron Hotchkies cameron@0x90.org What is SQL Injection? Client supplied data passed to an application without appropriate data validation

More information

Oracle BI Extended Edition (OBIEE) Tips and Techniques: Part 1

Oracle BI Extended Edition (OBIEE) Tips and Techniques: Part 1 Oracle BI Extended Edition (OBIEE) Tips and Techniques: Part 1 From Dan: I have been working with Oracle s BI tools for years. I am quite the Discoverer expert (a free tool now from Oracle Corp OBISE standard

More information

Installing R and the psych package

Installing R and the psych package Installing R and the psych package William Revelle Department of Psychology Northwestern University August 17, 2014 Contents 1 Overview of this and related documents 2 2 Install R and relevant packages

More information

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures Introductory Statistics Lectures Visualizing Data Descriptive Statistics I Department of Mathematics Pima Community College Redistribution of this material is prohibited without written permission of the

More information

A Guide for Energy Efficiency Program Sponsors Leveraging ENERGY STAR Product Finder Advanced View Last Updated June 18, 2014

A Guide for Energy Efficiency Program Sponsors Leveraging ENERGY STAR Product Finder Advanced View Last Updated June 18, 2014 A Guide for Energy Efficiency Program Sponsors Leveraging ENERGY STAR Product Finder Advanced View Last Updated June 18, 2014 Introduction It is now easier than ever to customize ENERGY STAR-certified

More information

A Comparison of SAS versus Microsoft Excel and Access s Inbuilt VBA Functionality

A Comparison of SAS versus Microsoft Excel and Access s Inbuilt VBA Functionality A Comparison of SAS versus Microsoft Excel and Access s Inbuilt VBA Functionality Jozef Tarrant, Amadeus Software Ltd. Copyright 2011 Amadeus Software Ltd. 1 Overview What is VBA? VBA Essentials: Modules

More information

Data Management, Analysis Tools, and Analysis Mechanics

Data Management, Analysis Tools, and Analysis Mechanics Chapter 2 Data Management, Analysis Tools, and Analysis Mechanics This chapter explores different tools and techniques for handling data for research purposes. This chapter assumes that a research problem

More information

CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler

CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler 1) Operating systems a) Windows b) Unix and Linux c) Macintosh 2) Data manipulation tools a) Text Editors b) Spreadsheets

More information

Exploratory Data Analysis with R

Exploratory Data Analysis with R Exploratory Data Analysis with R Roger D. Peng This book is for sale at http://leanpub.com/exdata This version was published on 2015-11-12 This is a Leanpub book. Leanpub empowers authors and publishers

More information

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015

Principles of Data Visualization for Exploratory Data Analysis. Renee M. P. Teate. SYS 6023 Cognitive Systems Engineering April 28, 2015 Principles of Data Visualization for Exploratory Data Analysis Renee M. P. Teate SYS 6023 Cognitive Systems Engineering April 28, 2015 Introduction Exploratory Data Analysis (EDA) is the phase of analysis

More information

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175) Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,

More information

LabVIEW Day 6: Saving Files and Making Sub vis

LabVIEW Day 6: Saving Files and Making Sub vis LabVIEW Day 6: Saving Files and Making Sub vis Vern Lindberg You have written various vis that do computations, make 1D and 2D arrays, and plot graphs. In practice we also want to save that data. We will

More information

THEME: T-ACCOUNTS. By John W. Day, MBA. ACCOUNTING TERM: T-Account

THEME: T-ACCOUNTS. By John W. Day, MBA. ACCOUNTING TERM: T-Account THEME: T-ACCOUNTS By John W. Day, MBA ACCOUNTING TERM: T-Account A T-Account is a template or format shaped like a T that represents a particular general ledger account. Debit entries are recorded on the

More information

Introduction to Matlab

Introduction to Matlab Introduction to Matlab Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course provides

More information

Programming Exercise 3: Multi-class Classification and Neural Networks

Programming Exercise 3: Multi-class Classification and Neural Networks Programming Exercise 3: Multi-class Classification and Neural Networks Machine Learning November 4, 2011 Introduction In this exercise, you will implement one-vs-all logistic regression and neural networks

More information

Importing and Exporting With SPSS for Windows 17 TUT 117

Importing and Exporting With SPSS for Windows 17 TUT 117 Information Systems Services Importing and Exporting With TUT 117 Version 2.0 (Nov 2009) Contents 1. Introduction... 3 1.1 Aim of this Document... 3 2. Importing Data from Other Sources... 3 2.1 Reading

More information

Criteria for Evaluating Visual EDA Tools

Criteria for Evaluating Visual EDA Tools Criteria for Evaluating Visual EDA Tools Stephen Few, Perceptual Edge Visual Business Intelligence Newsletter April/May/June 2012 We visualize data for various purposes. Specific purposes direct us to

More information

Introduction to SPSS 16.0

Introduction to SPSS 16.0 Introduction to SPSS 16.0 Edited by Emily Blumenthal Center for Social Science Computation and Research 110 Savery Hall University of Washington Seattle, WA 98195 USA (206) 543-8110 November 2010 http://julius.csscr.washington.edu/pdf/spss.pdf

More information

Lecture 2 Mathcad Basics

Lecture 2 Mathcad Basics Operators Lecture 2 Mathcad Basics + Addition, - Subtraction, * Multiplication, / Division, ^ Power ( ) Specify evaluation order Order of Operations ( ) ^ highest level, first priority * / next priority

More information

Data Warehouse Center Administration Guide

Data Warehouse Center Administration Guide IBM DB2 Universal Database Data Warehouse Center Administration Guide Version 8 SC27-1123-00 IBM DB2 Universal Database Data Warehouse Center Administration Guide Version 8 SC27-1123-00 Before using this

More information

Introduction to Microsoft Excel 2007/2010

Introduction to Microsoft Excel 2007/2010 to Microsoft Excel 2007/2010 Abstract: Microsoft Excel is one of the most powerful and widely used spreadsheet applications available today. Excel's functionality and popularity have made it an essential

More information

GGobi meets R: an extensible environment for interactive dynamic data visualization

GGobi meets R: an extensible environment for interactive dynamic data visualization New URL: http://www.r-project.org/conferences/dsc-2001/ DSC 2001 Proceedings of the 2nd International Workshop on Distributed Statistical Computing March 15-17, Vienna, Austria http://www.ci.tuwien.ac.at/conferences/dsc-2001

More information

Scientific Graphing in Excel 2010

Scientific Graphing in Excel 2010 Scientific Graphing in Excel 2010 When you start Excel, you will see the screen below. Various parts of the display are labelled in red, with arrows, to define the terms used in the remainder of this overview.

More information

AP Computer Science Java Mr. Clausen Program 9A, 9B

AP Computer Science Java Mr. Clausen Program 9A, 9B AP Computer Science Java Mr. Clausen Program 9A, 9B PROGRAM 9A I m_sort_of_searching (20 points now, 60 points when all parts are finished) The purpose of this project is to set up a program that will

More information

Computational Statistics: A Crash Course using R for Biologists (and Their Friends)

Computational Statistics: A Crash Course using R for Biologists (and Their Friends) Computational Statistics: A Crash Course using R for Biologists (and Their Friends) Randall Pruim Calvin College Michigan NExT 2011 Computational Statistics Using R Why Use R? Statistics + Computation

More information

Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points

Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points Analytic 360 Our Raison d'être Identify major choice decision points Leverage Analytical Tools and Techniques to solve problems hindering these decision points Empowerment through Intelligence Our Suite

More information

Oracle Database 12c: Introduction to SQL Ed 1.1

Oracle Database 12c: Introduction to SQL Ed 1.1 Oracle University Contact Us: 1.800.529.0165 Oracle Database 12c: Introduction to SQL Ed 1.1 Duration: 5 Days What you will learn This Oracle Database: Introduction to SQL training helps you write subqueries,

More information

Toad Data Modeler - Features Matrix

Toad Data Modeler - Features Matrix Toad Data Modeler - Features Matrix Functionality Commercial Trial Freeware Notes General Features Physical Model (database specific) Universal Model (generic physical model) Logical Model (support for

More information

Charts for SharePoint

Charts for SharePoint KWizCom Corporation Charts for SharePoint Admin Guide Copyright 2005-2015 KWizCom Corporation. All rights reserved. Company Headquarters 95 Mural Street, Suite 600 Richmond Hill, ON L4B 3G2 Canada E-mail:

More information

Dream Report vs MS SQL Reporting. 10 Key Advantages for Dream Report

Dream Report vs MS SQL Reporting. 10 Key Advantages for Dream Report Dream Report vs MS SQL Reporting 10 Key Advantages for Dream Report Page 2 of 15 Table of Contents INTRODUCTION 3 PROFESSIONAL SOFTWARE FOR NON-PROGRAMMING USERS 4 CONSIDERABLE DIFFERENCE IN ENGINEERING

More information

Figure 1: Graphical example of a mergesort 1.

Figure 1: Graphical example of a mergesort 1. CSE 30321 Computer Architecture I Fall 2011 Lab 02: Procedure Calls in MIPS Assembly Programming and Performance Total Points: 100 points due to its complexity, this lab will weight more heavily in your

More information

Transforming Time Series Data into Capacity Planning Information

Transforming Time Series Data into Capacity Planning Information Transforming Time Series Data into Capacity Planning Information James F Brady Capacity Planner for the State of Nevada jfbrady@admin.nv.gov Often an analyst has time series data available from performance

More information

MySQL for Beginners Ed 3

MySQL for Beginners Ed 3 Oracle University Contact Us: 1.800.529.0165 MySQL for Beginners Ed 3 Duration: 4 Days What you will learn The MySQL for Beginners course helps you learn about the world's most popular open source database.

More information

VISUAL GUIDE to. RX Scripting. for Roulette Xtreme - System Designer 2.0

VISUAL GUIDE to. RX Scripting. for Roulette Xtreme - System Designer 2.0 VISUAL GUIDE to RX Scripting for Roulette Xtreme - System Designer 2.0 UX Software - 2009 TABLE OF CONTENTS INTRODUCTION... ii What is this book about?... iii How to use this book... iii Time to start...

More information