Automatic Generation of Accumulated Data Matrices in a Tabulating Process

Size: px
Start display at page:

Download "Automatic Generation of Accumulated Data Matrices in a Tabulating Process"

Transcription

1 _.~ Automatic Generation of Accumulated Data Matrices in a Tabulating Process CASTILLO, Jesus CASTRO, Alejandro de SANTOS, Angel Departamento de Estadistica - Comunidad de Madrid Informatica Comunidad de Madrid Abstract The SAS system offers great possibilities in order to get information in a tabular form. The procedure PROC TABULATE is the most adecuated for these kind of tasks. Its syntax is simple and powerful. With a few lines of code, tables with a complex structure can be obtained. However, the output produced by this procedure is not a data set but a report. Its manipulation for another SAS procedure is difficult. The work presented in SEUGI'94 has the purpose to develop an automatized procedure that allows to get an accumulated data matrix as a SAS data set starting from a table definition. Introduction A basic aspect in the difussion policy of an organism that produces statistics information is the format in which is distributed. The possibility of manipulating the data has a great importance. A priority purpose elaborating the tables of the Census of 1991 at "Departamento de Estadistica de la Comunidad de Madrid" was to get data matrices. PROC TABULATE does not include an option that allows the output is a SAS data set. This forced to write SAS programs for each table in order to get a SAS data set. This method allowed to solve some problems that appeared in the tables: the use of different where clauses in the same table, and the accumulation of a subset of the values of a variable. Figure 1 shows the second case. 121._" _~ i_.~'~ -~'_~;_ ~ - ",.- ''''''-~ -'-'.

2 AGE <20 I J >40 I I I >80 t t t t t t Figure 1. Accumulation of a subset of the values of a variable. None of these cases is solved using PROC TABULATE. How to Get Accumulated Data Matrices The experience obtained in the development of SAS programs to get accumulated data matrices allowed to elaborate a systematic for its realization. The study of the different kinds of tables that the Department generates, allowed to identify those parts that are necessary to get in a separated way using accumulation procedures. Basically, the characteristics that lead to this conclusion are two: If in a statistics table different where clauses are applied, it is essential to execute different accumulating procedures. If the table incorporates concatenation in some of its dimension: page, row or column, is convenient but not necessary to execute different accumulating procedures. For instance, if we have a data set with information about the population of "Comunidad de Madrid", it is possible to propose the following table: CITY Acebeda, La Ajalvir SEX AGE Female I Male ~20 I I I I >80 A B Madrid DISTRICT QUARTER Centro Palacio Arganzuela Acacias C D Figure 2. Table with different where clauses and concatenation. 122

3 ~., i-?h-:0~~""'~:t~~;;;''':.1-' h,-:~-~-::&:l~;:~of':.':;:~~:;-':-?-:''" :"-~.~- - <-', ~. In the table of figure 2, four parts can be identified: A: population classified by CITY and SEX. B: population classified by CITY and AGE. c: population of the City of Madrid classified by DISTRICT, QUARTER and SEX. D: population of the City of Madrid classified by DISTRICT, QUARTER and AGE. Different where clauses are applied in the row dimension: crossing DISTRICT and QUARTER is only for the City of Madrid. Two variables: SEX and AGE appear in a concatenated way in the column dimension. The process to get the accumulated data matrix is based on obtaining separately every part of the table, and their composition in a unique data matrix. The accumulation procedure used is PROC SUMMARY, that allows to get accumulated data sets. Strategy of Development Programming using SAS macro language has allowed to develop a tool that starting from a table definition can obtain an accumulated data matrix. The basic element is the definition of the table whose matrix is obtained. The structure of the page, row and column dimensions is defined by a syntax designed for this purpose. The definition grammar "is more simple than the PROC TABULATE one, nevertheless it allows to define most of the tables that are elaborated at the Statistics Department. This syntax allows to apply more than a where clause in the same table, and the accumulation of a subset of the values of a variable. We could not renounce to these advantages in the definition of a table. The structure of a table is defined by the identification of the different groups that compose the page, row and column dimensions. One group is defined by the combination of class variables. Each group is able to have a where clause associated that restricts the information that is manipulated. Naturally, it is possible to require different statistics for each group

4 Let's see an example with different groups that compose a table: CITY SEX AGE All 1 Female I Male ~20 I I >40 I I I >80 Acebeda, La t t t t t t Ajalvir Madrid DISTRICT QUARTER Centro Palacio Arganzuela Acacias Figure 3. Identification of groups in a table. The row dimension is composed by two groups: Cities of "Comunidad de Madrid". The syntax of its definition is very simple: CITY Districts and quarters for the city of Madrid. It is only necessary to process the information referred to Madrid. This group has a where clause associated. The syntax of its definition is as follows: DISTRICT QUARTER FILTRO: CITY='Madrid' The structure of the column dimension is different. At first sight, two groups appear. One refers to the sex and the other one to the age: Accumulation of the population according to the sex. An ALL is required in the group. The syntax of definition is as follows: TOTAL SEX The columns that refer to the age, have some problems. The list of values of the AGE variable is as follows:, \. 1 : ~ 20 2 : :

5 4 : : > 80 A column appears in the table that really is the accumulation of a subset of the values of the AGE variable: : > 40 Two groups can be defined to solve this problem: Accumulation of the populations according to the age, till 40 years. AGE FILTRO: AGE < = 2 Accumulation of the population according to the age, starting from 40 years. An ALL is required (> 40). TOTAL AGE FILTRO: AGE > = 3 The macro that obtains the data matrix is as follows: %mda ( sasuser.data, sasuser. matrix, I * input data set *1 1* data matrix to get *1 1* there is no page dimension *1 1* groups that compose the row dimension *1 CITY + DISTRICT QUARTER FILTRO: CITY = 'Madrid', 1* groups that compose the column dimension *1 TOTAL SEX + AGE TOTAL AGE FILTRO: AGE < = 2 + FILTRO: AGE> = 3) ; The table is defined by 5 groups: 2 in the row dimension and three in the column dimension. It is necessary to realize 2 x 3 = 6 accumulation procedures. Every one is solved by a PROC SUMMARY that where clauses are applied on associated to every zone that composes the table. Then, the required observations are selected (the _TYPE_variable identifies the different accumulation levels). Next, using a PROC TRANSPOSE, a data matrix of every zone is obtained. With the composition of the data matrices associated to every zone, a data matrix of the required table is obtained. The algorithm definition development of a table structure has been really important. The definition grammar of the page, row and column dimensions is designed by graphs that _ ",." ~ ~ ~ ~._. J '".. '.. _...

6 express the recognized process behaviour. These graphs are simi~ar to the ones used in compilers theory to define a grammar. Its programming has been very simple. The changes in the definition syntax do not involve difficult modifications in the programs. Figure 4 shows the graph that defines the page dimension syntax. o A 7 \t ~ -+-Hn --'$"-- ~ ~(-TOT.--'-$ _:AL_> FUTRO: ~ S Y O~~O f1i!l"ro: 0~( --=-Hn_ o Figure 4. Graph that defines the page dimension syntax How the Statistics Technician Defines Tables The statistics technician that wishes to defme a table to get its data matrix, can use the macro directly. However, this is not usual. The tables definition is realized by a PC application., \. This product allows not only to defme tables, but also to manipulate the definitions: to group tables according to study areas, add, erase or update definitions, to define data sets, etc.. This application writes the SAS code that call the macro. This interface is not developed using SAS. It was important that the application could 126

7 be installed in a lot of PC's without licensed software problems. The table execution can be realized in the same computer where the application is installed, or in another computer. Even in another operating system. If SAS is not available in the same computer than the definition interface, the code generated is transferred to the computer where SAS is available. This application includes other possibilities to facilitate the work to the user. It is possible to work directly with SAS data sets, DBF and ASCII files. The data matrix can be obtained in any of these types. They are options that facilitate the work very much to an user that does not know how to programme in SAS. To a Second Version Most tables that Statistics Department need are obtained using this macro. However, there are some aspects that nowadays are not included. It is sometimes necessary to manipulate the information before or after the macro is executed. The macros that nowadays are developed, compose the main core of a product that continues incorporating new options. To get ready a second version, the following aspects are being considered: Definition of a general where clause associated to the table. Its incorporation is very simple. Possibility of defining formats. We can distinguish three kind of formats: referred to description, that associate labels to each value of a variable. about grouping, that allow to group several values of a variable using the same description. edition formats, that define the ch~lfacteristics length, number of decimals, etc.. of the cells of a table, such as its Information about the list of values of a variable contributes with a lot of information to the system. It will be likely to get tables where all the possible combinations will take place. PRINTMISS option in a TABLE statement in PROC TABULATE works in a similar way. If we have two variables, A and B define as follows: A = 1,2,3 B = 1,2 and a data set with four observations: 127

8 A B the statement TABLE A * B; in a PROC TABULATE produces the following columns: A=l and B=l A=l and B=2 A=3 and B=l If option PRINTMISS is specified, a new column is added: A=3 and B=2.;~ 1 with missing values in every cell. This combination did not appear in the data set. However, there is no column for A = 2. It will be likely to get all the possible combinations of class variables, although the information in the data set does not allow to deduce these cases. Two new columns will take place: A=2andB=1 A=2 and B=2 with missing values in all the cells. An aspect that has not been considered is the variable generation. It is a key point to solve. The accumulating information process is based on PROC SUMMARY. The information cannot be obtained if it is not supplied directly by this procedure, such as the mean, the minimum, etc.. It is not possible to get percentages, addings, etc.. DATA step, PROC SQL and PROC COMPUTAB in SAS/ETS will be the key to solve this problem. The final aim of the developed tool is to get accumulated data matrices as SAS data sets. To obtain the table as a report is the following step. PROC REPORT will generate it, acceding to the formats. The definition syntax of a table using a TABLE statement in a PROC TABULATE allows to define tables with a complex structure. If the tables to develop recommend this in the future, it will be necessary to modify the present definition grammar to make it similar to the TABLE statement syntax. 128

9 "'''''O~'''''''''=", ;;;'''''~C<,~5".~'o.,~."t.;:':'X':ct~:""""",, Conclusions The key point of this work is the development of an automatized procedure to obtain accumulated data matrices. It is much more important to generate a data set with accumulated information than a report. This data set can be transferred directly to a data base or spreadsheet. It is accesible with any SAS procedure. Any program can manipulate these data. The important is that the generated information is a SAS data set. From this point, everything is much easier. The effort is condensed in the table design, not in their development. As yet, the situation was the opposite. The maintenance cost and the developing time decrease enormously, and the analysis capacity of the information that is distributed is much greater on the part of the user. References SAS and SAS/ETS are registered trademarks of SAS Institute Inc., Cary, NC, USA. Departamento de Estadistica - Comunidad de Madrid Informatica Comunidad de Madrid Principe de Vergara, 132-6a Madrid Spain Tfn Fax , \.

Counting the Ways to Count in SAS. Imelda C. Go, South Carolina Department of Education, Columbia, SC

Counting the Ways to Count in SAS. Imelda C. Go, South Carolina Department of Education, Columbia, SC Paper CC 14 Counting the Ways to Count in SAS Imelda C. Go, South Carolina Department of Education, Columbia, SC ABSTRACT This paper first takes the reader through a progression of ways to count in SAS.

More information

Paper TU_09. Proc SQL Tips and Techniques - How to get the most out of your queries

Paper TU_09. Proc SQL Tips and Techniques - How to get the most out of your queries Paper TU_09 Proc SQL Tips and Techniques - How to get the most out of your queries Kevin McGowan, Constella Group, Durham, NC Brian Spruell, Constella Group, Durham, NC Abstract: Proc SQL is a powerful

More information

Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY

Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY ABSTRACT PROC FREQ is an essential procedure within BASE

More information

Producing Structured Clinical Trial Reports Using SAS: A Company Solution

Producing Structured Clinical Trial Reports Using SAS: A Company Solution Producing Structured Clinical Trial Reports Using SAS: A Company Solution By Andy Lawton, Helen Dewberry and Michael Pearce, Boehringer Ingelheim UK Ltd INTRODUCTION Boehringer Ingelheim (BI), like all

More information

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

IBM SPSS Statistics 20 Part 1: Descriptive Statistics CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 1: Descriptive Statistics Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the

More information

Chapter 2 The Data Table. Chapter Table of Contents

Chapter 2 The Data Table. Chapter Table of Contents Chapter 2 The Data Table Chapter Table of Contents Introduction... 21 Bringing in Data... 22 OpeningLocalFiles... 22 OpeningSASFiles... 27 UsingtheQueryWindow... 28 Modifying Tables... 31 Viewing and Editing

More information

Data Presentation. Paper 126-27. Using SAS Macros to Create Automated Excel Reports Containing Tables, Charts and Graphs

Data Presentation. Paper 126-27. Using SAS Macros to Create Automated Excel Reports Containing Tables, Charts and Graphs Paper 126-27 Using SAS Macros to Create Automated Excel Reports Containing Tables, Charts and Graphs Tugluke Abdurazak Abt Associates Inc. 1110 Vermont Avenue N.W. Suite 610 Washington D.C. 20005-3522

More information

EXST SAS Lab Lab #4: Data input and dataset modifications

EXST SAS Lab Lab #4: Data input and dataset modifications EXST SAS Lab Lab #4: Data input and dataset modifications Objectives 1. Import an EXCEL dataset. 2. Infile an external dataset (CSV file) 3. Concatenate two datasets into one 4. The PLOT statement will

More information

Storing and Using a List of Values in a Macro Variable

Storing and Using a List of Values in a Macro Variable Storing and Using a List of Values in a Macro Variable Arthur L. Carpenter California Occidental Consultants, Oceanside, California ABSTRACT When using the macro language it is not at all unusual to need

More information

From The Little SAS Book, Fifth Edition. Full book available for purchase here.

From The Little SAS Book, Fifth Edition. Full book available for purchase here. From The Little SAS Book, Fifth Edition. Full book available for purchase here. Acknowledgments ix Introducing SAS Software About This Book xi What s New xiv x Chapter 1 Getting Started Using SAS Software

More information

Improving Maintenance and Performance of SQL queries

Improving Maintenance and Performance of SQL queries PaperCC06 Improving Maintenance and Performance of SQL queries Bas van Bakel, OCS Consulting, Rosmalen, The Netherlands Rick Pagie, OCS Consulting, Rosmalen, The Netherlands ABSTRACT Almost all programmers

More information

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

There are six different windows that can be opened when using SPSS. The following will give a description of each of them. SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet

More information

SPSS The Basics. Jennifer Thach RHS Assessment Office March 3 rd, 2014

SPSS The Basics. Jennifer Thach RHS Assessment Office March 3 rd, 2014 SPSS The Basics Jennifer Thach RHS Assessment Office March 3 rd, 2014 Why use SPSS? - Used heavily in the Social Science & Business world - Ability to perform basic to high-level statistical analysis (i.e.

More information

The Art of Designing HOLAP Databases Mark Moorman, SAS Institute Inc., Cary NC

The Art of Designing HOLAP Databases Mark Moorman, SAS Institute Inc., Cary NC Paper 139 The Art of Designing HOLAP Databases Mark Moorman, SAS Institute Inc., Cary NC ABSTRACT While OLAP applications offer users fast access to information across business dimensions, it can also

More information

Business Process Management. Prof. Corrado Cerruti General Management Course

Business Process Management. Prof. Corrado Cerruti General Management Course Business Process Management General Management Course Summary Business Process Management definition Business Process Management Life Cycle ARIS approach to BPM Business Process Identification; Designing

More information

Beginning Tutorials. Web Publishing in SAS Software. Prepared by. International SAS Training and Consulting A SAS Institute Quality Partner

Beginning Tutorials. Web Publishing in SAS Software. Prepared by. International SAS Training and Consulting A SAS Institute Quality Partner Web Publishing in SAS Software Prepared by International SAS Training and Consulting A SAS Institute Quality Partner 100 Great Meadow Rd, Suite 601 Wethersfield, CT 06109-2379 Phone: (860) 721-1684 1-800-7TRAINING

More information

Simulate PRELOADFMT Option in PROC FREQ Ajay Gupta, PPD, Morrisville, NC

Simulate PRELOADFMT Option in PROC FREQ Ajay Gupta, PPD, Morrisville, NC ABSTRACT PharmaSUG 2015 - Paper QT33 Simulate PRELOADFMT Option in PROC FREQ Ajay Gupta, PPD, Morrisville, NC In Pharmaceuticals/CRO industries, table programing is often started when only partial data

More information

9.2 User s Guide SAS/STAT. Introduction. (Book Excerpt) SAS Documentation

9.2 User s Guide SAS/STAT. Introduction. (Book Excerpt) SAS Documentation SAS/STAT Introduction (Book Excerpt) 9.2 User s Guide SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation for the complete manual

More information

C H A P T E R 1 Introducing Data Relationships, Techniques for Data Manipulation, and Access Methods

C H A P T E R 1 Introducing Data Relationships, Techniques for Data Manipulation, and Access Methods C H A P T E R 1 Introducing Data Relationships, Techniques for Data Manipulation, and Access Methods Overview 1 Determining Data Relationships 1 Understanding the Methods for Combining SAS Data Sets 3

More information

Competent Data Management - a key component

Competent Data Management - a key component Competent Data Management - a key component Part II Illustrating the data entry application using CS-Pro April 2009 University of Reading Statistical Services Centre Data Management Support to RIU Projects

More information

Streamlining Reports: A Look into Ad Hoc and Standardized Processes James Jenson, US Bancorp, Saint Paul, MN

Streamlining Reports: A Look into Ad Hoc and Standardized Processes James Jenson, US Bancorp, Saint Paul, MN Working Paper 138-2010 Streamlining Reports: A Look into Ad Hoc and Standardized Processes James Jenson, US Bancorp, Saint Paul, MN Abstract: This paper provides a conceptual framework for quantitative

More information

SPSS (Statistical Package for the Social Sciences)

SPSS (Statistical Package for the Social Sciences) SPSS (Statistical Package for the Social Sciences) What is SPSS? SPSS stands for Statistical Package for the Social Sciences The SPSS home-page is: www.spss.com 2 What can you do with SPSS? Run Frequencies

More information

Defining a Validation Process for End-user (Data Manager / Statisticians) SAS Programs

Defining a Validation Process for End-user (Data Manager / Statisticians) SAS Programs Defining a Validation Process for End-user (Data Manager / Statisticians) SAS Programs Andy Lawton, Boehringer Ingelheim UK Ltd., Berkshire, England INTRODUCTION The requirements for validating end-user

More information

TECHNIQUES FOR BUILDING A SUCCESSFUL WEB ENABLED APPLICATION USING SAS/INTRNET SOFTWARE

TECHNIQUES FOR BUILDING A SUCCESSFUL WEB ENABLED APPLICATION USING SAS/INTRNET SOFTWARE TECHNIQUES FOR BUILDING A SUCCESSFUL WEB ENABLED APPLICATION USING SAS/INTRNET SOFTWARE Mary Singelais, Bell Atlantic, Merrimack, NH ABSTRACT (This paper is based on a presentation given in March 1998

More information

SAS Programming Tips, Tricks, and Techniques

SAS Programming Tips, Tricks, and Techniques SAS Programming Tips, Tricks, and Techniques A presentation by Kirk Paul Lafler Copyright 2001-2012 by Kirk Paul Lafler, Software Intelligence Corporation All rights reserved. SAS is the registered trademark

More information

Scatter Chart. Segmented Bar Chart. Overlay Chart

Scatter Chart. Segmented Bar Chart. Overlay Chart Data Visualization Using Java and VRML Lingxiao Li, Art Barnes, SAS Institute Inc., Cary, NC ABSTRACT Java and VRML (Virtual Reality Modeling Language) are tools with tremendous potential for creating

More information

SPSS: Getting Started. For Windows

SPSS: Getting Started. For Windows For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 Introduction to SPSS Tutorials... 3 1.2 Introduction to SPSS... 3 1.3 Overview of SPSS for Windows... 3 Section 2: Entering

More information

Chapter 2 Introduction to SPSS

Chapter 2 Introduction to SPSS Chapter 2 Introduction to SPSS Abstract This chapter introduces several basic SPSS procedures that are used in the analysis of a data set. The chapter explains the structure of SPSS data files, how to

More information

Quantrix & Excel: 3 Key Differences A QUANTRIX WHITE PAPER

Quantrix & Excel: 3 Key Differences A QUANTRIX WHITE PAPER Quantrix & Excel: 3 Key Differences A QUANTRIX WHITE PAPER Abstract This whitepaper is designed to educate spreadsheet users about three key conceptual and practical differences between Quantrix Modeler

More information

Charting LibQUAL+(TM) Data. Jeff Stark Training & Development Services Texas A&M University Libraries Texas A&M University

Charting LibQUAL+(TM) Data. Jeff Stark Training & Development Services Texas A&M University Libraries Texas A&M University Charting LibQUAL+(TM) Data Jeff Stark Training & Development Services Texas A&M University Libraries Texas A&M University Revised March 2004 The directions in this handout are written to be used with SPSS

More information

SAS Analyst for Windows Tutorial

SAS Analyst for Windows Tutorial Updated: August 2012 Table of Contents Section 1: Introduction... 3 1.1 About this Document... 3 1.2 Introduction to Version 8 of SAS... 3 Section 2: An Overview of SAS V.8 for Windows... 3 2.1 Navigating

More information

Supplementary Materials for Chapter 15 - Analysing Data

Supplementary Materials for Chapter 15 - Analysing Data Supplementary Materials for Chapter 15 - Introduction Analysing Data This resource supplements the discussions in Chapter 15 of the book - Analysing Data (pp. 291-293) under the heading Analysis of Quantitative

More information

EXCEL SOLVER TUTORIAL

EXCEL SOLVER TUTORIAL ENGR62/MS&E111 Autumn 2003 2004 Prof. Ben Van Roy October 1, 2003 EXCEL SOLVER TUTORIAL This tutorial will introduce you to some essential features of Excel and its plug-in, Solver, that we will be using

More information

Text Analytics Illustrated with a Simple Data Set

Text Analytics Illustrated with a Simple Data Set CSC 594 Text Mining More on SAS Enterprise Miner Text Analytics Illustrated with a Simple Data Set This demonstration illustrates some text analytic results using a simple data set that is designed to

More information

Query Optimization Approach in SQL to prepare Data Sets for Data Mining Analysis

Query Optimization Approach in SQL to prepare Data Sets for Data Mining Analysis Query Optimization Approach in SQL to prepare Data Sets for Data Mining Analysis Rajesh Reddy Muley 1, Sravani Achanta 2, Prof.S.V.Achutha Rao 3 1 pursuing M.Tech(CSE), Vikas College of Engineering and

More information

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO

More information

STEP TWO: Highlight the data set, then select DATA PIVOT TABLE

STEP TWO: Highlight the data set, then select DATA PIVOT TABLE STEP ONE: Enter the data into a database format, with the first row being the variable names, and each row thereafter being one completed survey. For this tutorial, highlight this table, copy and paste

More information

Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access

Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix Jennifer Clegg, SAS Institute Inc., Cary, NC Eric Hill, SAS Institute Inc., Cary, NC ABSTRACT Release 2.1 of SAS

More information

Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner

Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner 24 Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner Rekha S. Nyaykhor M. Tech, Dept. Of CSE, Priyadarshini Bhagwati College of Engineering, Nagpur, India

More information

Parallel Data Preparation with the DS2 Programming Language

Parallel Data Preparation with the DS2 Programming Language ABSTRACT Paper SAS329-2014 Parallel Data Preparation with the DS2 Programming Language Jason Secosky and Robert Ray, SAS Institute Inc., Cary, NC and Greg Otto, Teradata Corporation, Dayton, OH A time-consuming

More information

Salary. Cumulative Frequency

Salary. Cumulative Frequency HW01 Answering the Right Question with the Right PROC Carrie Mariner, Afton-Royal Training & Consulting, Richmond, VA ABSTRACT When your boss comes to you and says "I need this report by tomorrow!" do

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

How To Merge Multiple Reports In Jonas With Excel

How To Merge Multiple Reports In Jonas With Excel New Report Tool Across Modules 12.1 Detailed Agenda History of Reporting in Jonas Merging Multiple Reports Using the Power of Excel Report Samples Simple Dashboard - Technician Hours and Work Order Productivity

More information

As noted in previous chapters, crime analysis relies heavily on computer

As noted in previous chapters, crime analysis relies heavily on computer 07-Boba-4723.qxd 6/9/2005 3:43 PM Page 101 7 Crime Analysis Technology As noted in previous chapters, crime analysis relies heavily on computer technology, and over the past 15 years significant improvements

More information

Chapter 9 Joining Data from Multiple Tables. Oracle 10g: SQL

Chapter 9 Joining Data from Multiple Tables. Oracle 10g: SQL Chapter 9 Joining Data from Multiple Tables Oracle 10g: SQL Objectives Identify a Cartesian join Create an equality join using the WHERE clause Create an equality join using the JOIN keyword Create a non-equality

More information

Making the Output Delivery System (ODS) Work for You William Fehlner, SAS Institute (Canada) Inc., Toronto, Ontario

Making the Output Delivery System (ODS) Work for You William Fehlner, SAS Institute (Canada) Inc., Toronto, Ontario Making the Output Delivery System (ODS) Work for You William Fehlner, SAS Institute (Canada) Inc, Toronto, Ontario ABSTRACT Over the years, a variety of options have been offered in order to give a SAS

More information

A Closer Look at PROC SQL s FEEDBACK Option Kenneth W. Borowiak, PPD, Inc., Morrisville, NC

A Closer Look at PROC SQL s FEEDBACK Option Kenneth W. Borowiak, PPD, Inc., Morrisville, NC A Closer Look at PROC SQL s FEEDBACK Option Kenneth W. Borowiak, PPD, Inc., Morrisville, NC SESUG 2012 ABSTRACT The FEEDBACK option on the PROC SQL statement controls whether an expanded or transformed

More information

9.1 SAS. SQL Query Window. User s Guide

9.1 SAS. SQL Query Window. User s Guide SAS 9.1 SQL Query Window User s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2004. SAS 9.1 SQL Query Window User s Guide. Cary, NC: SAS Institute Inc. SAS

More information

Taming the PROC TRANSPOSE

Taming the PROC TRANSPOSE Taming the PROC TRANSPOSE Matt Taylor, Carolina Analytical Consulting, LLC ABSTRACT The PROC TRANSPOSE is often misunderstood and seldom used. SAS users are unsure of the results it will give and curious

More information

DESCRIPTIVE STATISTICS & DATA PRESENTATION*

DESCRIPTIVE STATISTICS & DATA PRESENTATION* Level 1 Level 2 Level 3 Level 4 0 0 0 0 evel 1 evel 2 evel 3 Level 4 DESCRIPTIVE STATISTICS & DATA PRESENTATION* Created for Psychology 41, Research Methods by Barbara Sommer, PhD Psychology Department

More information

Anyone Can Learn PROC TABULATE

Anyone Can Learn PROC TABULATE Paper 60-27 Anyone Can Learn PROC TABULATE Lauren Haworth, Genentech, Inc., South San Francisco, CA ABSTRACT SAS Software provides hundreds of ways you can analyze your data. You can use the DATA step

More information

LabVIEW Day 6: Saving Files and Making Sub vis

LabVIEW Day 6: Saving Files and Making Sub vis LabVIEW Day 6: Saving Files and Making Sub vis Vern Lindberg You have written various vis that do computations, make 1D and 2D arrays, and plot graphs. In practice we also want to save that data. We will

More information

SUGI 29 Systems Architecture. Paper 223-29

SUGI 29 Systems Architecture. Paper 223-29 Paper 223-29 SAS Add-In for Microsoft Office Leveraging SAS Throughout the Organization from Microsoft Office Jennifer Clegg, SAS Institute Inc., Cary, NC Stephen McDaniel, SAS Institute Inc., Cary, NC

More information

Modifying Colors and Symbols in ArcMap

Modifying Colors and Symbols in ArcMap Modifying Colors and Symbols in ArcMap Contents Introduction... 1 Displaying Categorical Data... 3 Creating New Categories... 5 Displaying Numeric Data... 6 Graduated Colors... 6 Graduated Symbols... 9

More information

New Tricks for an Old Tool: Using Custom Formats for Data Validation and Program Efficiency

New Tricks for an Old Tool: Using Custom Formats for Data Validation and Program Efficiency New Tricks for an Old Tool: Using Custom Formats for Data Validation and Program Efficiency S. David Riba, JADE Tech, Inc., Clearwater, FL ABSTRACT PROC FORMAT is one of the old standards among SAS Procedures,

More information

Using Excel for Statistics Tips and Warnings

Using Excel for Statistics Tips and Warnings Using Excel for Statistics Tips and Warnings November 2000 University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID Contents 1. Introduction 3 1.1 Data Entry and

More information

How to Use SDTM Definition and ADaM Specifications Documents. to Facilitate SAS Programming

How to Use SDTM Definition and ADaM Specifications Documents. to Facilitate SAS Programming How to Use SDTM Definition and ADaM Specifications Documents to Facilitate SAS Programming Yan Liu Sanofi Pasteur ABSTRCT SDTM and ADaM implementation guides set strict requirements for SDTM and ADaM variable

More information

G563 Quantitative Paleontology. SQL databases. An introduction. Department of Geological Sciences Indiana University. (c) 2012, P.

G563 Quantitative Paleontology. SQL databases. An introduction. Department of Geological Sciences Indiana University. (c) 2012, P. SQL databases An introduction AMP: Apache, mysql, PHP This installations installs the Apache webserver, the PHP scripting language, and the mysql database on your computer: Apache: runs in the background

More information

Utilizing Clinical SAS Report Templates Sunil Kumar Gupta Gupta Programming, Thousand Oaks, CA

Utilizing Clinical SAS Report Templates Sunil Kumar Gupta Gupta Programming, Thousand Oaks, CA Utilizing Clinical SAS Report Templates Sunil Kumar Gupta Gupta Programming, Thousand Oaks, CA ABSTRACT SAS programmers often have the responsibility of supporting the reporting needs of the Clinical Affairs

More information

An automatic predictive datamining tool. Data Preparation Propensity to Buy v1.05

An automatic predictive datamining tool. Data Preparation Propensity to Buy v1.05 An automatic predictive datamining tool Data Preparation Propensity to Buy v1.05 Januray 2011 Page 2 of 11 Data preparation - Introduction If you are using The Intelligent Mining Machine (TIMi) inside

More information

Big Data, Fast Processing Speeds Kevin McGowan SAS Solutions on Demand, Cary NC

Big Data, Fast Processing Speeds Kevin McGowan SAS Solutions on Demand, Cary NC Big Data, Fast Processing Speeds Kevin McGowan SAS Solutions on Demand, Cary NC ABSTRACT As data sets continue to grow, it is important for programs to be written very efficiently to make sure no time

More information

Development Period 1 2 3 4 5 6 7 8 9 Observed Payments

Development Period 1 2 3 4 5 6 7 8 9 Observed Payments Pricing and reserving in the general insurance industry Solutions developed in The SAS System John Hansen & Christian Larsen, Larsen & Partners Ltd 1. Introduction The two business solutions presented

More information

Is it statistically significant? The chi-square test

Is it statistically significant? The chi-square test UAS Conference Series 2013/14 Is it statistically significant? The chi-square test Dr Gosia Turner Student Data Management and Analysis 14 September 2010 Page 1 Why chi-square? Tests whether two categorical

More information

The Query Builder: The Swiss Army Knife of SAS Enterprise Guide

The Query Builder: The Swiss Army Knife of SAS Enterprise Guide Paper 1557-2014 The Query Builder: The Swiss Army Knife of SAS Enterprise Guide ABSTRACT Jennifer First-Kluge and Steven First, Systems Seminar Consultants, Inc. The SAS Enterprise Guide Query Builder

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

Statistics and Analysis. Quality Control: How to Analyze and Verify Financial Data

Statistics and Analysis. Quality Control: How to Analyze and Verify Financial Data Abstract Quality Control: How to Analyze and Verify Financial Data Michelle Duan, Wharton Research Data Services, Philadelphia, PA As SAS programmers dealing with massive financial data from a variety

More information

Performance Test Suite Results for SAS 9.1 Foundation on the IBM zseries Mainframe

Performance Test Suite Results for SAS 9.1 Foundation on the IBM zseries Mainframe Performance Test Suite Results for SAS 9.1 Foundation on the IBM zseries Mainframe A SAS White Paper Table of Contents The SAS and IBM Relationship... 1 Introduction...1 Customer Jobs Test Suite... 1

More information

Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina

Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina Paper 267-27 Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina ABSTRACT Randomization as a method of experimental control

More information

Chapter 9 Creating Reports in Excel

Chapter 9 Creating Reports in Excel Chapter 9 Creating Reports in Excel One of the most powerful features of Standard & Poor s Research Insight is its ability to communicate with Microsoft Excel through Active-X Technology. Excel requests

More information

This book serves as a guide for those interested in using IBM SPSS

This book serves as a guide for those interested in using IBM SPSS 1 Overview This book serves as a guide for those interested in using IBM SPSS Statistics software to assist in statistical data analysis whether as a companion to a statistics or research methods course,

More information

Creating Word Tables using PROC REPORT and ODS RTF

Creating Word Tables using PROC REPORT and ODS RTF Paper TT02 Creating Word Tables using PROC REPORT and ODS RTF Carey G. Smoak,, Pleasanton, CA ABSTRACT With the introduction of the ODS RTF destination, programmers now have the ability to create Word

More information

CHAPTER 1 Overview of SAS/ACCESS Interface to Relational Databases

CHAPTER 1 Overview of SAS/ACCESS Interface to Relational Databases 3 CHAPTER 1 Overview of SAS/ACCESS Interface to Relational Databases About This Document 3 Methods for Accessing Relational Database Data 4 Selecting a SAS/ACCESS Method 4 Methods for Accessing DBMS Tables

More information

Section 1 Spreadsheet Design

Section 1 Spreadsheet Design Section 1 Spreadsheet Design Level 6 Spreadsheet 6N4089 Contents 1. Assess the suitability of using a spreadsheet to achieve a given requirement from a given specification... 1 Advantages of using Spreadsheet

More information

03 The full syllabus. 03 The full syllabus continued. For more information visit www.cimaglobal.com PAPER C03 FUNDAMENTALS OF BUSINESS MATHEMATICS

03 The full syllabus. 03 The full syllabus continued. For more information visit www.cimaglobal.com PAPER C03 FUNDAMENTALS OF BUSINESS MATHEMATICS 0 The full syllabus 0 The full syllabus continued PAPER C0 FUNDAMENTALS OF BUSINESS MATHEMATICS Syllabus overview This paper primarily deals with the tools and techniques to understand the mathematics

More information

Downloading Your Financial Statements to Excel

Downloading Your Financial Statements to Excel Downloading Your Financial Statements to Excel Downloading Data from CU*BASE to PC INTRODUCTION How can I get my favorite financial statement from CU*BASE into my Excel worksheet? How can I get this data

More information

Listings and Patient Summaries in Excel (SAS and Excel, an excellent partnership)

Listings and Patient Summaries in Excel (SAS and Excel, an excellent partnership) Paper TS01 Listings and Patient Summaries in Excel (SAS and Excel, an excellent partnership) Xavier Passera, Detour Solutions Ltd., United Kingdom ABSTRACT The purpose of this paper is to explain how SAS

More information

PharmaSUG 2015 - Paper QT26

PharmaSUG 2015 - Paper QT26 PharmaSUG 2015 - Paper QT26 Keyboard Macros - The most magical tool you may have never heard of - You will never program the same again (It's that amazing!) Steven Black, Agility-Clinical Inc., Carlsbad,

More information

Extending the Metadata Security Audit Reporting Capabilities of the Audit and Performance Measurement Package October 2010

Extending the Metadata Security Audit Reporting Capabilities of the Audit and Performance Measurement Package October 2010 Extending the Metadata Security Audit Reporting Capabilities of the Audit and Performance Measurement Package October 2010 ENTERPRISE EXCELLENCE CENTER Table of Contents 1 Introduction... 1 2 Metadata

More information

SAS PROGRAM EFFICIENCY FOR BEGINNERS. Bruce Gilsen, Federal Reserve Board

SAS PROGRAM EFFICIENCY FOR BEGINNERS. Bruce Gilsen, Federal Reserve Board SAS PROGRAM EFFICIENCY FOR BEGINNERS Bruce Gilsen, Federal Reserve Board INTRODUCTION This paper presents simple efficiency techniques that can benefit inexperienced SAS software users on all platforms.

More information

Effective Use of SQL in SAS Programming

Effective Use of SQL in SAS Programming INTRODUCTION Effective Use of SQL in SAS Programming Yi Zhao Merck & Co. Inc., Upper Gwynedd, Pennsylvania Structured Query Language (SQL) is a data manipulation tool of which many SAS programmers are

More information

Excel Tutorial. Bio 150B Excel Tutorial 1

Excel Tutorial. Bio 150B Excel Tutorial 1 Bio 15B Excel Tutorial 1 Excel Tutorial As part of your laboratory write-ups and reports during this semester you will be required to collect and present data in an appropriate format. To organize and

More information

Experiences in Using Academic Data for BI Dashboard Development

Experiences in Using Academic Data for BI Dashboard Development Paper RIV09 Experiences in Using Academic Data for BI Dashboard Development Evangeline Collado, University of Central Florida; Michelle Parente, University of Central Florida ABSTRACT Business Intelligence

More information

Simulating Chi-Square Test Using Excel

Simulating Chi-Square Test Using Excel Simulating Chi-Square Test Using Excel Leslie Chandrakantha John Jay College of Criminal Justice of CUNY Mathematics and Computer Science Department 524 West 59 th Street, New York, NY 10019 lchandra@jjay.cuny.edu

More information

Step 3: Go to Column C. Use the function AVERAGE to calculate the mean values of n = 5. Column C is the column of the means.

Step 3: Go to Column C. Use the function AVERAGE to calculate the mean values of n = 5. Column C is the column of the means. EXAMPLES - SAMPLING DISTRIBUTION EXCEL INSTRUCTIONS This exercise illustrates the process of the sampling distribution as stated in the Central Limit Theorem. Enter the actual data in Column A in MICROSOFT

More information

Dataset Preparation and Indexing for Data Mining Analysis Using Horizontal Aggregations

Dataset Preparation and Indexing for Data Mining Analysis Using Horizontal Aggregations Dataset Preparation and Indexing for Data Mining Analysis Using Horizontal Aggregations Binomol George, Ambily Balaram Abstract To analyze data efficiently, data mining systems are widely using datasets

More information

Intro to Longitudinal Data: A Grad Student How-To Paper Elisa L. Priest 1,2, Ashley W. Collinsworth 1,3 1

Intro to Longitudinal Data: A Grad Student How-To Paper Elisa L. Priest 1,2, Ashley W. Collinsworth 1,3 1 Intro to Longitudinal Data: A Grad Student How-To Paper Elisa L. Priest 1,2, Ashley W. Collinsworth 1,3 1 Institute for Health Care Research and Improvement, Baylor Health Care System 2 University of North

More information

Paper 70-27 An Introduction to SAS PROC SQL Timothy J Harrington, Venturi Partners Consulting, Waukegan, Illinois

Paper 70-27 An Introduction to SAS PROC SQL Timothy J Harrington, Venturi Partners Consulting, Waukegan, Illinois Paper 70-27 An Introduction to SAS PROC SQL Timothy J Harrington, Venturi Partners Consulting, Waukegan, Illinois Abstract This paper introduces SAS users with at least a basic understanding of SAS data

More information

Learning Objectives. Definition of OLAP Data cubes OLAP operations MDX OLAP servers

Learning Objectives. Definition of OLAP Data cubes OLAP operations MDX OLAP servers OLAP Learning Objectives Definition of OLAP Data cubes OLAP operations MDX OLAP servers 2 What is OLAP? OLAP has two immediate consequences: online part requires the answers of queries to be fast, the

More information

ODS for PRINT, REPORT and TABULATE

ODS for PRINT, REPORT and TABULATE Paper 3-26 ODS for PRINT, REPORT and TABULATE Lauren Haworth, Genentech, Inc., San Francisco ABSTRACT For most procedures in the SAS system, the only way to change the appearance of the output is to change

More information

Utilizing Clinical SAS Report Templates with ODS Sunil Kumar Gupta, Gupta Programming, Simi Valley, CA

Utilizing Clinical SAS Report Templates with ODS Sunil Kumar Gupta, Gupta Programming, Simi Valley, CA Utilizing Clinical SAS Report Templates with ODS Sunil Kumar Gupta, Gupta Programming, Simi Valley, CA ABSTRACT SAS progrannners often have the responsibility of supporting the reporting needs of the Clinical

More information

Database Programming with PL/SQL: Learning Objectives

Database Programming with PL/SQL: Learning Objectives Database Programming with PL/SQL: Learning Objectives This course covers PL/SQL, a procedural language extension to SQL. Through an innovative project-based approach, students learn procedural logic constructs

More information

Spreadsheet software for linear regression analysis

Spreadsheet software for linear regression analysis Spreadsheet software for linear regression analysis Robert Nau Fuqua School of Business, Duke University Copies of these slides together with individual Excel files that demonstrate each program are available

More information

New York State Department of Financial Services

New York State Department of Financial Services New York State Department of Financial Services Instructions for Preparation of Structured Settlement Annuity and Immediate Annuity Files for Year-End Reserve Reporting ann_inst05.doc 10/31/05 Intro -

More information

Linear Algebra and TI 89

Linear Algebra and TI 89 Linear Algebra and TI 89 Abdul Hassen and Jay Schiffman This short manual is a quick guide to the use of TI89 for Linear Algebra. We do this in two sections. In the first section, we will go over the editing

More information

a presentation by Kirk Paul Lafler SAS Consultant, Author, and Trainer E-mail: KirkLafler@cs.com

a presentation by Kirk Paul Lafler SAS Consultant, Author, and Trainer E-mail: KirkLafler@cs.com a presentation by Kirk Paul Lafler SAS Consultant, Author, and Trainer E-mail: KirkLafler@cs.com 1 Copyright Kirk Paul Lafler, 1992-2010. All rights reserved. SAS is the registered trademark of SAS Institute

More information

Graphing Parabolas With Microsoft Excel

Graphing Parabolas With Microsoft Excel Graphing Parabolas With Microsoft Excel Mr. Clausen Algebra 2 California State Standard for Algebra 2 #10.0: Students graph quadratic functions and determine the maxima, minima, and zeros of the function.

More information

An Introduction to SAS/SHARE, By Example

An Introduction to SAS/SHARE, By Example Paper 020-29 An Introduction to SAS/SHARE, By Example Larry Altmayer, U.S. Census Bureau, Washington, DC ABSTRACT SAS/SHARE software is a useful tool for allowing several users to simultaneously access

More information

A terminology model approach for defining and managing statistical metadata

A terminology model approach for defining and managing statistical metadata A terminology model approach for defining and managing statistical metadata Comments to : R. Karge (49) 30-6576 2791 mail reinhard.karge@run-software.com Content 1 Introduction... 4 2 Knowledge presentation...

More information

Excel & Visual Basic for Applications (VBA)

Excel & Visual Basic for Applications (VBA) Excel & Visual Basic for Applications (VBA) The VBA Programming Environment Recording Macros Working with the Visual Basic Editor (VBE) 1 Why get involved with this programming business? If you can't program,

More information