1 SAS PROGRAM EFFICIENCY FOR BEGINNERS Bruce Gilsen, Federal Reserve Board INTRODUCTION This paper presents simple efficiency techniques that can benefit inexperienced SAS software users on all platforms. Efficiency techniques are frequently documented as follows.! Describe an efficiency technique.! Demonstrate the technique with examples. The drawback to this approach is that it can be difficult for SAS software users to determine when to apply the techniques. This paper takes an alternate approach, as follows.! Describe an application or data set.! Present simple efficiency techniques for the application or data set. This approach is designed to make it easier for SAS software users to determine when to apply the techniques to their application or data set. IF, WHERE, DO WHILE, or DO UNTIL statements, use AND operators to test if all of a group of conditions are true. 5. Select observations from a SAS data set with a WHERE statement. 6. In a DATA step, read a SAS data set with many variables to create a new SAS data set. Only a few of the variables are needed in the DATA step or the new SAS data set. 7. Create a new SAS data set containing all observations from two existing SAS data sets. The variables in the two data sets have the same length and type. 8. Process a SAS data set in a DATA step when no output SAS data set is needed. This could occur when a DATA step is used to write reports with PUT statements, examine a data set's attributes, or generate macro variables with the CALL SYMPUT statement. 9. Execute a SAS DATA step in which the denominator of a division operation could be zero. SUMMARY OF PROGRAMMING TASKS This paper presents efficiency techniques for the following programming tasks. 1. Create a SAS data set by reading long records from a flat file with an INPUT statement. Keep selected records based on the values of only a few incoming variables. 2. Create a new SAS data set by reading an existing SAS data set with a SET statement. Keep selected observations based on the values of only a few incoming variables. 3. Select only some observations from a SAS data set. The selected data are used as input to a SAS procedure, but are not otherwise needed. 4. In IF, WHERE, DO WHILE, or DO UNTIL statements, use OR operators or an IN operator to test if at least one of a group of conditions is true. In 1. Read long records from a flat file, keeping only selected records Create a SAS data set by reading long records from a flat file with an INPUT statement. Keep selected records based on the values of only a few incoming variables. First, read only the variables needed to determine if the record should be kept. Test the values of the variables, and only read the rest of the record if necessary. Read 2000 byte records from a flat file. Keep the record (include it in the resulting SAS data set) if NETINC is greater than 100, and otherwise discard it.
2 Method 1, less efficient. data income; infile incdata; 0001 bank 0009 netinc 0017 nextvar 1993 lastvar 8.; if netinc > 100; Method 2, more efficient. data income; infile incdata; 0009 netinc if netinc > 100; 0001 bank 0017 nextvar 1993 lastvar 8.; In method 1, all 2000 bytes are read, and the current record is kept if NETINC is greater than 100. In method 2, the first INPUT statement reads only the variable NETINC. If NETINC is greater than 100, the second INPUT statement reads the rest of the current record, and the current record is kept. The at the end of the first INPUT statement ensures that the current record is available to re-read. If NETINC is not greater than 100, the current record is discarded, and only 8 bytes are read instead of The following statement is an example of a subsetting IF statement. if netinc > 100; Subsetting IF statements test a condition. If the condition is true, the SAS system continues to process the current observation. Otherwise, the SAS system discards the observation and begins processing the next observation. Subsetting IF statements can be distinguished from IF-THEN/ELSE statements because they do not contain a THEN clause. The following statements are equivalent. if netinc > 100; if not (netinc > 100) then delete; if netinc <= 100 then delete; 2. If all 2000 bytes have the same informat, then the following code is equivalent to method 2. Moving the informat to the end of the second INPUT statement makes the code easier to read. data income; infile incdata; 0009 netinc if netinc > 100; 0001 bank 0017 (nextvar. lastvar) (8.) ; 2. Subset a SAS data set to create a new SAS data set Create a new SAS data set by reading an existing SAS data set with a SET statement. Keep selected observations based on the values of only a few incoming variables. Use a WHERE statement instead of a subsetting IF statement. A WHERE statement and a subsetting IF statement both test a condition to determine if the SAS system should process an observation. They differ as follows.! A WHERE statement tests the condition before an observation is read into the SAS program data vector (PDV). If the condition is true, the observation is read into the PDV and processed. Otherwise, the observation is not read into the PDV, and processing continues with the next observation.! A subsetting IF statement tests the condition after an observation is read into the PDV. If the condition is true, the SAS system continues processing the current observation. Otherwise, the observation is discarded, and processing continues with the next observation. Create SAS data set TWO from SAS data set ONE. Keep only observations where GNP is greater than 10 and CON is not equal to zero.
3 Method 1, less efficient. if gnp > 10 and con ne 0 ; Method 2, more efficient. where gnp > 10 and con ne 0 ; In method 1, all observations are read, and observations not meeting the selection criteria are discarded. In method 2, the WHERE statement ensures that observations not meeting the selection criteria are not read. 1. In Release 6.06 of the SAS system for MVS, WHERE statements performed less efficiently than a subsetting IF statement in a few cases. In subsequent releases, WHERE statements can perform less efficiently if they include SAS functions and in a few other cases. 2. Because WHERE statements process data before they are read into the PDV, they cannot include variables that are not part of the incoming data set, such as the following.! Variables you create in the current DATA step.! Variables automatically created by the SAS system in the DATA step, such as FIRST. variables, LAST. variables, and _N_. If data set ONE does not include the variable TAX, the following DATA step generates an error. tax = income / 2 ; where tax > 5 ; To prevent this error, use a subsetting IF statement instead of a WHERE statement, as follows. if tax > 5 ; 3. Prior to Release 6.07 of the SAS system for MVS, Release 6.07 of the SAS system for UNIX, and Release 6.07 of the SAS system for PCs, WHERE statements could not contain SAS functions. 4. To improve efficiency for large data sets, experienced users can combine WHERE statements with indexes. 3. Subset a SAS data set for use in a PROC step Select only some observations from a SAS data set. The selected data are used as input to a SAS procedure, but are not otherwise needed. Use a WHERE statement in the PROC step to select observations, and eliminate the DATA step. Execute PROC PRINT on observations in data set ONE in which GNP is greater than 0. Method 1, less efficient. if gnp > 0 ; proc print data = two; Method 2, less efficient. where gnp > 0 ; proc print data = two; Method 3, more efficient. proc print data = one; where gnp > 0 ; In method 1, the IF statement could be less efficient than a WHERE statement, and an intermediate data set is created. In method 2, an intermediate data set is created. In method 3, no unnecessary data sets are created.
4 1. The following statement is equivalent to the statements in method 3. proc print data=one(where=(gnp>0)); 4. Test multiple conditions with IN, OR, or AND operators In IF, WHERE, DO WHILE, or DO UNTIL statements, use OR operators or an IN operator to test whether at least one of a group of conditions is true. In IF, WHERE, DO WHILE, or DO UNTIL statements, use AND operators to test whether all of a group of conditions are true. Order the conditions to minimize the number of comparisons required by the SAS system, as follows. For OR operators or an IN operator, order the conditions in descending order of likelihood. Put the condition most likely to be true first, the condition second most likely to be true second, and so on. For AND operators, order the conditions in ascending order of likelihood. Put the condition most likely to be false first, the condition second most likely to be false second, and so on. When the SAS system processes an IF, WHERE, DO WHILE, or DO UNTIL statement, it tests the minimum number of conditions needed to determine if the statement is true or false. This technique is known as Boolean short circuiting. Prior information is often available about data being processed. Use this information to improve program efficiency. Order the conditions in IF, WHERE, DO WHILE, and DO UNTIL statements to take advantage of Boolean short circuiting. An IN operator and a series of OR operators are equally efficient, though the IN operator can make programs easier to read. s The examples in this section use SAS data set ONE, which has 50,000 observations. In data set ONE, CTRY is expected to be 'czech' in about 20,000 observations, 'hungary' in about 5000 observations, 'belgium' in about 2000 observations, and 'slovakia' in about 1000 observations. INV is expected to be greater than 900 in about 10,000 observations, and TAX is expected to be greater than 500 in about 1000 observations. 1. Use an IF-THEN statement to set GNP to 10,000 if CTRY is equal to one of four values. The following two examples are efficient because they order the values of CTRY from most likely to least likely. The two IF statements are equally efficient. Using an IN operator. if ctry in('czech','hungary','belgium', 'slovakia') then gnp = ; Using OR operators. if ctry = 'czech' or ctry = 'hungary' or ctry= 'belgium' or ctry= 'slovakia' then gnp = ; 2. Use a WHERE statement to select observations in which CTRY is equal to 'czech' or 'slovakia'. The following two examples are efficient because they order the values of CTRY from most likely to least likely. The two WHERE statements are equally efficient. Using an IN operator. where ctry in('czech','slovakia') ; Using an OR operator. where ctry = 'czech' or ctry = 'slovakia' ; 3. Use a WHERE statement to select observations in which CTRY is equal to 'czech', INV is greater than 900, and TAX is greater than 500. The following example is efficient because it orders the conditions from most likely to be false to least likely to be false. where tax > 500 and inv > 900 and ctry = 'czech' ;
5 1. The following equally efficient statements set GNP to 10,000 if CODE is equal to one of several values. Boolean short circuiting for the IF statement was implemented in Release 6.08 of the SAS system for MVS, Release 6.09 of the SAS system for UNIX, and Release 6.10 of the SAS system for PCs. In earlier versions of SAS software, only the IN operator used Boolean short circuiting, so the first statement was more efficient. if code in(100, 200, 300, 500) then gnp = ; if code = 100 or code = 200 or code = 300 or code = 500 then gnp = ; 2. If prior information about the data is not available, PROC FREQ can provide information about the frequency of the conditions being tested. 5. Subset a SAS data set with WHERE statement operators Select observations from a SAS data set with a WHERE statement. Previous sections of this paper demonstrated the WHERE statement. This section describes WHERE statement operators, several useful operators that can be used only in WHERE statements BETWEEN-AND operator. The BETWEEN-AND operator is used to test whether the value of a variable falls in an inclusive range. This operator provides a convenient syntax but no additional functionality. The following three statements are equivalent. where salary between 4000 and 5000 ; where salary >= 4000 and salary <= 5000 ; where 4000 <= salary <= 5000 ; 5.2. CONTAINS or question mark (?) operator. The CONTAINS operator is used to test whether a character variable contains a specified string. CONTAINS and? are equivalent. The CONTAINS operator is case sensitive; upper and lower case characters are not equivalent. In the following DATA step, NAME is a character variable in data set ONE. Observations in which NAME contains the characters 'JO' are selected. For example, JONES, JOHN, HOJO are selected, but JAO or John are not selected. The CONTAINS operator is somewhat comparable to the SAS functions INDEX and INDEXC. where name contains 'JO' ; The following statements are equivalent. where name? 'JO' ; where name contains 'JO' ; 5.3. IS MISSING or IS NULL operator. The IS MISSING operator is used to test whether a character or numeric variable is missing. IS MISSING and IS NULL are equivalent. 1. Select observations in which the value of the numeric variable GNP is missing. The following three statements are equivalent. where gnp is null ; where gnp is missing ; where gnp =. ; 2. Select observations in which the value of the character variable NAME is not missing. The following three statements are equivalent. where name is not null ; where name is not missing ; where name ne "" ; The IS MISSING operator allows you to test whether a variable is missing without knowing if the variable is numeric or character, preventing the errors generated by the following statements.! This statement generates an error if NAME is a character variable. where name =. ;! This statement generates an error if GNP is a numeric variable. where gnp = "" ;
6 A numeric variable whose value is a special missing value (a-z or an underscore) is recognized as missing by the IS MISSING operator LIKE operator. The LIKE operator is used to test whether a character variable contains a specified pattern. A pattern consists of any combination of valid characters and the following wild card characters.! The percent sign (%) represents any number of characters (0 or more).! The underscore (_) represents any single character. The LIKE operator provides some of the functionality of the UNIX command grep. It is much more powerful than the SAS functions INDEX or INDEXC, which must be used multiple times to search for complex character patterns. The LIKE operator is case sensitive; upper and lower case characters are not equivalent. 1. Select observations in which the variable NAME begins with the character 'J', is followed by 0 or more characters, and ends with the character 'n'. John, Jan, Jn, and Johanson are selected, but Jonas, JAN, and AJohn are not selected. where name like 'J%n' ; 2. Select observations in which the variable NAME begins with the character 'J', is followed by any single character, and ends with the character 'n'. Jan is selected, but John, Jonas, Jn, Johanson, JAN, and AJohn are not selected. where name like 'J_n'; 3. Select observations in which the variable NAME contains the character 'J'. Jan, John, Jn, Johanson, Jonas, JAN, AJohn, TAJ, and J are selected, but j, Elijah, and jan are not selected. where name like '%J%'; 5.5. Sounds-like (=*) operator. The Sounds-like (=*) operator uses the Soundex algorithm to test whether a character variable contains a spelling variation of a word. This operator can be used to perform edit checks on character data by checking for small typing mistakes, and will uncover some but not all of the mistakes. In the following example, the WHERE statement keeps all but the last two input records. data one; input bankname $1-20; length bankname $20; cards; new york bank new york bank. NEW YORK bank new york bnk new yrk bank ne york bank neww york bank nnew york bank ew york bank new york mets ; where bankname=* 'new york bank' ; To identify cases where leading characters are omitted from a character variable, use the Sounds-like operator multiple times in a WHERE statement, removing one additional character in each clause of the WHERE statement, as follows. where bankname=* 'new york bank' or bankname=* 'ew york bank' or bankname=* 'w york bank' or bankname=* 'york bank' ; 5.6. SAME-AND operator. The SAME-AND operator is used to add more clauses to a previous WHERE statement without reentering the original clauses. The SAME-AND operator need not be in the same PROC or DATA step as the previous WHERE statement. The SAME-AND operator reduces keystrokes during an interactive session, but can make programs harder to understand. It is not recommended for large programs or applications that have code stored in multiple files.
7 . The WHERE statement for data set FOUR selects observations in which GNP is greater than 100, INV is less than 10, and CON is less than 20. where gnp > 100 and inv < 10 ; data four ; set three ; where same-and con < 20 ; The following statement is equivalent to the WHERE statement for data set FOUR. where gnp > 100 and inv < 10 and con< 20 ; 1. See pages in the "SAS Language Reference, Version 6, First Edition," for more information about WHERE statement operators. 6. Read a SAS data set, but need only a few of many variables In a DATA step, read a SAS data set with many variables to create a new SAS data set. Only a few of the variables are needed in the DATA step or the new SAS data set. Use the KEEP= or DROP= option in the SET statement to prevent unnecessary variables from being read into the SAS program data vector (PDV) or the new data set. Create SAS data set TWO from SAS data set ONE. Data set ONE contains variables A and X1-X1000. The only variables needed in data set TWO are A and B. Method 1, less efficient. b = a*1000 ; Method 2, less efficient. keep a b ; b = a*1000 ; Method 3, incorrect result. data two (keep = b) ; set one (keep = a) ; b = a*1000 ; Method 4, more efficient. set one (keep = a) ; b = a*1000 ; In method 1, 1001 variables are read into the PDV from data set ONE, and 1002 variables are written to data set TWO of the variables are not needed. In method 2, 1001 variables are read into the PDV from data set ONE, and two variables are written to data set TWO. The KEEP statement specifies that only A and B are written to data set TWO. Variables X1-X1000 are not written to data set TWO, but are still read unnecessarily into the PDV from data set ONE. In method 3, one variable, A, is read into the PDV from data set ONE, and one variable, B, is written to data set TWO. In method 4, one variable, A, is read into the PDV from data set ONE, and two variables, A and B, are written to data set TWO. This is the most efficient method. 1. If variables X1-X1000 are needed during the execution of DATA step TWO (for example, if they are used to calculate B), but are not needed in the output data set, then use method If A is the only variable needed during the execution of DATA step TWO, and B is the only variable needed in the output data set, then use method In method 3, the following statement is equivalent to (KEEP = A);. (drop = x1-x1000);
8 4. In method 2, either of the following statements are equivalent to KEEP A B ;. drop x1-x1000; data two(keep = a b); 7. Concatenate (append) one SAS data set to another Create a new SAS data set containing all observations from two existing SAS data sets. The variables in the two data sets have the same length and type. Use PROC APPEND instead of a SET statement. The SET statement reads both data sets. PROC APPEND reads the data set to be concatenated, but does not read the other data set, known as the BASE data set. Concatenate SAS data set TWO to SAS data set ONE. Method 1, less efficient. data one; set one two ; Method 2, more efficient. proc append base = one data = two ; In method 1, the SAS system reads all observations in both data sets. In method 2, the SAS system reads only the observations in data set TWO. 1. Since PROC APPEND reads only the second data set, set BASE= to the larger data set if, as in the following example, the order of the data sets does not matter. proc append base = one data = two ; proc sort data = one ; by var1 ; run ; variables or variables with the same name but different lengths or types (character versus numeric). 8. Execute a DATA step, but no output SAS data set is needed Process a SAS data set in a DATA step when no output SAS data set is needed. This could occur when a DATA step is used to write reports with PUT statements, examine a data set's attributes, or generate macro variables with the CALL SYMPUT statement. Name the data set the reserved name "_NULL_" to avoid creating an output SAS data set. Use PUT statements to write information from data set ONE. No output data set is needed. Method 1, less efficient. var2 ; more PUT and printing statements Method 2, less efficient. data one; var2 ; more PUT and printing statements Method 3, more efficient. data _null_; var2 ; more PUT and printing statements In method 1, the SAS system creates an unnecessary data set, TWO. In method 2, data set ONE is unnecessarily recreated. In method 3, an output data set is not created. 2. The documentation for PROC APPEND in the "SAS Procedures Guide, Version 6, Third Edition," describes how to concatenate data sets that contain different
9 1. Using the statement DATA; instead of DATA _NULL_; causes the creation of an output data set called DATAn, where n is 1,2,3,... (the first such data set is called DATA1, the second is called DATA2, and so on). 9. Execute a SAS DATA step in which the denominator of a division operation could be zero Execute a SAS DATA step in which the denominator of a division operation could be zero. Test the value of the denominator, and divide only if the denominator is not zero. Programs execute substantially faster if you prevent the SAS system from attempting to divide by zero. Execute a simple DATA step. Set the variable QUOTIENT to the value NUM/DENOM. Input data set for this example. data one; input num denom ; cards; ; Method 1, less efficient. quotient = num/denom; Method 2, more efficient. if denom ne 0 then quotient = num/denom; else quotient =.; Method 1 is less efficient because the SAS system attempts to divide by zero. Since attempting to divide by zero results in a missing value, both methods generate the same output data set, TWO, as follows. OBS NUM DENOM QUOTIENT When the SAS system attempts to divide by zero, a note similar to the following is printed to the SAS log. NOTE: Division by zero detected at line 329 column 11. NUM=10 DENOM=0 QUOTIENT=. _ERROR_=1 _N_=3 NOTE: Division by zero detected at line 329 column 11. NUM=9 DENOM=0 QUOTIENT=. _ERROR_=1 _N_=5 NOTE: Mathematical operations could not be performed at the following places. The results of the operations have been set to missing values. Each place is given by: (Number of times) at (Line):(Column). 2 at 329:11 2. An example of a tremendous performance improvement from preventing the SAS system from attempting to divide by zero is as follows. In Release 6.06 of the SAS system for MVS at the Federal Reserve Board, a program processed a SAS data set with 200,000 observations and 16 variables, performing 8 divisions per observation, of which approximately half were attempts to divide by zero. Recoding the program to test for division by zero reduced the CPU time from 4 minutes to 7 seconds. CONCLUSION This paper presented simple efficiency techniques that can benefit inexperienced SAS software users on all platforms. Each section of the paper began with a description of an application or data set, followed by efficiency techniques for that application or data set. The author of this paper hopes that presenting the techniques this way will make it easier for users to determine when to apply these techniques. For more information, contact Bruce Gilsen Federal Reserve Board, Mail Stop 157 Washington, DC
10 REFERENCES Polzin, Jeffrey A, (1994), "DATA Step Efficiency and Performance," in the Proceedings of the Nineteenth Annual SAS Users Group International Conference, 19, SAS Institute Inc. (1990), "SAS Language Reference, Version 6, First Edition," Cary, NC: SAS Institute Inc. SAS Institute Inc. (1990), "SAS Procedures Guide, Version 6, Third Edition," Cary, NC: SAS Institute Inc. SAS Institute Inc. (1990), "SAS Programming Tips: A Guide to Efficient SAS Processing," Cary, NC: SAS Institute Inc. ACKNOWLEDGMENTS The following people contributed extensively to the development of this paper: Donna Hill, Julia Meredith, and Steve Schacht at the Federal Reserve Board, Mike Bradicich at Vistech, and Peter Sorock. Their support is greatly appreciated. TRADEMARK INFORMATION SAS is a registered trademark or trademark of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are registered trademarks or trademarks of their respective companies.
More Tales from the Help Desk: Solutions for Simple SAS Mistakes Bruce Gilsen, Federal Reserve Board INTRODUCTION In 20 years as a SAS consultant at the Federal Reserve Board, I have seen SAS users make
Paper 074-29 Tales from the Help Desk: Solutions for Simple SAS Mistakes Bruce Gilsen, Federal Reserve Board INTRODUCTION In 19 years as a SAS consultant at the Federal Reserve Board, I have seen SAS users
Tales from the Help Desk 3: More Solutions for Simple SAS Mistakes Bruce Gilsen, Federal Reserve Board INTRODUCTION In 20 years as a SAS consultant at the Federal Reserve Board, I have seen SAS users make
Paper 109-25 Merges and Joins Timothy J Harrington, Trilogy Consulting Corporation Abstract This paper discusses methods of joining SAS data sets. The different methods and the reasons for choosing a particular
SAS ARRAYS: A BASIC TUTORIAL Bruce Gilsen, Federal Reserve Board INTRODUCTION SAS arrays provide a simple, convenient way to process a group of variables in a SAS DATA step. As a SAS software consultant
The SET Statement and Beyond: Uses and Abuses of the SET Statement S. David Riba, JADE Tech, Inc., Clearwater, FL ABSTRACT The SET statement is one of the most frequently used statements in the SAS System.
Simple Rules to Remember When Working with Indexes Kirk Paul Lafler, Software Intelligence Corporation, Spring Valley, California Abstract SAS users are always interested in learning techniques related
A PROC SQL Primer Matt Taylor, Carolina Analytical Consulting, LLC, Charlotte, NC ABSTRACT Most SAS programmers utilize the power of the DATA step to manipulate their datasets. However, unless they pull
Paper 1440-2014 What You re Missing About Missing Values Christopher J. Bost, MDRC, New York, NY ABSTRACT Do you know everything you need to know about missing values? Do you know how to assign a missing
THE POWER OF PROC FORMAT Jonas V. Bilenas, Chase Manhattan Bank, New York, NY ABSTRACT The FORMAT procedure in SAS is a very powerful and productive tool. Yet many beginning programmers rarely make use
Taming the PROC TRANSPOSE Matt Taylor, Carolina Analytical Consulting, LLC ABSTRACT The PROC TRANSPOSE is often misunderstood and seldom used. SAS users are unsure of the results it will give and curious
How to Reduce the Disk Space Required by a SAS Data Set Selvaratnam Sridharma, U.S. Census Bureau, Washington, DC ABSTRACT SAS datasets can be large and disk space can often be at a premium. In this paper,
Paper 2917 Creating Variables: Traps and Pitfalls Olena Galligan, Clinops LLC, San Francisco, CA ABSTRACT Creation of variables is one of the most common SAS programming tasks. However, sometimes it produces
Programming Idioms Using the SET Statement Jack E. Fuller, Trilogy Consulting Corporation, Kalamazoo, MI ABSTRACT While virtually every programmer of base SAS uses the SET statement, surprisingly few programmers
Paper AD39 Top Ten SAS Performance Tuning Techniques Kirk Paul Lafler, Software Intelligence Corporation, Spring Valley, California Abstract The Base-SAS software provides users with many choices for accessing,
Paper 70-27 An Introduction to SAS PROC SQL Timothy J Harrington, Venturi Partners Consulting, Waukegan, Illinois Abstract This paper introduces SAS users with at least a basic understanding of SAS data
CC13 Transferring vs. Transporting Between SAS Operating Environments Mimi Lou, Medical College of Georgia, Augusta, GA ABSTRACT Prior to SAS version 8, permanent SAS data sets cannot be moved directly
Preparing your data for analysis using SAS Landon Sego 24 April 2003 Department of Statistics UW-Madison Assumptions That you have used SAS at least a few times. It doesn t matter whether you run SAS in
Tips, Tricks, and Techniques from the Experts Presented by Katie Ronk 2997 Yarmouth Greenway Drive, Madison, WI 53711 Phone: (608) 278-9964 Web: www.sys-seminar.com Systems Seminar Consultants, Inc www.sys-seminar.com
A Method for Cleaning Clinical Trial Analysis Data Sets Carol R. Vaughn, Bridgewater Crossings, NJ ABSTRACT This paper presents a method for using SAS software to search SAS programs in selected directories
3 CHAPTER 1 Overview of SAS/ACCESS Interface to Relational Databases About This Document 3 Methods for Accessing Relational Database Data 4 Selecting a SAS/ACCESS Method 4 Methods for Accessing DBMS Tables
PharmaSUG 2012 Paper PO11 SAS UNIX-Space Analyzer A handy tool for UNIX SAS Administrators Airaha Chelvakkanthan Manickam, Cognizant Technology Solutions, Teaneck, NJ ABSTRACT: In the fast growing area
Handling Missing Values in the SQL Procedure Danbo Yi, Abt Associates Inc., Cambridge, MA Lei Zhang, Domain Solutions Corp., Cambridge, MA ABSTRACT PROC SQL as a powerful database management tool provides
SAS Programming Tips, Tricks, and Techniques A presentation by Kirk Paul Lafler Copyright 2001-2012 by Kirk Paul Lafler, Software Intelligence Corporation All rights reserved. SAS is the registered trademark
Tips for Constructing a Data Warehouse Part 2 Curtis A. Smith, Defense Contract Audit Agency, La Mirada, CA ABSTRACT Ah, yes, data warehousing. The subject of much discussion and excitement. Within the
SESUG 2012 Paper CT-11 An Introduction to Criteria-based Deduplication of Records Elizabeth Heath RTI International, RTP, NC Priya Suresh RTI International, RTP, NC ABSTRACT When survey respondents are
Paper 020-29 An Introduction to SAS/SHARE, By Example Larry Altmayer, U.S. Census Bureau, Washington, DC ABSTRACT SAS/SHARE software is a useful tool for allowing several users to simultaneously access
Paper 23-27 Programming Tricks For Reducing Storage And Work Space Curtis A. Smith, Defense Contract Audit Agency, La Mirada, CA. ABSTRACT Have you ever had trouble getting a SAS job to complete, although
Paper AA600 SAS and UNIX: Techniques for Developing Your Toolbox Joe Novotny, GlaxoSmithKline Pharmaceuticals, Inc., Collegeville, PA ABSTRACT How many times have you had to write and run short SAS programs
New Tricks for an Old Tool: Using Custom Formats for Data Validation and Program Efficiency S. David Riba, JADE Tech, Inc., Clearwater, FL ABSTRACT PROC FORMAT is one of the old standards among SAS Procedures,
Swap the DATA Step for PROC TRANSPOSE Katie Joseph, U.S. Office of Personnel Management, Washington, DC ABSTRACT Anyone who has spent an unspeakable amount of time using arrays and do loops in a DATA step
Paper 073-29 Using Edit-Distance Functions to Identify Similar E-Mail Addresses Howard Schreier, U.S. Dept. of Commerce, Washington DC ABSTRACT Version 9 of SAS software has added functions which can efficiently
From The Little SAS Book, Fifth Edition. Full book available for purchase here. Acknowledgments ix Introducing SAS Software About This Book xi What s New xiv x Chapter 1 Getting Started Using SAS Software
Source Code Revision Control Systems and Auto-Documenting Headers for SAS Programs on a UNIX or PC Multiuser Environment Terek Peterson, Alliance Consulting Group, Philadelphia, PA Max Cherny, Alliance
Paper 8-25 Integrity Constraints and Audit Trails Working Together Gary Franklin, SAS Institute Inc., Austin, TX Art Jensen, SAS Institute Inc., Englewood, CO ABSTRACT New features in Version 7 and Version
Paper FF-007 Labels, Labels, and More Labels Stephanie R. Thompson, Rochester Institute of Technology, Rochester, NY ABSTRACT SAS datasets include labels as optional variable attributes in the descriptor
Paper 11-27 Table Lookup: Techniques Beyond the Obvious Nancy Croonen, CC Training Services, Belgium ir. Henri Theuwissen, SOLID Partners, Belgium ABSTRACT Table lookup operations are often the most time
Chapter 1 Overview of the SQL Procedure 1.1 Features of PROC SQL...1-3 1.2 Selecting Columns and Rows...1-6 1.3 Presenting and Summarizing Data...1-17 1.4 Joining Tables...1-27 1-2 Chapter 1 Overview of
Paper 243-29 SAS Macro Programming for Beginners Susan J. Slaughter, Avocet Solutions, Davis, CA Lora D. Delwiche, Delwiche Consulting, Winters, CA ABSTRACT Macro programming is generally considered an
Emailing Automated Notification of Errors in a Batch SAS Program Julie Kilburn, City of Hope, Duarte, CA Rebecca Ottesen, City of Hope, Duarte, CA ABSTRACT With multiple programmers contributing to a batch
CC14 Dup, Dedup, DUPOUT - New in PROC SORT Heidi Markovitz, Federal Reserve Board of Governors, Washington, DC ABSTRACT This paper presents the new DUPOUT option of PROC SORT and discusses the art of identifying
Managing Tables in Microsoft SQL Server using SAS Jason Chen, Kaiser Permanente, San Diego, CA Jon Javines, Kaiser Permanente, San Diego, CA Alan L Schepps, M.S., Kaiser Permanente, San Diego, CA Yuexin
Oracle For Beginners Page : 1 3.GETTING STARTED WITH ORACLE8i Creating a table Datatypes Displaying table definition using DESCRIBE Inserting rows into a table Selecting rows from a table Editing SQL buffer
Search and Replace in SAS Data Sets thru GUI Edmond Cheng, Bureau of Labor Statistics, Washington, DC ABSTRACT In managing data with SAS /BASE software, performing a search and replace is not a straight
How and When to Use WHERE J. Meimei Ma, Quintiles, Research Triangle Park, NC Sandra Schlotzhauer, Schlotzhauer Consulting, Chapel Hill, NC INTRODUCTION This tutorial explores the various types of WHERE
For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 Introduction to SPSS Tutorials... 3 1.2 Introduction to SPSS... 3 1.3 Overview of SPSS for Windows... 3 Section 2: Entering
EXST SAS Lab Lab #4: Data input and dataset modifications Objectives 1. Import an EXCEL dataset. 2. Infile an external dataset (CSV file) 3. Concatenate two datasets into one 4. The PLOT statement will
Paper SC-008 An Approach to Creating Archives That Minimizes Storage Requirements Ruben Chiflikyan, RTI International, Research Triangle Park, NC Mila Chiflikyan, RTI International, Research Triangle Park,
Chapter 6 Working with SAS Data Sets Chapter Table of Contents OVERVIEW... 79 OPENING A SAS DATA SET... 80 MAKING A SAS DATA SET CURRENT... 81 DISPLAYING SAS DATA SET INFORMATION... 82 REFERRING TO A SAS
Magic Tutorial #S-1: The scheme command-line interpreter Rajit Manohar Department of Computer Science California Institute of Technology Pasadena, CA 91125 This tutorial corresponds to Magic version 7.
UNIX Comes to the Rescue: A Comparison between UNIX SAS and PC SAS Chii-Dean Lin, San Diego State University, San Diego, CA Ming Ji, San Diego State University, San Diego, CA ABSTRACT Running SAS under
ABSTRACT PharmaSUG 2012 - Paper TF03 Simplifying Effective Data Transformation Via PROC TRANSPOSE Arthur X. Li, City of Hope Comprehensive Cancer Center, Duarte, CA You can store data with repeated measures
THE FUNDAMENTALS OF DATA STEP PROGRAMMING I: THE ESSENCE OF DATA STEP PROGRAMMING Arthur Li, City of Hope Comprehensive Cancer Center, Duarte, CA 1.1. COMPILATION AND EXECUTION PHASES A DATA step is processed
SQL Case Expression: An Alternative to DATA Step IF/THEN Statements John Q. Zhang LCS Industries, Inc. Clifton NJ ABSTRACT This paper will show that case-expression utilized in PROC SQL can be as efficient
Subsetting Observations from Large SAS Data Sets Christopher J. Bost, MDRC, New York, NY ABSTRACT This paper reviews four techniques to subset observations from large SAS data sets: MERGE, PROC SQL, user-defined
Restricting and Sorting Data Objectives After completing this lesson, you should be able to do the following: Limit the rows that are retrieved by a query Sort the rows that are retrieved by a query Use
Beyond SQL Essentials: The Need for Speed When Accessing SAS Data Transcript Beyond SQL Essentials: The Need for Speed When Accessing SAS Data Transcript was developed by Mark Jordan and Linda Mitterling.
Optimizing System Performance by Monitoring UNIX Server with SAS Sam Mao, Quintiles, Inc., Kansas City, MO Jay Zhou, Quintiles, Inc., Kansas City, MO ABSTRACT To optimize system performance and maximize
Paper 197 Project Request and Tracking Using SAS/IntrNet Software Steven Beakley, LabOne, Inc., Lenexa, Kansas ABSTRACT The following paper describes a project request and tracking system that has been
PharmaSUG2013 Paper AD11 Let SAS Set Up and Track Your Project Tom Santopoli, Octagon, now part of Accenture Wayne Zhong, Octagon, now part of Accenture ABSTRACT When managing the programming activities
PharmaSUG 2015 - Paper QT06 PROC SQL for SQL Die-hards Jessica Bennett, Advance America, Spartanburg, SC Barbara Ross, Flexshopper LLC, Boca Raton, FL ABSTRACT Inspired by Christianna William s paper on
PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY ABSTRACT Keywords: Logistic. INTRODUCTION This paper covers some gotchas in SAS R PROC LOGISTIC. A gotcha
SESUG 2012 Paper CT-28 Encoding the Password A low maintenance way to secure your data access Leanne Tang, National Agriculture Statistics Services USDA, Washington DC ABSTRACT When users access data in
AN INTRODUCTION TO MACRO VARIABLES AND MACRO PROGRAMS Mike S. Zdeb, New York State Department of Health INTRODUCTION There are a number of SAS tools that you may never have to use. Why? The main reason
The Essentials of Finding the Distinct, Unique, and Duplicate Values in Your Data Carter Sevick MS, DoD Center for Deployment Health Research, San Diego, CA ABSTRACT Whether by design or by error there
Data Cleaning 101 Ronald Cody, Ed.D., Robert Wood Johnson Medical School, Piscataway, NJ INTRODUCTION One of the first and most important steps in any data processing task is to verify that your data values
SAS Comments How Straightforward Are They? Jacksen Lou, Merck & Co.,, Blue Bell, PA 19422 ABSTRACT SAS comment statements typically use conventional symbols, *, %*, /* */. Most programmers regard SAS commenting
Paper 55-28 Describing and Retrieving Data with SAS formats David Johnson, DKV-J Consultancies, Holmeswood, England ABSTRACT For many business users, the format procedure might be their favourite SAS procedure.
Checking and Tracking SAS Programs Using SAS Software Keith M. Gregg, Ph.D., SCIREX Corporation, Chicago, IL Yefim Gershteyn, Ph.D., SCIREX Corporation, Chicago, IL ABSTRACT Various checks on consistency
Paper TU_09 Proc SQL Tips and Techniques - How to get the most out of your queries Kevin McGowan, Constella Group, Durham, NC Brian Spruell, Constella Group, Durham, NC Abstract: Proc SQL is a powerful
177 CHAPTER 8 Enhancements for SAS Users under Windows NT Overview 177 NT Event Log 177 Sending Messages to the NT Event Log Using a User-Written Function 178 Examples of Using the User-Written Function
Microsoft Access 3: Understanding and Creating Queries In Access Level 2, we learned how to perform basic data retrievals by using Search & Replace functions and Sort & Filter functions. For more complex
SAS 9.4 SPD Engine: Storing Data in the Hadoop Distributed File System Third Edition SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS 9.4
You have got SASMAIL! Rajbir Chadha, Cognizant Technology Solutions, Wilmington, DE ABSTRACT As SAS software programs become complex, processing times increase. Sitting in front of the computer, waiting
ABSTRACT Paper 10240-2016 Instant Interactive SAS Log Window Analyzer Palanisamy Mohan, ICON Clinical Research India Pvt Ltd Amarnath Vijayarangan, Emmes Services Pvt Ltd, India An interactive SAS environment
Mike Zdeb (402-6479, email@example.com) #121 (11) COMBINING SAS DATA SETS There are a number of ways to combine SAS data sets: # concatenate - stack data sets (place one after another) # interleave - stack
Guide to Operating SAS IT Resource Management 3.5 without a Middle Tier SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2014. Guide to Operating SAS
Choosing the Best Method to Create an Excel Report Romain Miralles, Clinovo, Sunnyvale, CA ABSTRACT PROC EXPORT, LIBNAME, DDE or excelxp tagset? Many techniques exist to create an excel file using SAS.
Paper BB-07-2014 Using Arrays to Quickly Perform Fuzzy Merge Look-ups: Case Studies in Efficiency Arthur L. Carpenter California Occidental Consultants, Anchorage, AK ABSTRACT Merging two data sets when
Your consent to our cookies if you continue to use this website.