Using the New Excel Import Wizard in JMP 11 for Windows JMP Discovery Conference 2013 Brian Corcoran SAS Institute

Similar documents
Accessing Your Database with JMP 10 JMP Discovery Conference 2012 Brian Corcoran SAS Institute

Microsoft Excel 2013: Macro to apply Custom Margins, Titles, Gridlines, Autofit Width & Add Macro to Quick Access Toolbar & How to Delete a Macro.

Microsoft Excel 2013: Using a Data Entry Form

Advanced Excel 10/20/2011 1

Learn how to create web enabled (browser) forms in InfoPath 2013 and publish them in SharePoint InfoPath 2013 Web Enabled (Browser) forms

Mail Merge. Course Description. Objectives: Design a Main Document. Design a Data Source. Directories. Merge the Main Document with the Data

Overview of sharing and collaborating on Excel data

Microsoft Office. Mail Merge in Microsoft Word

About PivotTable reports

Mail Merge Creating Mailing Labels 3/23/2011

Microsoft Excel: Pivot Tables

An Introduction to Excel Pivot Tables

Microsoft Office Access 2007 Basics

Pivot Tables & Pivot Charts

Using Excel for Business Analysis: A Guide to Financial Modelling Fundamentals

Introduction to Microsoft Access 2003

Sample- for evaluation purposes only! Advanced Excel. TeachUcomp, Inc. A Presentation of TeachUcomp Incorporated. Copyright TeachUcomp, Inc.

Excel 2007 A Beginners Guide

Microsoft Excel Understanding the Basics

SECTION 5: Finalizing Your Workbook

Designed by Jason Wagner, Course Web Programmer, Office of e-learning ZPRELIMINARY INFORMATION... 1 LOADING THE INITIAL REPORT... 1 OUR EXAMPLE...

Mail Merge Microsoft Word and Excel Queries Scott Kern Senior Consultant

Migrating to Excel 2010 from Excel Excel - Microsoft Office 1 of 1

Excel 2003 A Beginners Guide

Microsoft Excel 2010 Part 3: Advanced Excel

Word 2010: Mail Merge to with Attachments

Working together with Word, Excel and PowerPoint 2013

Excel Intermediate Session 2: Charts and Tables

Analyzing Excel Data Using Pivot Tables

Create a PivotTable or PivotChart report

ECDL. European Computer Driving Licence. Spreadsheet Software BCS ITQ Level 2. Syllabus Version 5.0

Excel Using Pivot Tables

ACCESS Importing and Exporting Data Files. Information Technology. MS Access 2007 Users Guide. IT Training & Development (818)

Excel macros made easy

Microsoft Office 2010

Excel 2003 PivotTables Summarizing, Analyzing, and Presenting Your Data

Avery Wizard: Using the wizard with Microsoft Word. This is a simple step-by-step guide showing how to use the Avery wizard in word

Explore commands on the ribbon Each ribbon tab has groups, and each group has a set of related commands.

Financial Reporting Using Microsoft Excel. Presented By: Jim Lee

Excel Using Pivot Tables

The first thing to do is choose if you are creating a mail merge for printing or an merge for distribution over .

Excel Pivot Tables. Blue Pecan Computer Training Ltd - Onsite Training Provider :: :: info@bluepecan.co.

Using Pivot Tables in Microsoft Excel 2003

Microsoft Excel 2013 Splitting Windows and Fixing Panes (Level 3)

Microsoft Word 2010 Mail Merge (Level 3)

Introduction to Pivot Tables in Excel 2003

Creating an Excel Spreadsheet for Mail Merge. Excel Spreadsheet Mail Merge. 1 of 9 Design & Print Offline: Mail Merge

Intro to Mail Merge. Contents: David Diskin for the University of the Pacific Center for Professional and Continuing Education. Word Mail Merge Wizard

EXCEL PIVOT TABLE David Geffen School of Medicine, UCLA Dean s Office Oct 2002

NEXT-ANALYTICS lets you specify more than one profile View in a single query.

Toad for Data Analysts, Tips n Tricks

bbc Creating a Purchase Order Form Adobe LiveCycle Designer ES2 November 2009 Version 9

Jump Start: Aspen Simulation Workbook in Aspen HYSYS V8

Microsoft Excel 2007 Consolidate Data & Analyze with Pivot Table Windows XP

ITS Training Class Charts and PivotTables Using Excel 2007

About Entering Donations in QuickBooks

An Introduction to Excel s Pivot Table

MICROSOFT WORD: MAIL MERGE

Excel Database Management Microsoft Excel 2003

Computer Training Centre University College Cork. Excel 2013 Level 1

Visual Studio.NET Database Projects

Personal Geodatabase 101

Produced by Flinders University Centre for Educational ICT. PivotTables Excel 2010

Creating tables of contents and figures in Word 2013

BlueJ Teamwork Tutorial

Six Steps to Completing a Mail-Merge

In This Issue: Excel Sorting with Text and Numbers

Working with Excel in Origin

NAVIGATION TIPS. Special Tabs

WHAT S NEW IN MS EXCEL 2013

To add a data form to excel - you need to have the insert form table active - to make it active and add it to excel do the following:

CHAPTER 6: ANALYZE MICROSOFT DYNAMICS NAV 5.0 DATA IN MICROSOFT EXCEL

Tips and Tricks for Printing an Excel Spreadsheet

Create Charts in Excel

What is a Mail Merge?

SPSS: Getting Started. For Windows

Microsoft Office 2007

Excel for Data Cleaning and Management

What s new in Excel 2013

Access 2007 Creating Forms Table of Contents

Training Needs Analysis

Microsoft Excel 2007 and Tools for Statistical Analysis David R. Anderson, Dennis J. Sweeney, and Thomas A. Williams

Microsoft Access 2010 handout

New Orleans 2007 Workshop Tips For Using Microsoft Excel to Analyze EMSC Data and Generate Reports Pivot Tables and Other Goodies

FileMaker Pro and Microsoft Office Integration

Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access

Mail Merges, Labels and Message Merges in Word 2007 Contents

SENDING S WITH MAIL MERGE

Chapter 4 Displaying and Describing Categorical Data

Coupling Microsoft Excel with NI Requirements Gateway

Microsoft Access Basics

Microsoft Excel 2010 Training

How To Create A Report In Excel

Detail Report Excel Guide for High Schools

Moving Data Between Access and Excel

Excel 2007 Basic knowledge

TABLE OF CONTENTS I. MAIL MERGING...1 II. MAIL MERGE WIZARD...1 III. MAIL MERGE TOOLBAR...4 PRINTING LABELS AND ENVELOPES...4

Microsoft Excel 2010 Pivot Tables

Sample- for evaluation purposes only. Advanced Crystal Reports. TeachUcomp, Inc.

EXAMPLE WITH NO NAME EXAMPLE WITH A NAME

Transcription:

Using the New Excel Import Wizard in JMP 11 for Windows JMP Discovery Conference 2013 Brian Corcoran SAS Institute There is a large body of legacy data in Excel 97 (xls) and Excel 2003 (xlsx) formats. The layout of this data often provides a challenge when importing it into JMP, and can lengthen the process of creating an analysis. The new Excel Import Wizard in JMP 11 for Windows will hopefully greatly simplify the import process and reduce the time to get to the exploratory phase with your data. This paper examines the features and operation of the Excel Import Wizard. The Excel Import Wizard is for the Windows version of JMP only. There is underlying technology to read the Excel files that depends on Microsoft s.net libraries being present on the machine. This makes it incompatible with the Mac. The JMP team will continue to look for a Mac equivalent for future editions of the product. The Excel Import Wizard is the default method of opening any Excel file in JMP 11. When an Excel file is selected in the File Open dialog, or off of the JMP Home Window Recent Files list, the Wizard will start. This behavior can be customized in the General Preferences pane. Basic Operation The initial default view for the Wizard is shown below. There is a preview pane, a worksheet settings pane that contains settings that can be customized for each Worksheet in the Workbook that is being opened, and then some general settings that apply to all Worksheets. The default settings are that column headers start on the first line that is found with content, and that the data starts on the second line. If there are several Worksheets within the Workbook, the same settings will be applied to all the Worksheets. The example below has one sheet called Worldprod.

We can see by inspecting the preview that the first line isn t really desirable for column headers, and that it would be better to select the second line. Also, the data starts three lines down from the headers. At this point, we can just quickly click the spin buttons for Column headers start and Data starts to the values of two and five and the preview looks much better (below). These are numbers that you would see in Excel for the Worksheet, not the preview. The preview will refresh every time a setting is changed. At this point, we can press Import to open the table in JMP. The Excel Wizard will remember the dialog settings across invocations of the dialog, and even the JMP session, if the number of Worksheets in the Workbook is the same between invocations. If the number of Worksheets is different, the Wizard assumes that you are using a different workbook and that you need to specify new settings for that Workbook. If, at any time, you wish to return to the default settings of one for Column headers, two for Data starts along with all of the other dialog settings, simply press the Restore Default Settings button. Multiple Worksheets within a Workbook The Wizard also allows for individual Worksheet settings to be specified in Workbooks that contain multiple sheets. The settings will be retained across invocations if the number of Worksheets remains the same. To specify settings for each Worksheet, first uncheck the Use for all worksheets checkbox. At this point, the settings for each Worksheet will be the current Wizard settings. An example is shown next, where there are three Worksheets in the Workbook. To preview any Worksheet in the Workbook, simply select the Worksheet name in the list at the top right of the dialog.

In this example, Team Results needs to have the column headers start on line three. The Data starts on row edit box will increment automatically when you press the spin button for Column Headers start on row. If the start of the data is more than one line down from the start of the headers, you can use the Data starts edit box to alter that setting, like we did with the Potato Production example. For this example, there is also a Worksheet called Planets that requires a change. When you change the settings for the Worksheet, a checkbox in the Worksheets list will be selected to indicate that you are using settings that are different from the settings that were in use before you unchecked Use for all worksheets.

Source Script Whenever you open a Worksheet, a table script called Source is creating in the resulting JMP data table. This script allows you to rerun the import operation with the same settings that were specified in the Wizard, but without invoking the Wizard again. So, if the data has changed and you would like a fresh copy, you can just run the table script and open up a new copy of the data within JMP. The picture below shows all of the settings that are retained in the script. Additional Options There is a second pane to the Wizard that is reached by pressing the Next button. It contains options that are less commonly used. Replicate data in spanned rows takes data from Excel merge cells, say Cells A1..A3, and makes sure that each row value within JMP contains the data in the merge cell. In JMP, that might mean Column 1, rows one to three.

The Suppress hidden options will not import rows or columns if they are hidden within the Excel table. For this to work, the entire row or column must be hidden within the Excel Worksheet. If individual cells are hidden, the results can vary. Most of the time the individual cell will be shown. The Data ends edit fields allow you to specify a termination row or column for your data. This is useful when you have extraneous information at the end of the file, or you want to truncate the number of observations that you import. Treat multiple column header lines as hierarchies will be described later. When you are finished with the settings on this page, you can either select Import to open the file immediately, or Back to return to the first page settings. Concatenating Worksheets It is common to have Workbooks that contain multiple Worksheets with data that is of a similar format. Financial data with each Worksheet containing the data for a particular month is a frequently seen example. In prior versions of JMP, you would need to open all of this data into individual JMP tables, and then clean and concatenate it after import. The Excel Wizard simplifies this task, particularly if the format is consistent. This next illustration shows a typical example. The Workbook has twelve Worksheets, each containing some revenue data by country. There is an extraneous TOTAL that would be better left off of the final imported table. First, the Column headers need to be changed to three. This will change Data starts to row four, which is fine. Next, the Data ends on row needs to be set to twelve to omit the TOTAL row. Finally, the Concatenate worksheets and try to match columns checkbox needs to be checked. This will combine all twelve Worksheets. To make it easier to identify where each observation originated, we can check the Create column with

worksheet name when concatenating checkbox. This will produce the preview below, which will match what you will see in JMP when you import the table. Data with Hierarchies Sometimes data is organized in an Excel Worksheet such that the data contains both row and column headers with important information regarding the data. Suppose you had rainfall data for San Antonio, Texas, USA. The data might be organized by year and month in a format like this: 2011 Q1 Q2 State County City Jan Feb March April May TX Bexar San 2.66.49.01.03.84 Antonio TX Travis Austin 2.92.48.09.27 3.65 Within JMP, if you wanted to use Table Summary, it would be important to have both the location and date data for each observation. The shape of this data makes it awkward to use in JMP as is. The Treat multiple column header lines as hierarchies setting can help with this. It tells the import mechanism to create a stacked table containing all of the elements of the column hierarchy. The row data for location is imported as usual. It is often useful to also make sure Replicate data in spanned rows is checked as well. In this case, we want any data for merge cells to be replicated in the stack table

result. In the next picture, you can see an example where the Treat multiple column header lines as hierarchies was specified on page two of the Wizard. The more important thing to notice in this case is that the Number of rows with column headers was set to three. This tells the importer to use three lines in the creation of the hierarchy data. All of the data in those three lines of column headings will be replicated as row data for the relevant observations. It is now a simple matter to import the data and rename the column headers for the hierarchy entries to Year, Quarter and Month. JSL All of the import settings that are available in the Wizard are available through JSL. The Source script that is created when you open the Excel worksheet into JMP uses the same syntax that you would use in JSL. This was shown previously. The easiest way to specify the options within JSL is to open the file using the Wizard one time, open the resulting Source script in the JMP data table, and copy the Open command syntax into your JSL script. Once you become familiar with the syntax and layout, you can just write your own JSL scripts without using the Wizard first. Two notable differences between the Source script and full JSL programming involve the use of the Worksheets command and the Sheet Settings command. A Source table created from an Excel worksheet will have just the settings necessary to open that Worksheet again. In JSL, you can have JMP open multiple Worksheets at one time, and you can specify different settings for each Worksheet that you will open. An example of the JSL to do that is:

dt = open("c:\excel Wizard Demo\MultipleDiffSettings.xlsx", Worksheets({"Planets", "Team Results"}), Use for all sheets( 0 ), Concatenate Worksheets( 0 ), Create Concatenation Column( 0 ), Worksheet Settings( 1, Has Column Headers( 1 ), Number of Rows in Headers( 1 ), Headers Start on Row( 2 ), Data Starts on Row( 3 ), Data Starts on Column( 1 ), Data Ends on Row( 0 ), Data Ends on Column( 0 ), Replicated Spanned Rows( 1 ), Suppress Hidden Rows( 1 ), Suppress Hidden Columns( 1 ), Treat as Hierarchy( 0 ) ), Worksheet Settings( 2, Has Column Headers( 1 ), Number of Rows in Headers( 1 ), Headers Start on Row( 3 ), Data Starts on Row( 4 ), Data Starts on Column( 1 ), Data Ends on Row( 0 ), Data Ends on Column( 0 ), Replicated Spanned Rows( 1 ), Suppress Hidden Rows( 1 ), Suppress Hidden Columns( 1 ), Treat as Hierarchy( 0 ) ) ); The Worksheet Settings command requires the index of the table for which the settings pertain, followed by the settings that you would like to change that are different from the default settings. In this case, the Planets table has headers that start on row two, and data that starts on row three. Team Results has headers that start on row three, and data that starts on row four. Conclusion The Excel Wizard in JMP 11 for Windows provides a variety of options to specify the shape of the data within the Excel Worksheet, and to quickly import it into JMP. For cases where the data is always in the same format, retained settings make this even simpler. Options to concatenate Worksheets and to use column headers to specify hierarchies simplify the task of getting the data into a format that JMP can quickly utilize for powerful analysis and visualizations.