Make Better Decisions with Optimization
|
|
|
- Franklin Harmon
- 10 years ago
- Views:
Transcription
1 ABSTRACT Paper SAS Make Better Decisions with Optimization David R. Duling, SAS Institute Inc. Automated decision making systems are now found everywhere, from your bank to your government to your home. For example, when you inquire for a loan through a website, a complex decision process likely runs combinations of statistical models and business rules to make sure you are offered a set of options for tantalizing terms and conditions. To make that happen, analysts diligently strive to encode their complex business logic into these systems. But how do you know if you are making the best possible decisions? How do you know if your decisions conform to your business constraints? You might want to maximize the number of loans that you provide while balancing the risk among different customer categories. Welcome to the world of optimization. SAS Business Rules Manager and SAS/OR software can be used together to manage and optimize decisions. This presentation demonstrates how to build business rules and then optimize the rule parameters to maximize the effectiveness of those rules. The end result is more confidence that you are delivering an effective decision making process. INTRODUCTION In his 2012 book, Decision Management Systems, James Taylor identifies three types of decisions: strategic decisions, tactical decisions, and operational decisions. Strategic decisions are high value, lowvolume decisions that guide the overall direction of the company, such as deciding what products to produce or what markets to enter. Tactical decisions are those focused on management and control, such as staffing and budgeting. Operational decisions are those of lower individual value and typically relate to a single customer or a single transaction, such as processing sales records, guiding call centers, delivering offers and recommendations, managing intelligent factories, and handling credit applications. (Taylor, 2012) This paper focuses on optimization of operational decisions. Due to their high frequency, operational decisions can generate enough data to enable an optimization process. Small improvements in the decisions can lead to large improvements in the business objectives. These improvements are limited only by the imagination of the analyst. In our working example we will follow the process of optimizing a simple decision on accepting a credit application for further processing. The general workflow is shown in Figure 1. Business analysts create business rules based on both analysis of data and implementation of business policies. These rules are deployed to production execution systems to make operational decisions. The results of these decisions generate new data on business performance such as revenue, recommendations, complaints, growth, and so on. This data is analyzed and the same analysts will then revise policies and update rules. Figure 1. Business Workflow 1
2 The rules evolve over time during periods of different conditions and strategies. However, economic conditions and business policies can change rapidly. If inventory is increasing, we might need to decrease prices, reduce production, or improve deliveries. But, how do we know which actions to take? How do you know that your rules are producing the best results possible? In a complex system there can be many different possible actions and the right combination is not always obvious. In these cases, where the rules need to adapt to changes, or opportunities for improvement exist, optimization of business rules can be an excellent strategy. BUSINESS RULES Most businesses encode many of their operational decisions as business rules. Typically, many analysts collaborate on creating and maintaining rules that implement business strategy, regulations, taxation, logistics, and other factors into a comprehensive database. The main benefit of a rules system over a collection of code files is the ability to search, index, save and recall versions, enable lineage, and modify components of rules without editing code files. This makes the system more approachable to mainstream business analysts and more transparent to regulation and governance. A rule set is a collection of similar rules that are used together and share a common set of input and output columns. The input columns are typically named conditions and the output columns are typically named actions. In this example rule set, we show a subset of rules needed to process credit applications. The most common display of a complex rules set is the decision table form as shown in Table 1. Conditions Actions Age Income Rating Accept > 30 > > 35 > > 30 > > 40 > Table 1. Business Rules Decision Table. When deployed to an operational system, the rule set might be expressed as SAS code: ACCEPT=0; IF AGE > 30 and INCOME > and RATING = 1 THEN ACCEPT=1; IF AGE > 35 and INCOME > and RATING = 2 THEN ACCEPT=1; IF AGE > 30 and INCOME > and RATING = 2 THEN ACCEPT=1; IF AGE > 40 and INCOME > and RATING = 3 THEN ACCEPT=1; In this example we see that we have several decision points for approving the application. Condition Terms represent all data needed to implement the business strategy. The Rating term is derived from analytical risk modeling processes and represents another large set of possible data points. Action Terms are the output of the rules and used by the operational application processing system to automate decisions. A typical operational rules system can contain hundreds to thousands of these rules, referring to hundreds to thousands of terms. In addition, these rules might have evolved over time and reflect many generations of strategy and policy. With such a large number of rules and changing dynamics, it is not surprising that the combination of many simple rules produces a complex system that might become stale, contain inconsistencies and inefficiencies, and produce surprising or unintended results. SAS Business Rules Manager provides a platform for creating, managing, and deploying business rules into operational systems. (SAS Institute Inc., 2014) 2
3 OPTIMIZATION Mathematical optimization can be used to enforce constraints and find optimal combinations of parameters in well-defined systems of equations. Logistics systems typically use optimization to map out the most efficient routes for delivery of a large number packages to a large number of destinations using a minimal amount of human and capital resources. There are several requirements needed to implement an effective optimization. The first requirement is an objective function that represents the business metric such as loan approvals that should be optimized. The objective function is represented as SAS code. The next requirement is a set of parameters that can be adjusted to produce the best objective function value. The final requirement is a set of constraints. The constraints are rules that represent limits on the values of the parameters, or on combinations of values of parameters such as limits on age ranges and income ranges. In our case, we want to optimize the rules for age and income. The first thing we need to do is to create parameters that can be used in the optimization. Table 2 shows the introduction of AGE1 and INC1 as new parameters and their values will be determined by the optimization process. Conditions Actions Age Income Rating Accept > AGE1 > INC1 1 1 > AGE1+5 > INC1 2 1 > AGE1 > INC > AGE1+10 > INC Table 2. Business Rules Decision Table with Parameter Substitution. We also need to establish some boundary constraints on our parameters. These are determined by the current business strategy and economic conditions. In our case, we want the minimum AGE threshold for application acceptance to be no less than 20 and no more than 30. And, for INCOME to be no less than and no more than Table 3 shows this parameterization. Parameter Min Max AGE INC Table 3. Optimization Parameters and Boundaries The business strategy also tells us that we must establish constraints on how many applications we should accept with certain attributes. We want to ensure that no more than 50% of the applications that we accept have a credit RATING greater than 2. We also want to ensure that no more than 50% of the applications that we accept have a current MORTDUE value greater than $100,000. MORTDUE is the amount due on the applicant s current mortgage. Even though MORTDUE does not appear in the rule set, it can be part of the constraints and thus influence the rules. Table 4 shows this parameterization. Parameter Values Limit RATING > 2 50% MORTDUE > $100,000 50% Table 4. Nonlinear Constraints Configuration 3
4 We need to establish an objective function. This is described in Table 5. Parameter Function Objective ACCEPT SUM MAX Table 5. Objective Function Definition Finally, we need to select an optimization procedure for this task. The OPTLSO procedure (SAS Institute Inc., 2014) implements a derivative free optimizer (Griffin, 2010) that can operate on a wide variety of problems without extensive tuning. Business rules can be encoded in simple SAS DATA step code and can be included in both the objective function and constraint functions. Lower and upper bounds on variables and constraint functions are encoded in simple tables that can be easily modified and scripted. The optimization process combines properties of a Genetic Algorithm to search a wide area and a pattern search algorithm to accurately search small areas. This is known as a local search optimization technique and is not guaranteed to find a global optimal solution even if one exists. Fortunately, for many business problems where design of experiments is useful, and the notion of best is well-defined, local search optimization can be used to iteratively improve the solution. In our simple example, it is not feasible to encode the business rules and the non-linear constraints into a problem that can be used with a general purpose global search technique. In such cases, the OPTLSO procedure is very appropriate for the problem. The OPTLSO procedure is part of SAS Operations Research (OR). IMPLEMENTATION The implementation is done entirely in SAS 9.4 code and the complete code listing is available directly from the author. The code is organized into 5 sections. 1. CREATE SAMPLE DATA This code simply creates the data set named APPDATA used to guide the optimization. The code creates these variables: AGE, INCOME, MORTDUE, and RATING. The values of the first three variables are distributed continuously, and the value of the RATING variable is distributed as an integer ranging from 1 to 5, where 1 is the best rating and 5 is the worst. The objective function and the constraint functions used in the optimization are computed for each row of the data. 2. CREATE BUSINESS RULES This code places the business rules into a macro for use both inside the optimization s objective function and directly inside a DATA step. The DATA step is used to deploy the rules into operational production systems. These rules could have been exported from the SAS Business Rules Manager system and then prepared for input to both the DATA step and optimization. The code used for optimization and operationalization has the parameters AGE1 and INC1 substituted for the original values: %MACRO RULES; ACCEPT=0; IF AGE > AGE1 and INCOME > INC1 and RATING = 1 THEN ACCEPT=1; IF AGE > AGE1+10 and INCOME > INC1 and RATING = 2 THEN ACCEPT=1; IF AGE > AGE1 and INCOME > INC and RATING = 2 THEN ACCEPT=1; IF AGE > AGE1+10 and INCOME > INC and RATING = 3 THEN ACCEPT=1; %MEND; A DATA step can be used to test the initial values of the rules and produce output for comparison to the final values after optimization. 4
5 DATA SCORES; SET APPDATA; AGE1= 30; INC1= 30000; %RULES; END; 3. CONFIGURE THE OPTIMIZATION The OPTLSO procedure requires the creation of functions using the SAS Function Compiler (FCMP) ) procedure. We can reuse the %RULES macro we defined earlier in the acceptance function definition. The OPTLSO procedure calls this function many times during the optimization process for each row of the data. Notice that the function signature includes both the optimization parameters AGE1 and INC1, and also the values that are read from the data RATING, AGE, and INCOME. For more information about the FCMP procedure refer to SAS Customer Support at This snippet shows creation of the function for executing the business rules: OPTIONS CMPLIB = WORK.BRM; PROC FCMP OUTLIB=WORK.BRM.OPT; FUNCTION ACCEPTANCE (AGE1, INC1, RATING, AGE, INCOME); %RULES; RETURN (ACCEPT); ENDSUB; QUIT; We also need to define the objective and constraint functions called by the OPTLSO procedure. That code is similar to the code shown above and is available directly from the author. Finally, we need to create tables that tell the procedure what to do. These data sets implement the business policies listed above. The OBJDATA table tells the procedure about the objective function that is used during the optimization. The VARDATA table tells the procedure that the solution should attempt to be constrained to the region where 20<AGE1<30 and < INC1 < The CONDATA table contains more sophisticated nonlinear constraints that are encoded as functions named RATINGC constraint and the MORTDUEC constraint: DATA VARDATA; INPUT _ID_ $ _LB UB_; DATALINES; AGE INC ; RUN; DATA CONDATA; INPUT _ID_ $ _LB UB_; DATALINES; RATINGC MORTDUEC ; RUN; DATA OBJDATA; INPUT _ID_ $ _FUNCTION_ $ _SENSE_ $; DATALINES; ACCEPTS OBJFUNC MAX RUN; Now, whenever we want to change the optimization scenario, we only need to edit the code that creates those tables. To change the acceptable range of the AGE threshold, we edit the code that manages the VARDATA table. To change the allowable ratio of loans with poor ratings, we edit the code that manages the CONDATA table. This code makes it very easy to run many different scenarios quickly by editing or scripting the values in these tables. 5
6 4. RUN THE OPTIMIZATION Now we can run the optimization using the OPTLSO procedure by executing the following code. The initial values of AGE1 and INC1 are generated within the range defined by the VARDATA boundaries. The final values of AGE1 and INC1 are determined by the optimization. Generations of possible solutions are managed by a custom Genetic Algorithm. At each one of these generations, a custom pattern search evaluates dozens of candidate solutions. The objective and constraint functions are evaluated multiple times for each candidate solution. The OPTLSO procedure contains a sophisticated scheme for scheduling the function evaluations in parallel in order to minimize run time. PROC OPTLSO MAXGEN=100 LASTGEN=LASTGEN VARIABLES = VARDATA OBJECTIVE = OBJDATA NLINCON = CONDATA PRIMALOUT = SOLUTION FEASTOL = ; READARRAY APPDATA; PERFORMANCE NTHREADS = 8; RUN; The procedure outputs a Solution Summary table that is shown in Figure 2. The output of the procedure shows that the final solution status indicates that the procedure terminated when the objective function converged to a single value without improvement. The solution ran 23 steps of the Genetic Algorithm and 1242 overall objective function evaluations. Infeasibility is the maximum amount of deviation of any constraint from its boundaries. A value of zero indicates that none of the final constraint evaluations returned a value outside the bounds previously established in the CONDATA table; thus indicating that our solution satisfies our business policies. Solution Summary Solution Status Function convergence Objective 4002 Infeasibility 0 Iterations 23 Evaluations 1242 Cached Evaluations 36 Global Searches 1 Population Size 80 Seed 1 Figure 2. Final Optimization Details Printing the solution data set shows the final values of the objective function and the constraints in Figure 3. The objective function is defined as the sum of the accepted applications and this output shows that the number is The optimal value of AGE1 is years and the optimal value of INC1 is $37, The final value of the RATINGC constraint indicates that 50% of the acceptances have a credit rating of 1 or 2. The final value of the MORTDUEC constraint indicates that only 24% of the acceptances have a mortgage due value greater than $100,000. 6
7 Name Value _obj_ _inf_ 0.00 AGE INC ACCEPTS RATINGC 0.50 MORTDUEC 0.24 Figure 3. Final Optimization Values 5. REPORT ON RESULTS The final section of code prepares a report on the effectiveness of the optimization. The code scores the original data using the original business rules with the new values of AGE1 and INC1. The output table SCORES are run through steps to create plots and comparisons between the original naïve results and the optimized results. This snippet of code shows the substitution of the optimized parameters into the scoring step: PROC SQL noprint; select _value_ into :AGE1 from solution where _id_ eq 'AGE1' ; select _value_ into :INC1 from solution where _id_ eq 'INC1' ; select _value_ into :RAT from solution where _id_ eq 'RATINGC' ; select _value_ into :MORT from solution where _id_ eq 'MORTDUEC' ; QUIT; DATA SCORES ; set APPDATA ; AGE1= &AGE1. ; INC1= &INC1. ; %RULES; RUN; RESULTS The original naïve solution sets AGE1 to 30 and INC1 to $40,000. The optimization process shows that we can move those values to and $37, For business purposes, we can round those values to their nearest integers. Figure 4 shows the distributions of the input variables versus the accept output as shown in the four plots below. The histogram of AGE shows vertical reference lines at the optimal rule thresholds based on AGE1 (27, 32, and 37). The histogram of INCOME shows reference lines at the optimal rule threshold values based on INC1 ($37,244, $47,234, and $77,244). The histogram of MORTDUE shows a reference line at the constraint value, and the bar plot of ratings shows the number accepted in each bin. We can see a large number of loans both accepted and denied across all age groups above the value of AGE1. However, most loans above the INC1 threshold were accepted. Combined with the credit RATING, it is more important to have a higher income than to have a higher age. We can also see that most denials have a relatively low MORTDUE value, indicating that applicants with higher mortgages also tended to have higher risk. 7
8 Figure 4. Plots of Input Variables versus Accepts after Optimization Figure 5 shows the number of applications that were accepted across each credit rating category before and after the optimization. The number of accepted applications increases in categories one, two, and three after the optimization. The business rules do not permit any applications from customers with a credit rating of four or five. The total number accepted applications rose from 3643 to Accepting 359 more loans with no increased risk can have a large impact on profitability. In our case, the optimization is a success. Accept Rating Before After change Figure 5. Accepted Applications before and after Optimization 8
9 Now, look at the tradeoff between objective function and infeasibility. Figure 6 shows a point for all 1242 solutions that were attempted during the optimization. Infeasibility is plotted on the X axis and objective function is plotted on the Y axis. Each solution generates new values of AGE1 and INC1 and then computes new values of the objective function and the constraints. The objective function is the number of applications accepted. As the solution becomes more infeasible, the objective value goes up. Our final solution shows the one that generates the greatest objective function (4002) with zero infeasibility, meaning that all constraints are satisfied. We can see how much we can improve the objective function if we change our business policy to relax some constraints. The optimization algorithm only generates candidate values of AGE1 and INC1 within their bounds. Thus, the infeasibility violations are in the nonlinear constraints RATINGC and MORTDUEC. Expanding either the age and income ranges, or the acceptable rate of high credit ratings, or the ratio or definition of high mortgage due values can lead to a higher objective function. Figure 6 Objective Function (_obj_) versus infeasibility (_inf_) CONCERNS While the optimization in this example was a rousing success, not all optimizations will achieve that goal. Even when our initial objectives are not reached we can still use the analysis to improve our business process. Two of these scenarios are described below. Not all problems reach a feasible solution. The OPTLSO procedure output shows this as a significant value in the infeasibility measure (_inf_). If this value had been significantly greater than zero or the number of Accepts had been disappointing, then we need to reconsider our business policies as expressed in the business rules, or reconsider the wisdom of our constraints. This is not a bad result. Infeasibility shows us that our business problem is not realistic and needs to be re-examined. Not all problems yield improvement in the objective function from the original expectation. This can happen when the original expectation is not feasible and the optimization process finds a feasible solution with a lower objective function value. This is not a bad result. Moving a business rule set from an infeasible solution to feasible solution is extremely beneficial and produces a more robust objective function estimate that is more useful in business planning. 9
10 These concerns highlight an important point. The process of optimization can yield important knowledge about your decision making systems even if the optimization does not reach a satisfying result. CONCLUSION Business Rules are often the product of a mixture of data driven systems and expert systems. While optimization is a mathematical approach to improving processes. The two dissimilar systems can work together to improve operational decision making systems. SAS Business Rules Manager provides a system for analysts to collaborate on business rule development and create sequential rule sets that execute in the SAS system. SAS OR provides local search optimization that is well suited for the problems described by business rules. Analysts who understand the business needs and are able to quantify both objectives and constraints are now able to analyze and improve their processes using sophisticated optimization tools. REFERENCES Taylor, J. (2012). Decision Management Systems. Boston, MA: IBM Press. SAS Institute Inc. (2014). SAS Business Rules Manager 2.2: User's Guide. Cary, NC: SAS Institute Inc. SAS Institute Inc. (2014). SAS/OR 13.2 User s Guide: Local Search Optimization. Cary, N.C.: SAS Institue Inc. Griffin, J. D, et al. (2010). Derivative-Free Optimization Via Evolutionary Algorithms Guiding Local Search. Retrieved from Sandia National Labs: ACKNOWLEDGMENTS The author wishes to acknowledge the kind help from Joshua Griffin and Michelle Opp of SAS Institute. CONTACT INFORMATION Your comments and questions are valued and encouraged. You can also request the complete code file that implements this analysis. Contact the author at: David R. Duling SAS Institute Inc. Cary, North Carolina, [email protected] Phone: SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. 10
Why is SAS/OR important? For whom is SAS/OR designed?
Fact Sheet What does SAS/OR software do? SAS/OR software provides a powerful array of optimization, simulation and project scheduling techniques to identify the actions that will produce the best results,
Dynamic Decision-Making Web Services Using SAS Stored Processes and SAS Business Rules Manager
Paper SAS1787-2015 Dynamic Decision-Making Web Services Using SAS Stored Processes and SAS Business Rules Manager Chris Upton and Lori Small, SAS Institute Inc. ABSTRACT With the latest release of SAS
Portfolio Construction with OPTMODEL
ABSTRACT Paper 3225-2015 Portfolio Construction with OPTMODEL Robert Spatz, University of Chicago; Taras Zlupko, University of Chicago Investment portfolios and investable indexes determine their holdings
Paper CI-06-2013. Marketing Campaign (Message sent) Contact Recorded in SAS System Tables. Response Recorded in SAS System Tables
Paper CI-06-2013 SAS Treatments: One to One Marketing with a Customized Treatment Process Dave Gribbin, SAS Institute Inc., Cary, NC Amy Glassman, SAS Institute Inc., Cary, NC ABSTRACT The importance of
Counting the Ways to Count in SAS. Imelda C. Go, South Carolina Department of Education, Columbia, SC
Paper CC 14 Counting the Ways to Count in SAS Imelda C. Go, South Carolina Department of Education, Columbia, SC ABSTRACT This paper first takes the reader through a progression of ways to count in SAS.
IBM SPSS Direct Marketing 23
IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release
IBM SPSS Direct Marketing 22
IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release
Maximizing the ROI Of Visual Rules
Table of Contents Introduction... 3 Decision Management... 3 Decision Discovery... 4 Decision Services... 6 Decision Analysis... 11 Conclusion... 12 About Decision Management Solutions... 12 Acknowledgements
SAS Marketing Automation 5.1. User s Guide
SAS Marketing Automation 5.1 User s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2007. SAS Marketing Automation 5.1: User s Guide. Cary, NC: SAS Institute
Model, Analyze and Optimize the Supply Chain
Model, Analyze and Optimize the Supply Chain Optimize networks Improve product flow Right-size inventory Simulate service Balance production Optimize routes The Leading Supply Chain Design and Analysis
Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole
Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many
STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and
Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table
Prescriptive Analytics. A business guide
Prescriptive Analytics A business guide May 2014 Contents 3 The Business Value of Prescriptive Analytics 4 What is Prescriptive Analytics? 6 Prescriptive Analytics Methods 7 Integration 8 Business Applications
Strategic Online Advertising: Modeling Internet User Behavior with
2 Strategic Online Advertising: Modeling Internet User Behavior with Patrick Johnston, Nicholas Kristoff, Heather McGinness, Phuong Vu, Nathaniel Wong, Jason Wright with William T. Scherer and Matthew
Chapter 32 Histograms and Bar Charts. Chapter Table of Contents VARIABLES...470 METHOD...471 OUTPUT...472 REFERENCES...474
Chapter 32 Histograms and Bar Charts Chapter Table of Contents VARIABLES...470 METHOD...471 OUTPUT...472 REFERENCES...474 467 Part 3. Introduction 468 Chapter 32 Histograms and Bar Charts Bar charts are
7 Steps to Successful Data Blending for Excel
COOKBOOK SERIES 7 Steps to Successful Data Blending for Excel What is Data Blending? The evolution of self-service analytics is upon us. What started out as a means to an end for a data analyst who dealt
Technical Paper. Performance of SAS In-Memory Statistics for Hadoop. A Benchmark Study. Allison Jennifer Ames Xiangxiang Meng Wayne Thompson
Technical Paper Performance of SAS In-Memory Statistics for Hadoop A Benchmark Study Allison Jennifer Ames Xiangxiang Meng Wayne Thompson Release Information Content Version: 1.0 May 20, 2014 Trademarks
Advanced Analytics for Audit Case Selection
Advanced Analytics for Audit Case Selection New York State Capitol Building Albany, New York Presented by: Tim Gardinier, Manager, Data Processing Services Audit Case Selection Putting a scientific approach
White Paper. Thirsting for Insight? Quench It With 5 Data Management for Analytics Best Practices.
White Paper Thirsting for Insight? Quench It With 5 Data Management for Analytics Best Practices. Contents Data Management: Why It s So Essential... 1 The Basics of Data Preparation... 1 1: Simplify Access
Everything you need to know about flash storage performance
Everything you need to know about flash storage performance The unique characteristics of flash make performance validation testing immensely challenging and critically important; follow these best practices
Operationalizing Data Governance through Data Policy Management
Operationalizing Data Governance through Data Policy Management Prepared for alido by: David Loshin nowledge Integrity, Inc. June, 2010 2010 nowledge Integrity, Inc. Page 1 Introduction The increasing
Three steps to put Predictive Analytics to Work
Three steps to put Predictive Analytics to Work The most powerful examples of analytic success use Decision Management to deploy analytic insight in day to day operations helping organizations make more
SAS Rule-Based Codebook Generation for Exploratory Data Analysis Ross Bettinger, Senior Analytical Consultant, Seattle, WA
SAS Rule-Based Codebook Generation for Exploratory Data Analysis Ross Bettinger, Senior Analytical Consultant, Seattle, WA ABSTRACT A codebook is a summary of a collection of data that reports significant
Manage the Analytical Life Cycle for Continuous Innovation
Manage the Analytical Life Cycle for Continuous Innovation From Data to Decision WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 The Complexity of Managing the Analytical Life Cycle....
Scheduling Algorithm with Optimization of Employee Satisfaction
Washington University in St. Louis Scheduling Algorithm with Optimization of Employee Satisfaction by Philip I. Thomas Senior Design Project http : //students.cec.wustl.edu/ pit1/ Advised By Associate
Innovative Techniques and Tools to Detect Data Quality Problems
Paper DM05 Innovative Techniques and Tools to Detect Data Quality Problems Hong Qi and Allan Glaser Merck & Co., Inc., Upper Gwynnedd, PA ABSTRACT High quality data are essential for accurate and meaningful
Tools for Managing and Measuring the Value of Big Data Projects
Tools for Managing and Measuring the Value of Big Data Projects Abstract Big Data and analytics focused projects have undetermined scope and changing requirements at their core. There is high risk of loss
Social Media Implementations
SEM Experience Analytics Social Media Implementations SEM Experience Analytics delivers real sentiment, meaning and trends within social media for many of the world s leading consumer brand companies.
Alex Vidras, David Tysinger. Merkle Inc.
Using PROC LOGISTIC, SAS MACROS and ODS Output to evaluate the consistency of independent variables during the development of logistic regression models. An example from the retail banking industry ABSTRACT
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
WHITEPAPER. How to Credit Score with Predictive Analytics
WHITEPAPER How to Credit Score with Predictive Analytics Managing Credit Risk Credit scoring and automated rule-based decisioning are the most important tools used by financial services and credit lending
OPRE 6201 : 2. Simplex Method
OPRE 6201 : 2. Simplex Method 1 The Graphical Method: An Example Consider the following linear program: Max 4x 1 +3x 2 Subject to: 2x 1 +3x 2 6 (1) 3x 1 +2x 2 3 (2) 2x 2 5 (3) 2x 1 +x 2 4 (4) x 1, x 2
Fundamentals of Measurements
Objective Software Project Measurements Slide 1 Fundamentals of Measurements Educational Objective: To review the fundamentals of software measurement, to illustrate that measurement plays a central role
What's New in SAS Data Management
Paper SAS034-2014 What's New in SAS Data Management Nancy Rausch, SAS Institute Inc., Cary, NC; Mike Frost, SAS Institute Inc., Cary, NC, Mike Ames, SAS Institute Inc., Cary ABSTRACT The latest releases
Automation can dramatically increase product quality, leading to lower field service, product support and
QA Automation for Testing Medical Device Software Benefits, Myths and Requirements Automation can dramatically increase product quality, leading to lower field service, product support and liability cost.
SAS Add-In 2.1 for Microsoft Office: Getting Started with Data Analysis
SAS Add-In 2.1 for Microsoft Office: Getting Started with Data Analysis The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2007. SAS Add-In 2.1 for Microsoft Office: Getting
Big Data Optimization at SAS
Big Data Optimization at SAS Imre Pólik et al. SAS Institute Cary, NC, USA Edinburgh, 2013 Outline 1 Optimization at SAS 2 Big Data Optimization at SAS The SAS HPA architecture Support vector machines
IBM SPSS Direct Marketing 19
IBM SPSS Direct Marketing 19 Note: Before using this information and the product it supports, read the general information under Notices on p. 105. This document contains proprietary information of SPSS
SAS Programming Tips, Tricks, and Techniques
SAS Programming Tips, Tricks, and Techniques A presentation by Kirk Paul Lafler Copyright 2001-2012 by Kirk Paul Lafler, Software Intelligence Corporation All rights reserved. SAS is the registered trademark
The Predictive Data Mining Revolution in Scorecards:
January 13, 2013 StatSoft White Paper The Predictive Data Mining Revolution in Scorecards: Accurate Risk Scoring via Ensemble Models Summary Predictive modeling methods, based on machine learning algorithms
Chapter 6: The Information Function 129. CHAPTER 7 Test Calibration
Chapter 6: The Information Function 129 CHAPTER 7 Test Calibration 130 Chapter 7: Test Calibration CHAPTER 7 Test Calibration For didactic purposes, all of the preceding chapters have assumed that the
Oracle Real Time Decisions
A Product Review James Taylor CEO CONTENTS Introducing Decision Management Systems Oracle Real Time Decisions Product Architecture Key Features Availability Conclusion Oracle Real Time Decisions (RTD)
The Total Economic Impact Of SAS Customer Intelligence Solutions Real-Time Decision Manager
A Forrester Total Economic Impact Study Commissioned By SAS Project Director: Dean Davison May 2014 The Total Economic Impact Of SAS Customer Intelligence Solutions Real-Time Decision Manager Table Of
Drawing a histogram using Excel
Drawing a histogram using Excel STEP 1: Examine the data to decide how many class intervals you need and what the class boundaries should be. (In an assignment you may be told what class boundaries to
Data processing goes big
Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,
Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs
Using Excel Jeffrey L. Rummel Emory University Goizueta Business School BBA Seminar Jeffrey L. Rummel BBA Seminar 1 / 54 Excel Calculations of Descriptive Statistics Single Variable Graphs Relationships
BUSINESS RULES AND GAP ANALYSIS
Leading the Evolution WHITE PAPER BUSINESS RULES AND GAP ANALYSIS Discovery and management of business rules avoids business disruptions WHITE PAPER BUSINESS RULES AND GAP ANALYSIS Business Situation More
Compliance. Technology. Process. Using Automated Decisioning and Business Rules to Improve Real-time Risk Management
Technology Process Compliance Using Automated Decisioning and Business Rules to Improve Real-time Risk Management Sandeep Gupta, Equifax James Taylor, Smart (enough) Systems August 2008 Equifax is a registered
Data Visualization Techniques
Data Visualization Techniques From Basics to Big Data with SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Generating the Best Visualizations for Your Data... 2 The
Predictive Coding Defensibility
Predictive Coding Defensibility Who should read this paper The Veritas ediscovery Platform facilitates a quality control workflow that incorporates statistically sound sampling practices developed in conjunction
Implementing a Data Warehouse with Microsoft SQL Server 2012
Implementing a Data Warehouse with Microsoft SQL Server 2012 Course ID MSS300 Course Description Ace your preparation for Microsoft Certification Exam 70-463 with this course Maximize your performance
Leveraging Ensemble Models in SAS Enterprise Miner
ABSTRACT Paper SAS133-2014 Leveraging Ensemble Models in SAS Enterprise Miner Miguel Maldonado, Jared Dean, Wendy Czika, and Susan Haller SAS Institute Inc. Ensemble models combine two or more models to
IBM SPSS Modeler Social Network Analysis 15 User Guide
IBM SPSS Modeler Social Network Analysis 15 User Guide Note: Before using this information and the product it supports, read the general information under Notices on p. 25. This edition applies to IBM
Getting Correct Results from PROC REG
Getting Correct Results from PROC REG Nathaniel Derby, Statis Pro Data Analytics, Seattle, WA ABSTRACT PROC REG, SAS s implementation of linear regression, is often used to fit a line without checking
The Rolling Stock Recovery Problem. Literature review. Julie Jespersen Groth *α, Jesper Larsen β and Jens Clausen *γ
The Rolling Stock Recovery Problem Julie Jespersen Groth *α, Jesper Larsen β and Jens Clausen *γ DTU Management Engineering, The Technical University of Denmark, Produktionstorvet, DTU Building 424, 2800
A Survey of Shared File Systems
Technical Paper A Survey of Shared File Systems Determining the Best Choice for your Distributed Applications A Survey of Shared File Systems A Survey of Shared File Systems Table of Contents Introduction...
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation
Cost Models for Vehicle Routing Problems. 8850 Stanford Boulevard, Suite 260 R. H. Smith School of Business
0-7695-1435-9/02 $17.00 (c) 2002 IEEE 1 Cost Models for Vehicle Routing Problems John Sniezek Lawerence Bodin RouteSmart Technologies Decision and Information Technologies 8850 Stanford Boulevard, Suite
1 Solving LPs: The Simplex Algorithm of George Dantzig
Solving LPs: The Simplex Algorithm of George Dantzig. Simplex Pivoting: Dictionary Format We illustrate a general solution procedure, called the simplex algorithm, by implementing it on a very simple example.
A Constraint Programming based Column Generation Approach to Nurse Rostering Problems
Abstract A Constraint Programming based Column Generation Approach to Nurse Rostering Problems Fang He and Rong Qu The Automated Scheduling, Optimisation and Planning (ASAP) Group School of Computer Science,
The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon
The Science and Art of Market Segmentation Using PROC FASTCLUS Mark E. Thompson, Forefront Economics Inc, Beaverton, Oregon ABSTRACT Effective business development strategies often begin with market segmentation,
IMPROVING PRODUCTIVITY USING STANDARD MATHEMATICAL PROGRAMMING SOFTWARE
IMPROVING PRODUCTIVITY USING STANDARD MATHEMATICAL PROGRAMMING SOFTWARE $QWRQýLåPDQ 1, Samo Cerc 2, Andrej Pajenk 3 1 University of Maribor, Fakulty of Organizational Sciences.UDQM.LGULþHYDD(PDLODQWRQFL]PDQ#IRYXQLPEVL
9.2 User s Guide SAS/STAT. Introduction. (Book Excerpt) SAS Documentation
SAS/STAT Introduction (Book Excerpt) 9.2 User s Guide SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation for the complete manual
SharePoint 2013 Migration Readiness
SharePoint 2013 Migration Readiness Decision Points around Migrating to SharePoint 2013 MARK ECKERT Contents Purpose... 1 SharePoint 2013 Implementation Options... 1 On-premise installation... 1 Cloud...
How To: Choosing the Right Catalog for Software License Management
Software License Management Guide How To: Choosing the Right Catalog for Software License Management Software License Management tools all rely on a catalog to reference and validate data. In this guide
Model-based Parameter Optimization of an Engine Control Unit using Genetic Algorithms
Symposium on Automotive/Avionics Avionics Systems Engineering (SAASE) 2009, UC San Diego Model-based Parameter Optimization of an Engine Control Unit using Genetic Algorithms Dipl.-Inform. Malte Lochau
USE OF ARIMA TIME SERIES AND REGRESSORS TO FORECAST THE SALE OF ELECTRICITY
Paper PO10 USE OF ARIMA TIME SERIES AND REGRESSORS TO FORECAST THE SALE OF ELECTRICITY Beatrice Ugiliweneza, University of Louisville, Louisville, KY ABSTRACT Objectives: To forecast the sales made by
Automate Data Integration Processes for Pharmaceutical Data Warehouse
Paper AD01 Automate Data Integration Processes for Pharmaceutical Data Warehouse Sandy Lei, Johnson & Johnson Pharmaceutical Research and Development, L.L.C, Titusville, NJ Kwang-Shi Shu, Johnson & Johnson
PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY
PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY ABSTRACT Keywords: Logistic. INTRODUCTION This paper covers some gotchas in SAS R PROC LOGISTIC. A gotcha
Harnessing the power of advanced analytics with IBM Netezza
IBM Software Information Management White Paper Harnessing the power of advanced analytics with IBM Netezza How an appliance approach simplifies the use of advanced analytics Harnessing the power of advanced
KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES
HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES Translating data into business value requires the right data mining and modeling techniques which uncover important patterns within
NETWORK DESIGN & ANALYSIS PTIIK - 2012
NETWORK DESIGN & ANALYSIS PTIIK - 2012 Networking at a glance Discrete Mathematics Operating Systems Computer Networks Network Security Advance Networking Network Analysis Multimedia Networking Network
ENERGY EFFICIENT CONTROL OF VIRTUAL MACHINE CONSOLIDATION UNDER UNCERTAIN INPUT PARAMETERS FOR THE CLOUD
ENERGY EFFICIENT CONTROL OF VIRTUAL MACHINE CONSOLIDATION UNDER UNCERTAIN INPUT PARAMETERS FOR THE CLOUD ENRICA ZOLA, KARLSTAD UNIVERSITY @IEEE.ORG ENGINEERING AND CONTROL FOR RELIABLE CLOUD SERVICES,
A Property & Casualty Insurance Predictive Modeling Process in SAS
Paper AA-02-2015 A Property & Casualty Insurance Predictive Modeling Process in SAS 1.0 ABSTRACT Mei Najim, Sedgwick Claim Management Services, Chicago, Illinois Predictive analytics has been developing
Multiple Linear Regression in Data Mining
Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple
CHAPTER 2 Estimating Probabilities
CHAPTER 2 Estimating Probabilities Machine Learning Copyright c 2016. Tom M. Mitchell. All rights reserved. *DRAFT OF January 24, 2016* *PLEASE DO NOT DISTRIBUTE WITHOUT AUTHOR S PERMISSION* This is a
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER JMP Genomics Step-by-Step Guide to Bi-Parental Linkage Mapping Introduction JMP Genomics offers several tools for the creation of linkage maps
WHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics
WHITE PAPER Harnessing the Power of Advanced How an appliance approach simplifies the use of advanced analytics Introduction The Netezza TwinFin i-class advanced analytics appliance pushes the limits of
Using Adaptive Random Trees (ART) for optimal scorecard segmentation
A FAIR ISAAC WHITE PAPER Using Adaptive Random Trees (ART) for optimal scorecard segmentation By Chris Ralph Analytic Science Director April 2006 Summary Segmented systems of models are widely recognized
E x c e l 2 0 1 0 : Data Analysis Tools Student Manual
E x c e l 2 0 1 0 : Data Analysis Tools Student Manual Excel 2010: Data Analysis Tools Chief Executive Officer, Axzo Press: Series Designer and COO: Vice President, Operations: Director of Publishing Systems
SALES AND OPERATIONS PLANNING BLUEPRINT BUSINESS VALUE GUIDE
Business Value Guide SALES AND OPERATIONS PLANNING BLUEPRINT BUSINESS VALUE GUIDE INTRODUCTION What if it were possible to tightly link sales, marketing, supply chain, manufacturing and finance, so that
INTEGER PROGRAMMING. Integer Programming. Prototype example. BIP model. BIP models
Integer Programming INTEGER PROGRAMMING In many problems the decision variables must have integer values. Example: assign people, machines, and vehicles to activities in integer quantities. If this is
Forschungskolleg Data Analytics Methods and Techniques
Forschungskolleg Data Analytics Methods and Techniques Martin Hahmann, Gunnar Schröder, Phillip Grosse Prof. Dr.-Ing. Wolfgang Lehner Why do we need it? We are drowning in data, but starving for knowledge!
IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance
Data Sheet IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Overview Multidimensional analysis is a powerful means of extracting maximum value from your corporate
Branch-and-Price Approach to the Vehicle Routing Problem with Time Windows
TECHNISCHE UNIVERSITEIT EINDHOVEN Branch-and-Price Approach to the Vehicle Routing Problem with Time Windows Lloyd A. Fasting May 2014 Supervisors: dr. M. Firat dr.ir. M.A.A. Boon J. van Twist MSc. Contents
One problem > Multiple solutions; various ways of removing duplicates from dataset using SAS Jaya Dhillon, Louisiana State University
One problem > Multiple solutions; various ways of removing duplicates from dataset using SAS Jaya Dhillon, Louisiana State University ABSTRACT In real world, analysts seldom come across data which is in
Optimization Modeling for Mining Engineers
Optimization Modeling for Mining Engineers Alexandra M. Newman Division of Economics and Business Slide 1 Colorado School of Mines Seminar Outline Linear Programming Integer Linear Programming Slide 2
Data-Driven Decisions: Role of Operations Research in Business Analytics
Data-Driven Decisions: Role of Operations Research in Business Analytics Dr. Radhika Kulkarni Vice President, Advanced Analytics R&D SAS Institute April 11, 2011 Welcome to the World of Analytics! Lessons
Experiences in Using Academic Data for BI Dashboard Development
Paper RIV09 Experiences in Using Academic Data for BI Dashboard Development Evangeline Collado, University of Central Florida; Michelle Parente, University of Central Florida ABSTRACT Business Intelligence
Building a Data Quality Scorecard for Operational Data Governance
Building a Data Quality Scorecard for Operational Data Governance A White Paper by David Loshin WHITE PAPER Table of Contents Introduction.... 1 Establishing Business Objectives.... 1 Business Drivers...
WHITEPAPER. Creating and Deploying Predictive Strategies that Drive Customer Value in Marketing, Sales and Risk
WHITEPAPER Creating and Deploying Predictive Strategies that Drive Customer Value in Marketing, Sales and Risk Overview Angoss is helping its clients achieve significant revenue growth and measurable return
Automating Software License Management
Automating Software License Management Automating license management saves time, resources, and costs. It also consistently produces high quality data and a documentable method for mapping software licenses
Lecture 3. Linear Programming. 3B1B Optimization Michaelmas 2015 A. Zisserman. Extreme solutions. Simplex method. Interior point method
Lecture 3 3B1B Optimization Michaelmas 2015 A. Zisserman Linear Programming Extreme solutions Simplex method Interior point method Integer programming and relaxation The Optimization Tree Linear Programming
Profit maximization in different market structures
Profit maximization in different market structures In the cappuccino problem as well in your team project, demand is clearly downward sloping if the store wants to sell more drink, it has to lower the
