Risk management systems: using data mining in developing countries customs administrations
|
|
|
- Elizabeth Moore
- 10 years ago
- Views:
Transcription
1 World Customs Journal Risk management systems: using data mining in developing countries customs administrations Abstract Bertrand Laporte Limiting intrusive customs inspections is recommended under the revised Kyoto Convention, and is also a proposal discussed as part of World Trade Organization (WTO) trade facilitation negotiations. To limit such inspection, the more modern administrations intervene at all stages of the customs chain using electronic data exchange and risk analysis and focusing their resources on a posteriori controls. Customs administrations of developing countries are slow to move in that direction. Risk analysis would therefore seem to be a priority for modernising the customs systems in developing countries. The most effective risk management system uses statistical scoring techniques. Several simple statistical techniques are tested in this article. They all show a good capability to predict and detect that contain s. They can easily be implemented in developing countries customs administrations and replace the rather inefficient methods of selectivity that result in high rates of control and very low rates of recorded s. Introduction Limiting intrusive customs examinations is recommended under the revised Kyoto Convention. It is also a proposal discussed in the context of World Trade Organization (WTO) trade facilitation negotiations. To limit these intrusive examinations, the more modern governments now intervene at all stages of the customs chain, using electronic data exchange and risk analysis, and focusing their resources on a posteriori inspection (Revised Kyoto Convention 1999; Keen 2004; eds De Wulf & Sokol 2004). Developing countries customs authorities are slow to move in this direction and implement the latest risk analysis and management techniques (Geourjon & Laporte 2005; Geourjon, Laporte & Rota Graziozi 2010). These techniques are used in many areas which are facing the risk of fraud, for example, in insurance, credit banking, and so on (for a review, see Bolton & Hand 2002; Phua et al. 2005). Yet risk analysis is a priority for modernising customs in developing countries. Indeed, it is a powerful lever for conducting a comprehensive operational reform in particular because it calls for closer cooperation between different departments in charge of information management and also because it allows for the redeployment of agents to a posteriori inspection. Risk analysis should be accompanied by a reform in human resource management, recruitment on the basis of job profiles and specific skills. Most developing countries have outsourced risk management systems to private inspection companies when implementing pre-shipment inspection programs and/or scanning services. The systems offered by these companies work only for imports/exports that depend on their contractually-defined scope of intervention. Their effectiveness is often compromised by a limited exchange of information customs authorities (Johnson 2001). For imports/exports falling in customs intervention, risk management systems are based on simple criteria of selectivity most often as blacklists focusing on the goods, the origin of the goods and the importer, plus a random target. This risk management system Volume 5, Number 1 17
2 International Network of Customs Universities has not proven to be particularly effective. It requires customs authorities to monitor intrusively a large number of containers which, as identified in unpublished technical reports, frequently results in recorded offence rates of less than 3 per cent in the cases of Benin, Côte d Ivoire, Mali and Senegal. Selectivity as a risk management method was forced on many countries as a result of using the integrated customs clearance management system, ASYCUDA (Automated SYstem for CUstoms DAta) 80 countries using the system today. Up to its latest version (ASYCUDA World), this system was closed and did not allow customs authorities to develop efficient risk management applications. ASYCUDA s risk management module was forced on Customs and depended on the option of being able to apply and combine simple selection criteria (lists of importers, origins, etc.). The new version of ASYCUDA is more open and allows the development of country-specific applications. Côte d Ivoire, Mali and Senegal (the latter country uses its own computer system for customs clearance, GAINDE) have therefore undertaken to develop their own risk management applications, notably the technical assistance of the International Monetary Fund s (IMF) West Africa Regional Technical Assistance Centre (West AFRITAC). The aim of this article is to present and compare simple statistical techniques that contribute to the modernisation of customs administration systems by enabling efficient targeting of the to be inspected. These techniques can be developed by most developing countries customs authorities, which are increasingly recruiting officers the necessary statistics and mathematics skills. From descriptive statistics to decision-making statistics Whatever the sector of activity, statistical targeting techniques use available data. In Customs, data come from, from the results of first- and second-line inspections, and from private inspection companies via the verification certificates that they issue. The information obtained from these various sources makes up the customs information system which, among other things, allows a risk management system to be constructed and to be directed to the different customs clearance channels. Unfortunately, customs authorities only exploit a small part of the wealth of information available that flows through the customs clearance process and their control activities. Indeed, information processing is essentially qualitative in terms of the approach to selectivity. The selectivity criteria used are often few in numbers and the analysis for each criterion is most often dual (whether or not it is listed). Thus, if origin X is in the list of risk sources, as soon as a declaration gives that origin, an intrusive examination will be triggered even if the declaration is made by a known and serious importer. There is thus no gradation in risk perception and this reduces targeting performance. This leads to high rates of intrusive inspections in exchange for low rates of reported s; that is, a relatively inefficient risk management system. This low targeting efficiency is determined by a defective statistical analysis of risk that does not leverage the available customs information in an optimal manner. Descriptive statistics: data mining To succeed in accurately targeting that present a risk of, it is necessary to carry out prior work on data analysis, on descriptive statistics. This work requires Customs to identify the characteristics of that, in a preceding period (for example, over the previous twelve months) has resulted in an, and then deducing the statistical regularities in those s. For this reason, all available information is used, that is, the contents of verification certificates, detailed, and the results of inspections during a reference period. These statistical regularities enable risk profiles to be established. 18 Volume 5, Number 1
3 World Customs Journal Indeed, while the information is essentially qualitative in its use of selectivity criteria, statistical analysis makes it possible to establish a quantitative risk scale. Let us take the example of importers: to measure the quality of importers, the frequency of s is calculated for each importer (this is the ratio between the number of made by an importer involved in a customs and the total number of made by that importer during the period in question). Thus, importers are rated on a scale from 0 to 1 (or 0 to 100), where 0 is for importers that represent no risk and 1 for importers that represent a high risk. This type of calculation can be done for all potential risk criteria: origin, HS position, billing currency, freight agents, and so on. These calculations enable the establishment of risk profiles for each criterion (see Figure 1). Decision-making statistics: referring to a customs clearance channel Next, we need to combine these risk profiles to facilitate the right decision regard to referring the declaration to a particular customs clearance channel. The combination of criteria may be simple (statistical average) or more elaborate (econometric analysis). In both cases, the objective is to assign a score to each new declaration, obtained by combining the frequencies of for the different criteria (risk profiles). At best, this score should reflect risk of (or even the probability of an occurring). Referral to one of the customs clearance channels is based on the score and thresholds previously determined through statistical analysis (see Figure 1). With the simplest system, the score for the declaration is obtained by applying a simple or weighted average of frequency for the different criteria, or by taking only the value of the highest frequency from among the criteria used (other combinations can be thought of). Prior to this, the most significant criteria will have been determined ad hoc by customs officers responsible for control activities. The most common and significant criteria are importer, freight agents, HS position and origin. A more elaborate system uses statistical distribution properties to effectively combine customs information. Econometric models (combining statistical and mathematical approaches) enable (1) the determination of risk criteria relevant in accounting for an ; and (2) the calculation of the probability of for each new declaration introduced in the customs clearance system. This probability is the calculated score for the declaration. For this purpose, it is first necessary to estimate the following econometric equation on the background history of the : Pr(Infraction ij = 1) = α + β 1 fq_ critère1 ij + β 2 fq_ critère2 +.. ij + β N fq_ critèren ij + ε ij where Pr is probability; Infraction ij the binary 0/1 variable for declaration i, product j (1 if, 0 if no for the declaration j and for product i) fq_ ij, the frequency of customs s for each criterion of risk associated declaration i and product j, ε, error term (which is not explained by the criteria used in the equation) and α and β as the parameters of the equation to be estimated. The use of background history involves looking over all the for a reference period, giving a mark of 1 to those that have been found in and 0 to those that have not. The binary variable 0/1 is thus constructed ( explained or dependent variable). This variable is then explained by risk criteria ( explanatory or independent variables), the values of which are continuous between 0 and 1. The estimate can be drawn from a linear probability model, a PROBIT model or a LOGIT model. The last two of these models are the most appropriate for estimating a model a binary explained variable. Indeed, some stochastic assumptions are violated in linear regression. The error term occurs through heteroskedastic construction and does not follow normal distribution. Furthermore, the predicted value cannot be interpreted as a probability of since it does not belong to the [0, 1]. If the LOGIT and PROBIT models are in theory the most appropriate, then the end goal is to find a model Volume 5, Number 1 19
4 International Network of Customs Universities capable of better targeting those that present real risk of. The three models can therefore be tested and the one presenting the best results should be accepted. From theory to practice: results from some empirical tests The tests presented use a database created by Senegalese customs authorities. Indeed, in 2011, the Senegalese customs authorities propose incorporation of a risk management module into their customs clearance system, GAINDE The confidentiality of customs data and the effectiveness of the system itself preclude the specific presentation of the criteria used, but this does not impede comparison of the results according to the different methods proposed. The database used covers twelve months. Data comes from detailed and monthly statements of customs s for the two main offices in Dakar. Verifications certified by the inspection company are not taken into account. Risk profiles (based on frequency of ) were calculated for six different criteria: importer, freight agents, HS position, origin, provenance, and customs regime. Only from operators an identification number were taken into account in these tests. Six different scoring calculations are done and their performance is compared for the purposes of targeting. The effectiveness of targeting is measured by comparing, for each declaration from the period in question, the occurrence of an the calculated score. If the calculated score is high, that is, if the estimated risk of is high, the declaration should be subject to an (binary variable = 1). If this is the case, the system enables the targeting of risky. On the other hand, if the calculated score is high and the declaration was not subject to an (binary variable = 0), the calculated score does not enable effective targeting. Similarly, if the calculated score is low, the declaration should not be subject to an (binary variable = 0). If this is the case, the system enables good targeting. If the opposite is the case, it does not enable good targeting. This method of measuring effectiveness is applied regardless of the score calculation method. Simple methods for score calculation Among the six criteria adopted a priori, three were identified as very important by the Senegalese customs authorities. Thus, for all the stored in the database, the score of each declaration was calculated based on the risk profiles for these three criteria in accordance three different methods: (1) a simple average of the frequency of ; (2) a weighted average of the frequency of, 0.5, 0.3 and 0.2 weighting; and (3) by accepting only the maximum value for the frequency of for the three criteria adopted. To facilitate the analysis of the results, are grouped into ranges of scores (those that have a calculated score of between 0 and 0.01, then between 0.01 and 0.02, and so on). The results can be seen in the following tables. The choice of s is important because it will determine the thresholds that refer to the various customs clearance channels. Ten s have been chosen here to make it easier to read the tables. 20 Volume 5, Number 1
5 World Customs Journal Volume 5, Number 1 21
6 International Network of Customs Universities Table 1: Effectiveness of targeting simple average (1) Score for the Rate of by (1) (2) (3) (4) (5) (6) (7) (8) [0.5 : 1] [0.1 : 0.5] [0.07 : 0.1] [0.06 : 0.07] [0.05 : 0.06] [0.04 : 0.05] [0.03 : 0.04] [0.02 : 0.03] [0.01 : 0.02] [0 : 0.01] Total Table 1 is read as follows: Column 1 shows the selected s. Column 2 shows the number of a calculated score in the range: 20,292 have a calculated score of between 0.01 and Column 3 shows the cumulative number of : there are 30,644 scores between 0.01 and 1. Column 4 shows these cumulative as percentages: the that have a calculated score of between 0.01 and 1 (that is, greater than 0.01) represent 29.3% of all. Column 5 shows the number of showing an by : among the 20,292 a calculated score of between 0.01 and 0.02, 81 have actually been subject to an. Column 6 shows the cumulative number of that have had an : there are 1,105 that involved an an rate of 1.06% (column 7). Column 8 shows the cumulative number of showing s: 96.3% of showing s have a calculated score of between 0.01 and 1 (that is, above 0.01). Table 2: Effectiveness of targeting weighted average (2) Score for the Rate of by (1) (2) (3) (4) (5) (6) (7) (8) [0.5 : 1] [0.1 : 0.5] [0.07 : 0.1] [0.06 : 0.07] [0.05 : 0.06] [0.04 : 0.05] [0.03 : 0.04] [0.02 : 0.03] [0.01 : 0.02] [0 : 0.01] Total Volume 5, Number 1
7 World Customs Journal How to analyse these results. By targeting (inspecting) all that have a calculated score above 0.02, that is, 9.9% of, the system captures 89% of that have been subject to. If the threshold is lowered to 0.01, this rises to 96.3% of that have been subject to, captured by inspecting only 29.3% of. How to use these results. If customs authorities set a 0.01 threshold for intrusive examinations, any new declaration recorded in the customs clearance system that shows a calculated score higher than 0.01 will be referred to an intrusive examination channel; that is, approximately 30% of. The threshold can be adjusted according to the customs authorities objectives. Score calculation based on weighted average improves targeting effectiveness. By targeting all a score higher than 0.01 (that is, 20.6% of ), the system captures 96.6% of s, that is, an inspection rate much lower than is currently practised in many developing countries. Table 3: Effectiveness of targeting maximum value (3) Score for the Rate of by (1) (2) (3) (4) (5) (6) (7) (8) [0.5 : 1] [0.1 : 0.5] [0.07 : 0.1] [0.06 : 0.07] [0.05 : 0.06] [0.04 : 0.05] [0.03 : 0.04] [0.02 : 0.03] [0.01 : 0.02] [0 : 0.01] Total Calculating the score according to maximum value is the least effective, since 18.5% of need to be targeted in order to capture 90.2% of s. Econometric methods for score calculation Three estimations were used: (4) estimation by a linear probability model; (5) estimation using a LOGIT model which uses the logistical distribution, and (6) estimation using a PROBIT model, based on normal distribution. The linear probability model The equation is estimated using the ordinary least squares method, corrected by heteroskedasticity. The explained variable is the binary (0/1) variable. The explanatory variables are the risk criteria. Four of these variables show a coefficient that is significantly different from zero. The adjusted R² is The failure to respect the residual normality hypothesis means the usual econometric tests cannot be run. Estimating the variable coefficients in the equation enables calculation of the score for each declaration and conduct of the targeting effectiveness test. Volume 5, Number 1 23
8 International Network of Customs Universities Table 4: Effectiveness of targeting Linear Probability (4) Score for the Rate of by (1) (2) (3) (4) (5) (6) (7) (8) [0.5 : 1] [0.1 : 0.5] [0.07 : 0.1] [0.06 : 0.07] [0.05 : 0.06] [0.04 : 0.05] [0.03 : 0.04] [0.02 : 0.03] [0.01 : 0.02] [0 : 0.01] Total Although econometric estimation is biased by construction, its result in terms of targeting is good and slightly better than that of the simple methods. By targeting all a calculated score higher than 0.01, that is, 17.8% of, the system captures 95.9% of s. The PROBIT and LOGIT models These two nonlinear models enable an estimate of the probability that a declaration may contain an. The variables used are the same as for the linear probability model. The estimates are adjusted for the heteroskedasticity problem. The estimate of the equation based on the LOGIT model has a Pseudo-R² of 0.32 and that based on the PROBIT model results in a Pseudo R² of The six explanatory variables are significant. Both estimates enable the calculation of the probability that a declaration may contain s. Table 5: Effectiveness of targeting Probability calculated a LOGIT model (5) Score for the Rate of by (1) (2) (3) (4) (5) (6) (7) (8) [0.5 : 1] [0.1 : 0.5] [0.07 : 0.1] [0.06 : 0.07] [0.05 : 0.06] [0.04 : 0.05] [0.03 : 0.04] [0.02 : 0.03] [0.01 : 0.02] [0 : 0.01] Total Volume 5, Number 1
9 World Customs Journal Table 6: Effectiveness of targeting Probability calculated a PROBIT model (6) Score for the Rate of by (1) (2) (3) (4) (5) (6) (7) (8) [0.5 : 1] [0.1 : 0.5] [0.07 : 0.1] [0.06 : 0.07] [0.05 : 0.06] [0.04 : 0.05] [0.03 : 0.04] [0.02 : 0.03] [0.01 : 0.02] [0 : 0.01] Total Although the method is statistically more rigorous, estimates on the basis of a LOGIT or PROBIT model do not enable improvement of targeting effectiveness. Indeed, by targeting that have a calculated score above 0.01 (that is, between 10% and 12% of, depending on the model), the system captures only 82-84% of containing s depending on the model, that is, a rate lower than that of other calculation methods. Why do we get these results? The quality of the database is certainly the most plausible explanation. Indeed, s are rare (about 1% of the in this database), which the LOGIT and PROBIT models find hard to accommodate. The reality is certainly different. Good targeting is a factor in improving customs information systems and, hence, the database that feeds the system. Indeed, by controlling less and in a different manner (second-line controls that substitute first-line controls), services are made to control better. Over time, database quality improvement should restore advantage to the PROBIT and LOGIT models which are especially well-suited to scoring. Some lessons for customs authorities of developing countries Streamlining customs controls is one of the keys to modernising customs administration in developing countries. Using statistical techniques can substantially improve the efficiency of targeting to be inspected. The statistical techniques aspect is no obstacle to customs authorities developing an effective risk management system. The simple methods proposed are accessible to all staff trained to a masters degree level in economics or statistics. The proposed tests show that statistical techniques allow effective targeting of. They have a good ability to predict and detect containing s. Thus, a developing country s customs authorities can build, stage by stage, a risk management system at the pace at which it acquires staff skills and practice, while serving as the driving force behind the modernisation of the first stage. A period of two to three years is sufficient to develop such a system in three main stages. The first stage is to establish risk profiles by criterion and simply combine them (in a weighted average, for example). This stage is already possible in most developing countries customs authorities; countries such as Côte d Ivoire, Senegal and Mali have managed this fast. This system can effectively replace the current, rather inefficient systems of selectivity, out risk to either tax revenues or the country s security. Volume 5, Number 1 25
10 International Network of Customs Universities Since the database is fed by more reliable data, the second stage is to apply econometric scoring techniques usually more effective than averaging because they are based on statistical distributions. Binomial models are the easiest to implement. The information analysed on the background history of is binary: has the declaration been subject to an yes or no? The information returned is binary too: does the new declaration present an increased risk of information yes or no? The model says nothing about the nature of the. The second stage comes into play when the customs information system is not sufficiently informed about the nature of the s confirmed. The information revealed is the presumption of, whereby the control officers must define the nature of the. The third step entails setting up a comprehensive and accurate customs information system on recorded s. These multinomial models are then used to provide information about the nature of the that is being considered. These statistical methods are just one component of the risk management system. Random controls, selectivity on new fraud forms detected by the intelligence services, selectivity from the moment one characteristic of the declaration is not entered in the database, are complementary to the statistical analysis. The combination of all these elements creates an effective risk management system that helps reduce the number of intrusive inspections out risk to the country. Reducing the number of first-line inspections makes it possible to improve the inspection performed as well as to develop a posteriori inspection. Developing a posteriori inspections should be an opportunity to bring together Customs and tax authorities, thereby contributing to greater efficiency in total government revenue. Trade facilitation and improving revenue collection are therefore perfectly compatible. External technical assistance regarding risk management systems enables developing countries to invest more quickly and solidly in modernising their customs authorities. This external technical assistance, however, is beneficial only if the customs authorities adopt an up-to-date management of their human resources: staff in charge of these technical matters should be recruited on the basis of well-defined job descriptions and personnel assigned to risk management services should be appointed for the long term. References Bolton, R-J & Hand, D-J 2002, Statistical fraud detection: a review, Statistical Science, vol. 17, no. 3, pp De Wulf, L & Sokol, J-B (eds) 2004, Customs modernization handbook, World Bank, Washington, DC. Geourjon, A-M & Laporte, B 2005, Risk management for targeting customs controls in developing countries: a risky venture for revenue performance?, Public Administration and Development, vol. 25, no. 2, pp Geourjon, A-M, Laporte, B & Rota Graziosi, G 2010, How to modernise risk analysis and the selectivity of customs controls in developing countries?,wconews, no. 62, pp Johnson, ND 2001, Committing to Civil Service Reform: the performance of pre-shipment inspection under different institutional regimes, World Bank Policy Research Working Paper, No. 2594, Washington, DC. Keen, M (ed.) 2003, Changing Customs: challenges and strategies for the reform of customs administration, International Monetary Fund, Washington, DC. Phua, C, Lee, V, Smith-Miles, K & Gayler, R 2005, A comprehensive survey of data mining-based fraud detection research, working paper, Computing Research Repository, abs/ Volume 5, Number 1
11 World Customs Journal World Customs Organization (WCO) 1999, International Convention on the Harmonization and Simplification of Customs Procedures (as amended), Revised Kyoto Convention, WCO, Brussels, content.html. Endnote 1 GAINDE is the French acronym for Automated Management of Customs and Economic Information. Bertrand Laporte Professor Bertrand Laporte is a member of CERDI (Centre for Study and Research into International Development) and maître de conferences at the Université d Auvergne. He is co-director of the Public Finance in Developing and Transition Countries master s course, and has spent almost 20 years working on international trade, tax and customs administration issues. He is a member of the panel of experts of the International Monetary Fund (IMF) Fiscal Affairs Department and works for a number of international institutions. Volume 5, Number 1 27
12 International Network of Customs Universities 28 Volume 5, Number 1
Calculating the Probability of Returning a Loan with Binary Probability Models
Calculating the Probability of Returning a Loan with Binary Probability Models Associate Professor PhD Julian VASILEV (e-mail: [email protected]) Varna University of Economics, Bulgaria ABSTRACT The
Case Story. The use of the WCO Time Release Study to measure border performance in a landlocked developing country (Uganda)
Case Story The use of the WCO Time Release Study to measure border performance in a landlocked developing country (Uganda) Executive Summary (1) A Time Release study (TRS) is a unique tool 1 and method
Predicting the Performance of a First Year Graduate Student
Predicting the Performance of a First Year Graduate Student Luís Francisco Aguiar Universidade do Minho - NIPE Abstract In this paper, I analyse, statistically, if GRE scores are a good predictor of the
Benefits of the Revised Kyoto Convention
Benefits of the Revised Kyoto Convention Tadashi Yasui Research and Strategy Unit 1. Introduction The Revised Kyoto Convention (RKC) 1 is an international agreement that provides a set of comprehensive
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
Local outlier detection in data forensics: data mining approach to flag unusual schools
Local outlier detection in data forensics: data mining approach to flag unusual schools Mayuko Simon Data Recognition Corporation Paper presented at the 2012 Conference on Statistical Detection of Potential
Marketing Mix Modelling and Big Data P. M Cain
1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored
Revised Kyoto Convention Let s talk
Revised Kyoto Convention Let s talk Your questions answered International Convention on the Simplification and Harmonisation of Customs Procedures (as amended) A Customs blueprint for the 21 st century
Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
The Elasticity of Taxable Income: A Non-Technical Summary
The Elasticity of Taxable Income: A Non-Technical Summary John Creedy The University of Melbourne Abstract This paper provides a non-technical summary of the concept of the elasticity of taxable income,
LOGIT AND PROBIT ANALYSIS
LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 [email protected] In dummy regression variable models, it is assumed implicitly that the dependent variable Y
Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios
Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios By: Michael Banasiak & By: Daniel Tantum, Ph.D. What Are Statistical Based Behavior Scoring Models And How Are
Statistical Analysis on Relation between Workers Information Security Awareness and the Behaviors in Japan
Statistical Analysis on Relation between Workers Information Security Awareness and the Behaviors in Japan Toshihiko Takemura Kansai University This paper discusses the relationship between information
Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*
Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?* Jennjou Chen and Tsui-Fang Lin Abstract With the increasing popularity of information technology in higher education, it has
Executive Master's in Business Administration Program
Executive Master's in Business Administration Program College of Business Administration 1. Introduction \ Program Mission: The UOS EMBA program has been designed to deliver high quality management education
Causes of Inflation in the Iranian Economy
Causes of Inflation in the Iranian Economy Hamed Armesh* and Abas Alavi Rad** It is clear that in the nearly last four decades inflation is one of the important problems of Iranian economy. In this study,
Hedge Effectiveness Testing
Hedge Effectiveness Testing Using Regression Analysis Ira G. Kawaller, Ph.D. Kawaller & Company, LLC Reva B. Steinberg BDO Seidman LLP When companies use derivative instruments to hedge economic exposures,
Social Security Eligibility and the Labor Supply of Elderly Immigrants. George J. Borjas Harvard University and National Bureau of Economic Research
Social Security Eligibility and the Labor Supply of Elderly Immigrants George J. Borjas Harvard University and National Bureau of Economic Research Updated for the 9th Annual Joint Conference of the Retirement
From the help desk: Bootstrapped standard errors
The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution
Internet classes are being seen more and more as
Internet Approach versus Lecture and Lab-Based Approach Blackwell Oxford, TEST Teaching 0141-982X Journal Original XXXXXXXXX 2008 The compilation UK Articles Statistics Publishing Authors Ltd 2008 Teaching
Teaching model: C1 a. General background: 50% b. Theory-into-practice/developmental 50% knowledge-building: c. Guided academic activities:
1. COURSE DESCRIPTION Degree: Double Degree: Derecho y Finanzas y Contabilidad (English teaching) Course: STATISTICAL AND ECONOMETRIC METHODS FOR FINANCE (Métodos Estadísticos y Econométricos en Finanzas
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
Generalized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
Easily Identify Your Best Customers
IBM SPSS Statistics Easily Identify Your Best Customers Use IBM SPSS predictive analytics software to gain insight from your customer database Contents: 1 Introduction 2 Exploring customer data Where do
Regression with a Binary Dependent Variable
Regression with a Binary Dependent Variable Chapter 9 Michael Ash CPPA Lecture 22 Course Notes Endgame Take-home final Distributed Friday 19 May Due Tuesday 23 May (Paper or emailed PDF ok; no Word, Excel,
ECON 523 Applied Econometrics I /Masters Level American University, Spring 2008. Description of the course
ECON 523 Applied Econometrics I /Masters Level American University, Spring 2008 Instructor: Maria Heracleous Lectures: M 8:10-10:40 p.m. WARD 202 Office: 221 Roper Phone: 202-885-3758 Office Hours: M W
Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest
Analyzing Intervention Effects: Multilevel & Other Approaches Joop Hox Methodology & Statistics, Utrecht Simplest Intervention Design R X Y E Random assignment Experimental + Control group Analysis: t
A Short review of steel demand forecasting methods
A Short review of steel demand forecasting methods Fujio John M. Tanaka This paper undertakes the present and past review of steel demand forecasting to study what methods should be used in any future
Power and sample size in multilevel modeling
Snijders, Tom A.B. Power and Sample Size in Multilevel Linear Models. In: B.S. Everitt and D.C. Howell (eds.), Encyclopedia of Statistics in Behavioral Science. Volume 3, 1570 1573. Chicester (etc.): Wiley,
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
Benefits of the Revised Kyoto Convention
WCO Research Paper No. 6 Benefits of the Revised Kyoto Convention (February 2010) Tadashi Yasui 1 Abstract This paper aims primarily to summarize the benefits of both acceding to and implementing the WCO
IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES
IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES Bruno Carneiro da Rocha 1,2 and Rafael Timóteo de Sousa Júnior 2 1 Bank of Brazil, Brasília-DF, Brazil [email protected] 2 Network Engineering
FDI as a source of finance in imperfect capital markets Firm-Level Evidence from Argentina
FDI as a source of finance in imperfect capital markets Firm-Level Evidence from Argentina Paula Bustos CREI and Universitat Pompeu Fabra September 2007 Abstract In this paper I analyze the financing and
FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits
Technical Paper Series Congressional Budget Office Washington, DC FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits Albert D. Metz Microeconomic and Financial Studies
Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables
Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables Introduction In the summer of 2002, a research study commissioned by the Center for Student
What explains modes of engagement in international trade? Conference paper for Kiel International Economics Papers 2011. Current Version: June 2011
What explains modes of engagement in international trade? Conference paper for Kiel International Economics Papers 2011 Current Version: June 2011 Natalia Trofimenko Kiel Institute for World Economy Hindenburgufer
Asymmetry and the Cost of Capital
Asymmetry and the Cost of Capital Javier García Sánchez, IAE Business School Lorenzo Preve, IAE Business School Virginia Sarria Allende, IAE Business School Abstract The expected cost of capital is a crucial
Financial Trading System using Combination of Textual and Numerical Data
Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,
Mortgage Loan Approvals and Government Intervention Policy
Mortgage Loan Approvals and Government Intervention Policy Dr. William Chow 18 March, 214 Executive Summary This paper introduces an empirical framework to explore the impact of the government s various
Module 2: Introduction to Quantitative Data Analysis
Module 2: Introduction to Quantitative Data Analysis Contents Antony Fielding 1 University of Birmingham & Centre for Multilevel Modelling Rebecca Pillinger Centre for Multilevel Modelling Introduction...
Trade risk management: a global approach
World Customs Journal Trade risk management: a global approach Abstract Lorraine Trapani This article discusses IBM s global approach to managing risk associated with importing product into more than 170
Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate?
Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate? Emily Polito, Trinity College In the past two decades, there have been many empirical studies both in support of and opposing
MASTER OF APPLIED ECONOMICS
The Master of Applied Economics is intended for students with strong academic background in economics, statistics, mathematics and econometrics. This Master offers a two-year graduate study program providing
Master Program Applied Economics
The English version of the curriculum for the Master Program in Applied Economics is not legally binding and is for informational purposes only. The legal basis is regulated in the curriculum published
Data Mining with Qualitative and Quantitative Data
Data Mining with Qualitative and Quantitative Data John F. Elder IV, Ph.D. CEO, Elder Research, IIA Faculty S e p t e m b e r, 2010 www.iianalytics.com www.iianalytics.com John F. Elder IV, PhD Elder Research,
Improving sales effectiveness in the quote-to-cash process
IBM Software Industry Solutions Management Improving sales effectiveness in the quote-to-cash process Improving sales effectiveness in the quote-to-cash process Contents 2 Executive summary 2 Effective
Advisory Guidelines of the Financial Supervisory Authority. Requirements regarding the arrangement of operational risk management
Advisory Guidelines of the Financial Supervisory Authority Requirements regarding the arrangement of operational risk management These Advisory Guidelines have established by resolution no. 63 of the Management
Chapter 6: The Information Function 129. CHAPTER 7 Test Calibration
Chapter 6: The Information Function 129 CHAPTER 7 Test Calibration 130 Chapter 7: Test Calibration CHAPTER 7 Test Calibration For didactic purposes, all of the preceding chapters have assumed that the
Yew May Martin Maureen Maclachlan Tom Karmel Higher Education Division, Department of Education, Training and Youth Affairs.
How is Australia s Higher Education Performing? An analysis of completion rates of a cohort of Australian Post Graduate Research Students in the 1990s. Yew May Martin Maureen Maclachlan Tom Karmel Higher
Getting Correct Results from PROC REG
Getting Correct Results from PROC REG Nathaniel Derby, Statis Pro Data Analytics, Seattle, WA ABSTRACT PROC REG, SAS s implementation of linear regression, is often used to fit a line without checking
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
Mixing internal and external data for managing operational risk
Mixing internal and external data for managing operational risk Antoine Frachot and Thierry Roncalli Groupe de Recherche Opérationnelle, Crédit Lyonnais, France This version: January 29, 2002 Introduction
Quality of Education in India: A Case Study of Primary Schools of Assam
Bangladesh e-journal of Sociology. Volume 10 Number 1, January 2013. 71 Quality of Education in India: A Case Study of Primary Schools of Assam Sahidul Ahmed * Abstract: The global community is giving
Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers
Correlation Greg C Elvers What Is Correlation? Correlation is a descriptive statistic that tells you if two variables are related to each other E.g. Is your related to how much you study? When two variables
Latent Class Regression Part II
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
ASYCUDA objectives and functions
Rev 2 January 2011 UNCTAD Trust Fund for Trade Facilitation Negotiations Technical Note No. 21 ASYCUDA Background UNCTAD s Automated System for Customs Data (ASYCUDA) is an automated Customs data management
Microfinance and the Role of Policies and Procedures in Saturated Markets and During Periods of Fast Growth
Microfinance and the Role of Policies and Procedures in Saturated Markets and During Periods of Fast Growth Microfinance Information Exchange & Planet Rating An evaluation of the role that lending methodologies,
Attrition in Online and Campus Degree Programs
Attrition in Online and Campus Degree Programs Belinda Patterson East Carolina University [email protected] Cheryl McFadden East Carolina University [email protected] Abstract The purpose of this study
DECISION TREE ANALYSIS: PREDICTION OF SERIOUS TRAFFIC OFFENDING
DECISION TREE ANALYSIS: PREDICTION OF SERIOUS TRAFFIC OFFENDING ABSTRACT The objective was to predict whether an offender would commit a traffic offence involving death, using decision tree analysis. Four
Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University
Practical I conometrics data collection, analysis, and application Christiana E. Hilmer Michael J. Hilmer San Diego State University Mi Table of Contents PART ONE THE BASICS 1 Chapter 1 An Introduction
Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors
Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors Arthur Lewbel, Yingying Dong, and Thomas Tao Yang Boston College, University of California Irvine, and Boston
13. Poisson Regression Analysis
136 Poisson Regression Analysis 13. Poisson Regression Analysis We have so far considered situations where the outcome variable is numeric and Normally distributed, or binary. In clinical work one often
STATISTICAL ANALYSIS OF UBC FACULTY SALARIES: INVESTIGATION OF
STATISTICAL ANALYSIS OF UBC FACULTY SALARIES: INVESTIGATION OF DIFFERENCES DUE TO SEX OR VISIBLE MINORITY STATUS. Oxana Marmer and Walter Sudmant, UBC Planning and Institutional Research SUMMARY This paper
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví. Pavel Kříž. Seminář z aktuárských věd MFF 4.
Insurance Analytics - analýza dat a prediktivní modelování v pojišťovnictví Pavel Kříž Seminář z aktuárských věd MFF 4. dubna 2014 Summary 1. Application areas of Insurance Analytics 2. Insurance Analytics
10 Shenton Way MAS Building Singapore 079117 Telephone: (65) 6225 5577 Facsimile: (65) 6229 9229
10 Shenton Way MAS Building Singapore 079117 Telephone: (65) 6225 5577 Facsimile: (65) 6229 9229 Circular No. CMI 03/2015 28 October 2015 To: Holders of a Capital Markets Services Licence for conducting
26. Determinants I. 1. Prehistory
26. Determinants I 26.1 Prehistory 26.2 Definitions 26.3 Uniqueness and other properties 26.4 Existence Both as a careful review of a more pedestrian viewpoint, and as a transition to a coordinate-independent
CHARACTERISTICS OF SUCCESSFUL MEGAPROJECTS
Chapter Six CHARACTERISTICS OF SUCCESSFUL MEGAPROJECTS Bibliographic Information: Committee to Assess the Policies and Practices of the Department of Energy to Design, Manage, and Procure Environmental
Optimization of the physical distribution of furniture. Sergey Victorovich Noskov
Optimization of the physical distribution of furniture Sergey Victorovich Noskov Samara State University of Economics, Soviet Army Street, 141, Samara, 443090, Russian Federation Abstract. Revealed a significant
2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
Measuring BDC s impact on its clients
From BDC s Economic Research and Analysis Team July 213 INCLUDED IN THIS REPORT This report is based on a Statistics Canada analysis that assessed the impact of the Business Development Bank of Canada
Keep It Simple: Easy Ways To Estimate Choice Models For Single Consumers
Keep It Simple: Easy Ways To Estimate Choice Models For Single Consumers Christine Ebling, University of Technology Sydney, [email protected] Bart Frischknecht, University of Technology Sydney,
Masters in Financial Economics (MFE)
Masters in Financial Economics (MFE) Admission Requirements Candidates must submit the following to the Office of Admissions and Registration: 1. Official Transcripts of previous academic record 2. Two
Gerry Hobbs, Department of Statistics, West Virginia University
Decision Trees as a Predictive Modeling Method Gerry Hobbs, Department of Statistics, West Virginia University Abstract Predictive modeling has become an important area of interest in tasks such as credit
AML Topics Using analytics to get the most from your transaction monitoring system
www.pwc.com AML Topics Using analytics to get the most from your transaction monitoring system March 2011 Contents Components of the AML Compliance Program... 1 Transaction Monitoring... 1 Transaction
How To Monitor A Project
Module 4: Monitoring and Reporting 4-1 Module 4: Monitoring and Reporting 4-2 Module 4: Monitoring and Reporting TABLE OF CONTENTS 1. MONITORING... 3 1.1. WHY MONITOR?... 3 1.2. OPERATIONAL MONITORING...
Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression
Logistic Regression Department of Statistics The Pennsylvania State University Email: [email protected] Logistic Regression Preserve linear classification boundaries. By the Bayes rule: Ĝ(x) = arg max
Week 7 - Game Theory and Industrial Organisation
Week 7 - Game Theory and Industrial Organisation The Cournot and Bertrand models are the two basic templates for models of oligopoly; industry structures with a small number of firms. There are a number
Effectiveness of online teaching of Accounting at University level
Effectiveness of online teaching of Accounting at University level Abstract: In recent years, online education has opened new educational environments and brought new opportunities and significant challenges
COAG National Legal Profession Reform Discussion Paper: Trust money and trust accounting
COAG National Legal Profession Reform Discussion Paper: Trust money and trust accounting Purpose The purpose of this Paper is to outline the Taskforce s preferred approach to regulation of trust money
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and
WHITE PAPER. Payment Integrity Trends: What s A Code Worth. A White Paper by Equian
WHITE PAPER Payment Integrity Trends: What s A Code Worth A White Paper by Equian June 2014 To install or not install a pre-payment code edit, that is the question. Not all standard coding rules and edits
A framework to plan monitoring and evaluation
6. A framework to plan monitoring and evaluation Key ideas Monitoring and evaluation should be an integral part of (and therefore affect) all stages of the project cycle. As mentioned earlier, you need
SURVEY ON HOW COMMERCIAL BANKS DETERMINE LENDING INTEREST RATES IN ZAMBIA
BANK Of ZAMBIA SURVEY ON HOW COMMERCIAL BANKS DETERMINE LENDING INTEREST RATES IN ZAMBIA September 10 1 1.0 Introduction 1.1 As Government has indicated its intention to shift monetary policy away from
Classification Problems
Classification Read Chapter 4 in the text by Bishop, except omit Sections 4.1.6, 4.1.7, 4.2.4, 4.3.3, 4.3.5, 4.3.6, 4.4, and 4.5. Also, review sections 1.5.1, 1.5.2, 1.5.3, and 1.5.4. Classification Problems
Global Trends in Life Insurance: Claims
What you need to know LIFE INSURANCE Global Trends in Life Insurance: Claims Key claims trends and their implications for the life insurance industry Contents 1 Highlights 3 2 Introduction 4 3 Emerging
Statement of Guidance
Statement of Guidance Internal Audit Unrestricted Trust Companies 1. Statement of Objectives 1.1. To provide specific guidance on Internal Audit Functions as called for in section 3.6 of the Statement
