Descriptive Statistics on Companies in the Forest Products Industry

Size: px
Start display at page:

Download "Descriptive Statistics on Companies in the Forest Products Industry"

From this document you will learn the answers to the following questions:

  • Who is the author of the paper Descriptive Statistics on Companies in the Forest products Industry?

  • What ratio exceeded the boundaries for normal distributions?

  • What is the main reason the paper is used?

Transcription

1 Descriptive Statistics on Companies in the Forest Products Industry Kristoffer Öström IAMSR, Åbo Akademi University Lemminkäineng. 14 B, 252 Åbo, Finland Barbro Back IAMSR, Åbo Akademi University Lemminkäineng. 14 B, 252 Åbo, Finland Hannu Vanharanta Pori School of Technology and Economics Pohjoisrantak. 11, P.O. Box 33, 281 Pori, Finland Ari Visa Tampere University of Technology P.O. Box 553, 3311 Tampere, Finland Turku Centre for Computer Science TUCS Technical Report No 33 February 2 ISBN

2 ABSTRACT A prerequisite of many statistical methods is that the underlying data material is normally or approximately normally distributed. If the data material violates this assumption, the results of an analysis might be misleading or incorrect. The overall objective with this paper is to provide the reader with descriptive statistics over two separate sets of empirical data and test these sets against the assumption of normality. The data material consists of financial ratios calculated from the international forest products industry. This paper is part of a larger study where we want to analyse the forest products companies financial performance world-wide. For that study it is of outmost importance that we know whether the variables in each of our sets of data are normally distributed. Three different test statistics: skewness, kurtosis and Kolmogorov- Smirnov with Lilliefors correction are used to test the data material against normality. Results achieved show that the Kolmogorov-Smirnov test statistic rejected normality for all variables. The skewness ratio for all variables fell within the specified range of approximately normal distributions while half of the kurtosis ratios exceeded the boundaries for approximately normally distributed data. Key words: Normality, Skewness, Kurtosis, Kolmogorov-Smirnov TUCS Research Group Computational Intelligence in Business

3 1. Introduction The most widely recognised distribution in the world is perhaps the normal distribution: the well-known bell-shaped curve. Textbooks in statistics often goes into great depth when describing how various statistical methods can be used in modelling large sets of data. The examples used in these books are often constructed on clean sets of data and both the techniques as well as the outcome work well. This situation, the textbook one, is very rare in the real world. It is the nature of real world data to be skewed, poorly distributed or otherwise incorrect. These issues, if not properly addressed, can undermine the even most robust statistical method to yield reliable and stable results. There seems to be a tendency that current research reporting ignores the shape of a distribution [Hopkins et al., 199]. All too often sets of data are dumped in statistical software packages without giving a thought on the distribution of the data material and only hoping for the best outcome. In this paper six financial ratios will be investigated and tested against the assumption of normality. The ratios will be extracted from two empirical databases with information about forest products companies world-wide. The first database encloses data from the time period 1985 to 1989 while the other database includes financial information from the years 1996 to The ratios will be inspected in terms of identifying anomalous values, pre-processed with appropriate statistical methods and tested for normality. Financial ratios are used because they are good indicators on a company s overall performance. As the financial reports of a business contains a wealth of financial information it is, of course, these ratios that are essential when financially measuring a company or even an entire industrial sector. Moreover, financial ratios are especially exposed to distortions in their distributions since they are a result of human management activities to measure and control natural business processes. Many studies have shown that financial data, and from them extracted financial ratios, do not tend to be normally distributed. For example, Tam et al. (1992) tested 19 financial ratios for normality and the tests indicated that 15 out of the 19 ratios were not normally distributed. Dorfman (1993) again proved that a large portion of agricultural economic data is inconsistent with the assumption of normality. This paper is part of a larger study where we want to analyse the forest products companies financial performance world-wide. For that study it is of outmost importance that we know whether the variables in each of our sets of data are normally distributed or not. The questions to be answered in this paper are therefore: 1) do financial ratios in the forest products industry diverge from normality and 2) should normal multivariate statistical methods be applied or are instead non-

4 parametrical methods such as neural networks to prefer when analysing real world, empirical sets of data. The rest of the paper is organised as follows: Section 2 describes the sets of data and the applied procedure for selecting the financial ratios. Section 3 presents the preliminary data inspection and used statistical methods. In section 4 the results are presented while the conclusions of this study are found in the final section The Database and Selection of Ratios In this paper two sets of data are used for the statistical calculations. The Green Gold Financial Reports database [Salonen et al., 199a, 199b, 1991] is used as a source of information to acquire the 1985 to 1989 data while the set of data collected in [Öström, 1999] is used to provide information from the years 1996 and The Green Gold database consists of standardised income statements, balance sheets and cash flow statements of 16 companies in the international forest products industry. The database also consists of specific financial ratios calculated from the standardised reports as well as general company information concerning production volumes, products etc. The latter database includes financial information collected from companies www pages through Internet. This database consists of 34 major forest products companies from the Nordic countries and North America. The selection process of ratios is one of the most important decisions an analyst makes when preparing a statistical analysis [Stein, 1993]. In the first database there were more than 5 different key ratios that represented the financial situation of each company and in the latter enough information to calculate several important financial ratios. The selection of appropriate financial ratios in this paper is based on a paper conducted by Vanharanta et al. (1995). In that paper a cognitive approach was used to select the most important variables that would adequately represent a company s overall performance. Ten corporate analysts with long experience in the field of capital investments and accounting were asked to choose the most important variables from the Green Gold Financial Reports database for a performance analysis task. This approach resulted in nine different financial key ratios. From those nine ratios six were selected to represent the points of reference in this paper. The reduction from nine to six ratios was done in order to compile the two sets of data and make comparative studies possible.

5 The financial ratios retrieved from the databases and used in this paper are: 1. Operating profit / sales 2. Profit after financial items / sales 3. Return on equity 4. Solidity 5. Current ratio 6. Funds from operations / sales We note that there are three profitability measures (1-3), one that measures capital structure (4), one liquidity measure (5), and one cash flow measure (6). Certainly, including more financial ratios measuring the capital structure, working capital or indebtedness may further improve the behavioural picture of this industry sector. However, the limited quantity of available financial information in the set of data resulted in a trade-off where the added benefits of using more financial ratios would not have exceeded the hardship of obtaining them. The selected ratios are therefore considered sufficient in numbers for normality testing in the international forest products industry. 3. Data Preparation and Test Methods The financial ratios collected from the databases were first processed and cleaned from anomalous outliers i.e. from erroneous data entries that can have a very disruptive influence on any analysis. For this purpose boxplots and frequency histograms of individual variables were used. The boxplot is a box summarising the distribution of a set of data values where anomalous data entries appear as individual plots far outside the box. In Figure 1 the frequency histogram of Return on equity is presented. Figure 1 reveals that the data material contains at least two erroneous data entries. These anomalous values are discarded when assured that they in fact are actual errors and not divergent behaviours of some of the companies in the database. Figure 1 distribution of Return on equity

6 Each financial ratio was examined in the same manner and anomalies were identified. These values, a total of 14, were then excluded. The final set of data contained a total of 3177 observations for the years 1985 to 1989, distributed on 525 to 531 observations per financial ratio and a total of 355 observations from the years 1996 to 1997, distributed on 57 to 6 observations per financial ratio. In Table 1 the minimum and maximum values for both databases are presented. The companies included in the study are listed in Table 2 and Table 3. Financial ratios data sets data sets unit min max min max Operating profit / sales % Profit after financial items / sales % Return on equity % Solidity ratio Current ratio ratio Funds from operations / sales % Table 1 Range of the financial ratios We note that the range for the period is considerably smaller compared to that for the period The minimum and maximum values for each variable are also to be found in the set of data except for solidity, which has its minimum located in the latter set of data. As earlier mentioned, natural processes that possess similar features may well produce distributions that are approximately normal when there is an appreciable number of observations, the socalled law of large numbers [Feller, 1968]. However, when analysing real-world processes and problems normally distributed data is rarely available [Pryzdek, 1995]. A reason to this is that the objective of most human management activity is to measure and control natural processes. Other distortions, which obstruct normality, also occur when people try to measure the results from different activities. In this paper three main methods: skewness, kurtosis and Kolmogorov-Smirnov with Lilliefors correction are used to test the variables for normality. Skewness is a method for determining the symmetry of a distribution. The skewness gives the answer on whether the function is shaped like a bell, half-bell, dented i.e. the asymmetry of the distribution around its mean. The normal distribution is symmetric, and has a skewness value of zero. In a normal distribution the mean and the median will also have the same value. Data from a positively skewed distribution i.e. skewed to the right will have values grouped together below the mean but also a long tail of values above the mean. Negative skewed distributions have values grouped above the mean with tails extending toward more negative values. If the distribution is skewed to the right the mean of the distribution will be larger than the median and vice versa. A skewness value greater than 2. [SPSS, 1999] indicates a distribution that differs significantly from a normal distribution.

7 Sweden AB Statens Skogsindustrier Graningeverkens AB Korsnäs AB Mo och Domsjö AB Munkedals AB Munksjö AB Norrlands Skogsägarens Cellulosa AB Norrsundets Bruks AB Obbola Linerboard AB Rottneros Bruk AB Svenska Cellulosa AB (SCA) Stora Kopparbergs Bergslags AB Södra Skogsägarna AB Finland A. Ahlström Oy Enso-Gutzeit Oy Kemi Oy Kymmene Oy Metsä-Serla Oy Rauma-Repola Oy Sunila Oy Oy Tampella Ab Oy Veitsiluoto Ab Norway Norske Skogindustrier A.S. A/S Union Austria Laakirchen Lenzig Leykam Nettingsdorfer Steyrermuhl Zellstoff Germany Europa Carton Feldmuhle Haindl Papier Hannover MD Papier Schwöbishe Waldhof-A. Zanders Italy Saffa Cartiera Holland Berghuizer Buchmann Crown van Gelder Gelderse Koninklijke N.V. Papierfabriek Parenco Portugal Portucel-Empresa Group Caima Celbi Soporcel UK Associated Bowater Inrdustries BPB Industries David S. Smith France Arjomari Aussedat Beghin-Say Kaysersberg La Cellulose Du. La Rochette Sibylle Spain Empresa Nacion. La Papelera Swiss Attisholz Biber Holding Holstoff Industrieholding Papierfab. Perlen James Cropper Canada Abitibi-Price Inc. Canfor Corporation Cascades Inc. Canadian Pacific Forest Products Ltd. Crestbrook Forest Industries Ltd. Doman Industries Ltd. Domtar Inc. Donohue Inc. Fletcher Challenge Canada Ltd. International Forest Products Ltd. MacMillan Bloedel Ltd. Noranda Inc. Noranda Forest Inc. Perkins Papers Ltd. Repap Enterprises Corporation Inc. Rolland Inc. Scott Paper Ltd. Tembec Inc. USA Boise Cascade Corporation Champion International Corporation Chesapeake Corporation Consolidated Papers Inc. Dennison Manufacturing Company The Dexter Corporation Federal Paper Board Company Gaylord Container Corporation Georgia-Pacific Corporation P.H. Glatfelter Company Great Northern Nekoosa Corporation International Paper Company James River Corporation Kimberly-Clark Corporation Longview Fibre Company Louisiana-Pacific Corporation Mead Corporation Mosinee Paper Corporation Pentair Inc. Potlatch Corporation The Procter & Gamble Company Scott Paper Company Sonoco Products Company Stone Container Corporation Tambrands Inc. Temple-Inland Inc. Union Camp Corporation Wausau Paper Mills Company Westvaco Corporation Weyerhaeuser Company WTD Industries Inc. Sweden MoDo AB Munksjö AB SCA AB Södra AB Finland ENSO OY Metsä-Serla OY UPM-Kymmene OY Norway Hunsfoss Fabrikker Norske Skog A.S. USA Boise Cascade Bowater Champion International Consolidated Papers Fort James Georgia-Pacific Group International Paper Jefferson-Smurfit Corp. Kimberly-Clark Mead Potlatch Corp. Stone Container Union Camp Westvaco Weyerhaeuser Willamette Industries Canada Alliance Canfor Cascades Inc. Crestbrook Forest Ind. Ltd. Doman Industries Domtar Inc. MacMillan Bloedel Sonoco Inc. Tembec inc. Table 2 Companies included in the set of data Table 3 Companies included in the set of data

8 The second test for normality measures the kurtosis. While skewness indicates the asymmetry of the distribution around its mean, kurtosis measures the extent to which observations cluster around a central point. The value of the kurtosis coefficient is for normally distributed data. Positive kurtosis denotes a relatively peaked distribution while negative kurtosis indicates a relatively flat distribution. A kurtosis value greater than ±2. indicate, in the same way as skewness, a distribution that differs significantly from the normal distribution. The third test is the Kolmogorov-Smirnov test statistic. The Kolmogorov-Smirnov test is defined as the maximum vertical distance between an empirical distribution (the data sample) and a specified distribution function (the normal distribution function). In the Kolmogorov-Smirnov method the mean and/or variance are not specified beforehand. This implies a risk that the method becomes less powerful i.e. the probability of rejecting a null hypothesis when it is in fact false increases, and therefore a Lilliefors correction [Lilliefors, 1969] is added to the method. The Lilliefors test for normality adjusts the Kolmogorov-Smirnov test specifically for testing for normality when the mean and variance are unknown. Further information on the Kolmogorov-Smirnov test statistic can be found in books devoted exclusively to non-parametric statistics, such as Conover (198). Most statistical packages and spreadsheet programs provide some form of the skewness and kurtosis functions. Well worth to remember is that several different measures of skewness and kurtosis have been proposed [Groeneveld et al., 1984]. As each method differs slightly from the other they also yield slightly different results. Therefore, it is important to determine which formula is being used in the actual software where the analysis is being carried out. In this paper some basic statistical measures such as the mean, median and standard deviation are also used.

9 4. Results In this chapter the analysis of the databases will be carried out. The analysis was performed using the SPSS version 9. statistical program package [SPSS, 1999]. The numerical data was presented to the program after outliers and distributional anomalies were removed. Each variable was processed separately and in Table 4 summarised statistics for the financial ratios of both databases are presented. Financial ratios Unit Mean Median Std.Dev. Mean Median Std.Dev. Operating profit / sales % Profit after financial items / sales % Return on equity (ROE) % Solidity ratio Current ratio ratio Funds from operations / sales % Table 4 Summarised descriptive statistics for the data sets A breakdown of the table shows that the average Operating profit was, for example, 14,9 % during the eighties, which is much higher than 7,5 % recorded under recent years. The Return on equity was also twice as large in the former database. The other financial ratios follow the same tendency, which is to be somewhat higher during the years though, not as much as the Operating profit and the Return on equity. Figure 2 presents the distribution for each of the six ratios during and in a frequency histogram format. From the picture it is possible to graphically compare the distribution of the variables with their theoretical normal distributions generated from the mean and the standard deviation of each sample. The histograms created from the set of data are presented to the left while the histograms to the right is from the set of data. The values on the X-axis, the size of bins and the number of bins are same for each ratio. Endpoints on the X- axis are chosen to adequately represent the entire range of data.

10 Operating profit / sales Operating profit / sales Profit after financial items / sales 14 Profit after financial items / sales Return on equity (ROE) Return on equity (ROE) Solidity Solidity

11 Current ratio Current ratio Funds from operations / sales 12 Funds from operations / sales Figure 2 histograms of the distributions. The statistical tests for normality indicated that none of the ratios were precisely normally distributed. Results of the Kolmogorov-Smirnov test are presented in Table 5. This test (at a significance level of α =.5) rejected the normality for all financial ratios in both the former as well as the latter database. Financial ratios Statistic Significance Statistic Significance Operating profit / sales * Profit after financial items / sales * Return on equity (ROE) Solidity Current ratio Funds from operations / sales * This is a lower bound of the true significance. Table 5 Kolmogorov-Smirnov with Lilliefors significance correction

12 The skewness ratio for all financial ratios fell within the specified range of [-2,2]. However, a breakdown of the skewness ratios in Table 6 gives us the information that most of the values that diverge from zero are positive, which indicates slight asymmetric distributions with tails extending more towards positive values i.e. there are somewhat more companies performing below the industry average. The kurtosis was not as consistent with the normality as skewness. In the first database two values, and in the latter four values exceeded the right boundary of the interval [-2,2] indicating somewhat peaked distributions when compared to the normal distribution. Financial ratios Skewness Kurtosis Skewness Kurtosis Operating profit / sales Profit after financial items / sales Return on equity (ROE) Solidity Current ratio Funds from operations / sales Table 6 Skewness and kurtosis for the databases To investigate the cause of the non-normal results further we can re-examine the frequency histograms. It is well worth combining the statistical analysis with visual examination of the frequency histograms since no measure of normality is conclusive. In this case, again we notice that the shapes of most histograms are quite peaked, further strengthening the hypothesis that the underlying data material is not normal. 5. Conclusions In this paper two sets of data from the international forest products industry were tested against the assumption of normality. The data material consisted of six financial ratios: three profitability measures, one capital structure measure, one liquidity measure and one ratio measuring the cash flow. These ratios were considered to give an adequate description of a company as well as of the entire forest products industry. When testing the data material for normality anomalous values in the data sets were first identified and removed. Three statistical measures: skewness, kurtosis and Kolmogorov-Smirnov with Lilliefors correction were used to test the cleaned data material against the assumption of normality. Apart from the normality tests basic statistical measures such as the mean, median and standard deviation were also used in order to give a descriptive picture of the data material.

13 It comes as no surprise that some of the variables in the two sets of data could not be considered to fill the requirements of normally distributed data. The Kolmogorov- Smirnov test statistic rejected normality for all variables. The skewness ratio for all variables fell within the specified range of approximately normal distributions. Two ratios in the database and four in the set of data exceeded the kurtosis boundaries for approximately normally distributed data. From these results it is apparent that normally distributed data in empirical databases are all but common also in the international forest products industry. It is therefore worth to emphasise the importance of proper normality testing before performing any kind of statistical analysis, and to consider using non-parametric methods for further analysis of the data material.

14 References Conover, W. J. (198). Practical Nonparametric Statistics. 2nd ed. New York: Wiley & Sons Dorfman, J. H. (1993). Should normality be a normal assumption? Economics Letters, 42, pp Elsevier Science Publ. Feller, W. (1968). Laws of Large Numbers. An Introduction to Probability Theory and Its Applications, Vol. 1, 3rd ed., pp , New York: Wiley & Sons Groeneveld, R. A., G. Meeden (1984). Measuring Skewness and Kurtosis. The Statistician, 33, pp Hopkins, K., D. Weeks (199). Tests for normality and measures of skewness: Their place in research reporting. Educational & Psychological Measurement, Vol. 5. Issue 4, pp Sage Publ. Inc. Lilliefors, H. W. (1969). On the Kolmogorov-Smirnov test for normality with mean and variance unknown. Journal of the American Statistical Association, 64, pp Pryzdek, T. (1995). Why Normal Distributions Aren't [All That Normal]. Quality Engineering, Vol. 7, No. 4, pp Salonen, H., H. Vanharanta (199a). Financial Analysis World Pulp and Paper Companies , Nordic Countries. Green Gold Financial Reports. Vol. 1., Finland: Ekono OY Salonen, H., H. Vanharanta (199b). Financial Analysis World Pulp and Paper Companies , North America. Green Gold Financial Reports. Vol. 2., Finland: Ekono OY Salonen, H., H. Vanharanta (1991). Financial Analysis World Pulp and Paper Companies , Europe. Green Gold Financial Reports. Vol. 3., Finland: Ekono OY SPSS Incorporated (1999). SPSS for Windows. Release Stein, R. (1993). Selecting Data for Neural Networks. AI Expert. February issue, pp Tam, K. Y., M. Kiang (1992). Managerial Applications of Neural Networks: The Case of Bank Failure Predictions. Management Science. Vol. 38. No. 7, pp Vanharanta, H., T. Käkölä, B. Back (1995). Validity and utility of a Hyperknowledge- Based financial Benchmarking System. Proceedings of the Twenty-Eight Annual Hawaii International Conference on Systems Science. Vol. 3, pp IEEE Computer Society Press Öström, K. (1999). Addressing Benchmarking Complexity with Neural Networks and Self-organising Maps. Master s thesis, Åbo Akademi University, Turku

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

Non Parametric Inference

Non Parametric Inference Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable

More information

MEASURES OF LOCATION AND SPREAD

MEASURES OF LOCATION AND SPREAD Paper TU04 An Overview of Non-parametric Tests in SAS : When, Why, and How Paul A. Pappas and Venita DePuy Durham, North Carolina, USA ABSTRACT Most commonly used statistical procedures are based on the

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Normality Testing in Excel

Normality Testing in Excel Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Skewness and Kurtosis in Function of Selection of Network Traffic Distribution

Skewness and Kurtosis in Function of Selection of Network Traffic Distribution Acta Polytechnica Hungarica Vol. 7, No., Skewness and Kurtosis in Function of Selection of Network Traffic Distribution Petar Čisar Telekom Srbija, Subotica, Serbia, petarc@telekom.rs Sanja Maravić Čisar

More information

A MULTIVARIATE OUTLIER DETECTION METHOD

A MULTIVARIATE OUTLIER DETECTION METHOD A MULTIVARIATE OUTLIER DETECTION METHOD P. Filzmoser Department of Statistics and Probability Theory Vienna, AUSTRIA e-mail: P.Filzmoser@tuwien.ac.at Abstract A method for the detection of multivariate

More information

How To Test For Significance On A Data Set

How To Test For Significance On A Data Set Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

T O P I C 1 2 Techniques and tools for data analysis Preview Introduction In chapter 3 of Statistics In A Day different combinations of numbers and types of variables are presented. We go through these

More information

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research

More information

Impact of Skewness on Statistical Power

Impact of Skewness on Statistical Power Modern Applied Science; Vol. 7, No. 8; 013 ISSN 1913-1844 E-ISSN 1913-185 Published by Canadian Center of Science and Education Impact of Skewness on Statistical Power Ötüken Senger 1 1 Kafkas University,

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

Walk the Line Written by: Maryann Huey Drake University Maryann.Huey@drake.edu

Walk the Line Written by: Maryann Huey Drake University Maryann.Huey@drake.edu Walk the Line Written by: Maryann Huey Drake University Maryann.Huey@drake.edu Overview of Lesson In this activity, students will conduct an investigation to collect data to determine how far students

More information

THE IMPACT OF DAILY TRADE VOLUME ON THE DAY-OF-THE- WEEK EFFECT IN EMERGING STOCK MARKETS

THE IMPACT OF DAILY TRADE VOLUME ON THE DAY-OF-THE- WEEK EFFECT IN EMERGING STOCK MARKETS ISSN 1392 124X INFORMATION TECHNOLOGY AND CONTROL, 2007, Vol.36, No.1A THE IMPACT OF DAILY TRADE VOLUME ON THE DAY-OF-THE- WEEK EFFECT IN EMERGING STOCK MARKETS Virgilijus Sakalauskas, Dalia Krikščiūnienė

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

THE KRUSKAL WALLLIS TEST

THE KRUSKAL WALLLIS TEST THE KRUSKAL WALLLIS TEST TEODORA H. MEHOTCHEVA Wednesday, 23 rd April 08 THE KRUSKAL-WALLIS TEST: The non-parametric alternative to ANOVA: testing for difference between several independent groups 2 NON

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST

MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST Zahayu Md Yusof, Nurul Hanis Harun, Sharipah Sooad Syed Yahaya & Suhaida Abdullah School of Quantitative Sciences College of Arts and

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION Chin-Diew Lai, Department of Statistics, Massey University, New Zealand John C W Rayner, School of Mathematics and Applied Statistics,

More information

Data Transforms: Natural Logarithms and Square Roots

Data Transforms: Natural Logarithms and Square Roots Data Transforms: atural Log and Square Roots 1 Data Transforms: atural Logarithms and Square Roots Parametric statistics in general are more powerful than non-parametric statistics as the former are based

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

T test as a parametric statistic

T test as a parametric statistic KJA Statistical Round pissn 2005-619 eissn 2005-7563 T test as a parametric statistic Korean Journal of Anesthesiology Department of Anesthesia and Pain Medicine, Pusan National University School of Medicine,

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Module 4: Data Exploration

Module 4: Data Exploration Module 4: Data Exploration Now that you have your data downloaded from the Streams Project database, the detective work can begin! Before computing any advanced statistics, we will first use descriptive

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9

More information

Week 1. Exploratory Data Analysis

Week 1. Exploratory Data Analysis Week 1 Exploratory Data Analysis Practicalities This course ST903 has students from both the MSc in Financial Mathematics and the MSc in Statistics. Two lectures and one seminar/tutorial per week. Exam

More information

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics

More information

Statistical tests for SPSS

Statistical tests for SPSS Statistical tests for SPSS Paolo Coletti A.Y. 2010/11 Free University of Bolzano Bozen Premise This book is a very quick, rough and fast description of statistical tests and their usage. It is explicitly

More information

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.) Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

Lesson 4 Measures of Central Tendency

Lesson 4 Measures of Central Tendency Outline Measures of a distribution s shape -modality and skewness -the normal distribution Measures of central tendency -mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

SKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set.

SKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set. SKEWNESS All about Skewness: Aim Definition Types of Skewness Measure of Skewness Example A fundamental task in many statistical analyses is to characterize the location and variability of a data set.

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course

More information

How To Write A Data Analysis

How To Write A Data Analysis Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction

More information

CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13

CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13 COMMON DESCRIPTIVE STATISTICS / 13 CHAPTER THREE COMMON DESCRIPTIVE STATISTICS The analysis of data begins with descriptive statistics such as the mean, median, mode, range, standard deviation, variance,

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Java Modules for Time Series Analysis

Java Modules for Time Series Analysis Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series

More information

What Does the Normal Distribution Sound Like?

What Does the Normal Distribution Sound Like? What Does the Normal Distribution Sound Like? Ananda Jayawardhana Pittsburg State University ananda@pittstate.edu Published: June 2013 Overview of Lesson In this activity, students conduct an investigation

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

INTRODUCING THE NORMAL DISTRIBUTION IN A DATA ANALYSIS COURSE: SPECIFIC MEANING CONTRIBUTED BY THE USE OF COMPUTERS

INTRODUCING THE NORMAL DISTRIBUTION IN A DATA ANALYSIS COURSE: SPECIFIC MEANING CONTRIBUTED BY THE USE OF COMPUTERS INTRODUCING THE NORMAL DISTRIBUTION IN A DATA ANALYSIS COURSE: SPECIFIC MEANING CONTRIBUTED BY THE USE OF COMPUTERS Liliana Tauber Universidad Nacional del Litoral Argentina Victoria Sánchez Universidad

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

A Picture Really Is Worth a Thousand Words

A Picture Really Is Worth a Thousand Words 4 A Picture Really Is Worth a Thousand Words Difficulty Scale (pretty easy, but not a cinch) What you ll learn about in this chapter Why a picture is really worth a thousand words How to create a histogram

More information

Evaluating System Suitability CE, GC, LC and A/D ChemStation Revisions: A.03.0x- A.08.0x

Evaluating System Suitability CE, GC, LC and A/D ChemStation Revisions: A.03.0x- A.08.0x CE, GC, LC and A/D ChemStation Revisions: A.03.0x- A.08.0x This document is believed to be accurate and up-to-date. However, Agilent Technologies, Inc. cannot assume responsibility for the use of this

More information

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Wealth inequality: Britain in international perspective. Frank Cowell: Wealth Seminar June 2012

Wealth inequality: Britain in international perspective. Frank Cowell: Wealth Seminar June 2012 Wealth inequality: Britain in international perspective Frank Cowell: Wealth Seminar June 2012 Questions What does UK wealth inequality look like in context? What is role of inequality among the rich?

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

START Selected Topics in Assurance

START Selected Topics in Assurance START Selected Topics in Assurance Related Technologies Table of Contents Introduction Some Statistical Background Fitting a Normal Using the Anderson Darling GoF Test Fitting a Weibull Using the Anderson

More information

Interpreting Data in Normal Distributions

Interpreting Data in Normal Distributions Interpreting Data in Normal Distributions This curve is kind of a big deal. It shows the distribution of a set of test scores, the results of rolling a die a million times, the heights of people on Earth,

More information

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST EPS 625 INTERMEDIATE STATISTICS The Friedman test is an extension of the Wilcoxon test. The Wilcoxon test can be applied to repeated-measures data if participants are assessed on two occasions or conditions

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

determining relationships among the explanatory variables, and

determining relationships among the explanatory variables, and Chapter 4 Exploratory Data Analysis A first look at the data. As mentioned in Chapter 1, exploratory data analysis or EDA is a critical first step in analyzing the data from an experiment. Here are the

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Descriptive Analysis

Descriptive Analysis Research Methods William G. Zikmund Basic Data Analysis: Descriptive Statistics Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging,

More information

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)

More information

Chapter 7. One-way ANOVA

Chapter 7. One-way ANOVA Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

More information

Mathematics within the Psychology Curriculum

Mathematics within the Psychology Curriculum Mathematics within the Psychology Curriculum Statistical Theory and Data Handling Statistical theory and data handling as studied on the GCSE Mathematics syllabus You may have learnt about statistics and

More information

8. THE NORMAL DISTRIBUTION

8. THE NORMAL DISTRIBUTION 8. THE NORMAL DISTRIBUTION The normal distribution with mean μ and variance σ 2 has the following density function: The normal distribution is sometimes called a Gaussian Distribution, after its inventor,

More information

Data Exploration Data Visualization

Data Exploration Data Visualization Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select

More information

INTERNATIONAL COMPARISONS OF PART-TIME WORK

INTERNATIONAL COMPARISONS OF PART-TIME WORK OECD Economic Studies No. 29, 1997/II INTERNATIONAL COMPARISONS OF PART-TIME WORK Georges Lemaitre, Pascal Marianna and Alois van Bastelaer TABLE OF CONTENTS Introduction... 140 International definitions

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

CHI-SQUARE: TESTING FOR GOODNESS OF FIT CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity

More information

AP * Statistics Review. Descriptive Statistics

AP * Statistics Review. Descriptive Statistics AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production

More information

The Standard Normal distribution

The Standard Normal distribution The Standard Normal distribution 21.2 Introduction Mass-produced items should conform to a specification. Usually, a mean is aimed for but due to random errors in the production process we set a tolerance

More information

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Stat 5102 Notes: Nonparametric Tests and. confidence interval Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions

More information

The labour market, I: real wages, productivity and unemployment 7.1 INTRODUCTION

The labour market, I: real wages, productivity and unemployment 7.1 INTRODUCTION 7 The labour market, I: real wages, productivity and unemployment 7.1 INTRODUCTION Since the 1970s one of the major issues in macroeconomics has been the extent to which low output and high unemployment

More information

Lecture 2. Summarizing the Sample

Lecture 2. Summarizing the Sample Lecture 2 Summarizing the Sample WARNING: Today s lecture may bore some of you It s (sort of) not my fault I m required to teach you about what we re going to cover today. I ll try to make it as exciting

More information

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test

More information

Convex Hull Probability Depth: first results

Convex Hull Probability Depth: first results Conve Hull Probability Depth: first results Giovanni C. Porzio and Giancarlo Ragozini Abstract In this work, we present a new depth function, the conve hull probability depth, that is based on the conve

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

Validating Market Risk Models: A Practical Approach

Validating Market Risk Models: A Practical Approach Validating Market Risk Models: A Practical Approach Doug Gardner Wells Fargo November 2010 The views expressed in this presentation are those of the author and do not necessarily reflect the position of

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information