A Brief Study about Nonparametric Adherence Tests
|
|
- Paulina Burke
- 7 years ago
- Views:
Transcription
1 A Brief Study about Noparametric Adherece Tests Viicius R. Domigues, Lua C. S. M. Ozelim Abstract The statistical study has become idispesable for various fields of kowledge. Not ay differet, i Geotechics the study of probabilistic ad statistical methods has gaied power cosiderig its use i characterizig the ucertaities iheret i soil properties. Oe of the situatios where egieers are costatly faced is the defiitio of a probability distributio that represets sigificatly the sampled data. To be able to discard bad distributios, goodess-of-fit tests are ecessary. I this paper, three o-parametric goodess-of-fit tests are applied to a data set computatioally geerated to test the goodess-of-fit of them to a series of kow distributios. It is show that the use of ormal distributio does ot always provide satisfactory results regardig physical ad behavioral represetatio of the modeled parameters. Keywords Smirov, Aderso-Darlig, Cramer- Vo-Mises, Noparametric adherece tests. I. INTRODUCTION S i most braches of sciece, the use of probabilistic Aad statistical methods has become extremely importat i the developmet of moder Geotechics. I particular, the cocept of reliability of a veture has attracted cosiderable attetio i recet years, boostig therefore the study of statistics for egieers. It is kow that oe of the key poits of the correct modelig of geotechical data is the choice of a probability distributio represetig the behavior of the data aalysis. I geeral, the ormal distributio has bee the default choice of the vast majority of egieers. This ca be explaied by the wide applicability of this distributio; however, there are cases where the use of such radom variable does ot preserve the meaig ad the physical behavior of the variables uder cosideratio. The arises the questio o which distributio best fits a give sample of the target populatio. To provide aswers to this questio, adherece tests are used. II. ADHERENCE TESTS Adherece tests are statistical tests used to measure how well a give probability distributio is able to model the data set beig aalyzed. Adherece tests are also commoly called goodess of fit tests [1]. I geeral this type of testig is based o a hypothesis test, i which the ull hypothesis, H0, is that data cosidered follow a give distributio test, while the alterative hypothesis, H1, cosiders that the data do ot follow that distributio. There are two large groups of adherece tests with respect to prior kowledge of the Viicius R. Domigues ad Lua C. S. M. Ozelim are with the Departmet of Civil ad Evirometal Egieerig, Uiversity of Brasilia, Brasilia , Brazil ( viicius.rdomigues@gmail.com, luaoz@gmail.com). parameters ad distributio of data, amely parametric ad o-parametric adherece tests. A. Parametric Adherece Tests Parametric adherece tests are those i which the distributio of the studied populatio is kow or selected i some way ad ot ito questio, so that the hypotheses to be tested oly ivolve populatio parameters. For example, i the case of a ANOVA, the rigid assumptios for the variables uder compariso (ormal variables, for example) icur i this type of testig ot providig aswers o how well other distributios fit to the data. B. Noparametric Adherece Tests Noparametric adherece tests, o the other had, are valid tests for a broad rage of distributios, so that their applicatio provides a evaluatio of the hypothesis of a give radom variable beig distributed i a distributio differet from Normal. Still, this type of adherece test provides the meas to as-certai the distributio that best fits the data aalysis. I this article, it will be explored the use of three kow adherece tests i the evaluatio of the probability distributio that best fits a set of data geerated computatioally. The three tests to be explored are: Smirov test; Aderso- Darlig test ad test Cramer-Vo-Mises. III. CONSIDERED ADHERENCE TESTS The three adherece tests cosidered belog to the class of tests that uses the empirical distributio fuctio (EDF) i the calculatio of their statistics. Thus, it is ecessary to first defie this cocept. A. Empirical Fuctio Cosider the ordered data set {x1, x, x3,..., xn}, whose empirical cumulative distributio fuctio, F(X), oe wats to calculate. Mathematically, the EDF may be give by: N 1 1sig x xi F ( x) N i 1 (1) where x idicates the ceilig mathematical fuctio that provides the greatest iteger less tha x ad sig (x) is the sig fuctio, i which the result is +1 if x is positive ad -1 otherwise. I other words, the EDF a variable x is rages from 0 to 1 ad is icreased by 1 / N whe X passes each value i the ordered set of data. The EDF beig de-fied, the adherece tests of iterest ca be studied. B. Smirov Test The Smirov belogs to the highest class of EDF based statistics, give the fact that it works with the 685
2 biggest differece betwee the empirical distributio F(x) ad the hypothetical F(x). The hypothetical cumulative distributio comes from the test distributio, over which the ull hypothesis of the hypothesis test resides. Thus, i the case of the Smirov test, the ull hypothesis is that the data are distributed accordig F (x) ad the alterative hypothesis is that the data do ot follow such distributio Mathematically, the KS statistics of the Smirov test may be defied as: KS sup F ( x) F( x) () x I order to be able to accept or reject the ull hypothesis, the value of KS ad its distributio should be assessed. Note that, ituitively, if the distributio patter approximates the empirical distributio, the value of KS should be small. O the other had, whe KS is big, it is a idicative that the test does ot characterize the distributio ad the variable of iterest. Put i aother way, H0 ca be rejected if the statistic value KS is greater tha or equal to a give limit value, KSmax, the last of which deped o the cofidece level adopted, α. Aother importat cocept that arises from this evaluatio process is called the p-value, which gives the probability of obtaiig a statistic at least as extreme as the oe calculated, assumig that the ull hypothesis is true. Thus, oe ca accept H0 if the p-value associated is greater tha the sigificace level. I the preset paper, the software Mathematica will be used to evaluate the statistics ad p-values of the three tests cosidered. C. Cramer-Vo-Mises Test Ulike the Smirov test, Cramer-Vo Mises test is a quadratic test of the empirical distributio fuctio. This desigatio stems from the fact that this test works with the squared differeces betwee the empirical ad the hypothetical distributios []. This test has bee applied to study a wide variety of problems i sciece. I [3], a Cramer vo Mises type test based o local time of switchig diffusio process has bee studied. O the other had, i [4] a compariso betwee the Cramer-vo Mises test ad adaptive tests has bee performed. I the preset paper, o the other had, we restrict our attetio to the traditioal test. Mathematically, the CM statistics of the Cramèr-Vo-Mises test may be defied as: CM N F ( ) ( ) ( ) x F x df x (3) By kowig the data ad the target distributio, the use of (3) becomes ready. As i the case of the Smirov test, the Cramer-Vo-Mises test provides results which cofirm whether or ot H0, accordig to the statistical distributio of the CM or accordig to the p-value. D. Aderso-Darlig Test As the Cramer-Vo-Mises test, Aderso-Darlig test is a quadratic test of the empirical distributio fuctio. Furthermore, ulike what happes i (3), a weight is give to each observatio iside the itegral [5]. This test has bee modified ad applied to several distributios, such as powerlaw types [6] ad extreme-value distributios [7]. O the other had, as the case of the CM test, we restrict our aalysis to the classical Aderso-Darlig test. Mathematically, the AD statistics of the Aderso-Darlig test may be defied as: AD N F ( ) ( ) 1 ( ) x F x F x F x df x (4) Iterestigly, the mai differece betwee Ader-so- Darlig ad Cramèr-Vo-Mises tests is that the first gives greater weight to the data comig from the tails of the distributios. Similarly, to the other tests already metioed, the hypothesis H0 should be rejected by compariso with extreme values. Beig defied the adherece tests to be used; the ext step is to characterize a process of geeratig radom data through the software Mathematica ad subsequet applicatio of the adherece tests. IV. RANDOMIZES SAMPLES GENERATION AND ADHERENCE TESTS APPLICATION I the preset paper, three radomized samples of 10 4 elemets will be cosidered. I order to evaluate the applicability of the adherece tests, each oe of the radomized samples will display a tedecy that ca be foud i the practice of geotechical egieerig. The cosidered tedecies are: variables distributed accordig to a ormal distributio; variables with asymmetric distributio ad symmetric variable with log tail. A. Normal Sample (D1) Accordig to a ormal distributio, the Radom Variate fuctio of the software Mathematica was used for the geeratio of the distributed sample. The histogram of the respective geerated data ca be foud i Fig. 1 The histogram is show i Fig. 1 was geerated from a ormal distributio with a mea of ad stadard deviatio of 3. Mathematically, oe ca defie the probability desity fuctio of a ormal variable with mea μ ad stadard deviatio σ by meas of the followig equatio: ( x ) e f( x) (5) 686
3 Fig. 1 Radom sample from a ormal distributio with 10 4 elemets B. Asymmetric Sample (D) I order to geerate the sample that follows a skewed distributio the RadomVariate fuctio of the software Mathematica was also used. It was cosidered that the data follow a Levy distributio for the geeratio process. The histogram of the respective geerated data ca be foud i Fig.. I this case the Levy distributio with locatio parameter 4 ad dispersio parameter was used as source. Fig. Radom sample from a Levy distributio with 104 elemets Mathematically, oe ca defie the probability desity fuctio of a Levy variable with locatio parameter μ ad dispersio parameter σ from (6): e f ( x) ( x ) x C. Sample with Log Tail (D3) Fially, for the geeratio of the sample distributed accordig to a distributio with log tail the RadomVariate fuctio of the software Mathematica was used agai. The histogram of the data geerated ca be foud i Fig. 3. To geerate the data preseted i Fig. 3, a studet s t distributio with locatio parameter 0, scale parameter 3 ad 4 degrees of freedom was used. 3/, x (6) Fig. 3 Radom sample of a Studet's t distributio with 104 elemets Mathematically, it is possible to defie the probability desity fuctio of a t-studet variable with locatio parameter μ, scale parameter σ ad ν degrees of freedom, through (7): ( x ) f ( x) 1 Beta, where Beta [x, y] is the special fuctio defied by the followig itegral: As the samples ad its source distributios are set, it is possible to proceed to the dataa aalysis ad applicatio of the adherece tests. D. Basic Data Aalysis Based o the geerated data it is possible to calculate the average, stadardd deviatio ad skewess for each oe of the cosidered sets. Table I cotais those iformatio for each dataa group. Data D1 D D3 Beta x, y Average TABLE I BASIC DATA ANALYSIS Stadard deviatio As expected, the D1 ad D3 data are practically symmetric, while the D data is extremely asymmetrical. Still, accordig to the distributios used to geerate the data, averages ad stadard deviatios are compatible. After the basic data aalysis, the ext step is the determiatio of which distributios are goig to be ested accordig to the goodess-of-fit. 1 0 t (1 t) x1 y1 dt Skewess (7) (8) 687
4 E. s to Be Tested As previously argued, to use the adherece tests the test distributios must be determied. For simplicity, the classes of test distributios will be the same used for the geeratio of data that is the Normal, Levy ad Studet s t distributios. Each of the data will be tested agaist three distributios whose parameters were previously determied through the maximum likelihood estimatio provided i Mathematica software. The parameters for each distributio to be tested are show i Table II. TABLE II PARAMETERS AND DISTRIBUTIONS TO BE TESTED Data Normal Levy T-studet (μ, σ) (μ, σ) (μ, σ, ν) D1 (, 3) (-9.9, 10.8) (, 3, 41) D (6037, ) (4, ) (10 11, 10 11, 7368) D3 (1, 4.) (-9, 9) (1, 3, 4) F. Adherece Tests By applyig the adherece tests discussed i this article o the geerated data ad usig as test distributios those preseted i Table II, Tables III-V ca be geerated. TABLE III GOODNESS-OF-FIT TESTS RESULTS (P-VALUES) FOR THE D1 DATA Smirov Cramèr-Vo-Mises Aderso-Darlig Normal Levy T-studet It is worth otig that i cases where the D1 data set is cosidered, adherece tests clearly show that oly the Normal ad Studet s t distributios ca represet the sample. It is further oted that the fact that both distributios metioed fit well to the data follows from the cosideratio that the Normal distributio is the limitig case of t distributio whe it has a large umber of degrees of freedom. Thus, it ca be oted that adherece tests used are accurate o the characterizatio of the distributio process from which the D1 data set belogs. TABLE IV GOODNESS-OF-FIT TESTS RESULTS (P-VALUES) FOR THE D DATA Smirov Cramèr-Vo-Mises Aderso-Darlig Normal Levy T-studet By lookig at Table IV, it is easy to perceive that all tests suggest the Levy distributio as the best distributio of the data. I fact, the p-value is 0 for the other distributios, implyig the rejectio of H0 for cases of Normal ad Studet s t distributios. Therefore, all the tests are effective i characterizig the D set of data. TABLE V GOODNESS-OF-FIT TESTS RESULTS (P-VALUES) FOR THE D DATA Smirov Cramèr-Vo-Mises Aderso-Darlig Normal Levy T-studet It is easy to otice from the Table V that the adherece tests idicate that the D3 data are distributed accordig to a Studet s t distributio. This fact shows, oce agai, that the cosidered goodess-of-fit tests are effective as regards the determiatio of the probability distributio of a data set. V. CONCLUSIONS The role of Statistics i exact scieces has bee more cosidered with passig of time. Especially i the geotechical egieerig field, the iheret variability of soil parameters fids a great ally i this area of the kowledge. Oe of the most commo situatios whe a egieer is aalyzig a set of data is the proper choice of probability distributio that represets it. This choice take must be as reliable as it ca possibly be, so that the physical behavior of the modeled variable is ot lost. The adherece tests come as powerful allies to help determie which distributio is better adjusted to the database. Therefore, it is imperative that there is complete kowledge of them for a successful statistic modelig process. Oe of the aims of this paper was a discussio about some the features of three of the most kow o-parametric adherece testes, such as: Smirov test, Cramèr- Vo-Mises test ad Aderso-Darlig test. From computatioal experimets, it was show how these tests ca be used to determie the probability distributio that adapts better to the data i aalysis. This way, it is believed that a cotributio to the diffusio of traditioal statistics tools i the field of geotechical egieerig has bee doe. ACKNOWLEDGMENTS The authors would like to thak FAP-DF for the fiacial support, provided as grats (umber /015) to preset the paper at the coferece. REFERENCES [1] Torma, V. B. L.; Coster, R.; Riboldi, J. 01. Normalidade de variáveis: métodos de verifcação e comparação de algus testes ão paramétricos por simulação, Rev. HCPA, Porto Alegre, v.3,., p [] Aderso, T. W. ad Darlig, D. A A Test of Goodess-of-Fit. Joural of the America Statistical Associatio, 49: [3] Gassem, A O Cramér vo Mises type test based o local time of switchig diffusio process. Joural of Statistical Plaig ad Iferece, Vol 141(4), P [4] Iglot, T. ad Ledwia, T O cosistet miimax distiguishability ad itermediate efficiecy of Cramér vo Mises test. Joural of Statistical Plaig ad Iferece, Vol. 14 (), P [5] D'Agostio, R. B. ad Stephes, M. A Goodess-of-Fit Techiques. New York: Marcel Dekker. ISBN , [6] Coroel-Brizio, H. F. ad Herádez-Motoya, A. R The Aderso Darlig test of fit for the power-law distributio from left- 688
5 cesored samples. Physica A: Statistical Mechaics ad its Applicatios, Vol. 389 (17), P [7] Heo, J-H., Shi, H., Nam, W., Om, J., Jeog, C Approximatio of modified Aderso Darlig test statistics for extreme value distributios with ukow shape parameter. Joural of Hydrology, Vol. 499 (30), P
PSYCHOLOGICAL STATISTICS
UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics
More informationInference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval
Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio
More informationChapter 7: Confidence Interval and Sample Size
Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum
More information5: Introduction to Estimation
5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample
More informationUniversity of California, Los Angeles Department of Statistics. Distributions related to the normal distribution
Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.
More informationHypothesis testing. Null and alternative hypotheses
Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate
More informationDetermining the sample size
Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors
More information1 Computing the Standard Deviation of Sample Means
Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.
More informationIn nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008
I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces
More informationLesson 15 ANOVA (analysis of variance)
Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi
More informationCase Study. Normal and t Distributions. Density Plot. Normal Distributions
Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca
More informationI. Chi-squared Distributions
1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.
More information1. C. The formula for the confidence interval for a population mean is: x t, which was
s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value
More informationThe following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles
The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio
More informationA Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:
A Test of Normality Textbook Referece: Chapter. (eighth editio, pages 59 ; seveth editio, pages 6 6). The calculatio of p values for hypothesis testig typically is based o the assumptio that the populatio
More informationProperties of MLE: consistency, asymptotic normality. Fisher information.
Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout
More information0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5
Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.
More informationOne-sample test of proportions
Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:
More informationCHAPTER 7: Central Limit Theorem: CLT for Averages (Means)
CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:
More informationZ-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown
Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about
More informationMaximum Likelihood Estimators.
Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio
More informationDescriptive Statistics
Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote
More informationData Analysis and Statistical Behaviors of Stock Market Fluctuations
44 JOURNAL OF COMPUTERS, VOL. 3, NO. 0, OCTOBER 2008 Data Aalysis ad Statistical Behaviors of Stock Market Fluctuatios Ju Wag Departmet of Mathematics, Beijig Jiaotog Uiversity, Beijig 00044, Chia Email:
More informationCenter, Spread, and Shape in Inference: Claims, Caveats, and Insights
Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the
More informationOverview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals
Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of
More informationConfidence Intervals
Cofidece Itervals Cofidece Itervals are a extesio of the cocept of Margi of Error which we met earlier i this course. Remember we saw: The sample proportio will differ from the populatio proportio by more
More informationThe analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection
The aalysis of the Courot oligopoly model cosiderig the subjective motive i the strategy selectio Shigehito Furuyama Teruhisa Nakai Departmet of Systems Maagemet Egieerig Faculty of Egieerig Kasai Uiversity
More informationExample 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).
BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationConfidence Intervals for One Mean
Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a
More informationChapter 14 Nonparametric Statistics
Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they
More informationModified Line Search Method for Global Optimization
Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o
More informationQuadrat Sampling in Population Ecology
Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may
More informationhp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation
HP 1C Statistics - average ad stadard deviatio Average ad stadard deviatio cocepts HP1C average ad stadard deviatio Practice calculatig averages ad stadard deviatios with oe or two variables HP 1C Statistics
More informationResearch Method (I) --Knowledge on Sampling (Simple Random Sampling)
Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact
More informationMeasures of Spread and Boxplots Discrete Math, Section 9.4
Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,
More informationChapter 7 Methods of Finding Estimators
Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of
More informationOutput Analysis (2, Chapters 10 &11 Law)
B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should
More informationConfidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.
Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).
More informationHypergeometric Distributions
7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you
More informationLesson 17 Pearson s Correlation Coefficient
Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig
More informationCOMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS
COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S CONTROL CHART FOR THE CHANGES IN A PROCESS Supraee Lisawadi Departmet of Mathematics ad Statistics, Faculty of Sciece ad Techoology, Thammasat
More informationNormal Distribution.
Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued
More informationSECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES
SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,
More information1 Correlation and Regression Analysis
1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio
More informationMann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)
No-Parametric ivariate Statistics: Wilcoxo-Ma-Whitey 2 Sample Test 1 Ma-Whitey 2 Sample Test (a.k.a. Wilcoxo Rak Sum Test) The (Wilcoxo-) Ma-Whitey (WMW) test is the o-parametric equivalet of a pooled
More informationDiscrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13
EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may
More informationChapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:
Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries
More informationPractice Problems for Test 3
Practice Problems for Test 3 Note: these problems oly cover CIs ad hypothesis testig You are also resposible for kowig the samplig distributio of the sample meas, ad the Cetral Limit Theorem Review all
More informationwhere: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return
EVALUATING ALTERNATIVE CAPITAL INVESTMENT PROGRAMS By Ke D. Duft, Extesio Ecoomist I the March 98 issue of this publicatio we reviewed the procedure by which a capital ivestmet project was assessed. The
More informationPROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM
PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics
More informationTHE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n
We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample
More informationLECTURE 13: Cross-validation
LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M
More informationBiology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships
Biology 171L Eviromet ad Ecology Lab Lab : Descriptive Statistics, Presetig Data ad Graphig Relatioships Itroductio Log lists of data are ofte ot very useful for idetifyig geeral treds i the data or the
More informationA Recursive Formula for Moments of a Binomial Distribution
A Recursive Formula for Momets of a Biomial Distributio Árpád Béyi beyi@mathumassedu, Uiversity of Massachusetts, Amherst, MA 01003 ad Saverio M Maago smmaago@psavymil Naval Postgraduate School, Moterey,
More informationA modified Kolmogorov-Smirnov test for normality
MPRA Muich Persoal RePEc Archive A modified Kolmogorov-Smirov test for ormality Zvi Drezer ad Ofir Turel ad Dawit Zerom Califoria State Uiversity-Fullerto 22. October 2008 Olie at http://mpra.ub.ui-mueche.de/14385/
More informationTHE TWO-VARIABLE LINEAR REGRESSION MODEL
THE TWO-VARIABLE LINEAR REGRESSION MODEL Herma J. Bieres Pesylvaia State Uiversity April 30, 202. Itroductio Suppose you are a ecoomics or busiess maor i a college close to the beach i the souther part
More information*The most important feature of MRP as compared with ordinary inventory control analysis is its time phasing feature.
Itegrated Productio ad Ivetory Cotrol System MRP ad MRP II Framework of Maufacturig System Ivetory cotrol, productio schedulig, capacity plaig ad fiacial ad busiess decisios i a productio system are iterrelated.
More informationMathematical goals. Starting points. Materials required. Time needed
Level A1 of challege: C A1 Mathematical goals Startig poits Materials required Time eeded Iterpretig algebraic expressios To help learers to: traslate betwee words, symbols, tables, ad area represetatios
More informationChapter 6: Variance, the law of large numbers and the Monte-Carlo method
Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value
More informationDefinition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean
1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.
More informationSystems Design Project: Indoor Location of Wireless Devices
Systems Desig Project: Idoor Locatio of Wireless Devices Prepared By: Bria Murphy Seior Systems Sciece ad Egieerig Washigto Uiversity i St. Louis Phoe: (805) 698-5295 Email: bcm1@cec.wustl.edu Supervised
More informationSoving Recurrence Relations
Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree
More informationDAME - Microsoft Excel add-in for solving multicriteria decision problems with scenarios Radomir Perzina 1, Jaroslav Ramik 2
Itroductio DAME - Microsoft Excel add-i for solvig multicriteria decisio problems with scearios Radomir Perzia, Jaroslav Ramik 2 Abstract. The mai goal of every ecoomic aget is to make a good decisio,
More informationAn Efficient Polynomial Approximation of the Normal Distribution Function & Its Inverse Function
A Efficiet Polyomial Approximatio of the Normal Distributio Fuctio & Its Iverse Fuctio Wisto A. Richards, 1 Robi Atoie, * 1 Asho Sahai, ad 3 M. Raghuadh Acharya 1 Departmet of Mathematics & Computer Sciece;
More informationDepartment of Computer Science, University of Otago
Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly
More informationEstimating Probability Distributions by Observing Betting Practices
5th Iteratioal Symposium o Imprecise Probability: Theories ad Applicatios, Prague, Czech Republic, 007 Estimatig Probability Distributios by Observig Bettig Practices Dr C Lych Natioal Uiversity of Irelad,
More informationCentral Limit Theorem and Its Applications to Baseball
Cetral Limit Theorem ad Its Applicatios to Baseball by Nicole Aderso A project submitted to the Departmet of Mathematical Scieces i coformity with the requiremets for Math 4301 (Hoours Semiar) Lakehead
More informationMath C067 Sampling Distributions
Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters
More informationA probabilistic proof of a binomial identity
A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two
More informationBaan Service Master Data Management
Baa Service Master Data Maagemet Module Procedure UP069A US Documetiformatio Documet Documet code : UP069A US Documet group : User Documetatio Documet title : Master Data Maagemet Applicatio/Package :
More informationSequences and Series
CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their
More informationActuarial Models for Valuation of Critical Illness Insurance Products
INTERNATIONAL JOURNAL OF MATHEMATICAL MODELS AND METHODS IN APPLIED SCIENCES Volume 9, 015 Actuarial Models for Valuatio of Critical Illess Isurace Products P. Jidrová, V. Pacáková Abstract Critical illess
More informationHow To Solve The Homewor Problem Beautifully
Egieerig 33 eautiful Homewor et 3 of 7 Kuszmar roblem.5.5 large departmet store sells sport shirts i three sizes small, medium, ad large, three patters plaid, prit, ad stripe, ad two sleeve legths log
More informationA Mathematical Perspective on Gambling
A Mathematical Perspective o Gamblig Molly Maxwell Abstract. This paper presets some basic topics i probability ad statistics, icludig sample spaces, probabilistic evets, expectatios, the biomial ad ormal
More informationCHAPTER 3 THE TIME VALUE OF MONEY
CHAPTER 3 THE TIME VALUE OF MONEY OVERVIEW A dollar i the had today is worth more tha a dollar to be received i the future because, if you had it ow, you could ivest that dollar ad ear iterest. Of all
More informationGCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.
GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all
More informationThe Stable Marriage Problem
The Stable Marriage Problem William Hut Lae Departmet of Computer Sciece ad Electrical Egieerig, West Virgiia Uiversity, Morgatow, WV William.Hut@mail.wvu.edu 1 Itroductio Imagie you are a matchmaker,
More informationTopic 5: Confidence Intervals (Chapter 9)
Topic 5: Cofidece Iterval (Chapter 9) 1. Itroductio The two geeral area of tatitical iferece are: 1) etimatio of parameter(), ch. 9 ) hypothei tetig of parameter(), ch. 10 Let X be ome radom variable with
More information15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011
15.075 Exam 3 Istructor: Cythia Rudi TA: Dimitrios Bisias November 22, 2011 Gradig is based o demostratio of coceptual uderstadig, so you eed to show all of your work. Problem 1 A compay makes high-defiitio
More informationDesigning Incentives for Online Question and Answer Forums
Desigig Icetives for Olie Questio ad Aswer Forums Shaili Jai School of Egieerig ad Applied Scieces Harvard Uiversity Cambridge, MA 0238 USA shailij@eecs.harvard.edu Yilig Che School of Egieerig ad Applied
More informationProject Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments
Project Deliverables CS 361, Lecture 28 Jared Saia Uiversity of New Mexico Each Group should tur i oe group project cosistig of: About 6-12 pages of text (ca be loger with appedix) 6-12 figures (please
More informationIncremental calculation of weighted mean and variance
Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically
More informationA Faster Clause-Shortening Algorithm for SAT with No Restriction on Clause Length
Joural o Satisfiability, Boolea Modelig ad Computatio 1 2005) 49-60 A Faster Clause-Shorteig Algorithm for SAT with No Restrictio o Clause Legth Evgey Datsi Alexader Wolpert Departmet of Computer Sciece
More informationNATIONAL SENIOR CERTIFICATE GRADE 12
NATIONAL SENIOR CERTIFICATE GRADE MATHEMATICS P EXEMPLAR 04 MARKS: 50 TIME: 3 hours This questio paper cosists of 8 pages ad iformatio sheet. Please tur over Mathematics/P DBE/04 NSC Grade Eemplar INSTRUCTIONS
More informationVladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT
Keywords: project maagemet, resource allocatio, etwork plaig Vladimir N Burkov, Dmitri A Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT The paper deals with the problems of resource allocatio betwee
More informationNow here is the important step
LINEST i Excel The Excel spreadsheet fuctio "liest" is a complete liear least squares curve fittig routie that produces ucertaity estimates for the fit values. There are two ways to access the "liest"
More informationSampling Distribution And Central Limit Theorem
() Samplig Distributio & Cetral Limit Samplig Distributio Ad Cetral Limit Samplig distributio of the sample mea If we sample a umber of samples (say k samples where k is very large umber) each of size,
More information7. Concepts in Probability, Statistics and Stochastic Modelling
7. Cocepts i Probability, Statistics ad Stochastic Modellig 1. Itroductio 169. Probability Cocepts ad Methods 170.1. Radom Variables ad Distributios 170.. Expectatio 173.3. Quatiles, Momets ad Their Estimators
More informationPresent Value Factor To bring one dollar in the future back to present, one uses the Present Value Factor (PVF): Concept 9: Present Value
Cocept 9: Preset Value Is the value of a dollar received today the same as received a year from today? A dollar today is worth more tha a dollar tomorrow because of iflatio, opportuity cost, ad risk Brigig
More informationYour organization has a Class B IP address of 166.144.0.0 Before you implement subnetting, the Network ID and Host ID are divided as follows:
Subettig Subettig is used to subdivide a sigle class of etwork i to multiple smaller etworks. Example: Your orgaizatio has a Class B IP address of 166.144.0.0 Before you implemet subettig, the Network
More informationPredictive Modeling Data. in the ACT Electronic Student Record
Predictive Modelig Data i the ACT Electroic Studet Record overview Predictive Modelig Data Added to the ACT Electroic Studet Record With the release of studet records i September 2012, predictive modelig
More informationNon-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring
No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy
More information7.1 Finding Rational Solutions of Polynomial Equations
4 Locker LESSON 7. Fidig Ratioal Solutios of Polyomial Equatios Name Class Date 7. Fidig Ratioal Solutios of Polyomial Equatios Essetial Questio: How do you fid the ratioal roots of a polyomial equatio?
More informationResearch Article Sign Data Derivative Recovery
Iteratioal Scholarly Research Network ISRN Applied Mathematics Volume 0, Article ID 63070, 7 pages doi:0.540/0/63070 Research Article Sig Data Derivative Recovery L. M. Housto, G. A. Glass, ad A. D. Dymikov
More informationAnalyzing Longitudinal Data from Complex Surveys Using SUDAAN
Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical
More informationINVESTMENT PERFORMANCE COUNCIL (IPC)
INVESTMENT PEFOMANCE COUNCIL (IPC) INVITATION TO COMMENT: Global Ivestmet Performace Stadards (GIPS ) Guidace Statemet o Calculatio Methodology The Associatio for Ivestmet Maagemet ad esearch (AIM) seeks
More informationTHE HEIGHT OF q-binary SEARCH TREES
THE HEIGHT OF q-binary SEARCH TREES MICHAEL DRMOTA AND HELMUT PRODINGER Abstract. q biary search trees are obtaied from words, equipped with the geometric distributio istead of permutatios. The average
More informationPlug-in martingales for testing exchangeability on-line
Plug-i martigales for testig exchageability o-lie Valetia Fedorova, Alex Gammerma, Ilia Nouretdiov, ad Vladimir Vovk Computer Learig Research Cetre Royal Holloway, Uiversity of Lodo, UK {valetia,ilia,alex,vovk}@cs.rhul.ac.uk
More informationWHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?
WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? JÖRG JAHNEL 1. My Motivatio Some Sort of a Itroductio Last term I tought Topological Groups at the Göttige Georg August Uiversity. This
More information