Basic Data Analysis Principles. Acknowledgments

Size: px
Start display at page:

Download "Basic Data Analysis Principles. Acknowledgments"

Transcription

1 CEB - Basic Data Aalysis Priciples Basic Data Aalysis Priciples What to do oce you get the data Whe we reaso about quatitative evidece, certai methods for displayig ad aalyzig data are better tha others. Superior methods are more likely to produce truthful, credible, ad precise fidigs. The differece betwee a excellet aalysis ad a faulty oe ca sometimes have mometous cosequeces. -Edward R. Tufte, Visual ad Statistical Thikig: Displays of Evidece for Makig Decisios Visual Explaatios, Edward R. Tufte, Graphics Press, Uit III - Module 6 1 Ackowledgmets ICEAA is idebted to TASC, Ic., for the developmet ad maiteace of the Cost Estimatig Body of Kowledge (CEBoK ) ICEAA is also idebted to Techomics, Ic., for the idepedet review ad maiteace of CEBoK ICEAA is also idebted to the followig idividuals who have made sigificat cotributios to the developmet, review, ad maiteace of CostPROF ad CEBoK Module 6 Basic Data Aalysis Priciples Lead authors: Mega E. Damero, Bethia L. Cullis, Mauree L. Tedford Seior reviewers: Richard L. Colema, Jessica R. Summerville, Joh S. Smuck, Fred K. Blackbur Reviewers: Samuel B. Toas, Kevi Cicotta, Matthew J. Pitlyk, Bria A. Welsh Maagig editor: Peter J. Braxto Uit III - Module 6 1

2 CEB - Basic Data Aalysis Priciples Uit Idex Uit I Cost Estimatig Uit II Cost Aalysis Techiques Uit III Aalytical Methods 6. Basic Data Aalysis Priciples 7. Learig Curve Aalysis. Regressio Aalysis 9. Cost ad Schedule Risk Aalysis.Probability ad Statistics Uit IV Specialized Costig Uit V Maagemet Applicatios Uit III - Module 6 3 Data Aalysis Overview Key Ideas Visual Display of Iformatio Cetral Tedecy of Data Dispersio (Spread) of Data Data accumulatio Outliers Aalytical Costructs Descriptive statistics Mea, media, mode Variace, std deviatio, CV Fuctioal forms Practical Applicatios Makig sese of your data Related Topics Parametrics Distributios Normal, Chi, t, F Probability ad Statistics 3 Uit III - Module 6 4

3 Frequecy Frequecy Frequecy CEB - Basic Data Aalysis Priciples Past Uderstadig your historical data Data Aalysis Withi The Cost Estimatig Framework Preset Developig estimatig tools Future Estimatig the ew system 3 Mothly Gas Bill 3 Mothly Gas Bill 3 Mothly Gas Bill More $ More $ More $ Historical data Mea = $34.19 Average cost Cofidece Iterval = +/-$.76 Cofidece Itervals Uit III - Module 6 Data Aalysis Outlie Core Kowledge Types of Data Uivariate Data Aalysis Scatter Plots Variables Axes ad Fuctio Types Data Validatio Descriptive Statistics Outliers Rules of Thumbs Two Cautioary Tales Summary Resources Related ad Advaced Topics Uit III - Module 6 6 3

4 Cost Frequecy CEB - Basic Data Aalysis Priciples Types of Data Uivariate Bivariate Multivariate Time Series Uit III - Module Uivariate Types of Data Sigle variable Use descriptive ad iferetial statistics Bivariate Oe idepedet variable ad oe depedet variable (i.e., y is a fuctio of x) Use descriptive ad iferetial statistics Multivariate Liear Tred Several idepedet variables ad oe depedet variable (i.e., y is a fuctio of x 1, x, ad x 3 ) Use descriptive ad iferetial statistics Mothly Gas Bill Weight More $ Tip: Uivariate data plus a Nomial variable is really bivariate S1 Uit III - Module 6 4

5 CEB - Basic Data Aalysis Priciples Types of Data Time Series Time as the idepedet variable Iterval matters! Make sure you use a XY (Scatter) ad ot a Lie Chart i Excel uless itervals are equally spaced Smooth treds are rarely foud i time series Possible rare exceptios (e.g., corrosio over time) Stadard treds such as ivestmet ad iflatio Look for paradigm shifts, cycles, autocorrelatio Use movig averages, divide data ito groups ad 11 compare descriptive statistics Regressio is ofte ot useful as it oly picks up smooth treds uless AR1/ARIMA.4. ANOVA ad mea comparisos are more useful Uit III - Module Uivariate Data Aalysis Visual Display of Iformatio Histogram, stem-ad-leaf, box plot Measures of Cetral Tedecy Mea (or media or mode) Measures of Variability Stadard deviatio (or variace), coefficiet of variatio (CV) Measures of Ucertaity Cofidece Iterval (CI) Statistical Tests Tip: This aalysis framework is mirrored i bivariate ad multivariate aalysis. Uit III - Module 6 How precise are you? What does it look like? t test, chi square test, Kolmogorov-Smirov (K-S) test What s your best guess? How ca you be sure? How much remais uexplaied?

6 Frequecy Frequecy Frequecy CEB - Basic Data Aalysis Priciples Visual Display - Histograms 6 Histograms should be used to give a idea of the distributio of the data Mothly Gas Bill More $ Warig: Results of macros do ot update if your data chage! Excel Data Aalysis Add-I Histogram. Tip: Create histogram maually usig Chart type Colum so that results do update whe data chage! Skew-right distributio, possibly Expoetial, Triagular, or Logormal Uit III - Module 6 11 Histograms Bis It is importat to carefully cosider the umber of bis used i a histogram Experimet with itervals to be sure you uderstad the data Mothly Gas Bill Warig: Default bis i Excel histograms may ot be optimal! $ This histogram allows Excel to choose the bis More Which is clearer? Which sets a trap? Uit III - Module 6 1 Mothly Gas Bill More $ This histogram specifies the bis. 6 Warig: Histograms ca be maipulated! 6

7 CEB - Basic Data Aalysis Priciples Cetral Tedecy - Mea 6 The sample mea of the data set {x 1, x,, x } is calculated as: x xi 1 x1 x... x i I Excel, use the AVERAGE( ) fuctio Meas of example data sets: Gas bill (74 moths), $6. Therms used (74 moths), 14. The mea is the Expected Value of a radom variable Uit III - Module 6 13 Cetral Tedecy - Media The sample media is the middle data poit, with % of the remaiig observatios fallig uder that poit, ad % above If a data set has a odd umber of poits, the middle value is the media The media of the data set {,,7,9,} is 7 AKA th Percetile If a data set has a eve umber of poits, the two middle values are averaged The media of the set {3, 6,, 11, 13, 3} is 9. (average of ad 11) I geeral, the k th percetile is the poit with k% of the data below ad (-k)% of the data above Quartiles (,, 7), deciles (,,,, 9), icosatiles (,, 1,, 9) Whe there are extreme data poits, the media may be more represetative tha the mea because robust outliers impact the mea more tha the media Represetative is a descriptive term, ot a mathematical term There are may mathematical reasos to prefer mea over media Uit III - Module 6 Leged Red = Extreme poits 14 Blue = Middle poits 7

8 Frequecy CEB - Basic Data Aalysis Priciples Mea, Media, ad Skew The mea ad the media are equal if the distributio is symmetric Uequal meas ad medias are a idicatio of skewess Logormal Distributio Media = Mea Symmetric. Beta Distributio Normal Distributio Media < Mea Skew(ed) Right Media > Mea Skew(ed) Left Uit III - Module 6 1 Cetral Tedecy - Mode The sample mode is the most frequet poit to occur i a data set The mode of a distributio is its peak Value with the greatest probability mass (or desity) The mode of the set {,4,4,7,9,9,9} is 9 The mode is a descriptive metric aswerig the questio what happes most frequetly? X Value It ca help give a visual idea of what the distributio looks like Most useful i discrete data A histogram shows that the value 9 occurs most ofte this is the mode Uit III - Module

9 CEB - Basic Data Aalysis Priciples Variability Variace / Stadard Deviatio The sample variace measures the deviatio of the data poits from their mea easy to remember s i1 ( x i x) 1 i1 I Excel, use the VAR( ) fuctio xi i1 1 The sample stadard deviatio is simply The stadard deviatio is expressed i the same uits as the origial data I Excel, use the STDEV( ) fuctio x i Uit III - Module 6 17 Tip: Low variace idicates less dispersio, i.e., tighter data easy to calculate s s Tip: s is the estimator for the populatio parameter σ Variability - Coefficiet of Variatio 13 The Coefficiet of Variatio (CV) expresses the stadard deviatio as a percet of the mea CV Large CVs idicate that the mea is a poor estimator Cosider regressio o cost drivers Examie data for multiple populatios (outliers) CVs of example data sets: Gas bill, 74.4% (69.%) Therms used, 4.% (.%) s X Uit III - Module 6 1 Tip: Low CV idicates less dispersio, i.e., tighter data. 1% or less is desired Note that sums ad averages ted to have smaller variaces 9

10 Frequecy Frequecy CEB - Basic Data Aalysis Priciples Dispersio ad CV These two data sets have the same mea, but differet stadard deviatios 1 Lower CV This data has a higher CV (3%) ad has more dispersio More Bi This data has a lower CV (17%) ad is more tightly distributed Higher CV More Bi Uit III - Module 6 19 Cofidece Iterval Illustratio A cofidece iterval (CI) suggests to us that we are (1-a)*% cofidet that the true parameter value is cotaied withi the calculated rage* x t s, x ta / a /, 1,, 1 s * Note this statemet provides a geeral sese of what a cofidece iterval does for us i cocise laguage, for ease of uderstadig. The specific statistical iterpretatio is that if may idepedet samples are take where the levels of the predictor variable are the same as i the data set, ad a (1-a) % cofidece iterval is costructed for each sample, the (1-a) % of the itervals will cotai the true value of the parameter. a/ a/ 1 - a critical values Uit III - Module 6

11 CEB - Basic Data Aalysis Priciples Sample Sizes - Sufficietly Large 6 4 I geeral, we prefer to be large how large is a fuctio of our tolerace for error The 6.3% CI for the mea is roughly CV/ So, for CVs ragig aroud 3%, we get the followig 6.3% Cofidece Iterval with : +/- 4 1% 9 % 16 % 6% 36 % Tip: 3 is ot a magic umber of data poits If we would like to be able to make judgmets withi about % poits with a CV of 3%, we eed 36 We may have o choice but to deal with small I ay case, we ca calculate the rage of estimated mea Uit III - Module 6 1 Predictio Itervals The previous cofidece iterval illustratio gives the true average cost withi a certai rage If we wat to kow the predicted cost of a ew item withi a certai rage, we eed a predictio iterval The PI suggests to us that we are (1-a)*% cofidet that the ext observatio will be cotaied withi the calculated rage The larger stadard error i the PI accouts for both the ucertaity i the mea (captured by the CI) ad the ucertaity i idividual observatios x t s 1 1, y 1, x t /, 1s a /, 1 a 1 1 Uit III - Module 6 11

12 CEB - Basic Data Aalysis Priciples Statistical Tests 17 t test for mea Is the Cost Growth Factor (CGF) for NAVAIR programs differet tha 1.? Chi square test for variace Is 3% a reasoable CV to use for this variable? Should t test for equal meas assume equal variaces? Chi square test for distributio Are Lie-Replaceable Uit (LRU) failures uiform across all deployed uits? Kolmogorov-Smirov test for distributio Is the ormal distributio appropriate for modelig ucertaity i desig weight? Uit III - Module 6 3 Scatter Plots Variables Axes Fuctio Types Uit III - Module 6 4 1

13 Light Ship Displacemet CEB - Basic Data Aalysis Priciples Scatter Plots A picture is worth a thousad words! A scatter plot ca reveal a wealth of iformatio about relatioships preset i the data Create scatter plots i Excel by usig the Chart Wizard XY (Scatter) Add a tred lie i Excel by right clickig the plotted data ad choosig Add Tred lie Helps lik graph ad equatio Look at iferetial statistics later Tip: Scatter plots are the sigle most useful tool i all of aalysis they are the gift of sight to the aalyst 1,,, 6, 4,, - y = 7.9x R = Year Uit III - Module 6 1 Scatter Plots Variables Plot cost (or other variable of iterest, e.g., hours) as the depedet variable Look at a variety of differet idepedet variables Techical parameters such as weight, lies of code, etc. Performace parameters such as speed, accuracy, etc. Operatioal parameters such as crew size, flyig hours, etc. Cost of aother elemet Thik about which variables you believe should drive cost ad collect that data! Uit III - Module

14 Cost Cost Cost CEB - Basic Data Aalysis Priciples Scatter Plots Cost Drivers Scatter plots ca help idetify cost drivers R iterpretatio: % of variatio i y explaied (liearly) by variatio i x 13 1 y =.37x R² = y =.7x + 7. R² = Warig: R is just Sigificat correlatio potetial cost driver Variable a idicator, cosult t ad F statistics! y = -.9x + 1. R² = Variable Variable Weak correlatio Ucorrelated Uit III - Module 6 7 Scatter Plots Uit Space Data should first be plotted i uit space* x is plotted o the horizotal axis (x-axis) ad y is plotted o the vertical axis (y-axis) If the data have a o-liear relatioship whe plotted i uit space, ivestigate how the data ca be made liear No-liear relatioships ca ofte be trasformed to appear liear through the use of atural logs Trasformed data ca the be regressed liearly Before the widespread use of computers, oliear data was graphed o semi-log or log-log paper * Uit space refers to the origial, utrasformed data. Uit III - Module 6 14

15 Cost CEB - Basic Data Aalysis Priciples Scatter Plots Liear Fuctio The most commo relatioships are liear Of the form y = mx + b [m = slope, b = y-it.] Plotted i uit space y =.7461x +.9 R =.3 Liear Tred Weight Tip: Liear models are also the best approximatios to o-liear models by which we mea, they take you least far afield if you guessed wrog. Uit III - Module 6 9 Scatter Plots Power Fuctio Power fuctios are of the form y = ax b Ca be trasformed ito liear fuctios Takig the atural log of both sides gives l(y) = l(a) + b l(x) Plot l(x) o the horizotal axis ad l(y) o the vertical axis ad look for a liear tred This trasformatio is show graphically o the ext slide Uit III - Module 6 3 1

16 l (Cost) Cost CEB - Basic Data Aalysis Priciples 3 Scatter Plots Power Fuctio This fuctio is most commoly used for learig curves, but ca also be used for CERs Cost =.394Weight.14 R =.99 Power Tred Weight Tip: Aother virtue of tred lies is that they ca act as a Rosetta Stoe for the values of a curve fit o trasformed variables. Uit III - Module 6 31 Slope o log-log graph is the expoet of the power equatio Power Tred o Log-Log Axes l (Cost) =.14l (Weight) R = l (Weight) A alterative is to use Format Axis Logarithmic scale Scatter Plots Expoetial Fuctio Expoetial fuctios are of the form y = ae bx = a(e b ) x = ak x Models of this form ca be trasformed ad made to be liear Takig the atural log (l) of both sides gives l(y) = l(a) + bx The atural log (l) is the iverse fuctio of the expoetial: y = e x x = l(y) Tip: Expoetial fuctios are seldom ecoutered i cost estimatio outside of iflatio Uit III - Module

17 l (Cost) Cost CEB - Basic Data Aalysis Priciples Scatter Plots Expoetial Fuctio The, x is plotted o the horizotal axis ad l(y) is plotted o the vertical axis This trasformatio is show graphically below Expoetial Tred Cost =.4e 1.19Weight R² = Weight Slope o semi-log graph is the coefficiet of x i the expoetial equatio Expoetial Tred o Semi-Log Axes l(cost) = 1.19Weight R² = Weight Uit III - Module 6 33 Scatter Plots Costat Terms Geeralized power ad expoetial equatios are of the form: y = ax b + c, y = ae bx + c Power ad expoetial models usually assume a costat term of c = However, c = is more commo i theory tha i practice If c = does ot fit the data, cosider usig a model with c Use the Excel Add-i Solver (or aother, more robust optimizatio tool) to fit a curve to the data, where a, b, c are chose simultaeously (GERM) Miimize SSE or maximize uit-space R To b or Not to b The y-itercept i Cost Estimatio, R. L. Colema, J. R. Summerville, P. J. Braxto, B. L. Cullis, E. R. Druker, SCEA, 7. Warig: Excel forces power ad expoetial tredlies to have c =! Uit III - Module

18 CEB - Basic Data Aalysis Priciples Data Validatio Scatter plottig gives you a idea of the relatioships preset i the data What s ext? Look at descriptive statistics Look for outliers Compare to historical studies, idustry stadards, or rules of thumb Uit III - Module 6 3 Descriptive Statistics Calculate descriptive statistics for each data group Sample size Raw mea Stadard deviatio Coefficiet of variatio (CV) Warig: Results of macros do ot update if your data chage! Tip: Create formulae maually so that results do update whe data chage! 14 Weighted averages (e.g., dollar-weighted) Movig averages (for time series data) I Excel, Tools Data Aalysis Descriptive Statistics will easily calculate the most importat descriptive statistics 11 Uit III - Module

19 $ Wtd DE CGF CEB - Basic Data Aalysis Priciples Descriptive Statistics - Bar Charts Bar charts ca be used to compare the descriptive statistics for differet groups Y-error bars ca be added to show the stadard deviatio Tip: Stadard deviatios are useful, but predictio itervals would be better, capturig the iteractio of quatity ad dispersio more 3. succictly ad i a iferetially better way. Be sure 3.to label which they are. RDT&E Programs by Compay (SAR Programs with EMD oly) Co. 1 Co. Co. 3 Co. 4 Co. = 3 = 7 = 4 = 4 = 9 Uit III - Module 6 37 Bar Charts i Excel Bar charts Excel Chart Wizard Colum Chart Y-error bars Format Data Series Y-error bars (3) Chart Tools Layout Aalysis Error Bars (7) Histogram Excel Data Aalysis Add-I - Histogram Tip: It is recommeded that you create your ow dyamic histograms with flexible bi spacig usig COUNTIF() ad Colum Charts. Uit III - Module

20 Cost CEB - Basic Data Aalysis Priciples Outliers 6 Outliers are data poits that fall far away from the ceter of the data ad are ot represetative of the populatio you are tryig to model For ormally distributed data sets, about 9.4% of the data should fall withi two stadard deviatios of the mea So, we d expect 4.% to be outside two stadard deviatios 99.7% of the data should be withi three stadard deviatios of the mea If a data poit is more tha three stadard deviatios from the mea, it is a potetial outlier Tip: The ormal distributio is a good first approximatio, but if your data are sigificatly skewed, these rules of thumb should ot be used to idetify potetial outliers. Uit III - Module Outliers ad Tred Lies Outliers may bias the regressio lie y Cost vs. Weight All Data ad Potetial Outlier Removed y = 1.31x R² =.166 y = 1.793x R² =.76 1 Weight Possible outlier, 4.4 stadard deviatios from the mea Tip: If usig two graphs, do ot chage scale of axes whe comparig! Uit III - Module 6 4 Without the possible outlier, the slope of the regressio lie is steeper ad the R is higher

21 CEB - Basic Data Aalysis Priciples Removig Outliers Do ot remove a outlier from the data without a good reaso! Doig so removes some of the variatio preset i history Doig so ca be a form of cookig the data Good reasos for removig a outlier: Program was restructured or divided Oe of these is ot like the others e.g., a helo i a set of missile data Bad reasos for removig a outlier: Too high stadard deviatios away from the mea [!] Uit III - Module 6 41 Tip: Outlier treatmet separates the aalysts from the spi meisters 4 Rules of Thumb Compare your descriptive statistics to historical rules of thumb NCCA Stadard Factors hadbook, for example Saity check! Tip: Compariso to history ad cross checks separates the thorough from the sloppy Uit III - Module 6 4 1

22 CEB - Basic Data Aalysis Priciples Two Cautioary Tales Expert s Eyeball Descriptive Statistics ad Visual Displays Techical Huch Outliers Uit III - Module 6 43 Egieerig Judgmets 14 Suppose we are give a estimate that has egieerig judgmet as its basis Egieerig judgmets should ever be accepted without validatio! The aalyst must fid out if the guess is correct, or at least i the ballpark Experts ofte possess isight or ituitio regardig systems that bears o cost, but it is the aalyst s job to make the estimate explicit ad reproducible Uit III - Module 6 44

23 Percet of First Ship CEB - Basic Data Aalysis Priciples Example: Expert s Eyeball Follow Ship Support Hull FF DDG 37 Average 7.1% 4.%.9% 3 9.4%.1% 7.% 4 9.% 4.3% 6.7% % Is the average a good idea? Is the th ship guess right? Uit III - Module 6 4 Example: Expert s Eyeball The average is a good umber! 3 1 Decrease i Follow-Ship Support Whe the average lie is extrapolated, it looks like th ship should be about 6% Tip: Graphic esures cost estimate credibility! FF DDG 37 Average 3 4 Hull The th ship guess of 4% looks too low! Uit III - Module

24 CEB - Basic Data Aalysis Priciples Example: Techical Huch I this real-life example, we will look at the importace of correctly ivestigatig outliers Scatter plots ca be extremely useful i idetifyig potetial outliers Uit III - Module 6 47 Example: Techical Huch Shakedow Hull Hours/To DD DD 9.4 DD 93.3 DD 96.1 DD 963 is too low for a first ship Uit III - Module 6 4 4

25 Hours per To CEB - Basic Data Aalysis Priciples Wrog Outlier Rejected! Istead of DD 963, look ito DD 9 That s the potetial outlier! This lie produces a more reasoable th ship estimate. Hours/To Hull Hours/To The expert s curve is urealistic at the th ship! Uit III - Module 6 49 Data Aalysis Summary Steps of basic data aalysis 1. Scatter plot visual depictio of the relatioships i the data. Descriptive statistics calculate the meas ad CVs If the CV is uder 1%, the average may be a sufficiet predictor, focus more attetio o elemets with higher CVs If the CV is over 1%, focus o this elemet usig regressio aalysis to look for a better predictor tha the average (CER developmet) 3. Look for outliers (data quality check) 4. Compare to history Uit III - Module 6

26 CEB - Basic Data Aalysis Priciples Resources A Itroductio to Mathematical Statistics ad Its Applicatios, 3 rd ed., Richard J. Larse ad Morris L. Marx, Pretice Hall, Probability ad Statistics for Egieerig ad the Scieces, th ed., Jay L. Devore, Brooks/Cole Publishig, 1999 Calculus: Sigle Variable, Deborah Hughes-Hallett ad Adrew Gleaso, Joh Wiley & Sos, 199. How to Lie with Statistics, Darrel Huff, W.W. Norto & Compay, 194 The Visual Display of Quatitative Iformatio, Edward R. Tufte, Graphics Press, 193 Evisioig Iformatio, Edward R. Tufte, Graphics Press, 199 Visual Explaatios, Edward R. Tufte, Graphics Press, 1997 Beautiful Evidece, Edward R. Tufte, Graphics Press, 6 Uit III - Module 6 1 Related ad Advaced Topics Visual Display of Iformatio Additioal Graph Types for Uivariate Stem-ad-Leaf Boxplots Bi Width ad Number Rules Mea - Metal Math Trick Sample Sizes Cofidece Itervals CI Simplified Sufficietly Large Rules of Thumb Outlier Idetificatio Rules Uit III - Module 6 6

27 Hours CEB - Basic Data Aalysis Priciples Visual Display of Iformatio Poor visual displays of iformatio hider uderstadig Excel s default scatter plot is ot a oe-size-fitsall iformatio display Quick fixes esure a graph ca truly give the gift of sight Use evocative colors to your advatage Size matters Make sure the graph fills the space the data is the mai evet! Check the scale Choose a fot size Check the placemet of the leged Two possible displays follow Uit III - Module 6 3 Visual Display of Iformatio Excel Default Visual Disply Example Uit Series1 Series Series3 Series4 Uit III - Module 6 4 7

28 Hours- Thousads CEB - Basic Data Aalysis Priciples Visual Display of Iformatio Aother Display Visual Display Example Series1 Series Series3 Series4 Uit Uit III - Module 6 Stem-ad-Leaf Plots Similar to a histogram Horizotal umbers istead of vertical bars Example: Therms of atural gas used Uit III - Module 6 6 Mode = 4 therms

29 CEB - Basic Data Aalysis Priciples Box Plots 9 Lower Fourth Media Upper Fourth Data poit betwee 1. f s ad 3 f s from the Upper Fourth Potetial Outlier: Data poit more tha 3 f s from the Upper Fourth Lowest data poit withi 1. f s of the Lower Fourth f s = Upper Fourth Lower Fourth = Iterquartile Distace Highest data poit withi 1. f s of the Upper Fourth Uit III - Module 6 7 Box Plots Applicatio Box plots ca be used to: Show the ceter, spread, ad symmetry of the data Idetify outliers A sample box plot is show o the previous slide, ad a real-world oe below: Media Bill = $16. Mothly Gas Bill ($) Uit III - Module 6 9

30 CEB - Basic Data Aalysis Priciples Bi Widths ad Number Rules Various Rules for Bi Width (h) or Number of Bis (k) based o Number of Data Poits (), Sample Stadard Deviatio (s), ad Iterquartile Rage (IQR) Bi Width (h) Number of Bis (k) Assumptios Commets Square Root Rule Max x i Mi k x Sturges Rule Maxx i Mi xi k i Scott s Rule 3.s Maxx i Mix i 3 h Freedma Diacois Rule IQR Maxx i Mix i 3 h - 3 log 1 Normal distributio Normal distributio Used i Excel Data Aalysis Histogram tool Used by DAU Reasoable default if data ot too skewed Modifies Scott s Rule by focusig o IQR istead of s NEW! Uit III - Module 6 9 Mea Metal Math Trick X The mea ca also be a arbitrary umber plus the average of the deviatios from that umber: X i i1 i1 X X i * X * * * X i X X i X * X i1 * i1 X Mothly average therms used data: {37, 6, 13, 3, 3, 3, 3, 3, 4, 7, 1, 4} Average = + ( )/1 = + 43/1 = 13.6 Uit III - Module 6 6 3

31 CEB - Basic Data Aalysis Priciples Sample Sizes Cofidece Iterval How big a sample size do we eed so that a 6.3% Cofidece Iterval (oe stadard deviatio) about the estimate is +/-% of the estimate? i.e., there is 6.3% probability that the populatio mea is withi % of our estimated mea. Cosider the cofidece iterval for the mea of a ormal distributio x t s x t s Note that the size of the rage aroud the estimate of the mea is a fuctio of: the variability, captured by stadard deviatio, s, or coefficiet of variatio, CV the sample size, a /, 1,, a /, 1 Uit III - Module 6 61 Note: we are assumig a ormal distributio for simplicity Sample Sizes CI Simplified Istead of workig with stadard deviatios, we would like to shift to CVs CVs are uit-less ad more ituitive (expressed i percets) So, divide the rage by x s x ta /, 1 x t x a /, 1 1 ta /, 1 x CV s Uit III - Module 6 6 This shifts the rage ito percets. The rage is relative to % of the estimate t a /, 1 CV 31

32 CEB - Basic Data Aalysis Priciples Sample Sizes Sufficietly Large What sample size is eeded for judgmets withi %? For a 6.3% two-tailed CI, we have a = = 31.7% ad thus a/ = 1.9% 1.9% 1.9% Suppose we have a CV of 3% 6.3% CV t.19, +/- 4 3% % 9 3% % 16 3% 1.34 % CV 3% 1.44 t 6% a /, % % We would like to be able to make judgmets withi about % poits, so with a CV of 3%, we eed 36 t a/,-1 Note: for a 9% CI we would use a =.. The t multipliers would vary from.7 to.3 Uit III - Module 6 63 Sample Sizes Rule of Thumb For a easy rule of thumb, we ca just roud the t value to t = 1 CV The, we use simply CV t.19, Exact +/- Thumb rule 4 3% % 1% 9 3% % % 16 3% 1.34 CV % % 3% 1.44 ta /, 1 6% 6% 36 3% % % Tip: For a 6.3% CI, use CV. For a 9% CI, use CV. Uit III - Module

33 CEB - Basic Data Aalysis Priciples Outlier Idetificatio Rules Rule Outlier(s) Iff Ratioale Chauveet s Criterio Grubbs Test Dixo s Q Test x x 1. s 1 G, t a / a / Gap/Rage > (critical value from table), where Gap = distace betwee outlier ad its closest eighbor t Uit III - Module 6 6 Normal distributio properties Normal distributio properties, where x x G Max s Uclear. Will ot detect two approximately equal outliers. IQR-Based x ot i the iterval Ca customize k based o choice of distributio, α, ad. For Q 1 kq3 Q1, Q3 kq3 Q1 example, i a ormal distributio, k = 3 implies that < % of poits should fall outside the rage., NEW! 33

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number. GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics

More information

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

Now here is the important step

Now here is the important step LINEST i Excel The Excel spreadsheet fuctio "liest" is a complete liear least squares curve fittig routie that produces ucertaity estimates for the fit values. There are two ways to access the "liest"

More information

Exploratory Data Analysis

Exploratory Data Analysis 1 Exploratory Data Aalysis Exploratory data aalysis is ofte the rst step i a statistical aalysis, for it helps uderstadig the mai features of the particular sample that a aalyst is usig. Itelliget descriptios

More information

1 Correlation and Regression Analysis

1 Correlation and Regression Analysis 1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about

More information

Chapter 7: Confidence Interval and Sample Size

Chapter 7: Confidence Interval and Sample Size Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum

More information

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships Biology 171L Eviromet ad Ecology Lab Lab : Descriptive Statistics, Presetig Data ad Graphig Relatioships Itroductio Log lists of data are ofte ot very useful for idetifyig geeral treds i the data or the

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical

More information

Lesson 17 Pearson s Correlation Coefficient

Lesson 17 Pearson s Correlation Coefficient Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means) CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find 1.8 Approximatig Area uder a curve with rectagles 1.6 To fid the area uder a curve we approximate the area usig rectagles ad the use limits to fid 1.4 the area. Example 1 Suppose we wat to estimate 1.

More information

1. C. The formula for the confidence interval for a population mean is: x t, which was

1. C. The formula for the confidence interval for a population mean is: x t, which was s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value

More information

Normal Distribution.

Normal Distribution. Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote

More information

Data Analysis and Statistical Behaviors of Stock Market Fluctuations

Data Analysis and Statistical Behaviors of Stock Market Fluctuations 44 JOURNAL OF COMPUTERS, VOL. 3, NO. 0, OCTOBER 2008 Data Aalysis ad Statistical Behaviors of Stock Market Fluctuatios Ju Wag Departmet of Mathematics, Beijig Jiaotog Uiversity, Beijig 00044, Chia Email:

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

LECTURE 13: Cross-validation

LECTURE 13: Cross-validation LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M

More information

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book) MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

INVESTMENT PERFORMANCE COUNCIL (IPC) Guidance Statement on Calculation Methodology

INVESTMENT PERFORMANCE COUNCIL (IPC) Guidance Statement on Calculation Methodology Adoptio Date: 4 March 2004 Effective Date: 1 Jue 2004 Retroactive Applicatio: No Public Commet Period: Aug Nov 2002 INVESTMENT PERFORMANCE COUNCIL (IPC) Preface Guidace Statemet o Calculatio Methodology

More information

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio

More information

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

, a Wishart distribution with n -1 degrees of freedom and scale matrix. UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that

More information

Institute of Actuaries of India Subject CT1 Financial Mathematics

Institute of Actuaries of India Subject CT1 Financial Mathematics Istitute of Actuaries of Idia Subject CT1 Fiacial Mathematics For 2014 Examiatios Subject CT1 Fiacial Mathematics Core Techical Aim The aim of the Fiacial Mathematics subject is to provide a groudig i

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio

More information

Lesson 15 ANOVA (analysis of variance)

Lesson 15 ANOVA (analysis of variance) Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi

More information

Quadrat Sampling in Population Ecology

Quadrat Sampling in Population Ecology Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

CHAPTER 3 THE TIME VALUE OF MONEY

CHAPTER 3 THE TIME VALUE OF MONEY CHAPTER 3 THE TIME VALUE OF MONEY OVERVIEW A dollar i the had today is worth more tha a dollar to be received i the future because, if you had it ow, you could ivest that dollar ad ear iterest. Of all

More information

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation HP 1C Statistics - average ad stadard deviatio Average ad stadard deviatio cocepts HP1C average ad stadard deviatio Practice calculatig averages ad stadard deviatios with oe or two variables HP 1C Statistics

More information

One-sample test of proportions

One-sample test of proportions Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:

More information

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Research Method (I) --Knowledge on Sampling (Simple Random Sampling) Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact

More information

7. Concepts in Probability, Statistics and Stochastic Modelling

7. Concepts in Probability, Statistics and Stochastic Modelling 7. Cocepts i Probability, Statistics ad Stochastic Modellig 1. Itroductio 169. Probability Cocepts ad Methods 170.1. Radom Variables ad Distributios 170.. Expectatio 173.3. Quatiles, Momets ad Their Estimators

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011

15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011 15.075 Exam 3 Istructor: Cythia Rudi TA: Dimitrios Bisias November 22, 2011 Gradig is based o demostratio of coceptual uderstadig, so you eed to show all of your work. Problem 1 A compay makes high-defiitio

More information

INVESTMENT PERFORMANCE COUNCIL (IPC)

INVESTMENT PERFORMANCE COUNCIL (IPC) INVESTMENT PEFOMANCE COUNCIL (IPC) INVITATION TO COMMENT: Global Ivestmet Performace Stadards (GIPS ) Guidace Statemet o Calculatio Methodology The Associatio for Ivestmet Maagemet ad esearch (AIM) seeks

More information

Soving Recurrence Relations

Soving Recurrence Relations Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree

More information

Systems Design Project: Indoor Location of Wireless Devices

Systems Design Project: Indoor Location of Wireless Devices Systems Desig Project: Idoor Locatio of Wireless Devices Prepared By: Bria Murphy Seior Systems Sciece ad Egieerig Washigto Uiversity i St. Louis Phoe: (805) 698-5295 Email: bcm1@cec.wustl.edu Supervised

More information

Modified Line Search Method for Global Optimization

Modified Line Search Method for Global Optimization Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o

More information

Tradigms of Astundithi and Toyota

Tradigms of Astundithi and Toyota Tradig the radomess - Desigig a optimal tradig strategy uder a drifted radom walk price model Yuao Wu Math 20 Project Paper Professor Zachary Hamaker Abstract: I this paper the author iteds to explore

More information

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test) No-Parametric ivariate Statistics: Wilcoxo-Ma-Whitey 2 Sample Test 1 Ma-Whitey 2 Sample Test (a.k.a. Wilcoxo Rak Sum Test) The (Wilcoxo-) Ma-Whitey (WMW) test is the o-parametric equivalet of a pooled

More information

Subject CT5 Contingencies Core Technical Syllabus

Subject CT5 Contingencies Core Technical Syllabus Subject CT5 Cotigecies Core Techical Syllabus for the 2015 exams 1 Jue 2014 Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical techiques which ca be used to model ad value

More information

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS

COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S CONTROL CHART FOR THE CHANGES IN A PROCESS Supraee Lisawadi Departmet of Mathematics ad Statistics, Faculty of Sciece ad Techoology, Thammasat

More information

Bio-Plex Manager Software

Bio-Plex Manager Software Multiplex Suspesio Array Bio-Plex Maager Software Extract Kowledge Faster Move Your Research Forward Bio-Rad cotiues to iovate where it matters most. With Bio-Plex Maager 5.0 software, we offer valuable

More information

Forecasting. Forecasting Application. Practical Forecasting. Chapter 7 OVERVIEW KEY CONCEPTS. Chapter 7. Chapter 7

Forecasting. Forecasting Application. Practical Forecasting. Chapter 7 OVERVIEW KEY CONCEPTS. Chapter 7. Chapter 7 Forecastig Chapter 7 Chapter 7 OVERVIEW Forecastig Applicatios Qualitative Aalysis Tred Aalysis ad Projectio Busiess Cycle Expoetial Smoothig Ecoometric Forecastig Judgig Forecast Reliability Choosig the

More information

NATIONAL SENIOR CERTIFICATE GRADE 11

NATIONAL SENIOR CERTIFICATE GRADE 11 NATIONAL SENIOR CERTIFICATE GRADE MATHEMATICS P EXEMPLAR 007 MARKS: 50 TIME: 3 hours This questio paper cosists of pages, 4 diagram sheets ad a -page formula sheet. Please tur over Mathematics/P DoE/Exemplar

More information

A probabilistic proof of a binomial identity

A probabilistic proof of a binomial identity A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two

More information

UM USER SATISFACTION SURVEY 2011. Final Report. September 2, 2011. Prepared by. ers e-research & Solutions (Macau)

UM USER SATISFACTION SURVEY 2011. Final Report. September 2, 2011. Prepared by. ers e-research & Solutions (Macau) UM USER SATISFACTION SURVEY 2011 Fial Report September 2, 2011 Prepared by ers e-research & Solutios (Macau) 1 UM User Satisfactio Survey 2011 A Collaboratio Work by Project Cosultat Dr. Agus Cheog ers

More information

Chapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions

Chapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions Chapter 5 Uit Aual Amout ad Gradiet Fuctios IET 350 Egieerig Ecoomics Learig Objectives Chapter 5 Upo completio of this chapter you should uderstad: Calculatig future values from aual amouts. Calculatig

More information

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is 0_0605.qxd /5/05 0:45 AM Page 470 470 Chapter 6 Additioal Topics i Trigoometry 6.5 Trigoometric Form of a Complex Number What you should lear Plot complex umbers i the complex plae ad fid absolute values

More information

Overview of some probability distributions.

Overview of some probability distributions. Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability

More information

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean 1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.

More information

*The most important feature of MRP as compared with ordinary inventory control analysis is its time phasing feature.

*The most important feature of MRP as compared with ordinary inventory control analysis is its time phasing feature. Itegrated Productio ad Ivetory Cotrol System MRP ad MRP II Framework of Maufacturig System Ivetory cotrol, productio schedulig, capacity plaig ad fiacial ad busiess decisios i a productio system are iterrelated.

More information

A Balanced Scorecard

A Balanced Scorecard A Balaced Scorecard with VISION A Visio Iteratioal White Paper Visio Iteratioal A/S Aarhusgade 88, DK-2100 Copehage, Demark Phoe +45 35430086 Fax +45 35434646 www.balaced-scorecard.com 1 1. Itroductio

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

NATIONAL SENIOR CERTIFICATE GRADE 12

NATIONAL SENIOR CERTIFICATE GRADE 12 NATIONAL SENIOR CERTIFICATE GRADE MATHEMATICS P EXEMPLAR 04 MARKS: 50 TIME: 3 hours This questio paper cosists of 8 pages ad iformatio sheet. Please tur over Mathematics/P DBE/04 NSC Grade Eemplar INSTRUCTIONS

More information

ODBC. Getting Started With Sage Timberline Office ODBC

ODBC. Getting Started With Sage Timberline Office ODBC ODBC Gettig Started With Sage Timberlie Office ODBC NOTICE This documet ad the Sage Timberlie Office software may be used oly i accordace with the accompayig Sage Timberlie Office Ed User Licese Agreemet.

More information

Math C067 Sampling Distributions

Math C067 Sampling Distributions Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

This document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC.

This document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC. SPC Formulas ad Tables 1 This documet cotais a collectio of formulas ad costats useful for SPC chart costructio. It assumes you are already familiar with SPC. Termiology Geerally, a bar draw over a symbol

More information

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection The aalysis of the Courot oligopoly model cosiderig the subjective motive i the strategy selectio Shigehito Furuyama Teruhisa Nakai Departmet of Systems Maagemet Egieerig Faculty of Egieerig Kasai Uiversity

More information

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics

More information

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return EVALUATING ALTERNATIVE CAPITAL INVESTMENT PROGRAMS By Ke D. Duft, Extesio Ecoomist I the March 98 issue of this publicatio we reviewed the procedure by which a capital ivestmet project was assessed. The

More information

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT Keywords: project maagemet, resource allocatio, etwork plaig Vladimir N Burkov, Dmitri A Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT The paper deals with the problems of resource allocatio betwee

More information

Learning objectives. Duc K. Nguyen - Corporate Finance 21/10/2014

Learning objectives. Duc K. Nguyen - Corporate Finance 21/10/2014 1 Lecture 3 Time Value of Moey ad Project Valuatio The timelie Three rules of time travels NPV of a stream of cash flows Perpetuities, auities ad other special cases Learig objectives 2 Uderstad the time-value

More information

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here). BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly

More information

Time Value of Money. First some technical stuff. HP10B II users

Time Value of Money. First some technical stuff. HP10B II users Time Value of Moey Basis for the course Power of compoud iterest $3,600 each year ito a 401(k) pla yields $2,390,000 i 40 years First some techical stuff You will use your fiacial calculator i every sigle

More information

A Review and Comparison of Methods for Detecting Outliers in Univariate Data Sets

A Review and Comparison of Methods for Detecting Outliers in Univariate Data Sets A Review ad Compariso of Methods for Detectig Outliers i Uivariate Data Sets by Sogwo Seo BS, Kyughee Uiversity, Submitted to the Graduate Faculty of Graduate School of Public Health i partial fulfillmet

More information

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments Project Deliverables CS 361, Lecture 28 Jared Saia Uiversity of New Mexico Each Group should tur i oe group project cosistig of: About 6-12 pages of text (ca be loger with appedix) 6-12 figures (please

More information

THE TWO-VARIABLE LINEAR REGRESSION MODEL

THE TWO-VARIABLE LINEAR REGRESSION MODEL THE TWO-VARIABLE LINEAR REGRESSION MODEL Herma J. Bieres Pesylvaia State Uiversity April 30, 202. Itroductio Suppose you are a ecoomics or busiess maor i a college close to the beach i the souther part

More information

Chapter XIV: Fundamentals of Probability and Statistics *

Chapter XIV: Fundamentals of Probability and Statistics * Objectives Chapter XIV: Fudametals o Probability ad Statistics * Preset udametal cocepts o probability ad statistics Review measures o cetral tedecy ad dispersio Aalyze methods ad applicatios o descriptive

More information

Confidence intervals and hypothesis tests

Confidence intervals and hypothesis tests Chapter 2 Cofidece itervals ad hypothesis tests This chapter focuses o how to draw coclusios about populatios from sample data. We ll start by lookig at biary data (e.g., pollig), ad lear how to estimate

More information

MATH 083 Final Exam Review

MATH 083 Final Exam Review MATH 08 Fial Eam Review Completig the problems i this review will greatly prepare you for the fial eam Calculator use is ot required, but you are permitted to use a calculator durig the fial eam period

More information

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth Questio 1: What is a ordiary auity? Let s look at a ordiary auity that is certai ad simple. By this, we mea a auity over a fixed term whose paymet period matches the iterest coversio period. Additioally,

More information

Swaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps

Swaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps Swaps: Costat maturity swaps (CMS) ad costat maturity reasury (CM) swaps A Costat Maturity Swap (CMS) swap is a swap where oe of the legs pays (respectively receives) a swap rate of a fixed maturity, while

More information

How to read A Mutual Fund shareholder report

How to read A Mutual Fund shareholder report Ivestor BulletI How to read A Mutual Fud shareholder report The SEC s Office of Ivestor Educatio ad Advocacy is issuig this Ivestor Bulleti to educate idividual ivestors about mutual fud shareholder reports.

More information

3. If x and y are real numbers, what is the simplified radical form

3. If x and y are real numbers, what is the simplified radical form lgebra II Practice Test Objective:.a. Which is equivalet to 98 94 4 49?. Which epressio is aother way to write 5 4? 5 5 4 4 4 5 4 5. If ad y are real umbers, what is the simplified radical form of 5 y

More information

Page 1. Real Options for Engineering Systems. What are we up to? Today s agenda. J1: Real Options for Engineering Systems. Richard de Neufville

Page 1. Real Options for Engineering Systems. What are we up to? Today s agenda. J1: Real Options for Engineering Systems. Richard de Neufville Real Optios for Egieerig Systems J: Real Optios for Egieerig Systems By (MIT) Stefa Scholtes (CU) Course website: http://msl.mit.edu/cmi/ardet_2002 Stefa Scholtes Judge Istitute of Maagemet, CU Slide What

More information

Mathematical goals. Starting points. Materials required. Time needed

Mathematical goals. Starting points. Materials required. Time needed Level A1 of challege: C A1 Mathematical goals Startig poits Materials required Time eeded Iterpretig algebraic expressios To help learers to: traslate betwee words, symbols, tables, ad area represetatios

More information

Hypergeometric Distributions

Hypergeometric Distributions 7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you

More information

Hypothesis testing using complex survey data

Hypothesis testing using complex survey data Hypotesis testig usig complex survey data A Sort Course preseted by Peter Ly, Uiversity of Essex i associatio wit te coferece of te Europea Survey Researc Associatio Prague, 5 Jue 007 1 1. Objective: Simple

More information