Basic Data Analysis Principles. Acknowledgments
|
|
- Loraine Ball
- 8 years ago
- Views:
Transcription
1 CEB - Basic Data Aalysis Priciples Basic Data Aalysis Priciples What to do oce you get the data Whe we reaso about quatitative evidece, certai methods for displayig ad aalyzig data are better tha others. Superior methods are more likely to produce truthful, credible, ad precise fidigs. The differece betwee a excellet aalysis ad a faulty oe ca sometimes have mometous cosequeces. -Edward R. Tufte, Visual ad Statistical Thikig: Displays of Evidece for Makig Decisios Visual Explaatios, Edward R. Tufte, Graphics Press, Uit III - Module 6 1 Ackowledgmets ICEAA is idebted to TASC, Ic., for the developmet ad maiteace of the Cost Estimatig Body of Kowledge (CEBoK ) ICEAA is also idebted to Techomics, Ic., for the idepedet review ad maiteace of CEBoK ICEAA is also idebted to the followig idividuals who have made sigificat cotributios to the developmet, review, ad maiteace of CostPROF ad CEBoK Module 6 Basic Data Aalysis Priciples Lead authors: Mega E. Damero, Bethia L. Cullis, Mauree L. Tedford Seior reviewers: Richard L. Colema, Jessica R. Summerville, Joh S. Smuck, Fred K. Blackbur Reviewers: Samuel B. Toas, Kevi Cicotta, Matthew J. Pitlyk, Bria A. Welsh Maagig editor: Peter J. Braxto Uit III - Module 6 1
2 CEB - Basic Data Aalysis Priciples Uit Idex Uit I Cost Estimatig Uit II Cost Aalysis Techiques Uit III Aalytical Methods 6. Basic Data Aalysis Priciples 7. Learig Curve Aalysis. Regressio Aalysis 9. Cost ad Schedule Risk Aalysis.Probability ad Statistics Uit IV Specialized Costig Uit V Maagemet Applicatios Uit III - Module 6 3 Data Aalysis Overview Key Ideas Visual Display of Iformatio Cetral Tedecy of Data Dispersio (Spread) of Data Data accumulatio Outliers Aalytical Costructs Descriptive statistics Mea, media, mode Variace, std deviatio, CV Fuctioal forms Practical Applicatios Makig sese of your data Related Topics Parametrics Distributios Normal, Chi, t, F Probability ad Statistics 3 Uit III - Module 6 4
3 Frequecy Frequecy Frequecy CEB - Basic Data Aalysis Priciples Past Uderstadig your historical data Data Aalysis Withi The Cost Estimatig Framework Preset Developig estimatig tools Future Estimatig the ew system 3 Mothly Gas Bill 3 Mothly Gas Bill 3 Mothly Gas Bill More $ More $ More $ Historical data Mea = $34.19 Average cost Cofidece Iterval = +/-$.76 Cofidece Itervals Uit III - Module 6 Data Aalysis Outlie Core Kowledge Types of Data Uivariate Data Aalysis Scatter Plots Variables Axes ad Fuctio Types Data Validatio Descriptive Statistics Outliers Rules of Thumbs Two Cautioary Tales Summary Resources Related ad Advaced Topics Uit III - Module 6 6 3
4 Cost Frequecy CEB - Basic Data Aalysis Priciples Types of Data Uivariate Bivariate Multivariate Time Series Uit III - Module Uivariate Types of Data Sigle variable Use descriptive ad iferetial statistics Bivariate Oe idepedet variable ad oe depedet variable (i.e., y is a fuctio of x) Use descriptive ad iferetial statistics Multivariate Liear Tred Several idepedet variables ad oe depedet variable (i.e., y is a fuctio of x 1, x, ad x 3 ) Use descriptive ad iferetial statistics Mothly Gas Bill Weight More $ Tip: Uivariate data plus a Nomial variable is really bivariate S1 Uit III - Module 6 4
5 CEB - Basic Data Aalysis Priciples Types of Data Time Series Time as the idepedet variable Iterval matters! Make sure you use a XY (Scatter) ad ot a Lie Chart i Excel uless itervals are equally spaced Smooth treds are rarely foud i time series Possible rare exceptios (e.g., corrosio over time) Stadard treds such as ivestmet ad iflatio Look for paradigm shifts, cycles, autocorrelatio Use movig averages, divide data ito groups ad 11 compare descriptive statistics Regressio is ofte ot useful as it oly picks up smooth treds uless AR1/ARIMA.4. ANOVA ad mea comparisos are more useful Uit III - Module Uivariate Data Aalysis Visual Display of Iformatio Histogram, stem-ad-leaf, box plot Measures of Cetral Tedecy Mea (or media or mode) Measures of Variability Stadard deviatio (or variace), coefficiet of variatio (CV) Measures of Ucertaity Cofidece Iterval (CI) Statistical Tests Tip: This aalysis framework is mirrored i bivariate ad multivariate aalysis. Uit III - Module 6 How precise are you? What does it look like? t test, chi square test, Kolmogorov-Smirov (K-S) test What s your best guess? How ca you be sure? How much remais uexplaied?
6 Frequecy Frequecy Frequecy CEB - Basic Data Aalysis Priciples Visual Display - Histograms 6 Histograms should be used to give a idea of the distributio of the data Mothly Gas Bill More $ Warig: Results of macros do ot update if your data chage! Excel Data Aalysis Add-I Histogram. Tip: Create histogram maually usig Chart type Colum so that results do update whe data chage! Skew-right distributio, possibly Expoetial, Triagular, or Logormal Uit III - Module 6 11 Histograms Bis It is importat to carefully cosider the umber of bis used i a histogram Experimet with itervals to be sure you uderstad the data Mothly Gas Bill Warig: Default bis i Excel histograms may ot be optimal! $ This histogram allows Excel to choose the bis More Which is clearer? Which sets a trap? Uit III - Module 6 1 Mothly Gas Bill More $ This histogram specifies the bis. 6 Warig: Histograms ca be maipulated! 6
7 CEB - Basic Data Aalysis Priciples Cetral Tedecy - Mea 6 The sample mea of the data set {x 1, x,, x } is calculated as: x xi 1 x1 x... x i I Excel, use the AVERAGE( ) fuctio Meas of example data sets: Gas bill (74 moths), $6. Therms used (74 moths), 14. The mea is the Expected Value of a radom variable Uit III - Module 6 13 Cetral Tedecy - Media The sample media is the middle data poit, with % of the remaiig observatios fallig uder that poit, ad % above If a data set has a odd umber of poits, the middle value is the media The media of the data set {,,7,9,} is 7 AKA th Percetile If a data set has a eve umber of poits, the two middle values are averaged The media of the set {3, 6,, 11, 13, 3} is 9. (average of ad 11) I geeral, the k th percetile is the poit with k% of the data below ad (-k)% of the data above Quartiles (,, 7), deciles (,,,, 9), icosatiles (,, 1,, 9) Whe there are extreme data poits, the media may be more represetative tha the mea because robust outliers impact the mea more tha the media Represetative is a descriptive term, ot a mathematical term There are may mathematical reasos to prefer mea over media Uit III - Module 6 Leged Red = Extreme poits 14 Blue = Middle poits 7
8 Frequecy CEB - Basic Data Aalysis Priciples Mea, Media, ad Skew The mea ad the media are equal if the distributio is symmetric Uequal meas ad medias are a idicatio of skewess Logormal Distributio Media = Mea Symmetric. Beta Distributio Normal Distributio Media < Mea Skew(ed) Right Media > Mea Skew(ed) Left Uit III - Module 6 1 Cetral Tedecy - Mode The sample mode is the most frequet poit to occur i a data set The mode of a distributio is its peak Value with the greatest probability mass (or desity) The mode of the set {,4,4,7,9,9,9} is 9 The mode is a descriptive metric aswerig the questio what happes most frequetly? X Value It ca help give a visual idea of what the distributio looks like Most useful i discrete data A histogram shows that the value 9 occurs most ofte this is the mode Uit III - Module
9 CEB - Basic Data Aalysis Priciples Variability Variace / Stadard Deviatio The sample variace measures the deviatio of the data poits from their mea easy to remember s i1 ( x i x) 1 i1 I Excel, use the VAR( ) fuctio xi i1 1 The sample stadard deviatio is simply The stadard deviatio is expressed i the same uits as the origial data I Excel, use the STDEV( ) fuctio x i Uit III - Module 6 17 Tip: Low variace idicates less dispersio, i.e., tighter data easy to calculate s s Tip: s is the estimator for the populatio parameter σ Variability - Coefficiet of Variatio 13 The Coefficiet of Variatio (CV) expresses the stadard deviatio as a percet of the mea CV Large CVs idicate that the mea is a poor estimator Cosider regressio o cost drivers Examie data for multiple populatios (outliers) CVs of example data sets: Gas bill, 74.4% (69.%) Therms used, 4.% (.%) s X Uit III - Module 6 1 Tip: Low CV idicates less dispersio, i.e., tighter data. 1% or less is desired Note that sums ad averages ted to have smaller variaces 9
10 Frequecy Frequecy CEB - Basic Data Aalysis Priciples Dispersio ad CV These two data sets have the same mea, but differet stadard deviatios 1 Lower CV This data has a higher CV (3%) ad has more dispersio More Bi This data has a lower CV (17%) ad is more tightly distributed Higher CV More Bi Uit III - Module 6 19 Cofidece Iterval Illustratio A cofidece iterval (CI) suggests to us that we are (1-a)*% cofidet that the true parameter value is cotaied withi the calculated rage* x t s, x ta / a /, 1,, 1 s * Note this statemet provides a geeral sese of what a cofidece iterval does for us i cocise laguage, for ease of uderstadig. The specific statistical iterpretatio is that if may idepedet samples are take where the levels of the predictor variable are the same as i the data set, ad a (1-a) % cofidece iterval is costructed for each sample, the (1-a) % of the itervals will cotai the true value of the parameter. a/ a/ 1 - a critical values Uit III - Module 6
11 CEB - Basic Data Aalysis Priciples Sample Sizes - Sufficietly Large 6 4 I geeral, we prefer to be large how large is a fuctio of our tolerace for error The 6.3% CI for the mea is roughly CV/ So, for CVs ragig aroud 3%, we get the followig 6.3% Cofidece Iterval with : +/- 4 1% 9 % 16 % 6% 36 % Tip: 3 is ot a magic umber of data poits If we would like to be able to make judgmets withi about % poits with a CV of 3%, we eed 36 We may have o choice but to deal with small I ay case, we ca calculate the rage of estimated mea Uit III - Module 6 1 Predictio Itervals The previous cofidece iterval illustratio gives the true average cost withi a certai rage If we wat to kow the predicted cost of a ew item withi a certai rage, we eed a predictio iterval The PI suggests to us that we are (1-a)*% cofidet that the ext observatio will be cotaied withi the calculated rage The larger stadard error i the PI accouts for both the ucertaity i the mea (captured by the CI) ad the ucertaity i idividual observatios x t s 1 1, y 1, x t /, 1s a /, 1 a 1 1 Uit III - Module 6 11
12 CEB - Basic Data Aalysis Priciples Statistical Tests 17 t test for mea Is the Cost Growth Factor (CGF) for NAVAIR programs differet tha 1.? Chi square test for variace Is 3% a reasoable CV to use for this variable? Should t test for equal meas assume equal variaces? Chi square test for distributio Are Lie-Replaceable Uit (LRU) failures uiform across all deployed uits? Kolmogorov-Smirov test for distributio Is the ormal distributio appropriate for modelig ucertaity i desig weight? Uit III - Module 6 3 Scatter Plots Variables Axes Fuctio Types Uit III - Module 6 4 1
13 Light Ship Displacemet CEB - Basic Data Aalysis Priciples Scatter Plots A picture is worth a thousad words! A scatter plot ca reveal a wealth of iformatio about relatioships preset i the data Create scatter plots i Excel by usig the Chart Wizard XY (Scatter) Add a tred lie i Excel by right clickig the plotted data ad choosig Add Tred lie Helps lik graph ad equatio Look at iferetial statistics later Tip: Scatter plots are the sigle most useful tool i all of aalysis they are the gift of sight to the aalyst 1,,, 6, 4,, - y = 7.9x R = Year Uit III - Module 6 1 Scatter Plots Variables Plot cost (or other variable of iterest, e.g., hours) as the depedet variable Look at a variety of differet idepedet variables Techical parameters such as weight, lies of code, etc. Performace parameters such as speed, accuracy, etc. Operatioal parameters such as crew size, flyig hours, etc. Cost of aother elemet Thik about which variables you believe should drive cost ad collect that data! Uit III - Module
14 Cost Cost Cost CEB - Basic Data Aalysis Priciples Scatter Plots Cost Drivers Scatter plots ca help idetify cost drivers R iterpretatio: % of variatio i y explaied (liearly) by variatio i x 13 1 y =.37x R² = y =.7x + 7. R² = Warig: R is just Sigificat correlatio potetial cost driver Variable a idicator, cosult t ad F statistics! y = -.9x + 1. R² = Variable Variable Weak correlatio Ucorrelated Uit III - Module 6 7 Scatter Plots Uit Space Data should first be plotted i uit space* x is plotted o the horizotal axis (x-axis) ad y is plotted o the vertical axis (y-axis) If the data have a o-liear relatioship whe plotted i uit space, ivestigate how the data ca be made liear No-liear relatioships ca ofte be trasformed to appear liear through the use of atural logs Trasformed data ca the be regressed liearly Before the widespread use of computers, oliear data was graphed o semi-log or log-log paper * Uit space refers to the origial, utrasformed data. Uit III - Module 6 14
15 Cost CEB - Basic Data Aalysis Priciples Scatter Plots Liear Fuctio The most commo relatioships are liear Of the form y = mx + b [m = slope, b = y-it.] Plotted i uit space y =.7461x +.9 R =.3 Liear Tred Weight Tip: Liear models are also the best approximatios to o-liear models by which we mea, they take you least far afield if you guessed wrog. Uit III - Module 6 9 Scatter Plots Power Fuctio Power fuctios are of the form y = ax b Ca be trasformed ito liear fuctios Takig the atural log of both sides gives l(y) = l(a) + b l(x) Plot l(x) o the horizotal axis ad l(y) o the vertical axis ad look for a liear tred This trasformatio is show graphically o the ext slide Uit III - Module 6 3 1
16 l (Cost) Cost CEB - Basic Data Aalysis Priciples 3 Scatter Plots Power Fuctio This fuctio is most commoly used for learig curves, but ca also be used for CERs Cost =.394Weight.14 R =.99 Power Tred Weight Tip: Aother virtue of tred lies is that they ca act as a Rosetta Stoe for the values of a curve fit o trasformed variables. Uit III - Module 6 31 Slope o log-log graph is the expoet of the power equatio Power Tred o Log-Log Axes l (Cost) =.14l (Weight) R = l (Weight) A alterative is to use Format Axis Logarithmic scale Scatter Plots Expoetial Fuctio Expoetial fuctios are of the form y = ae bx = a(e b ) x = ak x Models of this form ca be trasformed ad made to be liear Takig the atural log (l) of both sides gives l(y) = l(a) + bx The atural log (l) is the iverse fuctio of the expoetial: y = e x x = l(y) Tip: Expoetial fuctios are seldom ecoutered i cost estimatio outside of iflatio Uit III - Module
17 l (Cost) Cost CEB - Basic Data Aalysis Priciples Scatter Plots Expoetial Fuctio The, x is plotted o the horizotal axis ad l(y) is plotted o the vertical axis This trasformatio is show graphically below Expoetial Tred Cost =.4e 1.19Weight R² = Weight Slope o semi-log graph is the coefficiet of x i the expoetial equatio Expoetial Tred o Semi-Log Axes l(cost) = 1.19Weight R² = Weight Uit III - Module 6 33 Scatter Plots Costat Terms Geeralized power ad expoetial equatios are of the form: y = ax b + c, y = ae bx + c Power ad expoetial models usually assume a costat term of c = However, c = is more commo i theory tha i practice If c = does ot fit the data, cosider usig a model with c Use the Excel Add-i Solver (or aother, more robust optimizatio tool) to fit a curve to the data, where a, b, c are chose simultaeously (GERM) Miimize SSE or maximize uit-space R To b or Not to b The y-itercept i Cost Estimatio, R. L. Colema, J. R. Summerville, P. J. Braxto, B. L. Cullis, E. R. Druker, SCEA, 7. Warig: Excel forces power ad expoetial tredlies to have c =! Uit III - Module
18 CEB - Basic Data Aalysis Priciples Data Validatio Scatter plottig gives you a idea of the relatioships preset i the data What s ext? Look at descriptive statistics Look for outliers Compare to historical studies, idustry stadards, or rules of thumb Uit III - Module 6 3 Descriptive Statistics Calculate descriptive statistics for each data group Sample size Raw mea Stadard deviatio Coefficiet of variatio (CV) Warig: Results of macros do ot update if your data chage! Tip: Create formulae maually so that results do update whe data chage! 14 Weighted averages (e.g., dollar-weighted) Movig averages (for time series data) I Excel, Tools Data Aalysis Descriptive Statistics will easily calculate the most importat descriptive statistics 11 Uit III - Module
19 $ Wtd DE CGF CEB - Basic Data Aalysis Priciples Descriptive Statistics - Bar Charts Bar charts ca be used to compare the descriptive statistics for differet groups Y-error bars ca be added to show the stadard deviatio Tip: Stadard deviatios are useful, but predictio itervals would be better, capturig the iteractio of quatity ad dispersio more 3. succictly ad i a iferetially better way. Be sure 3.to label which they are. RDT&E Programs by Compay (SAR Programs with EMD oly) Co. 1 Co. Co. 3 Co. 4 Co. = 3 = 7 = 4 = 4 = 9 Uit III - Module 6 37 Bar Charts i Excel Bar charts Excel Chart Wizard Colum Chart Y-error bars Format Data Series Y-error bars (3) Chart Tools Layout Aalysis Error Bars (7) Histogram Excel Data Aalysis Add-I - Histogram Tip: It is recommeded that you create your ow dyamic histograms with flexible bi spacig usig COUNTIF() ad Colum Charts. Uit III - Module
20 Cost CEB - Basic Data Aalysis Priciples Outliers 6 Outliers are data poits that fall far away from the ceter of the data ad are ot represetative of the populatio you are tryig to model For ormally distributed data sets, about 9.4% of the data should fall withi two stadard deviatios of the mea So, we d expect 4.% to be outside two stadard deviatios 99.7% of the data should be withi three stadard deviatios of the mea If a data poit is more tha three stadard deviatios from the mea, it is a potetial outlier Tip: The ormal distributio is a good first approximatio, but if your data are sigificatly skewed, these rules of thumb should ot be used to idetify potetial outliers. Uit III - Module Outliers ad Tred Lies Outliers may bias the regressio lie y Cost vs. Weight All Data ad Potetial Outlier Removed y = 1.31x R² =.166 y = 1.793x R² =.76 1 Weight Possible outlier, 4.4 stadard deviatios from the mea Tip: If usig two graphs, do ot chage scale of axes whe comparig! Uit III - Module 6 4 Without the possible outlier, the slope of the regressio lie is steeper ad the R is higher
21 CEB - Basic Data Aalysis Priciples Removig Outliers Do ot remove a outlier from the data without a good reaso! Doig so removes some of the variatio preset i history Doig so ca be a form of cookig the data Good reasos for removig a outlier: Program was restructured or divided Oe of these is ot like the others e.g., a helo i a set of missile data Bad reasos for removig a outlier: Too high stadard deviatios away from the mea [!] Uit III - Module 6 41 Tip: Outlier treatmet separates the aalysts from the spi meisters 4 Rules of Thumb Compare your descriptive statistics to historical rules of thumb NCCA Stadard Factors hadbook, for example Saity check! Tip: Compariso to history ad cross checks separates the thorough from the sloppy Uit III - Module 6 4 1
22 CEB - Basic Data Aalysis Priciples Two Cautioary Tales Expert s Eyeball Descriptive Statistics ad Visual Displays Techical Huch Outliers Uit III - Module 6 43 Egieerig Judgmets 14 Suppose we are give a estimate that has egieerig judgmet as its basis Egieerig judgmets should ever be accepted without validatio! The aalyst must fid out if the guess is correct, or at least i the ballpark Experts ofte possess isight or ituitio regardig systems that bears o cost, but it is the aalyst s job to make the estimate explicit ad reproducible Uit III - Module 6 44
23 Percet of First Ship CEB - Basic Data Aalysis Priciples Example: Expert s Eyeball Follow Ship Support Hull FF DDG 37 Average 7.1% 4.%.9% 3 9.4%.1% 7.% 4 9.% 4.3% 6.7% % Is the average a good idea? Is the th ship guess right? Uit III - Module 6 4 Example: Expert s Eyeball The average is a good umber! 3 1 Decrease i Follow-Ship Support Whe the average lie is extrapolated, it looks like th ship should be about 6% Tip: Graphic esures cost estimate credibility! FF DDG 37 Average 3 4 Hull The th ship guess of 4% looks too low! Uit III - Module
24 CEB - Basic Data Aalysis Priciples Example: Techical Huch I this real-life example, we will look at the importace of correctly ivestigatig outliers Scatter plots ca be extremely useful i idetifyig potetial outliers Uit III - Module 6 47 Example: Techical Huch Shakedow Hull Hours/To DD DD 9.4 DD 93.3 DD 96.1 DD 963 is too low for a first ship Uit III - Module 6 4 4
25 Hours per To CEB - Basic Data Aalysis Priciples Wrog Outlier Rejected! Istead of DD 963, look ito DD 9 That s the potetial outlier! This lie produces a more reasoable th ship estimate. Hours/To Hull Hours/To The expert s curve is urealistic at the th ship! Uit III - Module 6 49 Data Aalysis Summary Steps of basic data aalysis 1. Scatter plot visual depictio of the relatioships i the data. Descriptive statistics calculate the meas ad CVs If the CV is uder 1%, the average may be a sufficiet predictor, focus more attetio o elemets with higher CVs If the CV is over 1%, focus o this elemet usig regressio aalysis to look for a better predictor tha the average (CER developmet) 3. Look for outliers (data quality check) 4. Compare to history Uit III - Module 6
26 CEB - Basic Data Aalysis Priciples Resources A Itroductio to Mathematical Statistics ad Its Applicatios, 3 rd ed., Richard J. Larse ad Morris L. Marx, Pretice Hall, Probability ad Statistics for Egieerig ad the Scieces, th ed., Jay L. Devore, Brooks/Cole Publishig, 1999 Calculus: Sigle Variable, Deborah Hughes-Hallett ad Adrew Gleaso, Joh Wiley & Sos, 199. How to Lie with Statistics, Darrel Huff, W.W. Norto & Compay, 194 The Visual Display of Quatitative Iformatio, Edward R. Tufte, Graphics Press, 193 Evisioig Iformatio, Edward R. Tufte, Graphics Press, 199 Visual Explaatios, Edward R. Tufte, Graphics Press, 1997 Beautiful Evidece, Edward R. Tufte, Graphics Press, 6 Uit III - Module 6 1 Related ad Advaced Topics Visual Display of Iformatio Additioal Graph Types for Uivariate Stem-ad-Leaf Boxplots Bi Width ad Number Rules Mea - Metal Math Trick Sample Sizes Cofidece Itervals CI Simplified Sufficietly Large Rules of Thumb Outlier Idetificatio Rules Uit III - Module 6 6
27 Hours CEB - Basic Data Aalysis Priciples Visual Display of Iformatio Poor visual displays of iformatio hider uderstadig Excel s default scatter plot is ot a oe-size-fitsall iformatio display Quick fixes esure a graph ca truly give the gift of sight Use evocative colors to your advatage Size matters Make sure the graph fills the space the data is the mai evet! Check the scale Choose a fot size Check the placemet of the leged Two possible displays follow Uit III - Module 6 3 Visual Display of Iformatio Excel Default Visual Disply Example Uit Series1 Series Series3 Series4 Uit III - Module 6 4 7
28 Hours- Thousads CEB - Basic Data Aalysis Priciples Visual Display of Iformatio Aother Display Visual Display Example Series1 Series Series3 Series4 Uit Uit III - Module 6 Stem-ad-Leaf Plots Similar to a histogram Horizotal umbers istead of vertical bars Example: Therms of atural gas used Uit III - Module 6 6 Mode = 4 therms
29 CEB - Basic Data Aalysis Priciples Box Plots 9 Lower Fourth Media Upper Fourth Data poit betwee 1. f s ad 3 f s from the Upper Fourth Potetial Outlier: Data poit more tha 3 f s from the Upper Fourth Lowest data poit withi 1. f s of the Lower Fourth f s = Upper Fourth Lower Fourth = Iterquartile Distace Highest data poit withi 1. f s of the Upper Fourth Uit III - Module 6 7 Box Plots Applicatio Box plots ca be used to: Show the ceter, spread, ad symmetry of the data Idetify outliers A sample box plot is show o the previous slide, ad a real-world oe below: Media Bill = $16. Mothly Gas Bill ($) Uit III - Module 6 9
30 CEB - Basic Data Aalysis Priciples Bi Widths ad Number Rules Various Rules for Bi Width (h) or Number of Bis (k) based o Number of Data Poits (), Sample Stadard Deviatio (s), ad Iterquartile Rage (IQR) Bi Width (h) Number of Bis (k) Assumptios Commets Square Root Rule Max x i Mi k x Sturges Rule Maxx i Mi xi k i Scott s Rule 3.s Maxx i Mix i 3 h Freedma Diacois Rule IQR Maxx i Mix i 3 h - 3 log 1 Normal distributio Normal distributio Used i Excel Data Aalysis Histogram tool Used by DAU Reasoable default if data ot too skewed Modifies Scott s Rule by focusig o IQR istead of s NEW! Uit III - Module 6 9 Mea Metal Math Trick X The mea ca also be a arbitrary umber plus the average of the deviatios from that umber: X i i1 i1 X X i * X * * * X i X X i X * X i1 * i1 X Mothly average therms used data: {37, 6, 13, 3, 3, 3, 3, 3, 4, 7, 1, 4} Average = + ( )/1 = + 43/1 = 13.6 Uit III - Module 6 6 3
31 CEB - Basic Data Aalysis Priciples Sample Sizes Cofidece Iterval How big a sample size do we eed so that a 6.3% Cofidece Iterval (oe stadard deviatio) about the estimate is +/-% of the estimate? i.e., there is 6.3% probability that the populatio mea is withi % of our estimated mea. Cosider the cofidece iterval for the mea of a ormal distributio x t s x t s Note that the size of the rage aroud the estimate of the mea is a fuctio of: the variability, captured by stadard deviatio, s, or coefficiet of variatio, CV the sample size, a /, 1,, a /, 1 Uit III - Module 6 61 Note: we are assumig a ormal distributio for simplicity Sample Sizes CI Simplified Istead of workig with stadard deviatios, we would like to shift to CVs CVs are uit-less ad more ituitive (expressed i percets) So, divide the rage by x s x ta /, 1 x t x a /, 1 1 ta /, 1 x CV s Uit III - Module 6 6 This shifts the rage ito percets. The rage is relative to % of the estimate t a /, 1 CV 31
32 CEB - Basic Data Aalysis Priciples Sample Sizes Sufficietly Large What sample size is eeded for judgmets withi %? For a 6.3% two-tailed CI, we have a = = 31.7% ad thus a/ = 1.9% 1.9% 1.9% Suppose we have a CV of 3% 6.3% CV t.19, +/- 4 3% % 9 3% % 16 3% 1.34 % CV 3% 1.44 t 6% a /, % % We would like to be able to make judgmets withi about % poits, so with a CV of 3%, we eed 36 t a/,-1 Note: for a 9% CI we would use a =.. The t multipliers would vary from.7 to.3 Uit III - Module 6 63 Sample Sizes Rule of Thumb For a easy rule of thumb, we ca just roud the t value to t = 1 CV The, we use simply CV t.19, Exact +/- Thumb rule 4 3% % 1% 9 3% % % 16 3% 1.34 CV % % 3% 1.44 ta /, 1 6% 6% 36 3% % % Tip: For a 6.3% CI, use CV. For a 9% CI, use CV. Uit III - Module
33 CEB - Basic Data Aalysis Priciples Outlier Idetificatio Rules Rule Outlier(s) Iff Ratioale Chauveet s Criterio Grubbs Test Dixo s Q Test x x 1. s 1 G, t a / a / Gap/Rage > (critical value from table), where Gap = distace betwee outlier ad its closest eighbor t Uit III - Module 6 6 Normal distributio properties Normal distributio properties, where x x G Max s Uclear. Will ot detect two approximately equal outliers. IQR-Based x ot i the iterval Ca customize k based o choice of distributio, α, ad. For Q 1 kq3 Q1, Q3 kq3 Q1 example, i a ormal distributio, k = 3 implies that < % of poits should fall outside the rage., NEW! 33
GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.
GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all
More informationMeasures of Spread and Boxplots Discrete Math, Section 9.4
Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,
More informationCase Study. Normal and t Distributions. Density Plot. Normal Distributions
Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca
More informationPSYCHOLOGICAL STATISTICS
UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics
More informationNon-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring
No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy
More informationI. Chi-squared Distributions
1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.
More information5: Introduction to Estimation
5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample
More informationConfidence Intervals for One Mean
Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a
More informationOutput Analysis (2, Chapters 10 &11 Law)
B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should
More informationHypothesis testing. Null and alternative hypotheses
Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate
More informationNow here is the important step
LINEST i Excel The Excel spreadsheet fuctio "liest" is a complete liear least squares curve fittig routie that produces ucertaity estimates for the fit values. There are two ways to access the "liest"
More informationExploratory Data Analysis
1 Exploratory Data Aalysis Exploratory data aalysis is ofte the rst step i a statistical aalysis, for it helps uderstadig the mai features of the particular sample that a aalyst is usig. Itelliget descriptios
More information1 Correlation and Regression Analysis
1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio
More informationCenter, Spread, and Shape in Inference: Claims, Caveats, and Insights
Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the
More informationZ-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown
Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about
More informationChapter 7: Confidence Interval and Sample Size
Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum
More informationBiology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships
Biology 171L Eviromet ad Ecology Lab Lab : Descriptive Statistics, Presetig Data ad Graphig Relatioships Itroductio Log lists of data are ofte ot very useful for idetifyig geeral treds i the data or the
More informationDetermining the sample size
Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors
More informationAnalyzing Longitudinal Data from Complex Surveys Using SUDAAN
Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical
More informationLesson 17 Pearson s Correlation Coefficient
Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig
More informationMaximum Likelihood Estimators.
Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio
More informationTHE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n
We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample
More informationCHAPTER 7: Central Limit Theorem: CLT for Averages (Means)
CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:
More informationChapter 7 Methods of Finding Estimators
Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of
More informationOverview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals
Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of
More information1 Computing the Standard Deviation of Sample Means
Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.
More informationApproximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find
1.8 Approximatig Area uder a curve with rectagles 1.6 To fid the area uder a curve we approximate the area usig rectagles ad the use limits to fid 1.4 the area. Example 1 Suppose we wat to estimate 1.
More information1. C. The formula for the confidence interval for a population mean is: x t, which was
s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value
More informationNormal Distribution.
Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued
More informationUniversity of California, Los Angeles Department of Statistics. Distributions related to the normal distribution
Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.
More informationDescriptive Statistics
Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote
More informationData Analysis and Statistical Behaviors of Stock Market Fluctuations
44 JOURNAL OF COMPUTERS, VOL. 3, NO. 0, OCTOBER 2008 Data Aalysis ad Statistical Behaviors of Stock Market Fluctuatios Ju Wag Departmet of Mathematics, Beijig Jiaotog Uiversity, Beijig 00044, Chia Email:
More informationChapter 6: Variance, the law of large numbers and the Monte-Carlo method
Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value
More informationLECTURE 13: Cross-validation
LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M
More informationMEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)
MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:
More informationChapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:
Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries
More informationIn nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008
I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces
More informationProperties of MLE: consistency, asymptotic normality. Fisher information.
Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout
More informationINVESTMENT PERFORMANCE COUNCIL (IPC) Guidance Statement on Calculation Methodology
Adoptio Date: 4 March 2004 Effective Date: 1 Jue 2004 Retroactive Applicatio: No Public Commet Period: Aug Nov 2002 INVESTMENT PERFORMANCE COUNCIL (IPC) Preface Guidace Statemet o Calculatio Methodology
More informationThe following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles
The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio
More information, a Wishart distribution with n -1 degrees of freedom and scale matrix.
UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that
More informationInstitute of Actuaries of India Subject CT1 Financial Mathematics
Istitute of Actuaries of Idia Subject CT1 Fiacial Mathematics For 2014 Examiatios Subject CT1 Fiacial Mathematics Core Techical Aim The aim of the Fiacial Mathematics subject is to provide a groudig i
More information0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5
Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.
More informationInference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval
Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio
More informationLesson 15 ANOVA (analysis of variance)
Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi
More informationQuadrat Sampling in Population Ecology
Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may
More informationConfidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.
Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).
More informationCHAPTER 3 THE TIME VALUE OF MONEY
CHAPTER 3 THE TIME VALUE OF MONEY OVERVIEW A dollar i the had today is worth more tha a dollar to be received i the future because, if you had it ow, you could ivest that dollar ad ear iterest. Of all
More informationhp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation
HP 1C Statistics - average ad stadard deviatio Average ad stadard deviatio cocepts HP1C average ad stadard deviatio Practice calculatig averages ad stadard deviatios with oe or two variables HP 1C Statistics
More informationOne-sample test of proportions
Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:
More informationResearch Method (I) --Knowledge on Sampling (Simple Random Sampling)
Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact
More information7. Concepts in Probability, Statistics and Stochastic Modelling
7. Cocepts i Probability, Statistics ad Stochastic Modellig 1. Itroductio 169. Probability Cocepts ad Methods 170.1. Radom Variables ad Distributios 170.. Expectatio 173.3. Quatiles, Momets ad Their Estimators
More informationSECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES
SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,
More information15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011
15.075 Exam 3 Istructor: Cythia Rudi TA: Dimitrios Bisias November 22, 2011 Gradig is based o demostratio of coceptual uderstadig, so you eed to show all of your work. Problem 1 A compay makes high-defiitio
More informationINVESTMENT PERFORMANCE COUNCIL (IPC)
INVESTMENT PEFOMANCE COUNCIL (IPC) INVITATION TO COMMENT: Global Ivestmet Performace Stadards (GIPS ) Guidace Statemet o Calculatio Methodology The Associatio for Ivestmet Maagemet ad esearch (AIM) seeks
More informationSoving Recurrence Relations
Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree
More informationSystems Design Project: Indoor Location of Wireless Devices
Systems Desig Project: Idoor Locatio of Wireless Devices Prepared By: Bria Murphy Seior Systems Sciece ad Egieerig Washigto Uiversity i St. Louis Phoe: (805) 698-5295 Email: bcm1@cec.wustl.edu Supervised
More informationModified Line Search Method for Global Optimization
Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o
More informationTradigms of Astundithi and Toyota
Tradig the radomess - Desigig a optimal tradig strategy uder a drifted radom walk price model Yuao Wu Math 20 Project Paper Professor Zachary Hamaker Abstract: I this paper the author iteds to explore
More informationMann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)
No-Parametric ivariate Statistics: Wilcoxo-Ma-Whitey 2 Sample Test 1 Ma-Whitey 2 Sample Test (a.k.a. Wilcoxo Rak Sum Test) The (Wilcoxo-) Ma-Whitey (WMW) test is the o-parametric equivalet of a pooled
More informationSubject CT5 Contingencies Core Technical Syllabus
Subject CT5 Cotigecies Core Techical Syllabus for the 2015 exams 1 Jue 2014 Aim The aim of the Cotigecies subject is to provide a groudig i the mathematical techiques which ca be used to model ad value
More informationCOMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS
COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S CONTROL CHART FOR THE CHANGES IN A PROCESS Supraee Lisawadi Departmet of Mathematics ad Statistics, Faculty of Sciece ad Techoology, Thammasat
More informationBio-Plex Manager Software
Multiplex Suspesio Array Bio-Plex Maager Software Extract Kowledge Faster Move Your Research Forward Bio-Rad cotiues to iovate where it matters most. With Bio-Plex Maager 5.0 software, we offer valuable
More informationForecasting. Forecasting Application. Practical Forecasting. Chapter 7 OVERVIEW KEY CONCEPTS. Chapter 7. Chapter 7
Forecastig Chapter 7 Chapter 7 OVERVIEW Forecastig Applicatios Qualitative Aalysis Tred Aalysis ad Projectio Busiess Cycle Expoetial Smoothig Ecoometric Forecastig Judgig Forecast Reliability Choosig the
More informationNATIONAL SENIOR CERTIFICATE GRADE 11
NATIONAL SENIOR CERTIFICATE GRADE MATHEMATICS P EXEMPLAR 007 MARKS: 50 TIME: 3 hours This questio paper cosists of pages, 4 diagram sheets ad a -page formula sheet. Please tur over Mathematics/P DoE/Exemplar
More informationA probabilistic proof of a binomial identity
A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two
More informationUM USER SATISFACTION SURVEY 2011. Final Report. September 2, 2011. Prepared by. ers e-research & Solutions (Macau)
UM USER SATISFACTION SURVEY 2011 Fial Report September 2, 2011 Prepared by ers e-research & Solutios (Macau) 1 UM User Satisfactio Survey 2011 A Collaboratio Work by Project Cosultat Dr. Agus Cheog ers
More informationChapter 5 Unit 1. IET 350 Engineering Economics. Learning Objectives Chapter 5. Learning Objectives Unit 1. Annual Amount and Gradient Functions
Chapter 5 Uit Aual Amout ad Gradiet Fuctios IET 350 Egieerig Ecoomics Learig Objectives Chapter 5 Upo completio of this chapter you should uderstad: Calculatig future values from aual amouts. Calculatig
More informationTrigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is
0_0605.qxd /5/05 0:45 AM Page 470 470 Chapter 6 Additioal Topics i Trigoometry 6.5 Trigoometric Form of a Complex Number What you should lear Plot complex umbers i the complex plae ad fid absolute values
More informationOverview of some probability distributions.
Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability
More informationDefinition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean
1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.
More information*The most important feature of MRP as compared with ordinary inventory control analysis is its time phasing feature.
Itegrated Productio ad Ivetory Cotrol System MRP ad MRP II Framework of Maufacturig System Ivetory cotrol, productio schedulig, capacity plaig ad fiacial ad busiess decisios i a productio system are iterrelated.
More informationA Balanced Scorecard
A Balaced Scorecard with VISION A Visio Iteratioal White Paper Visio Iteratioal A/S Aarhusgade 88, DK-2100 Copehage, Demark Phoe +45 35430086 Fax +45 35434646 www.balaced-scorecard.com 1 1. Itroductio
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationNATIONAL SENIOR CERTIFICATE GRADE 12
NATIONAL SENIOR CERTIFICATE GRADE MATHEMATICS P EXEMPLAR 04 MARKS: 50 TIME: 3 hours This questio paper cosists of 8 pages ad iformatio sheet. Please tur over Mathematics/P DBE/04 NSC Grade Eemplar INSTRUCTIONS
More informationODBC. Getting Started With Sage Timberline Office ODBC
ODBC Gettig Started With Sage Timberlie Office ODBC NOTICE This documet ad the Sage Timberlie Office software may be used oly i accordace with the accompayig Sage Timberlie Office Ed User Licese Agreemet.
More informationMath C067 Sampling Distributions
Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters
More informationIncremental calculation of weighted mean and variance
Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically
More informationThis document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC.
SPC Formulas ad Tables 1 This documet cotais a collectio of formulas ad costats useful for SPC chart costructio. It assumes you are already familiar with SPC. Termiology Geerally, a bar draw over a symbol
More informationThe analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection
The aalysis of the Courot oligopoly model cosiderig the subjective motive i the strategy selectio Shigehito Furuyama Teruhisa Nakai Departmet of Systems Maagemet Egieerig Faculty of Egieerig Kasai Uiversity
More informationPROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM
PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics
More informationwhere: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return
EVALUATING ALTERNATIVE CAPITAL INVESTMENT PROGRAMS By Ke D. Duft, Extesio Ecoomist I the March 98 issue of this publicatio we reviewed the procedure by which a capital ivestmet project was assessed. The
More informationVladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT
Keywords: project maagemet, resource allocatio, etwork plaig Vladimir N Burkov, Dmitri A Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT The paper deals with the problems of resource allocatio betwee
More informationLearning objectives. Duc K. Nguyen - Corporate Finance 21/10/2014
1 Lecture 3 Time Value of Moey ad Project Valuatio The timelie Three rules of time travels NPV of a stream of cash flows Perpetuities, auities ad other special cases Learig objectives 2 Uderstad the time-value
More informationExample 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).
BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly
More informationTime Value of Money. First some technical stuff. HP10B II users
Time Value of Moey Basis for the course Power of compoud iterest $3,600 each year ito a 401(k) pla yields $2,390,000 i 40 years First some techical stuff You will use your fiacial calculator i every sigle
More informationA Review and Comparison of Methods for Detecting Outliers in Univariate Data Sets
A Review ad Compariso of Methods for Detectig Outliers i Uivariate Data Sets by Sogwo Seo BS, Kyughee Uiversity, Submitted to the Graduate Faculty of Graduate School of Public Health i partial fulfillmet
More informationProject Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments
Project Deliverables CS 361, Lecture 28 Jared Saia Uiversity of New Mexico Each Group should tur i oe group project cosistig of: About 6-12 pages of text (ca be loger with appedix) 6-12 figures (please
More informationTHE TWO-VARIABLE LINEAR REGRESSION MODEL
THE TWO-VARIABLE LINEAR REGRESSION MODEL Herma J. Bieres Pesylvaia State Uiversity April 30, 202. Itroductio Suppose you are a ecoomics or busiess maor i a college close to the beach i the souther part
More informationChapter XIV: Fundamentals of Probability and Statistics *
Objectives Chapter XIV: Fudametals o Probability ad Statistics * Preset udametal cocepts o probability ad statistics Review measures o cetral tedecy ad dispersio Aalyze methods ad applicatios o descriptive
More informationConfidence intervals and hypothesis tests
Chapter 2 Cofidece itervals ad hypothesis tests This chapter focuses o how to draw coclusios about populatios from sample data. We ll start by lookig at biary data (e.g., pollig), ad lear how to estimate
More informationMATH 083 Final Exam Review
MATH 08 Fial Eam Review Completig the problems i this review will greatly prepare you for the fial eam Calculator use is ot required, but you are permitted to use a calculator durig the fial eam period
More information.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth
Questio 1: What is a ordiary auity? Let s look at a ordiary auity that is certai ad simple. By this, we mea a auity over a fixed term whose paymet period matches the iterest coversio period. Additioally,
More informationSwaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps
Swaps: Costat maturity swaps (CMS) ad costat maturity reasury (CM) swaps A Costat Maturity Swap (CMS) swap is a swap where oe of the legs pays (respectively receives) a swap rate of a fixed maturity, while
More informationHow to read A Mutual Fund shareholder report
Ivestor BulletI How to read A Mutual Fud shareholder report The SEC s Office of Ivestor Educatio ad Advocacy is issuig this Ivestor Bulleti to educate idividual ivestors about mutual fud shareholder reports.
More information3. If x and y are real numbers, what is the simplified radical form
lgebra II Practice Test Objective:.a. Which is equivalet to 98 94 4 49?. Which epressio is aother way to write 5 4? 5 5 4 4 4 5 4 5. If ad y are real umbers, what is the simplified radical form of 5 y
More informationPage 1. Real Options for Engineering Systems. What are we up to? Today s agenda. J1: Real Options for Engineering Systems. Richard de Neufville
Real Optios for Egieerig Systems J: Real Optios for Egieerig Systems By (MIT) Stefa Scholtes (CU) Course website: http://msl.mit.edu/cmi/ardet_2002 Stefa Scholtes Judge Istitute of Maagemet, CU Slide What
More informationMathematical goals. Starting points. Materials required. Time needed
Level A1 of challege: C A1 Mathematical goals Startig poits Materials required Time eeded Iterpretig algebraic expressios To help learers to: traslate betwee words, symbols, tables, ad area represetatios
More informationHypergeometric Distributions
7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you
More informationHypothesis testing using complex survey data
Hypotesis testig usig complex survey data A Sort Course preseted by Peter Ly, Uiversity of Essex i associatio wit te coferece of te Europea Survey Researc Associatio Prague, 5 Jue 007 1 1. Objective: Simple
More information