International College of Economics and Finance Syllabus Probability Theory and Introductory Statistics Lecturer: Mikhail Zhitlukhin. 1. Course description Probability Theory and Introductory Statistics is a two-semester course for first-year students of the ICEF specializing in economics. The course is taught in Russian and English. The course is devoted to basic notions of statistics: data collection methods, the notion of a population and a sample, descriptive statistics, statistical methods of parameter estimation and hypothesis testing, regression models, etc. The course also includes topics from Probability Theory, which are essential for a consistent delivery. 2. Course objectives The main objective of the course is to provide students with knowledge of basic statistics. By the end of the course the students should understand the subject of statistics and master its basic methods. They acquire skills of primary data analysis (how to find a mean, a median, a standard deviation and other descriptive statistics), graphical representation of data (histograms, stem-and-leaf plots, dot plots, box plots). The students learn how to formulate and solve typical problems of basic statistics: point and interval parameter estimation, hypothesis testing, correlation analysis, regression models. An essential part of the course is devoted to basics of Probability Theory, which serves as a foundation of statistics. The students should understand the notion of a probability space, a random event, the probability of an event. They should know how to compute the probability of a complex event, solve basic combinatorial problems, understand the formula of total probability and the Bayes formula. The students should have a clear understanding of the concept of a random variable and its distribution. They should also understand the meaning of the Law of Large Numbers and the Central Limit Theorem. In classes students both solve theoretical problems and perform computer tasks with real data, develop practical skills and intuition. By the end of the course, students should understand the theory of statistical methods and be able to apply them in practice. In the course students obtain the amount of knowledge sufficient for the AP Statistics test. 3. Methods The following methods and forms of study are used in the course: lectures (2 hours a week), classes (2 hours a week), weekly home assignments, office hours, self-study. The course consists of 64 hours of lectures and 64 hours of classes. 4. Grade determination The students sit a written mid-term exam in the first module and a final semester exam in the end of the second module. The exams include multiple choice and free response questions. In the fourth module the students sit a mid-term exam of a similar form. After that the students have the AP Statistics exam. 1
The grade for the first two modules is made up of the autumn mid-term exam grade (30%), the winter exam grade (60%), the average grade for home assignments and classroom tests (10%). The final course grade is made up of the grade for the first two modules (25%), the grade for the spring mid-term exam (35%), the grade for the AP exam (35%), the average grade for home assignments and classroom tests in the 3rd and 4th modules (5%). 5. Main reading 1. Wonnacott R. J., Wonnacott T. H. Introductory Statistics for Business and Economics. John Wiley & Sons, fourth edition, 1990. 6. Supplementary reading 1. Айвазян С. А., Мхитарян В. С. Теория вероятностей и прикладная статистика. ЮНИТИ, 2001. 2. Гмурман В. Е. Теория вероятностей и математическая статистика. Высшая школа, 1998. 3. Гмурман В. Е. Руководство к решению задач по теории вероятностей и математической статистике. Высшая школа, 1998. 4. Шведов А. С. Теория вероятностей и математическая статистика. ВШЭ, 1995. 5. Ширяев А. Н. Вероятность-1. МЦНМО, 2007. 6. Ширяев А. Н., Эрлих И. Г., Яськов П. А. Вероятность в теоремах и задачах. МЦНМО, 2013. 7. Hogg R. V. and Tanis E. A. Probability and Statistical Inference. Prentice Hall, 1993. 7. Internet resources 1. Isotalo J. Basics of Statistics. http://www.mv.helsinki.fi/home/jmisotal/bos.pdf 2. Манита А. Д. Теория вероятностей и математическая статистика. http://teorver-online.narod.ru 3. The Data and Story Library. http://lib.stat.cmu.edu/dasl/ 4. Statistical Topics. http://www.stat.yale.edu/courses/1997-98/101/stat101.htm 5. Getting Started with Statistics Concepts. http://www.statsoft.com/textbook/esc.html 6. Engineering Statistics Handbook. http://www.itl.nist.gov/div898/handbook/pri/section3/pri3.htm 2
8. Course outline 1. Basic data analysis and descriptive statistics (WW, Ch.2) 1.1 Graphical representation of data. Dot plots. Steam-and-leaf plots. Histograms. 1.2 Characteristics of data. Outliers. Clusters. Histogram shape. 1.3 Descriptive statistics. Measures of center of a distribution: arithmetic average, median, mode. Measures of dispersion of a distribution: range, mean-square deviation, interquartile range, average absolute deviation, average relative deviation. Representation of data with box plots. 1.4 Transformation of basic statistics under a linear transform of data. 1.5 Measures of location in a sample: quartiles, percentiles, z-scale. 1.6 Computations with grouped data. 2. Data collection, planning and conducting an experiment (WW, Ch.1) 2.1 Methods of data collection: census, sample survey, experiment, observational study. 2.2 Population, sample, random sample. 2.3 Sources of bias in sampling and surveys. 2.4 Types of sampling: simple random sampling, stratified random sampling, cluster sampling. 2.5 Planning and conducting an experiment. 2.6 Control groups, random assignments, replication. 2.7 Sources of bias in experiments. Mixing factors, placebo effect, blinding. 2.8 Completely randomized design. Block design. 3. Elements of probability theory (WW, Ch. 3, 4) 3.1 Experiment with random outcomes. The notion of probability. The probability of an event as the limit of its relative frequency in a series of independent trials. The concept of the Law of Large Numbers. 3.2 Space of elementary outcomes as a mathematical model of an experiment with random outcomes. Algebra of events. Disjoint events. 3.3 Probability in a space of elementary outcomes. Classical probability. Elementary combinatorics. Probability of the sum of events. 3.4 Conditional probability. Probability of the product of events. Independent events. 3.5 The formula of total probability. The Bayes formula. 4. Discrete random variables (WW, Ch. 3, 4) 4.1 Examples of discrete random variables. Distribution of a discrete random variable. Relative frequencies and cumulative frequencies. 4.2 Mean value (expectation). Variance. Standard deviation. 4.3 Sequence of independent experiments. The binomial distribution. The geometric distribution. 5. Continuous random variables (WW, Ch. 3, 4) 5.1 Examples of continuous random variables. Distribution function. Distribution density. Mean value (expectation). Variance. Standard deviation. 5.2 The normal distribution, its properties. Normal distribution tables. 5.3 Linear transformations of random variables. 6. Two-dimensional distributions (WW, Ch. 5) 6.1 Joint distribution of two random variables. Marginal distribution. Conditional distribution. Conditional expectation. 6.2 Independent random variables. 6.3 Covariation coefficient. Correlation as a measure of the linear relationship between two random variables. Uncorrelated and independent random variables. The expectation and the variance of a linear combination of random variables. 3
7. Limit theorems 7.1 The Law of Large Numbers. 7.2 The Central Limit Theorem. Normal approximation of the binomial distribution. 8. Sampling distributions (WW, Ch. 6) 8.1 The distribution of the sample mean and the sample proportion. 8.2 The distribution of the difference of two proportions. The distribution of the difference of two independent sample means. 8.3 Student s t-distribution, the chi-squared distribution. 9. Point parameter estimation (WW, Ch. 7) 9.1 Point estimation of population parameters. Examples of point estimates: sample mean and sample variance. 9.2 Properties of estimates: unbiasedness, efficiency, consistency. 9.3 Estimates of mean and variance. 9.4 Estimates of proportion. 10. Interval parameter estimation (WW, Ch. 8) 10.1 The notion of a confidence interval. The confidence interval for the mean of a population. Normal approximation for large samples. Small samples (Student s distribution). 10.2 Confidence intervals for the difference of two population means (independent and matched samples). 10.3 Confidence intervals for the difference of two proportions. 10.4 Two-sided and one-sided confidence intervals. 11. Hypothesis testing (WW, Ch. 9) 11.1 Hypothesis and statistical test. Test for a population mean. Using confidence intervals and test-statistics. 11.2 Two-sided and one-sided tests. P -value. 11.3 Type I errors and type II errors. Significance and power of a test. 11.4 Standard tests: population mean, population proportion, difference of two independent and matched samples, difference of proportions. 11.5 Pearson s chi-squared test. Contingency tables. 12. Pair regression (WW, Ch. 11, 12) 12.1 X Y plot. Fitting a line. Ordinary least squares. 12.2 Transformations into a linear model. 12.3 Outliers. 12.4 Fitted values. 12.5 Errors and residuals. 12.6 Statistical properties of regression estimates. Confidence interval for the slope. Testing hypothesis for the slope. 4
9. Distribution of hours for topics and types of work No. Topics Number of hours lectures classes 1 Basic data analysis 6 6 2 Data collection, planning and conducting an experiment 4 4 3 Elements of Probability Theory 6 6 4 Discrete random variables 4 4 5 Continuous random variables 4 4 6 Two-dimensional distributions 4 4 7 Limit theorems 4 4 8 Sampling distributions 4 4 9 Point parameter estimation 6 6 10 Interval parameter estimation 8 8 11 Hypothesis testing 8 8 12 Pair regression 6 6 TOTAL 64 64 5