STA 248 Winter 2005 Assignment 1
|
|
- Margery Walton
- 7 years ago
- Views:
Transcription
1 STA 248 Winter 2005 Assignment 1 Due: Thursday, January 27 at beginning of lecture. (Late assignments will be subject to a deduction of 10% of the total marks for the assignment for each day late.) Please hand in your R code when used. On future assignments I won t be typing out the textbook problems. Let me know if you have any difficulty getting a copy of the text. Problems to be handed in for marking: Chapter 6: 12, 17, 25 Chapter 7: 9, 23, 25, 30, 59 Additional problems: 3 Problems from the textbook: Chapter 6: 8. (a) Acute exposure to cadmium produces respiratory distress and kidney and liver damage, and may even result in death. For this reason, the level of airborne cadmium dust and cadmium oxide fume in the air is monitored. This level is measured in milligrams cadmium per cubic meter of air. A sample of 35 readings yield the given data (available on the web). (a) Construct a stem-and-leaf diagram for these data. Use the numbers 02, 03, 04, 05, 06, and 07 as stems. (Do by hand.) (b) Would you be surprised to hear someone claim that the random variable X, the cadmium level in the air, is normally distributed? Explain. (d) Use R to construct a relative frequency histogram for these data. Does the histogram exhibit the bell-shape characteristic of a normal density? (e) Construct a relative cumulative frequency ogive for these data. Use the ogive to approximate that point above which 50% of the readings should fall. 12. (Percentiles.) Let X be a random variable. The point p k/100 such that P [X < p k/100 ] k/100 and P [X p k/100 ] k/100 is called the kth percentile for X. For example, let X be binomial with n = 20 and p =.5. The 25th percentile for X is the point p 25/100 = 8 since P [X < 8] = and P [X 8] = (a) Let X be binomial with n = 20 and p =.5. Find the 60th percentile for X. (b) Let X be Poisson with λ = 10. Find the 30th percentile for X. (d) Let X be exponentially distributed with β = 1. Show that the 20th percentile for X is ln.80. Hint: Find the point p such that p e x dx =
2 17. Consider the two given data sets (available on web). (a) Find the sample mean and sample median for each data set. (b) Find the sample range for each data set. (c) Find the sample variance and sample standard deviation for each data set. (d) Would you be surprised to hear someone claim that these data were drawn from the same population? Explain. Hint: Consider the shape of the distribution as well as the observed values of the sample statistics. 20. Use the data of Exercise 8 to approximate the mean, variance, and standard deviation of the random variable X, the level of airborne cadmium dust and cadmium oxide fumes. Assume that these approximations are fairly accurate. Between what two values would you expect approximately 95% of the readings to fall? Explain. 25. (Approximating σ via the range.) The range can play an important role in the design of statistical studies. To obtain a prespecified degree of accuracy when estimating population parameters, an adequate sized sample must be drawn. Most formulas used to determine sample size require knowledge of σ, the population standard deviation. Often the researcher will not have an estimate of σ available but will have an idea of the expected range of his or her data. When sampling from a normal distribution,. P [ 2σ < X µ < 2σ] =.95 If X is not normally distributed, then Chebyshev s inequality can be applied to conclude that P [ 3σ < X µ < 3σ].89 That is, X always lies within at most 3 standard deviations of its mean with high probability. From this it can be concluded that the estimated range covers an interval of roughly 4σ for normally distributed random variables and 6σ otherwise. In the normal case an estimate of σ can be obtained by solving the equation 4σ. = estimated range for σ. If X is not normally distributed, then σ. = (estimate range)/6 Data are given (available on the web) for the random variable X, the cpu time in seconds required to run a program using a statistical package. (a) Construct a stem-and-leaf diagram for these data. Is the assumption justified that X is normally distributed? (b) Approximate σ via the sample standard deviation s. (c) Find the sample range for these data, and use it to approximate σ. Compare your result to that obtained in part (b). 27. Let X be normally distributed with mean µ and variance σ 2. (a) Verify that q 3 = µ +.67σ and that q 1 = µ.67σ. (b) Find the interquartile range for X. (c) Verify that the inner fences for X are f 1 = µ 2.68σ and f 3 = µ σ. (d) Verify that the probability that X will fall beyond the inner fences is approximately
3 28. Temperature differences between the warm upper surface of the ocean and the colder deeper levels can be utilized to convert thermal energy to mechanical energy. This mechanical energy can in turn be used to produce electrical power using a vapor turbine. Let X denote the difference in temperature between the surface of the water and the water at a depth of 1 kilometer. Measurements are taken at 15 randomly selected sites in the Gulf of Mexico. The measured temperatures are available on the web. Use R to do the following. (a) Construct a double stem-and-leaf diagram for these data. (b) Find the sample mean, sample median, and sample standard deviation for these data. (c) Not that the observation with value 10.1 is very different from the others. It is a potential outlier. Construct a boxplot for these data to verify that the value 10.1 does appear to be an outlier. (d) To see the effect of this outlier, drop it from the data set and calculate the sample mean, median, and standard deviation for the remaining 14 observations. Which measure is least affected by the presence of the outlier? 36. It is known that power surges or line spikes can damage sensitive electronic equipment. A study of these surges is conducted. The purpose of the study is to ascertain whether or not there are differences in the frequency of these surges among the seven days of the week. Data for the study is found on the website. Variables are observation number; day, with m = Monday, t = Tuesday, w= Wednesday, th = Thursday, f = Friday, s = Saturday, and sn = Sunday; and number of spikes per day. Use R to do the following. (a) Obtain descriptive statistics on the number of spikes per day for each day of the week. Discuss any differences among days that appear to exist. (b) Construct boxplots for each day, and use the boxplots for a visual comparison of the days. Chapter 7: 1. Let X 1, X 2,..., X 20 be a random sample from a distribution with mean 8 and variance 5. Find the mean and variance of X. 5. Let X 1, X 2, X 3, X 4, X 5 be a random sample from a binomial distribution with n = 10 and p unknown. (a) Show that X/10 is an unbiased estimator for p. (b) Estimate p based on these data: 3, 4, 4, 5, (Weighted means.) Assume that one has k independent random samples of sizes n 1, n 2,..., n k from the same distribution. These samples generate k unbiased estimators for the mean, namely, X 1, X 2,..., X k. (a) Show that the arithmetic average of these estimators, (X 1 + X 2 + X k )/k, is also unbiased for µ. (b) Certain mineral elements required by plants are classed as macronutrients. Macronutrients are measured in terms of their percentage of the dry weight of the plant. Proportions of each element vary in different species and in the same species grown under differeing conditions. One macronutrient is sulfur. In a 3
4 study of winter cress, a member of the mustard family, these data, based on three independent random samples, are obtained: x 1 =.8 x 2 =.95 x 3 =.7 n 1 = 9 n 2 = 3 n 3 = 200 Use the result of part (a) to obtain an unbiased estimate for µ, the mean proportion of sulfur by dry weight in winter cress. By averaging the three values.8,.95, and.7 to obtain the estimate for µ, each sample is being given equal importance or weight. Does this seem reasonable in this problem? Explain. (c) To take sample sizes into account, a weighted mean is used. This estimator, ˆµ W, is given by ˆµ W = n 1X n k X k n n k Show that ˆµ W is an unbiased estimator for µ. (d) Use the data of part (b) to find the weighted estimate for the mean proportion of sulfur by dry weight in winter cress. Compare your answer to the estimate found in part (b). 16. Let X 1, X 2,..., X m be a random sample of size m from a binomial distribution with parameters n, assumed to be known, and p. Show that the method of moments estimator for p is ˆp = X/n. 17. Let X 1, X 2,..., X n be a random sample from a Poisson distribution with parameter λ. Find the method of moments estimate for λ. 23. Find the method of moments estimator for the parameter p of a geometric distribution. 25. Using the method of moments estimator for p found in Exercise 23, find an estimator for σ 2 for the geometric distribution. (You don t have to do the rest of this question that is in the text.) 27. Carbon dioxide is an odorless, colorless gass that constitutes about.035% by volume of the atmosphere. It affects the heat balance by acting as a one-way screen. It lets in the sun s heat to warm the oceans and the land but blocks some of the infrared heat that is radiated from the earth. This reflected heat is absorbed into the lower atmosphere, producing a greenhouse effect which causes the earth s surface to become warmer than it would be otherwise. Systematic measurements of CO 2 began in 1957 with Charles D. Keeling monitoring at Mauna Loa in Hawaii. (a) Given the data (available on the web) that are CO 2 readings in ppm, construct a stem-and-leaf plot (by hand) for these data using 31, 32, 32, 33, 33, 34, 34, 35 at stems. Graph leaves 0-4 on the first of each repeated stem and leaves 5-9 on the other. Is it reasonable to assume that the CO 2 level in the atmosphere is normally distributed? Explain. (b) Estimate µ and σ 2 using the method of moments estimators. (c) Find an unbiased estimate for σ Based on the data of Exercise 27, what are the maximum likelihood estimates for the mean and variance of the atmospheric CO 2 level? 4
5 30. Let X 1, X 2,..., X m be a random sample of size m from a binomial distribution with parameters n, assumed to be known, and p. Find the maximum likelihood estimator for p. Does it differ from the method of moments estimator found in Exercise 16? 31. Let W be an exponential random variable with parameter β unknown. Find the maximum likelihood estimator for β based on a sample of size n. Does it differ from the method of moments estimator (derived in lecture)? 34. Computer terminals have a battery pack that maintains the configuration of the terminal. These packs must be replaced occasionally. Let X denote the life span in years of such a battery. Assume that X is exponentially distributed with unknown parameter β. Find the maximum likelihood estimate for β based on the given data (available on the web). 35. To esimate the proportion of defective microprocessor chips being produced by a particular maker, samples of five chips are selected at 10 randomly selected times during the day. These chips are inspected, and X, the number of defective chips in each batch of size 5, is recorded. Assume that X is binomially distributed with n = 5 and p unknown. Use the data given (available on the web) to find the maximum likelihood estimate for p. 54. Let X denote the unit price of a 3.5-inch floppy diskette. Observations are obtained from a random sample of 10 suppliers. (Data are available on web.) (a) Find an unbiased estimate for the mean price of these diskettes. (b) Find an unbiased estimate for the variance in the price of these diskettes. (c) Find the sample standard deviation. Is this an unbiased estimate for σ? (d) Assume that X is normally distributed. Find the maximum likelihood estimate for σ 2. Does this agree with your answer to (b)? 59. Consider the random variable X with density given by f(x) = (1/θ 2 )xe x/θ, x > 0 (b) Show that E(X) = 2θ. (c) Find the method of moments estimator for θ. (d) Find the maximum likelihood estimator for θ based on a random sample of size n. Does this estimator differ from that found in part (c)? (e) Estimate θ based on these data: (f) Are the estimators found in parts (c) and (d) unbiased estimators for θ? Additional problems: 1. Which of the following statistics can be made arbitrarily large by making one number out of a batch of 100 numbers arbitrarily large: the mean, the median, the 10% trimmed mean, the standard deviation, the interquartile range? 2. Suppose X 1,..., X n are n identically distributed random variables with E(X i ) = µ, i = i,..., n. Show that (X) 2 is not an unbiased estimate of µ 2. 5
6 3. What general features are evident in a boxplot of data from a normal distribution? from a skewed distribution? from a distribution that is symmetric and bell-shaped like the normal distribution, but has less probability in the tails (the extreme values)? from a distribution that is symmetric and bell-shaped like the normal distribution, but has more probability in the tails (the extreme values)? 4. In data compression of text, a probability model is used where the probability of the next letter is heavily influenced by the preceding letters. In a first-order Markov model, the probability of the next letter depends only on the one letter immediately preceding it. Suppose we are interested in a model for the compression of a binary string. I ll label the values b for black and w for white. For a first-order Markov model we need the following probabilities for the value of a letter given the value preceding it: P (w w) = p w, P (b w) = 1 p w, P (b b) = p b, P (w b) = 1 p b Suppose X i is the random variable that is 1 if the ith letter is w and 0 if the ith letter is b. Then given that the (i 1)th letter is w (say), the probability function of X i is P (X i = x X i 1 = 1) = p x w (1 p w) 1 x. Suppose the string bbbbwwwbbbbbwwbbbbbbwwwwb is observed. Use maximum likelihood to estimate the parameters p w and p b. 6
BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
More informationChapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs
Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)
More informationImportant Probability Distributions OPRE 6301
Important Probability Distributions OPRE 6301 Important Distributions... Certain probability distributions occur with such regularity in real-life applications that they have been given their own names.
More informationGETTING TO THE CORE: THE LINK BETWEEN TEMPERATURE AND CARBON DIOXIDE
DESCRIPTION This lesson plan gives students first-hand experience in analyzing the link between atmospheric temperatures and carbon dioxide ( ) s by looking at ice core data spanning hundreds of thousands
More informationSummary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationDescriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course
More informationDensity Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve
More informationEXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!
STP 231 EXAM #1 (Example) Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.
More informationDef: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.
Lecture 6: Chapter 6: Normal Probability Distributions A normal distribution is a continuous probability distribution for a random variable x. The graph of a normal distribution is called the normal curve.
More informationCARBON THROUGH THE SEASONS
DESCRIPTION In this lesson plan, students learn about the carbon cycle and understand how concentrations of carbon dioxide (CO 2 ) in the Earth s atmosphere vary as the seasons change. Students also learn
More informationDescriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
More informationCA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction
CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous
More informationPr(X = x) = f(x) = λe λx
Old Business - variance/std. dev. of binomial distribution - mid-term (day, policies) - class strategies (problems, etc.) - exponential distributions New Business - Central Limit Theorem, standard error
More informationExploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
More informationEnergy Pathways in Earth s Atmosphere
BRSP - 10 Page 1 Solar radiation reaching Earth s atmosphere includes a wide spectrum of wavelengths. In addition to visible light there is radiation of higher energy and shorter wavelength called ultraviolet
More informationList of Examples. Examples 319
Examples 319 List of Examples DiMaggio and Mantle. 6 Weed seeds. 6, 23, 37, 38 Vole reproduction. 7, 24, 37 Wooly bear caterpillar cocoons. 7 Homophone confusion and Alzheimer s disease. 8 Gear tooth strength.
More informationExploratory Data Analysis
Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
More informationSTT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
More information5.1 Identifying the Target Parameter
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying
More informationThe Normal Distribution
Chapter 6 The Normal Distribution 6.1 The Normal Distribution 1 6.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Recognize the normal probability distribution
More information6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
More informationChapter 6: Point Estimation. Fall 2011. - Probability & Statistics
STAT355 Chapter 6: Point Estimation Fall 2011 Chapter Fall 2011 6: Point1 Estimat / 18 Chap 6 - Point Estimation 1 6.1 Some general Concepts of Point Estimation Point Estimate Unbiasedness Principle of
More informationAP * Statistics Review. Descriptive Statistics
AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production
More informationMATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
MATH 3/GRACEY PRACTICE EXAM/CHAPTERS 2-3 Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The frequency distribution
More information8. THE NORMAL DISTRIBUTION
8. THE NORMAL DISTRIBUTION The normal distribution with mean μ and variance σ 2 has the following density function: The normal distribution is sometimes called a Gaussian Distribution, after its inventor,
More informationComplete a table of values. Graph the values given in a table. Create an equation representing the information in a table or graph.
Activity III: Surface Area of a Leaf (Grades 7-9) Objectives: Complete a table of values. Graph the values given in a table. Create an equation representing the information in a table or graph. NCTM Standards
More information4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
More informationStatistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
More informationLecture 5 : The Poisson Distribution
Lecture 5 : The Poisson Distribution Jonathan Marchini November 10, 2008 1 Introduction Many experimental situations occur in which we observe the counts of events within a set unit of time, area, volume,
More informationChapter 3. The Normal Distribution
Chapter 3. The Normal Distribution Topics covered in this chapter: Z-scores Normal Probabilities Normal Percentiles Z-scores Example 3.6: The standard normal table The Problem: What proportion of observations
More informationMeans, standard deviations and. and standard errors
CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard
More informationSTATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members
More information3: Summary Statistics
3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes
More informationFACTS ABOUT CLIMATE CHANGE
FACTS ABOUT CLIMATE CHANGE 1. What is climate change? Climate change is a long-term shift in the climate of a specific location, region or planet. The shift is measured by changes in features associated
More informationLecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
More information6 3 The Standard Normal Distribution
290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since
More informationRandom Variables. Chapter 2. Random Variables 1
Random Variables Chapter 2 Random Variables 1 Roulette and Random Variables A Roulette wheel has 38 pockets. 18 of them are red and 18 are black; these are numbered from 1 to 36. The two remaining pockets
More informationQuantitative Methods for Finance
Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain
More informationMBA 611 STATISTICS AND QUANTITATIVE METHODS
MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain
More informationQuestion: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?
ECS20 Discrete Mathematics Quarter: Spring 2007 Instructor: John Steinberger Assistant: Sophie Engle (prepared by Sophie Engle) Homework 8 Hints Due Wednesday June 6 th 2007 Section 6.1 #16 What is the
More informationCenter: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)
Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center
More informationChapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
More informationNotes on Continuous Random Variables
Notes on Continuous Random Variables Continuous random variables are random quantities that are measured on a continuous scale. They can usually take on any value over some interval, which distinguishes
More informationBusiness Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
More information1. Let A, B and C are three events such that P(A) = 0.45, P(B) = 0.30, P(C) = 0.35,
1. Let A, B and C are three events such that PA =.4, PB =.3, PC =.3, P A B =.6, P A C =.6, P B C =., P A B C =.7. a Compute P A B, P A C, P B C. b Compute P A B C. c Compute the probability that exactly
More informationa. mean b. interquartile range c. range d. median
3. Since 4. The HOMEWORK 3 Due: Feb.3 1. A set of data are put in numerical order, and a statistic is calculated that divides the data set into two equal parts with one part below it and the other part
More informationSta 309 (Statistics And Probability for Engineers)
Instructor: Prof. Mike Nasab Sta 309 (Statistics And Probability for Engineers) Chapter 2 Organizing and Summarizing Data Raw Data: When data are collected in original form, they are called raw data. The
More informationBASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 seema@iasri.res.in Genomics A genome is an organism s
More informationThe Normal Distribution
The Normal Distribution Continuous Distributions A continuous random variable is a variable whose possible values form some interval of numbers. Typically, a continuous variable involves a measurement
More informationData Modeling & Analysis Techniques. Probability & Statistics. Manfred Huber 2011 1
Data Modeling & Analysis Techniques Probability & Statistics Manfred Huber 2011 1 Probability and Statistics Probability and statistics are often used interchangeably but are different, related fields
More information7 CONTINUOUS PROBABILITY DISTRIBUTIONS
7 CONTINUOUS PROBABILITY DISTRIBUTIONS Chapter 7 Continuous Probability Distributions Objectives After studying this chapter you should understand the use of continuous probability distributions and the
More informationChapter 4 Lecture Notes
Chapter 4 Lecture Notes Random Variables October 27, 2015 1 Section 4.1 Random Variables A random variable is typically a real-valued function defined on the sample space of some experiment. For instance,
More informationStatistics courses often teach the two-sample t-test, linear regression, and analysis of variance
2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample
More informationThe right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median
CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box
More informationSummarizing and Displaying Categorical Data
Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency
More informationThursday 8 November 2012 Afternoon
H Thursday 8 November 2012 Afternoon GCSE MATHEMATICS B J567/04 Paper 4 (Higher Tier) *J517181112* Candidates answer on the Question Paper. OCR supplied materials: None Other materials required: Geometrical
More informationProbability and Statistics Vocabulary List (Definitions for Middle School Teachers)
Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence
More informationAP STATISTICS REVIEW (YMS Chapters 1-8)
AP STATISTICS REVIEW (YMS Chapters 1-8) Exploring Data (Chapter 1) Categorical Data nominal scale, names e.g. male/female or eye color or breeds of dogs Quantitative Data rational scale (can +,,, with
More informationSKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set.
SKEWNESS All about Skewness: Aim Definition Types of Skewness Measure of Skewness Example A fundamental task in many statistical analyses is to characterize the location and variability of a data set.
More informationProbability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur
Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce
More informationMATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!
MATH BOOK OF PROBLEMS SERIES New from Pearson Custom Publishing! The Math Book of Problems Series is a database of math problems for the following courses: Pre-algebra Algebra Pre-calculus Calculus Statistics
More informationThe Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)
Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationDescriptive Statistics
Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9
More informationMaster s Theory Exam Spring 2006
Spring 2006 This exam contains 7 questions. You should attempt them all. Each question is divided into parts to help lead you through the material. You should attempt to complete as much of each problem
More informationFigure 1. Basic structure of the leaf, with a close up of the leaf surface showing Stomata and Guard cells.
BIOL100 Laboratory Assignment 3: Analysis of Stomata Name: Stomata (singular=stoma) are the respiratory control structures in plants (see Figure 1 below). They are essentially small holes in the surface
More informationCourse Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
More informationLecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions
Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Typical Inference Problem Definition of Sampling Distribution 3 Approaches to Understanding Sampling Dist. Applying 68-95-99.7 Rule
More informationA Correlation of. to the. South Carolina Data Analysis and Probability Standards
A Correlation of to the South Carolina Data Analysis and Probability Standards INTRODUCTION This document demonstrates how Stats in Your World 2012 meets the indicators of the South Carolina Academic Standards
More informationAssignment #03: Time Management with Excel
Technical Module I Demonstrator: Dereatha Cross dac4303@ksu.edu Assignment #03: Time Management with Excel Introduction Success in any endeavor depends upon time management. One of the optional exercises
More informationChapter 4. Probability and Probability Distributions
Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the
More informationAP Statistics Solutions to Packet 2
AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 6-8 2.1 DENSITY CURVES (a) Sketch a density curve that
More informationRandom variables, probability distributions, binomial random variable
Week 4 lecture notes. WEEK 4 page 1 Random variables, probability distributions, binomial random variable Eample 1 : Consider the eperiment of flipping a fair coin three times. The number of tails that
More informationREPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.
REPEATED TRIALS Suppose you toss a fair coin one time. Let E be the event that the coin lands heads. We know from basic counting that p(e) = 1 since n(e) = 1 and 2 n(s) = 2. Now suppose we play a game
More informationPermutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
More informationTutorial 5: Hypothesis Testing
Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................
More informationDESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1
DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 OVERVIEW STATISTICS PANIK...THE THEORY AND METHODS OF COLLECTING, ORGANIZING, PRESENTING, ANALYZING, AND INTERPRETING DATA SETS SO AS TO DETERMINE THEIR ESSENTIAL
More informationVariables. Exploratory Data Analysis
Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is
More informationDongfeng Li. Autumn 2010
Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis
More informationHow To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationYou flip a fair coin four times, what is the probability that you obtain three heads.
Handout 4: Binomial Distribution Reading Assignment: Chapter 5 In the previous handout, we looked at continuous random variables and calculating probabilities and percentiles for those type of variables.
More informationMilitary Reliability Modeling William P. Fox, Steven B. Horton
Military Reliability Modeling William P. Fox, Steven B. Horton Introduction You are an infantry rifle platoon leader. Your platoon is occupying a battle position and has been ordered to establish an observation
More informationCharacteristics of Binomial Distributions
Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation
More informationc. Construct a boxplot for the data. Write a one sentence interpretation of your graph.
MBA/MIB 5315 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?
More informationTHE BINOMIAL DISTRIBUTION & PROBABILITY
REVISION SHEET STATISTICS 1 (MEI) THE BINOMIAL DISTRIBUTION & PROBABILITY The main ideas in this chapter are Probabilities based on selecting or arranging objects Probabilities based on the binomial distribution
More informationIntroduction to Statistics for Psychology. Quantitative Methods for Human Sciences
Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html
More informationDepartment of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015.
Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment -3, Probability and Statistics, March 05. Due:-March 5, 05.. Show that the function 0 for x < x+ F (x) = 4 for x < for x
More informationBINOMIAL DISTRIBUTION
MODULE IV BINOMIAL DISTRIBUTION A random variable X is said to follow binomial distribution with parameters n & p if P ( X ) = nc x p x q n x where x = 0, 1,2,3..n, p is the probability of success & q
More informationUnit 5 Photosynthesis and Cellular Respiration
Unit 5 Photosynthesis and Cellular Respiration Advanced Concepts What is the abbreviated name of this molecule? What is its purpose? What are the three parts of this molecule? Label each part with the
More information1. Theoretical background
1. Theoretical background We consider the energy budget at the soil surface (equation 1). Energy flux components absorbed or emitted by the soil surface are: net radiation, latent heat flux, sensible heat
More informationCHI-SQUARE: TESTING FOR GOODNESS OF FIT
CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity
More informationDiagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
More informationCruise Line Agencies of Alaska. Cruise Ship Calendar for 2016 FOR PORT(S) = KTN AND SHIP(S) = ALL AND VOYAGES = ALL
Cruise Line Agencies of Alaska Cruise Ship Calendar for 2016 FOR PORT(S) = KTN AND SHIP(S) = ALL AND VOYAGES = ALL Page 1 of 5 Sunday, May 1 07:0-18:0 Monday, May 2 Tuesday, May 3 Wednesday, May 4 Thursday,
More informationHow To Write A Data Analysis
Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction
More informationThe Earth s Atmosphere
THE SUN-EARTH SYSTEM III The Earth s Atmosphere Composition and Distribution of the Atmosphere The composition of the atmosphere and the way its gases interact with electromagnetic radiation determine
More informationclimate science A SHORT GUIDE TO This is a short summary of a detailed discussion of climate change science.
A SHORT GUIDE TO climate science This is a short summary of a detailed discussion of climate change science. For more information and to view the full report, visit royalsociety.org/policy/climate-change
More informationHow Far is too Far? Statistical Outlier Detection
How Far is too Far? Statistical Outlier Detection Steven Walfish President, Statistical Outsourcing Services steven@statisticaloutsourcingservices.com 30-325-329 Outline What is an Outlier, and Why are
More informationCruise Line Agencies of Alaska. Cruise Ship Calendar for 2016 FOR PORT(S) = KTN AND SHIP(S) = ALL AND VOYAGES = ALL
Cruise Line Agencies of Alaska Cruise Ship Calendar for 06 FOR PORT(S) = KTN AND SHIP(S) = ALL AND VOYAGES = ALL 6: Friday, April 5, 06 Cruise Line Agencies of Alaska, Cruise Ship Calendar for 06 Page
More information