Rarefaction Method DRAFT 1/5/2016 Our data base combines taxonomic counts from 23 agencies. The number of organisms identified and counted per sample
|
|
|
- Cory Boyd
- 9 years ago
- Views:
Transcription
1 Rarefaction Method DRAFT 1/5/2016 Our data base combines taxonomic counts from 23 agencies. The number of organisms identified and counted per sample differs among agencies. Some count 100 individuals while others count more. A positive relationship exists between the probability of finding a new taxon and the number of individuals counted (Figure 1). This relationship influences important metrics such as taxa richness and diversity indices. The agency with a greater standard count will typically have higher estimates of richness and a better representation of rare taxa. Rarefaction is often used to overcome the bias in taxa richness and diversity metrics created by different counting protocols. Rarefaction randomly selects a prescribed number of individuals (e.g., 100, 200) without replacement from a sample s larger raw count and creates a rarified sample 1. The vegan package in R provides several useful rarefaction functions. The function rarefy provides a mean taxa richness value for the rarified sample and the error associated with that value. Mean taxa richness is the average number of taxa that can be expected in the rarified sample for a prescribed total count. Another function, rrarefy, creates a taxonomic assemblage for a rarefied sample by randomly selecting taxa without replacement from the original sample. The rarefied assemblage can then be used as raw counts to calculate unbiased richness and diversity metrics associated with the selected total count. Due to the random selection process, different combinations of taxa (assemblages) are usually created each time the rrarefy function is run (Table 1). Richness and diversity metric values vary with each iteration, making it difficult to score a sample consistently. The problem can be avoided by using the actual taxonomic proportions in the raw count to generate the assemblage most likely to be found in the rarified sample. When the rrarefy function is repeated many times, and the frequency of each unique taxonomic assemblage is plotted, a normalized curve is produced. The most frequently found assemblages occur in the middle of the normalized curve. They are represented by the more abundant taxa in the sample because these taxa have a higher probability of being randomly selected. We used the mean taxa richness and the taxonomic proportions in the original sample raw counts to subsample the original sample and create the sample s rarified assemblage. This approach reduces the amount of variability that results when the rrarefy function is applied, and produces the assemblage that would occur most frequently with many iterations of rrarefy. A sample s raw counts are first sorted in descending order. The mean richness value generated by the rarefy function is then applied to the descending list of taxa. This identifies the taxa most likely to occur in the rarified sample. Examples of this approach for different raw sample counts are shown in Tables 2-4. In Table 2, the rarefy function indicates a mean richness value of 6 (rounding up from 5.2) for the rarefied sample. The 6 taxa with the highest raw counts become the rarefied assemblage. The proportion (percentage) of each taxon in the raw count is then used to calculate an expected taxon count in the rarified assemblage. It is not possible to count a fraction of an individual taxon (e.g., Table 2 Family A = individuals), therefore, each estimated count is rounded up to the nearest whole number in the rarified sample. Rounding up typically results in total counts at or above the prescribed count for the rarified sample (e.g., for a prescribed total count of 100). However, richness and diversity metrics for samples that had different raw counts are now comparable.
2 In some cases, the original sample s taxonomic list cannot be subsampled easily because several taxa near the mean richness cutoff have the same raw count. In Table 3 and Table 4, for example, the count of the last taxon included in the subset assemblage (the count just above the dashed line) has a count identical to that of one or more neighboring taxa. When identical raw counts occur, a random selection without replacement needs to be applied to the affected taxa to avoid bias when determining the mean taxa richness value. Without random sampling of these taxa with identical counts, the program will select the taxon that occurs when the mean richness value is reached (i.e., F in Tables 3 and 4). This creates an unnecessary and odd bias based on a taxon s Latin name - a result of taxa with identical counts being automatically alphabetized. Therefore, a random sample without replacement is applied to taxa with identical counts in order to reach the appropriate number of individuals necessary to meet the richness requirements of the rarefied assemblage. In Table 3, Taxon G was randomly selected to fill the final space in the rarefied assemblage. In Table 4, 3 taxa are needed to meet the prescribed richness value of 6. Random selection picked Taxa E, F, and H. Taxa with the same count values are usually singletons (represented by one individual) or doubletons (represented by two individuals). Therefore, the variability produced by the final, random selection step has minimal influence on most metrics. Richness metrics for specific taxon or taxa may be more influenced by this variability (e.g., EPT richness, Diptera richness, Collector richness). 1 Within the BIBI2 package, the user can specify the taxonomic level (e.g. family, genus, species, etc.) at which the rarefaction function samples the assemblage and can also select the rarefaction sample size (e.g. 100, 200, etc.). We use a standard number of 100 for the Chessie BIBI, and specify the family or genus taxonomic level depending on which metrics are being calculated.
3 Simplified Explanation 1. The taxa are sorted by raw count in descending order for each sampling event (Table 1). 2. The vegan package function rarefy is used to calculate the mean richness for a prescribed sample count (e.g., 100, 200, etc.). 3. The mean richness (rounded up to the nearest whole number) is used as a cutoff for identifying a sample s taxa with the greatest probability of being randomly selected for the prescribed richness value (e.g., Table 2). a. If the count of the final taxon above the threshold is equal to the count of one or more neighboring taxa in the list, a random selection without replacement occurs among these taxa to fill the remaining space(s) in the subset assemblage (Tables 3 and 4). 4. The counts from the new subset assemblage are then represented as a proportion of the new subset s total, and rounded up to the nearest whole number. 5. Richness and diversity metric are calculated on the rarefied assemblage.
4 Richness Sample Count Figure 1. The number of individuals identified (Sample Count) plotted against the number of families observed (Richness). All of the data points (n = 5,271) represent macroinvertebrate assemblages collected in streams with a kick-net in streams of Strahler order 4 between 1992 and The trend line depicts the positive relationship between the number of individuals identified and the richness observed.
5 Table 1. Example of three different rarefied assemblages produced with the function rrarefy from the vegan package for a prescribed total count of 100 organisms. Each rrarefy column represents an iteration of the rrarefy function on the sample s raw count. Richness and diversity metric values are different for each rrarefy iterations. The bottom row is the mean taxa richness found using the function rarefy for the prescribed 100 count subsample. Taxa Raw Counts rrarefy 1 rrarefy 2 rrarefy 3 A B C D E F G H Total Richness Mean Taxa Richness 5.2 (6) Table 2. Example of how the rarefy function from the vegan package is used to create the most probable taxonomic assemblage for a rarified sample. In this example, the rarefy function returns a mean richness value of 5.2 unique taxa for a count of 100 individuals. 5.2 is rounded up to 6 taxa and the six most abundant taxa become the taxa comprising the rarified sample ( Subset ). The percentage that each taxa represents in the Subset sample (n = 602) is then used to estimate taxa counts in the rarified sample (Rarified Count). Taxa counts will remain the same each time richness and diversity metrics are calculated on the rarified sample. Taxa Raw Count Subset Percentage Rarefied Count A B C D E F G 2 H 1 Total Richness Mean Taxa Richness 5.2 (6)
6 Table 3. Example of how the most probable taxonomic assemblage for a rarified sample is created when three taxa at the richness cutoff are equally abundant. The rarefy function from the vegan package returns a value of 5.3 unique taxa (rounded up to 6) in a count of 100 organisms, and the dashed line indicates where the assemblage should be subset. Since the program orders the taxa list first by raw count and then alphabetically, Taxa F will always be selected for the rarified sample. This creates an unnecessary bias because Taxa F, G, and H each have a raw count of one and should have an equal chance of being selected. To avoid this bias, the program randomly selects one of the three equally abundant taxa for the rarefied assemblage. In this example, it selected Taxa G. Taxa Raw Count Subset Percentage Rarefied Count A B C D E F 1 G H 1 Total Richness 5.3 (6) Table 4. Example of how the most probable taxonomic assemblage for a rarified sample is created when five taxa at the richness cutoff are equally abundant and three need to be selected to create the rarified assemblage. The rarefy function from the vegan package returns a value of 5.3 unique taxa (rounded up to 6) in a count of 100 organisms, and the dashed line indicates where the assemblage should be subset. The 4 th through 8 th taxa (D H) each have raw counts of one and should have equal probability of being selected. In this example, the program randomly selects three of the five taxa with raw counts of one to be included in the rarefied assemblage. If three taxa were not randomly selected, the program would favor the taxa that appear first in alphabetical order (i.e., Taxa D F) and create an unnecessary bias. Taxa Raw Count Subset Percentage Rarefied Count A B C D 1 E F G 1 H Total Richness 5.04 (6) 6 6 6
Week 3&4: Z tables and the Sampling Distribution of X
Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal
Ecology and Simpson s Diversity Index
ACTIVITY BRIEF Ecology and Simpson s Diversity Index The science at work Ecologists, such as those working for the Environmental Agency, are interested in species diversity. This is because diversity is
z-scores AND THE NORMAL CURVE MODEL
z-scores AND THE NORMAL CURVE MODEL 1 Understanding z-scores 2 z-scores A z-score is a location on the distribution. A z- score also automatically communicates the raw score s distance from the mean A
7.S.8 Interpret data to provide the basis for predictions and to establish
7 th Grade Probability Unit 7.S.8 Interpret data to provide the basis for predictions and to establish experimental probabilities. 7.S.10 Predict the outcome of experiment 7.S.11 Design and conduct an
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
CALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
Sampling and Sampling Distributions
Sampling and Sampling Distributions Random Sampling A sample is a group of objects or readings taken from a population for counting or measurement. We shall distinguish between two kinds of populations
Module 2: Introduction to Quantitative Data Analysis
Module 2: Introduction to Quantitative Data Analysis Contents Antony Fielding 1 University of Birmingham & Centre for Multilevel Modelling Rebecca Pillinger Centre for Multilevel Modelling Introduction...
InfiniteInsight 6.5 sp4
End User Documentation Document Version: 1.0 2013-11-19 CUSTOMER InfiniteInsight 6.5 sp4 Toolkit User Guide Table of Contents Table of Contents About this Document 3 Common Steps 4 Selecting a Data Set...
Vulnerability Assessment of New England Streams: Developing a Monitoring Network to Detect Climate Change Effects
Vulnerability Assessment of New England Streams: Developing a Monitoring Network to Detect Climate Change Effects National Water Quality Monitoring Council 2012 Meeting Britta Bierwagen, National Center
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER JMP Genomics Step-by-Step Guide to Bi-Parental Linkage Mapping Introduction JMP Genomics offers several tools for the creation of linkage maps
Shannon diversity. Site Description. Hill numbers. Simpson diversity H =
Vegetation Analysis ite Description lide 1 Vegetation Analysis ite Description lide 2 hannon diversity ite Description 1. Diversity indices 2. pecies abundance models 3. pecies area relationship H = p
Testing Research and Statistical Hypotheses
Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you
Example: Find the expected value of the random variable X. X 2 4 6 7 P(X) 0.3 0.2 0.1 0.4
MATH 110 Test Three Outline of Test Material EXPECTED VALUE (8.5) Super easy ones (when the PDF is already given to you as a table and all you need to do is multiply down the columns and add across) Example:
6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
In This Issue: Excel Sorting with Text and Numbers
In This Issue: Sorting with Text and Numbers Microsoft allows you to manipulate the data you have in your spreadsheet by using the sort and filter feature. Sorting is performed on a list that contains
APP INVENTOR. Test Review
APP INVENTOR Test Review Main Concepts App Inventor Lists Creating Random Numbers Variables Searching and Sorting Data Linear Search Binary Search Selection Sort Quick Sort Abstraction Modulus Division
W6.B.1. FAQs CS535 BIG DATA W6.B.3. 4. If the distance of the point is additionally less than the tight distance T 2, remove it from the original set
http://wwwcscolostateedu/~cs535 W6B W6B2 CS535 BIG DAA FAQs Please prepare for the last minute rush Store your output files safely Partial score will be given for the output from less than 50GB input Computer
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation
Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve
1 J (Gr 6): Summarize and describe distributions.
MAT.07.PT.4.TRVLT.A.299 Sample Item ID: MAT.07.PT.4.TRVLT.A.299 Title: Travel Time to Work (TRVLT) Grade: 07 Primary Claim: Claim 4: Modeling and Data Analysis Students can analyze complex, real-world
MAP Reports. Teacher Report (by Goal Descriptors) Displays teachers class data for current testing term sorted by RIT score.
Goal Performance: These columns summarize the students performance in the goal strands tested in this subject. Data will display in these columns only if a student took a Survey w/ Goals test. Goal performance
Key Concept. Density Curve
MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 6 Normal Probability Distributions 6 1 Review and Preview 6 2 The Standard Normal Distribution 6 3 Applications of Normal
Biology 300 Homework assignment #1 Solutions. Assignment:
Biology 300 Homework assignment #1 Solutions Assignment: Chapter 1, Problems 6, 15 Chapter 2, Problems 6, 8, 9, 12 Chapter 3, Problems 4, 6, 15 Chapter 4, Problem 16 Answers in bold. Chapter 1 6. Identify
ISEB Assessments Year 8 Level 2 Maths Test 1 Author: ISEB
ISEB Assessments Year 8 Level 2 Maths Test 1 Author: ISEB This test contains a selected set of 10 questions in a particular topic order. 100 marks are available in total. You should take no more than 1
Adverse Impact Ratio for Females (0/ 1) = 0 (5/ 17) = 0.2941 Adverse impact as defined by the 4/5ths rule was not found in the above data.
1 of 9 12/8/2014 12:57 PM (an On-Line Internet based application) Instructions: Please fill out the information into the form below. Once you have entered your data below, you may select the types of analysis
Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.
Lecture 6: Chapter 6: Normal Probability Distributions A normal distribution is a continuous probability distribution for a random variable x. The graph of a normal distribution is called the normal curve.
MASCOT Search Results Interpretation
The Mascot protein identification program (Matrix Science, Ltd.) uses statistical methods to assess the validity of a match. MS/MS data is not ideal. That is, there are unassignable peaks (noise) and usually
3. Data Analysis, Statistics, and Probability
3. Data Analysis, Statistics, and Probability Data and probability sense provides students with tools to understand information and uncertainty. Students ask questions and gather and use data to answer
Model Selection. Introduction. Model Selection
Model Selection Introduction This user guide provides information about the Partek Model Selection tool. Topics covered include using a Down syndrome data set to demonstrate the usage of the Partek Model
Pure1 Manage User Guide
User Guide 11/2015 Contents Overview... 2 Pure1 Manage Navigation... 3 Pure1 Manage - Arrays Page... 5 Card View... 5 Expanded Card View... 7 List View... 10 Pure1 Manage Replication Page... 11 Pure1
Number boards for mini mental sessions
Number boards for mini mental sessions Feel free to edit the document as you wish and customise boards and questions to suit your learners levels Print and laminate for extra sturdiness. Ideal for working
NUMBER CORNER YEARLONG CONTENT OVERVIEW
August & September Workouts Calendar Grid Quilt Block Symmetries Identifying shapes and symmetries Calendar Collector Two Penny Toss Probability and data analysis Computational Fluency Mental Math Fluently
Estimating species richness
CHAPTER 4 Estimating species richness Nicholas J. Gotelli and Robert K. Colwell 4.1 Introduction Measuring species richness is an essential objective for many community ecologists and conservation biologists.
Reporting Manual. Prepared by. NUIT Support Center Northwestern University
Reporting Manual Prepared by NUIT Support Center Northwestern University Updated: February 2013 CONTENTS 1. Introduction... 1 2. Reporting... 1 2.1 Reporting Functionality... 1 2.2 Creating Reports...
Using Data Analytics to Detect Fraud
Using Data Analytics to Detect Fraud Fundamental Data Analysis Techniques 2016 Association of Certified Fraud Examiners, Inc. Discussion Question For each data analysis technique discussed in this section,
Expression. Variable Equation Polynomial Monomial Add. Area. Volume Surface Space Length Width. Probability. Chance Random Likely Possibility Odds
Isosceles Triangle Congruent Leg Side Expression Equation Polynomial Monomial Radical Square Root Check Times Itself Function Relation One Domain Range Area Volume Surface Space Length Width Quantitative
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Descriptive Statistics
Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9
Pulling a Random Sample from a MAXQDA Dataset
In this guide you will learn how to pull a random sample from a MAXQDA dataset, using the random cell function in Excel. In this process you will learn how to export and re-import variables from MAXQDA.
MATH 140 Lab 4: Probability and the Standard Normal Distribution
MATH 140 Lab 4: Probability and the Standard Normal Distribution Problem 1. Flipping a Coin Problem In this problem, we want to simualte the process of flipping a fair coin 1000 times. Note that the outcomes
CITATION METRICS WORKSHOP ANALYSIS & INTERPRETATION WEB OF SCIENCE Prepared by Bibliometric Team, NUS Libraries. April 2014.
CITATION METRICS WORKSHOP ANALYSIS & INTERPRETATION WEB OF SCIENCE Prepared by Bibliometric Team, NUS Libraries. April 2014. Analysis & Interpretation of Results using Web of Science Steps Technique Page
Chapter 7 Sampling (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.
Chapter 7 Sampling (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.) The purpose of Chapter 7 it to help you to learn about sampling in
Lesson 3: Calculating Conditional Probabilities and Evaluating Independence Using Two-Way Tables
Calculating Conditional Probabilities and Evaluating Independence Using Two-Way Tables Classwork Example 1 Students at Rufus King High School were discussing some of the challenges of finding space for
Dimension: Data Handling Module: Organization and Representation of data Unit: Construction and Interpretation of Simple Diagrams and Graphs
Topic: Stem and Leaf Diagrams S1 Topic 13 Level: Key Stage 3 Dimension: Data Handling Module: Organization and Representation of data Unit: Construction and Interpretation of Simple Diagrams and Graphs
A Percentile is a number in a SORTED LIST that has a given percentage of the data below it.
Section 3 3B: Percentiles A Percentile is a number in a SORTED LIST that has a given percentage of the data below it. If n represents the 70 th percentile then 70% of the data is less than n We use the
Evaluation & Validation: Credibility: Evaluating what has been learned
Evaluation & Validation: Credibility: Evaluating what has been learned How predictive is a learned model? How can we evaluate a model Test the model Statistical tests Considerations in evaluating a Model
III.1. Biodiversity: Concepts, Patterns, and Measurement Robert K. Colwell
III.1 Biodiversity: Concepts, Patterns, and Measurement Robert K. Colwell OUTLINE 1. What is biodiversity? 2. Relative abundance: Common species and rare ones 3. Measuring and estimating species richness
Shark Talent Management System Performance Reports
Shark Talent Management System Performance Reports Goals Reports Goal Details Report. Page 2 Goal Exception Report... Page 4 Goal Hierarchy Report. Page 6 Goal Progress Report.. Page 8 Goal Status Report...
Mining Social-Network Graphs
342 Chapter 10 Mining Social-Network Graphs There is much information to be gained by analyzing the large-scale data that is derived from social networks. The best-known example of a social network is
Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios
Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios By: Michael Banasiak & By: Daniel Tantum, Ph.D. What Are Statistical Based Behavior Scoring Models And How Are
The Research Proposal
Describes the: The Research Proposal Researchable question itself Why it's important (i.e., the rationale and significance of your research) Propositions that are known or assumed to be true (i.e., axioms
Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables
Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables Introduction In the summer of 2002, a research study commissioned by the Center for Student
Gaming the Law of Large Numbers
Gaming the Law of Large Numbers Thomas Hoffman and Bart Snapp July 3, 2012 Many of us view mathematics as a rich and wonderfully elaborate game. In turn, games can be used to illustrate mathematical ideas.
Assessment and Instruction: Two Sides of the Same Coin.
Assessment and Instruction: Two Sides of the Same Coin. Abstract: In this paper we describe our research and development efforts to integrate assessment and instruction in ways that provide immediate feedback
South Carolina College- and Career-Ready (SCCCR) Probability and Statistics
South Carolina College- and Career-Ready (SCCCR) Probability and Statistics South Carolina College- and Career-Ready Mathematical Process Standards The South Carolina College- and Career-Ready (SCCCR)
Lesson 3.1 Factors and Multiples of Whole Numbers Exercises (pages 140 141)
Lesson 3.1 Factors and Multiples of Whole Numbers Exercises (pages 140 141) A 3. Multiply each number by 1, 2, 3, 4, 5, and 6. a) 6 1 = 6 6 2 = 12 6 3 = 18 6 4 = 24 6 5 = 30 6 6 = 36 So, the first 6 multiples
RIFIS Ad Hoc Reports
RIFIS Ad Hoc Reports To retrieve the entire list of all Ad Hoc Reports, including the Base reports and any additional reports published to your Role, select Ad Hoc for the Type under Filter Report By and
Experimental Analysis
Experimental Analysis Instructors: If your institution does not have the Fish Farm computer simulation, contact the project directors for information on obtaining it free of charge. The ESA21 project team
Chapter 2 Exploratory Data Analysis
Chapter 2 Exploratory Data Analysis 2.1 Objectives Nowadays, most ecological research is done with hypothesis testing and modelling in mind. However, Exploratory Data Analysis (EDA), which uses visualization
Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab
1 Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab I m sure you ve wondered about the absorbency of paper towel brands as you ve quickly tried to mop up spilled soda from
Agility, Uncertainty, and Software Project Estimation Todd Little, Landmark Graphics
Agility, Uncertainty, and Software Project Estimation Todd Little, Landmark Graphics Summary Prior studies in software development project estimation have demonstrated large variations in the estimated
Stat 20: Intro to Probability and Statistics
Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum
A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling
A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling Background Bryan Orme and Rich Johnson, Sawtooth Software March, 2009 Market segmentation is pervasive
5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.
The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution
IBM SPSS Modeler Social Network Analysis 15 User Guide
IBM SPSS Modeler Social Network Analysis 15 User Guide Note: Before using this information and the product it supports, read the general information under Notices on p. 25. This edition applies to IBM
STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
Database Searching Tutorial/Exercises Jimmy Eng
Database Searching Tutorial/Exercises Jimmy Eng Use the PETUNIA interface to run a search and generate a pepxml file that is analyzed through the PepXML Viewer. This tutorial will walk you through the
Kaplan-Meier Survival Analysis 1
Version 4.0 Step-by-Step Examples Kaplan-Meier Survival Analysis 1 With some experiments, the outcome is a survival time, and you want to compare the survival of two or more groups. Survival curves show,
In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data.
MATHEMATICS: THE LEVEL DESCRIPTIONS In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data. Attainment target
Two-sample inference: Continuous data
Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As
This document contains Chapter 2: Statistics, Data Analysis, and Probability strand from the 2008 California High School Exit Examination (CAHSEE):
This document contains Chapter 2:, Data Analysis, and strand from the 28 California High School Exit Examination (CAHSEE): Mathematics Study Guide published by the California Department of Education. The
NORTH PACIFIC RESEARCH BOARD SEMIANNUAL PROGRESS REPORT
1. PROJECT INFORMATION NPRB Project Number: 1303 Title: Assessing benthic meiofaunal community structure in the Alaskan Arctic: A high-throughput DNA sequencing approach Subaward period July 1, 2013 Jun
Sampling. COUN 695 Experimental Design
Sampling COUN 695 Experimental Design Principles of Sampling Procedures are different for quantitative and qualitative research Sampling in quantitative research focuses on representativeness Sampling
We begin by presenting the current situation of women s representation in physics departments. Next, we present the results of simulations that
Report A publication of the AIP Statistical Research Center One Physics Ellipse College Park, MD 20740 301.209.3070 [email protected] July 2013 Number of Women in Physics Departments: A Simulation Analysis
Foundation of Quantitative Data Analysis
Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
DRAFT. New York State Testing Program Grade 7 Common Core Mathematics Test. Released Questions with Annotations
DRAFT New York State Testing Program Grade 7 Common Core Mathematics Test Released Questions with Annotations August 2014 Developed and published under contract with the New York State Education Department
Analyzing and interpreting data Evaluation resources from Wilder Research
Wilder Research Analyzing and interpreting data Evaluation resources from Wilder Research Once data are collected, the next step is to analyze the data. A plan for analyzing your data should be developed
Protein Prospector and Ways of Calculating Expectation Values
Protein Prospector and Ways of Calculating Expectation Values 1/16 Aenoch J. Lynn; Robert J. Chalkley; Peter R. Baker; Mark R. Segal; and Alma L. Burlingame University of California, San Francisco, San
LESSON PLANS FOR PERCENTAGES, FRACTIONS, DECIMALS, AND ORDERING Lesson Purpose: The students will be able to:
LESSON PLANS FOR PERCENTAGES, FRACTIONS, DECIMALS, AND ORDERING Lesson Purpose: The students will be able to: 1. Change fractions to decimals. 2. Change decimals to fractions. 3. Change percents to decimals.
Chapter 4. Probability and Probability Distributions
Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 10 Sajjad Haider Fall 2012 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
Statistical Data Mining. Practical Assignment 3 Discriminant Analysis and Decision Trees
Statistical Data Mining Practical Assignment 3 Discriminant Analysis and Decision Trees In this practical we discuss linear and quadratic discriminant analysis and tree-based classification techniques.
Prentice Hall Mathematics: Course 1 2008 Correlated to: Arizona Academic Standards for Mathematics (Grades 6)
PO 1. Express fractions as ratios, comparing two whole numbers (e.g., ¾ is equivalent to 3:4 and 3 to 4). Strand 1: Number Sense and Operations Every student should understand and use all concepts and
Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
FTS Real Time Client: Equity Portfolio Rebalancer
FTS Real Time Client: Equity Portfolio Rebalancer Many portfolio management exercises require rebalancing. Examples include Portfolio diversification and asset allocation Indexation Trading strategies
Pre-Algebra Lecture 6
Pre-Algebra Lecture 6 Today we will discuss Decimals and Percentages. Outline: 1. Decimals 2. Ordering Decimals 3. Rounding Decimals 4. Adding and subtracting Decimals 5. Multiplying and Dividing Decimals
A Simple Feature Extraction Technique of a Pattern By Hopfield Network
A Simple Feature Extraction Technique of a Pattern By Hopfield Network A.Nag!, S. Biswas *, D. Sarkar *, P.P. Sarkar *, B. Gupta **! Academy of Technology, Hoogly - 722 *USIC, University of Kalyani, Kalyani
Quick Reference ebook
This file is distributed FREE OF CHARGE by the publisher Quick Reference Handbooks and the author. Quick Reference ebook Click on Contents or Index in the left panel to locate a topic. The math facts listed
CLUSTER ANALYSIS FOR SEGMENTATION
CLUSTER ANALYSIS FOR SEGMENTATION Introduction We all understand that consumers are not all alike. This provides a challenge for the development and marketing of profitable products and services. Not every
