Distance sampling methods for estimating the density and/or abundance capture recapture methods spatially explicit capture recapture
|
|
|
- Bertram Shaw
- 10 years ago
- Views:
Transcription
1 Distance sampling Distance sampling is a widely used group of closely related methods for estimating the density and/or abundance of biological populations. The main methods are line-transect sampling and point-transect sampling (also called variable circular plot sampling). These have been used successfully in a very diverse array of taxa, including trees, shrubs and herbs, insects, amphibians, reptiles, birds, fish, and marine and land mammals. In both cases, the basic idea is the same. One or more observers perform a standardized survey along a randomly located set of lines or points, searching for objects of interest (usually animals or clusters of animals). For each object detected, they record the distance from the line or point to the object. Not all the objects will be detected, but a fundamental assumption of the basic methods is that all objects that are actually on the line or point are detected. Intuitively, one would expect that objects become harder to detect with increasing distance from the line or point, resulting in fewer detections with increasing distance. The key to distance sampling analyses is to fit a detection function to the observed distances, and use this fitted function to estimate the proportion of objects missed during the survey. From here, we can readily obtain point and interval estimates for the density and abundance of objects in the survey area. The basic methods (sometimes called standard or conventional distance sampling) are described in detail in Ref. 1, which is an updated version of Ref. 2. Various extensions and more advanced methods are considered in Ref. 3. Free software, Distance [4], provides for the design and analysis of distance sampling surveys, implementing the methods described in Ref. 1 and many of those in Ref. 3. Distance sampling is an extension of quadratbased sampling methods. Two forms of quadrat sampling are strip transects, in which one or more observers move along a line, counting all objects within a predetermined distance of the line, and point Based in part on the article Distance sampling by Len Thomas, Stephen T. Buckland, Kenneth P. Burnham, David R. Anderson, Jeffrey L. Laake, David L. Borches, and Samantha Strindberg, which appeared in the Encyclopedia of Environmetrics. counts, in which numbers of objects (usually birds or plants) in a circle about a point are counted. Population density is then estimated by dividing the total count by the total area surveyed. A fundamental assumption of these methods is that all objects within the strip or circle are counted. This assumption is difficult to meet for many populations, and cannot be tested using the survey data. Furthermore, especially for scarce species, the methods are wasteful because detections of objects beyond the strip or circle boundary are ignored. If the width of the strip or the radius of the circle is made sufficiently small to ensure that detection of any object within the surveyed area is almost certain, then a large proportion of detections are outside the surveyed area and so are ignored. Distance sampling extends quadrat-based methods by relaxing the assumption that all objects within the circle or strip are counted. By measuring distances to the objects that are observed, the probability of observing an object within the circle or strip can be estimated. Another approach to estimating wildlife abundance involves capture recapture methods. These are often more labor-intensive and more sensitive to failures of assumptions than distance sampling. However, they are applicable to some species that are not amenable to distance sampling methods, and can yield estimates of survival and recruitment rates, which distance sampling cannot do. Capture recapture methods based on remote camera traps have been used successfully for cryptic or elusive species that show individual natural markings; DNA-based methods using hair-snares may also be possible. Capture recapture can be efficient for populations that aggregate at some location each year, whereas distance sampling methods are more effective on dispersed populations. The two methods should therefore be seen as different tools for different purposes. We also note that various hybrid approaches take elements from both distance sampling and capture recapture these include spatially explicit capture recapture and mark recapture distance sampling (see the sections titled Related Methods and Standard Methods). In fish population estimation, catch per unit effort, catch-at-age, and catch-at-length are all commonly used to estimate abundance [5], as they require that the commercial catch is sampled, which is more cost-effective than sampling the living fish. Acoustic
2 2 Distance sampling surveys of fish schools can provide data amenable to distance sampling methods. Alternative methods for estimating animal abundance are reviewed and compared in Refs 6 8. First, the basic methods for line and point transects are considered, before briefly reviewing some related and extended methods. Survey Design As with any sampling exercise, obtaining reliable results from a distance sampling survey depends critically on good survey design. This relies on the fundamental sampling principles of replication and randomization. Systematic random placement of sampling units (lines or points) is preferable to a completely random design [1, 3], and stratification can be employed to reduce variance. The design process is facilitated by the software Distance, which has built-in (GIS) functionality and implements automated design algorithms [3, 9]. It can generate survey plans based on a range of point- and line-transect designs, as well as performing simulations to compare the efficiency of different designs and to investigate design properties such as probability of each location in the survey area being sampled (referred to as coverage probability). Line-transect Sampling In line-transect sampling, a series of (systematic) randomly located straight lines is traversed by one or more observers. This may be achieved in various ways, depending on the study species. In terrestrial studies, potential modes of movement include walking, horseback, all-terrain vehicle, airplane, and helicopter. Transect surveys in aquatic environments can be conducted by snorkeling or with SCUBA gear, from submarines, surface vessels, aircraft, or from sleds with mounted video units pulled underwater by a surface vessel. Passive acoustic surveys are gaining popularity for marine mammal species (and other taxa), in which case the observer is a towed hydrophone array. In the case of large observation platforms, there is typically a team of observers. Estimation Perpendicular distances x are determined from the line to each detected object of interest. In practice, (radial) detection distances r and detection angles θ are often recorded, from which perpendicular distances are calculated as x = r sin θ. Suppose k lines of lengths l 1,...,l k (with l j = L) are positioned according to some randomized scheme, and that animals further than some distance w from the line (the truncation distance) are not recorded. Then, the surveyed area is a = 2wL, within which n animals are detected at perpendicular distances x 1,...,x n.letp a be the probability that a randomly chosen animal within the surveyed area is detected, and suppose an estimate Pa ˆ is available. Animal density D is then estimated by n ˆD = (1) 2wLPˆ a To provide a framework for estimating P a, we define the detection function g(x) to be the probability that an object at distance x from the line is detected, 0 x w, and assume that g(0) = 1 (i.e., that we are certain to detect an animal on the transect line). If we plot the recorded perpendicular distances in a histogram, then conceptually the problem is to specify a suitable model for g(x) andtofitittothe perpendicular distance data. As shown in Figure 1, if we define μ = w 0 g(x)dx, thenp a = μ/w. The parameter μ is called the effective strip (half-) width; g(x) 1.0 m m 1.0 w Figure 1 The area μ under the detection function g(x), when expressed as a proportion of the area w of the rectangle, is the probability that an object within the surveyed area is detected; μ is also the effective strip width, and takes a value between 0 and w. (Source: Reproduced from Ref. 10. John Wiley & Sons, Ltd, 1998.) w x
3 Distance sampling 3 it is the distance from the line for which as many objects are detected beyond μ as are missed within μ (Figure 1). Thus, ˆD = n = apˆ a n 2wL ˆμ/w = n 2 ˆμL (2) We now need to estimate μ. We can turn this into a more familiar estimation problem by noting that the probability density function (pdf) of perpendicular distances to detected objects, denoted by f(x), isthe detection function g(x), rescaled so that it integrates to unity (see Frequency curves). That is, f(x)= g(x)/μ. In particular, because we assume g(0) = 1, it follows that f(0) = 1/μ (Figure 2). Hence, ˆD = n 2 ˆμL = n f(0) ˆ (3) 2L The problem is reduced to modeling the pdf of perpendicular distances, and evaluating the fitted function at x = 0. The large literature for fitting density functions is now available to us. The Distance software uses the methods of Ref. 1, in which a parametric key function is selected and, if it fails to provide an adequate fit, polynomial or cosine series adjustments are added until the fit is judged to be satisfactory by one or more criteria. Often, the perpendicular distances are recorded by distance category, so that each exact distance need not be measured, or data are grouped into distance categories before analysis. Standard likelihood methods for multinomial data are used to fit such grouped (or interval ) data [1]. Variance and Interval Estimation Thevarianceof ˆD is well approximated [1] using the delta method: [ var(n) var( ˆD) = ˆD 2 n 2 + var[ ] f(0)] ˆ (4) [ f(0)] ˆ 2 The variance of n generally is estimated from the sample variance in encounter rates, n j /l j. Examination of a range of possible estimators of this variance [11] showed that one based on weighting by line lengths squared performed best when lines are placed at random within the study area. This is the default in the Distance software (versions 6 and higher). The authors also proposed and evaluated estimators for the case of systematic random parallel line placement (a recommended design strategy); an updated estimator with even better performance has since been developed [12]. Given that f (0) is estimated by maximum likelihood, its variance can be estimated from the information matrix. If we assume that ˆD is lognormally distributed, approximately 95% confidence limits are given by ( ˆD/C, ˆDC) where f (x) with C = exp{1.96[ var(ln ˆD)] 0.5 } (5) [ var(ln ˆD) = ln 1 + var( ] ˆD) ˆD 2 (6) Freq m w x Figure 2 The pdf of perpendicular distances, f(x), plotted on a histogram of perpendicular distance frequencies (scaled so that the total area of histogram bars is unity). The area below the curve is unity by definition. Because the two shaded areas are equal in size, the area of the rectangle, μf (0), is also unity. Hence μ = 1/f (0). (Source: Reproduced from Ref. 10. John Wiley & Sons, Ltd, 1998.) Bootstrap resampling may also be used for variance and interval estimation. In this case, resamples are usually generated by sampling with replacement from the lines, so that independence between the lines is assumed, but independence between detections on the same line is not. If the model selection procedure for the detection function is applied independently to each resample, the bootstrap variance includes a component because of model selection uncertainty. Cluster Size Estimation Animals often occur in groups, which we term clusters. These may be flocks of birds, herds of ungulates,
4 4 Distance sampling pods of whales, and so on. If one animal in a cluster is detected, then it is assumed that the whole cluster is detected, and the distance to the center of the cluster is recorded. Equation (3) then gives an estimate of the density of clusters. To obtain the estimated density of individuals, we must multiply by an estimate of mean cluster size in the population, E(s): ˆD = n f(0)ê(s) ˆ (7) 2L Probability of detection is often a function of cluster size, so that the sample of detected cluster sizes exhibits size bias (larger clusters are easier to detect and so are overrepresented in the sample). In the absence of size bias, we can take Ê(s) = s, the mean size of detected clusters. Several methods exist for estimating E(s) in the presence of size bias [1] (see Size-biased sampling). One that works well in practice is to regress log(s) onĝ(x), the estimated probability of detection at distance x ignoring the effect of cluster size, and then predict log(s) when detection is certain, ĝ(x) = 1, as there can be no size bias in that circumstance. The prediction is backtransformed using a bias adjustment. Another option to account for size bias is to include cluster size as an additional covariate in the detection function (see the section titled Multiple Covariate Distance Sampling). Assumptions The physical setting for line-transect sampling is idealized as follows: 1. N objects are distributed through an area of size A according to some stochastic process with average rate parameter D = N/A. 2. Lines, placed according to some randomized design, are surveyed and a sample of n objects is detected. It is not necessary that the objects be randomly (i.e., Poisson) distributed. Rather, it is critical that the line or point be placed randomly with respect to the local distribution of objects. This ensures that objects in the surveyed strip are uniformly distributed with respect to distance from the line. Thus, if the strip has half-width w, object-to-line distances available for detection are uniformly distributed between zero and w. Random line placement also allows valid designbased extrapolation from the sample to the study area. There are three key assumptions of the basic method. Many extensions have been developed (see section titled Extensions to Standard Methods) with the aim of relaxing these assumptions. 1. Objects directly on the line are always detected, i.e., g(0) = 1. Missing objects on the line causes a corresponding underestimation of D. 2. Objects do not move. Conceptually, distance sampling is a snapshot method: we would like to freeze animals in position while we conduct the survey. In practice, nonresponsive movement is not problematic provided it is slow relative to the speed of the observer. Responsive movement before detection is, however, problematic [13, 14]. 3. Distances are measured accurately (for ungrouped distance data), or objects are correctly allocated to distance interval (for grouped data). Provided distance measurements are approximately unbiased, bias tends to be small in the presence of measurement errors [1, 15 17]. Biased measurements, if uncorrected, are problematic, and in cases when detection distances and angles are recorded, rounding of small angles to zero can also cause problems with estimation. A fourth assumption is made in many derivations of estimators and variances: whether an object is detected is independent of whether any other object is detected. Point estimates of D are robust to violations of the assumption of independence, and robust variance estimates are obtained by taking the line to be the sampling unit, either by bootstrapping on lines, or by calculating a weighted sample variance of encounter rates by line. It is also important that the detection function has a shoulder ; that is, the probability of detection remains at or close to one initially as distance from the line increases from zero. This is not an assumption, but a property that allows more reliable estimation of object density. Given the above, and assuming a suitably flexible method for estimating g(x), the point and interval estimates of D are extremely robust to variation in g(x) due to other factors such as observer, habitat, weather, and so on. This very useful property of standard distance sampling methods is known as pooling robustness [1, 3], and is not shared by capture recapture methods, which are intrinsically
5 Distance sampling 5 nonrobust to unmodeled variation in detectability [18]. Large variations in density over the study area are also not a problem although if areas of differing density can be defined in advance and then stratification of survey effort should be used to increase precision. g(r) Freq r Point-transect Sampling In point-transect sampling, an observer visits a number of points, the locations of which are determined by some (systematic) random design. The method is usually (but not exclusively) used for songbird populations, in which typically many species are recorded and most detections are aural. By recording from points, the observer can concentrate on detecting the objects of interest, without having to navigate along a line, and without having to negotiate a randomly positioned line through possibly difficult terrain. The principal disadvantages are that detections made while traveling from one point to the next are not utilized, a problem especially for scarce species, and the method is unsuited to species that are generally detected by flushing them, or to species that typically change their location appreciably over the time period of the count (see below). An increasing application is in passive acoustic density estimation of cetaceans (and other taxa) from fixed locations, where recording devices may be left at a sample of sites for long time periods. Estimation Detection distances r are measured from the point to each detected object. Suppose the design comprises k points, and distances less than or equal to w are recorded. Then the surveyed area is a = kπw 2, within which n objects are detected. As for linetransect sampling, denote the probability that an object within the surveyed area a is detected by P a with estimate Pˆ a. Then, we estimate the object density D by n ˆD = (8) kπw 2 Pˆ a We now define the detection function g(r) to be the probability that an object at distance r from the point is detected, and we again assume that g(0) = 1. For line transects, the area of an incremental strip at distance x from the lines is L dx, independent of x, which leads to the result that the pdf of distances (a) f(r) Freq (b) Figure 3 Histograms of detection distances from a point-transect survey. In (a), each histogram frequency has been scaled by dividing by the midpoint of the corresponding group interval. Also shown are the corresponding fits of the detection function [g(r) in (a)] and the pdf of detection distances [f(r) in (b)]. (Source: Reproduced from Ref. 10. John Wiley & Sons, Ltd, 1998.) differs from the detection function only in scale. By contrast, an incremental annulus at distance r from a point has area 2πr dr, proportional to r, sothat the pdf of detection distances is f(r)= 2πrg(r)/ν, where ν = 2π w 0 rg(r)dr. The respective shapes of the two functions g(r) and f(r) are illustrated in Figure 3. If we define an effective radius ρ, analogous to the effective strip width of line-transect sampling, then ν = πρ 2 is the effective area surveyed per point (Figure 4). Hence, ˆD = n n = apˆ a kπw 2 π ˆρ 2 /πw 2 = n (9) k ˆν The area of the triangle in Figure 4 is ρ 2 f (0)/2, where f (0) is the slope of f(r) at r = 0. Since this is equal to the area under f(r), which is unity, it w w r r
6 6 Distance sampling f(r) r w r Figure 4 The pdf of detection distances, f(r). The area under the curve is unity by definition. Because the two shaded areas are equal in size, the area of the triangle, ρ 2 f (0)/2, is also unity. Hence ν = πρ 2 = 2π/f (0). (Source: Reproduced from Ref. 10. John Wiley & Sons, Ltd, 1998.) follows that ν = πρ 2 = 2π/f (0), and ˆD = n f ˆ (0) 2πk We therefore need to model the pdf of detection distances, and evaluate the slope of the fitted function at r = 0. The software Distance does this using the same set of models for the detection function as for line-transect sampling. Variance and Interval Estimation The methods for variance and interval estimation for line-transect sampling also apply to point transects with minor modifications. The variance of n is usually estimated from the sample variance in encounter rates between points. However, point-transect surveys are sometimes designed by defining a series of lines, as if a line-transect survey is to be carried out, then locating a series of points along each line. If the distance between neighboring points on the same line is smaller than the distance between neighboring points on different lines, then the data for all points on the same line should be pooled and the variance of n estimated from the sample variance in encounter rates between lines, weighted by the number of points on each line. Similarly, in this situation, bootstrap variance estimates should be calculated by resampling lines with replacement, rather than individual points. Assumptions Assumptions are virtually unchanged from those for line-transect sampling. As there, the standard analyses are very robust to failure of the assumption of independent detections. Point-transect sampling is, however, more subject to bias than line-transect sampling when objects move through the area around a point. In principle, we try to obtain a snapshot, locating each object at the position it occupied at one instant in time. However, the count is not instantaneous, because the observer needs time to detect all objects close to that point. If, during that time, movement brings new objects into the neighborhood of the point, then object density will be overestimated. To minimize bias, we recommend that the amount of time spent at the point before and after the snapshot instant be fixed in advance, and be as small as possible, given the requirement to detect all objects close to the point. An alternative approach is to use a cue counting method (see the section titled Related Methods), where the object counted is an instantaneous cue such as a bird song. Point-transect methods are used to estimate the density of cues (i.e., bird songs per unit area and time), and this is converted to animal density by dividing by an estimate of cue rate, obtained separately. The relative merits of these and other methods for songbirds are considered in Ref. 19. Related Methods Trapping webs [1] and trapping line transects [3] provide an alternative to traditional capture recapture sampling for estimating animal density. In these, traps are placed along lines radiating from randomly chosen points (trapping webs) or with declining density either side of randomly chosen lines (trapping line transects), and the distances of trapped animals from the points or lines are used to estimate a distance sampling detection function. However, these methods suffer a number of potential biases [3], and have been largely superseded by spatially explicit capture recapture. Cue counting [20] was developed as an alternative to line-transect sampling for estimating whale abundance from sighting surveys. Observers on a ship or aircraft record all sighting cues within a sector ahead of the platform and their distance from the platform.
7 Distance sampling 7 The cue used depends on species, but might be the blow of a whale at the surface. The sighting distances are converted into the estimated number of cues per unit time per unit area using a point-transect modeling framework. The cue rate is estimated from separate studies, in which individual animals or clusters are monitored over a period of time. Cue counting has also been used in passive acoustic point-transect surveys of songbirds [19] and whales [21]. Indirect methods are often used when the animals are rare, cryptic, or tend to move away before being detected. Instead of counting the animals, the objects counted are something produced by the animals, for example, animal dung (e.g., deer [22] or elephant [23]) or nests (e.g., great apes [23, 24]). To convert object density to animal density, one must then estimate two further parameters: object production rate and object disappearance rate, from separate studies. Distance sampling can be viewed as a quadratbased method where detection within the sampled quadrats is not assumed certain, and auxiliary information, the distances, is gathered and used to estimate detection probability. In some cases, it may be better (or more feasible) to estimate detection probability from other sources. An example is a trapping point transect, where an auxiliary survey is used to determine the detection function for traps set at known distances from radiotracked small mammals; this function is then used to estimate the effective area surveyed by a grid of traps in a main survey that covers the study area [25]. Similar methods have been used to estimate a passive acoustic detection function for cetaceans, using a sample of animals at known locations [26, 27]. A related method, used when animals can be lured in, for example, using a playback, is the lure point transect [28]. At the extreme, models based on theoretical considerations could be used to predict detectability, as with Ref. 29 for passive acoustic density estimation of sperm whales; however, this is clearly a method of last resort. Related techniques sometimes used by botanists to estimate densities (and sometimes also termed distance sampling) are nearest-neighbor methods and point-to-nearest object methods [30]. These approaches do not involve modeling the detection function, and so are outside the definition of distance samplingusedhere. Extensions to Standard Methods The basic theory of distance sampling is now well established, as are the standard estimation and field methods [1]. More recent research (e.g., Ref. 3) has focused on methods for increasing precision and relaxing the assumptions of the standard methods, and on advanced design issues. Multiple Covariate Distance Sampling Generally, probability of detection is a function of many factors other than distance of the object from the line or point. We have considered briefly one other factor, cluster size, because if we do not allow for size bias in detection when objects occur in clusters, then our object density estimator may be biased. Other sources of heterogeneity contribute little to bias, provided g(0) = 1. Nevertheless, higher precision might be anticipated if additional covariates are recorded and their effects on g(x) modeled. One approach, first used by Ref. 31, is to allow covariates to affect the scale of the detection function but not its shape. Building on this work, extensions to thedetection function estimation methods outlined in the section on line-transect sampling above allow the scale parameter of the key function to be a function of covariates [3, 32]. See also Ref. 33 for an accessible guide to these extensions. Mark recapture Distance Sampling In some surveys, detection on the transect line is not certain (g(0)<1), either because observers miss potentially detectable animals because of environmental conditions, fatigue, and so on (perception bias), or because animals are unavailable for detection, for example, because they are underground or underwater (availability bias), or both. In the case of perception bias, capture recapture methods may be combined with distance sampling, through the use of two observation platforms [3, 34]. The platforms might be treated as mutually independent so that, provided that animals detected by both platforms (duplicate detections) can be identified, two-sample capture recapture methods that incorporate covariates can be used. Bias in such methods is typically large enough to be of concern unless heterogeneity in detectability is well modeled. However, it is seldom possible to record covariates that
8 8 Distance sampling reflect this heterogeneity adequately. For example, if a whale produces a blow that is particularly visible from one platform, due to light conditions or some other factor in the environment that is difficult to measure, then it will tend to be more visible from the other platform too, and abundance will be underestimated. These problems may be reduced by separating the areas of search for the two platforms, and using one to set up trials for the other. Various analytic approaches are also potentially available [35]. The resulting binary data may then be modeled using logistic regression [36]. In some studies, the platform that sets up the trials could be provided, for example, by a radiotagging study, where locations of animals are known, or by an underwater acoustic array (so long as species could be identified accurately). In double-platform methods, Horvitz Thompson-like estimators are used to estimate density, given the estimated probability of detection for each observation (see Sampling, environmental). All of the above work has taken place in the context of line transects, but similar methods for point transects are also available [37]. When animals are not available for detection by either platform, the above methods cannot correct for all the bias. Separate estimates of availability can be included in the analysis; alternatively the availability process can be modeled (e.g., Refs 38, 39). Density Surface Modeling The standard framework invokes properties of the sampling design to scale up from density in the surveyed transects to density in the study area. However, it may be advantageous to use modelbased methods for this stage instead, particularly methods where spatially indexed covariates are used and hence a spatial density surface model is fitted. First, a model-based approach allows data collected from nonrandom surveys (platforms of opportunity) to be used; second, animal density may be related to habitat and environmental variables, potentially increasing precision and improving understanding of factors affecting abundance; third, abundance may be estimated for any subregion of interest, by integrating under the fitted spatial density surface; and last, for study areas that are small relative to the width of the transects, there may be information gain by jointly modeling detection probability and spatial variation in density. One approach [40] is to conceptualize the distribution of animals as an inhomogeneous Poisson process, in which the detection function represents a thinning process. Joint modeling of detectability and density is possible (e.g., Refs 41 43), but in many cases the study area is large compared with the transect width, and there seems little loss in modeling detectability first using the observed distances, and then density variation over the study area conditional on the estimated detection function [3, 40]. Issues exist such as adequately modeling local small-scale density hotspots [44] and dealing with complex spatial topography [45]. Bayesian approaches to density surface modeling have been implemented [46, 47]. Other Extensions Adaptive sampling [48] (see Adaptive designs) offers a means of increasing sample size, and hence increasing precision, by concentrating survey effort where most observations occur. Standard adaptive sampling methods can readily be extended to distance sampling surveys [48]. A major practical problem of adaptive sampling is that the required survey effort is not known in advance. Methods have been developed for line-transect sampling to allow a fixed total effort, using a variable adaptation rule [49]. An experimental trial on a survey of harbor porpoise in the Gulf of Maine yielded substantially more detections and somewhat better precision than did conventional linetransect sampling [49]. Distance sampling surveys are sometimes used as part of an experimental design, for example, to compare density between control and treatment plots. Methods have been developed for such situations, which allow for the dependence between sample units when parameters of the detection function are shared between units [50]. Nonrandom placement of samplers requires the use of density surface models for encounter rate, covered above. However, another potential consequence of not using random locations is that the density of objects with respect to distance from the line or point is not necessarily the standard form, and hence, the distribution of observed distances cannot be assumed to tell us directly about detectability. Other auxiliary sources of information about nonstandard density gradients can, however, be incorporated, as demonstrated by Refs. 15, 51 and 52.
9 Distance sampling 9 Another assumption of the standard methods is that distances are measured without error. However, if there are errors, but information is available about their form, this can potentially be incorporated in estimation [15, 16, 53]. Although the standard methods are firmly established and perform well in many circumstances, there are still situations where neither they, nor the extensions above, are satisfactory. There is much research underway, and much left to be done, and hence the subject of distance sampling is still a lively one for statistics and ecology. References [1] Buckland, S.T., Anderson, D.R., Burnham, K.P., Laake, J.L., Borchers, D.L., & Thomas, L. (2001). Introduction to Distance Sampling, Oxford University Press, Oxford. [2] Buckland, S.T., Anderson, D.R., Burnham, K.P., & Laake, J.L. (1993). Distance Sampling: Estimating Abundance of Biological Populations, Chapman & Hall, London, reprinted (1999) by Research Unit for Wildlife Population Assessment, St Andrews. [3] Buckland, S.T., Anderson, D.R., Burnham, K.P., Laake, J.L., Borchers, D.L., & Thomas, L., eds (2004). Advanced Distance Sampling, Oxford University Press. [4] Thomas, L., Buckland, S.T., Rexstad, E.A., Laake, J.L., Strindberg, S., Hedley, S.L., Bishop, J.R.B., Marques, T.A., & Burnham, K.P. (2010). Distance software: design and analysis of distance sampling surveys for estimating population size, Journal of Applied Ecology 47, [5] Hilborn, R. & Walters, C.J. (1992). Quantitative Fisheries Stock Assessment: Choice, Dynamics and Uncertainty, Chapman & Hall, New York. [6] Seber, G.A.F. (1982). The Estimation of Animal Abundance and Related Parameters, 2nd Edition, Macmillan, New York. [7] Schwarz, C.J.& Seber, G.A.F. (1999). A review of estimating animal abundance II, Statistical Science 14, [8] Borchers, D.L., Buckland, S.T., & Zucchini, W. (2002). Estimating Animal Abundance: Closed Populations, Springer-Verlag, London, p [9] Strindberg, S. & Buckland, S.T. (2004). Zigzag survey designs in line transect sampling, Journal of Agricultural, Biological and Environmental Statistics 9, [10] Buckland, S.T., Anderson, D.R., Burnham, K.P., & Laake, J.L. (1998). Distance sampling, in Encyclopedia of Biostatistics, P. Armitage & T. Colton, eds, John Wiley & Sons, Ltd, Chichester. [11] Fewster, R.M., Buckland, S.T., Burnham, K.P., Borchers, D.L., Jupp, P.E., Laake, J.L., & Thomas, L. (2009). Estimating the encounter rate variance in distance sampling, Biometrics 65, [12] Fewster, R.M. (2011). Variance estimation for systematic designs in spatial surveys, Biometrics 67, [13] Buckland, S.T. (1992). Fitting density functions using polynomials, Applied Statistics 41, [14] Fewster, R.M., Southwell, C., Borchers, D.L., Buckland, S.T., & Pople, A.R. (2008). The influence of animal mobility on the assumption of uniform distances in aerial line transect surveys, Wildlife Research 35, [15] Marques, T.A. (2007). Incorporating measurement error and density gradients in distance sampling surveys, PhD thesis, University of St Andrews. [16] Borchers, D.L., Marques, T.A., Gunnlaugsson, T., & Jupp, P.E. (2010). Estimating distance sampling detection functions when distances are measured with errors, Journal of Agricultural, Biological, and Environmental Statistics 15, [17] Borchers, D.L., Pike, D., Gunlaugsson, T., & Vikingson, G. (2009). Minke whale abundance estimation from the NASS 1987 and 2001 cue counting surveys taking account of distance estimation errors, Vol. 7, North Atlantic Sightings Surveys ( ), North Atlantic Marine Mammal Commission Publications, pp [18] Link, W.A. (2003). Nonidentifiability of population size from capture recapture data with heterogeneous detection probabilities, Biometrics 59, [19] Buckland, S.T. (2006). Point transect surveys for songbirds: robust methodologies, The Auk 123, [20] Hiby, A.R. (1985). An approach to estimating population densities of great whales from sighting surveys, IMA Journal of Mathematics Applied in Medicine and Biology 2, [21] Marques, T.A., Thomas, L., Munger, L., Wiggins, S., & Hildebrand, J.A. (2011). Estimating North Pacific right whale (Eubalaena japonica) density using passive acoustic cue counting, Endangered Species Research 13, [22] Marques, F.F.C., Buckland, S.T., Goffin, D., Dixon, C.E., Borchers, D.L., Mayle, B.A., & Peace, A.J. (2001). Estimating deer abundance from line transect surveys of dung: sika deer in southern Scotland, Journal of Applied Ecology 38, [23] Stokes, E.J., Strindberg, S., Bakabana, P.C., Elkan, P.W., Iyenguet, F.C., Madzoké, B., Malanda, G.A., Mowawa, B.S., Moukoumbou, C., Ouakabadio, B.S., & Rainey, H.J. (2010). Monitoring great ape and elephant abundance at large spatial scales: measuring effectiveness of a conservation landscape, PLoS ONE 5, e DOI: /journal.pone [24] Plumptre, A.J. (2000). Monitoring mammal populations with line transect techniques in African forests, Journal of Applied Ecology 37, [25] Potts J.M. (2011). Estimating abundance of rare, small mammals: a case study of the Key Largo woodrat (Neotoma floridana small), PhD thesis, University of St Andrews.
10 10 Distance sampling [26] Marques, T.A., Thomas, L., Ward, J., DiMarzio, N., & Tyack, P.L. (2009). Estimating cetacean population density using fixed passive acoustic sensors: an example with beaked whales, Journal of the Acoustical Society of America 125, [27] Kyhn L.A., Tougaard, J., Thomas, L., Duve, L.R., Steinback, J., Amundin, M., Desportes, G., & Teilmann, J. (2012). From echolocation clicks to animal density acoustic sampling of harbour porpoises with static dataloggers, Journal of the Acoustical Society of America 131, [28] Buckland, S.T., Summers, R.W., Borchers, D.L., & Thomas, L. (2006). Point transect sampling with traps or lures, Journal of Applied Ecology 43, [29] Küsel, E.T., Mellinger, D.K., Thomas, L., Marques, T.A., Moretti, D.J., & Ward, J. (2011). Cetacean population density from single fixed sensors using passive acoustics, Journal of the Acoustical Society of America 129, [30] Diggle, P.J. (1983). Statistical Analysis of Spatial Point Patterns, Academic Press, London. [31] Ramsey, F.L., Wildman, V., & Engbirng, J. (1987). Covariate adjustments to effective area in variable-area wildlife surveys, Biometrics 43, [32] Marques, F.F.C. & Buckland, S.T. (2003). Incorporating covariates into standard line transect analysis, Biometrics 59, [33] Marques, T.A., Thomas, L., Fancy, S.G., & Buckland, S.T. (2007). Improving estimates of bird density using multiple covariate distance sampling, The Auk 127, [34] Borchers, D.L., Zucchini, W., & Fewster, R.M. (1998). Mark-recapture models for line transect surveys, Biometrics 54, [35] Buckland, S.T., Laake, J.L., & Borchers, D.L. (2010). Double-observer line transect methods: levels of independence, Biometrics 66, [36] Borchers, D.L., Buckland, S.T., Goedhart, P.W., Clarke, E.D., & Hedley, S.L. (1998). Horvitz Thompson estimators for double-platform line transect surveys, Biometrics 54, [37] Laake, J.L., Collier, B.A., Morrison, M.L., & Wilkins, R.N. (2011). Point-based mark recapture distance sampling, Journal of Agricultural, Biological and Environmental Statistics 16, [38] Schweder, T., Skaug, H.J., Langaas, M., & Dimakos, X. (1999). Simulated likelihood methods for complex double-platform line transect surveys, Biometrics 55, [39] Kleppe, T., Skaug, H.J., & Okamura, H. (2010). Asymptotic bias of the hazard probability model under model mis-specification, Journal of Cetacean Research and Management 11(3), [40] Hedley, S.L. & Buckland, S.T. (2004). Spatial models for line transect sampling. Journal of Agricultural, Biological and Environmental Statistics 9, [41] Johnson, D.S., Laake, J.L., & Ver Hoef, J.M. (2010). A model based approach for making ecological inference from distance sampling data, Biometrics 66, [42] Buckland, S.T., Borchers, D.L., Johnston, A., Henrys, P.A., & Marques, T.A. (2007). Line transect methods for plant surveys, Biometrics 63, [43] Royle, J.A., Dawson, D.K., & Bates, S. (2004). Modelling abundance effects in distance sampling, Ecology 85, [44] Skaug, H.J. (2006). Markov modulated Poisson processes for clustered line transect data, Environmental and Ecological Statistics 13, [45] Wood, S.N., Bravington, M.V., & Hedley, S.L. (2008). Soap film smoothing, Journal of the Royal Statistical Society, Series B 70, [46] Niemi, A. & Fernandez, C. (2010). Bayesian spatial point process modelling of line transect survey data, Journal of Agricultural, Biological and Environmental Statistics 15, [47] Royle, J.A. & Dorazio, R.M. (2008). Hierarchical Inference and Modelling in Ecology, Academic Press. [48] Thompson, S.K. & Seber, G.A.F. (1996). Adaptive Sampling, John Wiley & Sons, Inc., New York. [49] Pollard, J.H., Palka, D., & Buckland, S.T. (2002). Adaptive line transect sampling, Biometrics 58, [50] Buckland, S.T., Russell, R.E., Dickson, B.G., Saab, V.A., Gorman, D.G., & Block, W.M. (2009). Analysing designed experiments in distance sampling, Journal of Agricultural, Biological and Environmental Statistics 14, [51] Marques, T.A., Buckland, S.T., Borchers, D.L., Tosh, D., & McDonald, R.A. (2010). Point transect sampling along linear features, Biometrics 66, [52] Cox, M.J., Borchers, D.L., Demer, D.A., Cutter, G.R., & Brierley, A.S. (2011). Estimating the density of Antarctic krill (Euphausia superba) from multi-beam echo-sounder observations using distance sampling methods, Journal of the Royal Statistical Society: Series C (Applied Statistics) 60, [53] Marques, T.A. (2004). Predicting and correcting bias caused by measurement error in line transect sampling using multiplicative error models, Biometrics 60, (See also Abundance: population size and density estimation; Line-transect sampling, new approaches; Capture recapture methodology; Capture recapture models, spatially explicit) LEN THOMAS, STEPHEN T. BUCKLAND, KENNETH P. BURNHAM, DAVID R. ANDERSON, JEFFREY L. LAAKE, DAVID L. BORCHERS, & SAMANTHA STRINDBERG
Avian Management Indicator Species on Cimarron and Comanche National Grasslands
Avian Management Indicator Species on Cimarron and Comanche National Grasslands September 2008 In Cooperation With: Rocky Mountain Bird Observatory PO Box 1232 Brighton, CO 80601-1232 303.659.4348 www.rmbo.org
POINT TRANSECT SURVEYS FOR SONGBIRDS: ROBUST METHODOLOGIES
Running head: Point Transect Surveys for Songbirds POINT TRANSECT SURVEYS FOR SONGBIRDS: ROBUST METHODOLOGIES STEPHEN T. BUCKLAND Centre for Research into Ecological and Environmental Modelling, The Observatory,
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
Avian surveys and monitoring
85 Chapter 6 Avian surveys and monitoring A more complete understanding of the role that wild birds play in the ecology of wildlife diseases require baseline studies of those species likely to host, transmit
A Method of Population Estimation: Mark & Recapture
Biology 103 A Method of Population Estimation: Mark & Recapture Objectives: 1. Learn one method used by wildlife biologists to estimate population size of wild animals. 2. Learn how sampling size effects
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
POPULATION DYNAMICS. Zoo 511 Ecology of Fishes
POPULATION DYNAMICS Zoo 511 Ecology of Fishes Today s goals Understand why and how population dynamics are important in fisheries ecology Gain experience in a variety of mark-recapture methods What are
Handling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.
Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are
Algebra 1 Course Information
Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through
Quantitative Inventory Uncertainty
Quantitative Inventory Uncertainty It is a requirement in the Product Standard and a recommendation in the Value Chain (Scope 3) Standard that companies perform and report qualitative uncertainty. This
Statistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
Multivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
Exploratory Spatial Data Analysis
Exploratory Spatial Data Analysis Part II Dynamically Linked Views 1 Contents Introduction: why to use non-cartographic data displays Display linking by object highlighting Dynamic Query Object classification
Missing data and net survival analysis Bernard Rachet
Workshop on Flexible Models for Longitudinal and Survival Data with Applications in Biostatistics Warwick, 27-29 July 2015 Missing data and net survival analysis Bernard Rachet General context Population-based,
Machine Learning and Pattern Recognition Logistic Regression
Machine Learning and Pattern Recognition Logistic Regression Course Lecturer:Amos J Storkey Institute for Adaptive and Neural Computation School of Informatics University of Edinburgh Crichton Street,
Fixed-Effect Versus Random-Effects Models
CHAPTER 13 Fixed-Effect Versus Random-Effects Models Introduction Definition of a summary effect Estimating the summary effect Extreme effect size in a large study or a small study Confidence interval
Why Count Birds? (cont.)
AVIAN CENSUS TECHNIQUES: Why count birds? Descriptive Studies = asks what types of birds occur in a particular habitat? - Provides gross overview of bird occurrence and perhaps a crude estimate of abundance
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS
VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS Aswin C Sankaranayanan, Qinfen Zheng, Rama Chellappa University of Maryland College Park, MD - 277 {aswch, qinfen, rama}@cfar.umd.edu Volkan Cevher, James
Multivariate Analysis of Ecological Data
Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology
Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard
Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express
11. Time series and dynamic linear models
11. Time series and dynamic linear models Objective To introduce the Bayesian approach to the modeling and forecasting of time series. Recommended reading West, M. and Harrison, J. (1997). models, (2 nd
Approaches for Analyzing Survey Data: a Discussion
Approaches for Analyzing Survey Data: a Discussion David Binder 1, Georgia Roberts 1 Statistics Canada 1 Abstract In recent years, an increasing number of researchers have been able to access survey microdata
X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
Big Ideas in Mathematics
Big Ideas in Mathematics which are important to all mathematics learning. (Adapted from the NCTM Curriculum Focal Points, 2006) The Mathematics Big Ideas are organized using the PA Mathematics Standards
Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur
Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce
Comparison of Estimation Methods for Complex Survey Data Analysis
Comparison of Estimation Methods for Complex Survey Data Analysis Tihomir Asparouhov 1 Muthen & Muthen Bengt Muthen 2 UCLA 1 Tihomir Asparouhov, Muthen & Muthen, 3463 Stoner Ave. Los Angeles, CA 90066.
How To Understand The Theory Of Probability
Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL
Experiment #1, Analyze Data using Excel, Calculator and Graphs.
Physics 182 - Fall 2014 - Experiment #1 1 Experiment #1, Analyze Data using Excel, Calculator and Graphs. 1 Purpose (5 Points, Including Title. Points apply to your lab report.) Before we start measuring
Chapter 11 Introduction to Survey Sampling and Analysis Procedures
Chapter 11 Introduction to Survey Sampling and Analysis Procedures Chapter Table of Contents OVERVIEW...149 SurveySampling...150 SurveyDataAnalysis...151 DESIGN INFORMATION FOR SURVEY PROCEDURES...152
Multiple Imputation for Missing Data: A Cautionary Tale
Multiple Imputation for Missing Data: A Cautionary Tale Paul D. Allison University of Pennsylvania Address correspondence to Paul D. Allison, Sociology Department, University of Pennsylvania, 3718 Locust
User's Guide. Distance 6.0 Beta 5. Research Unit for Wildlife Population Assessment
User's Guide Distance 6.0 Beta 5 Research Unit for Wildlife Population Assessment Contents Chapter 1 - Introduction 1 Welcome...1 Conventions in This Documentation...2 Where to Go Next...2 Citation for
Statistical Rules of Thumb
Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN
STA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! [email protected]! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA
VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA Csilla Csendes University of Miskolc, Hungary Department of Applied Mathematics ICAM 2010 Probability density functions A random variable X has density
CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS
Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships
GEOENGINE MSc in Geomatics Engineering (Master Thesis) Anamelechi, Falasy Ebere
Master s Thesis: ANAMELECHI, FALASY EBERE Analysis of a Raster DEM Creation for a Farm Management Information System based on GNSS and Total Station Coordinates Duration of the Thesis: 6 Months Completion
Chapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA
Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or
The primary goal of this thesis was to understand how the spatial dependence of
5 General discussion 5.1 Introduction The primary goal of this thesis was to understand how the spatial dependence of consumer attitudes can be modeled, what additional benefits the recovering of spatial
Removal fishing to estimate catch probability: preliminary data analysis
Removal fishing to estimate catch probability: preliminary data analysis Raymond A. Webster Abstract This project examined whether or not removal sampling was a useful technique for estimating catch probability
Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification
Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification Presented by Work done with Roland Bürgi and Roger Iles New Views on Extreme Events: Coupled Networks, Dragon
Models for Product Demand Forecasting with the Use of Judgmental Adjustments to Statistical Forecasts
Page 1 of 20 ISF 2008 Models for Product Demand Forecasting with the Use of Judgmental Adjustments to Statistical Forecasts Andrey Davydenko, Professor Robert Fildes [email protected] Lancaster
A Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
Estimating Asian Elephant Population in Dindugul, Kodaikanal and Theni Forest Divisions, Western Ghats Tamil Nadu
Estimating Asian Elephant Population in Dindugul, Kodaikanal and Theni Forest Divisions, Western Ghats Tamil Nadu A. Kumaraguru 1, K. Karunanithi 2, S. Asokan 2 and N. Baskaran 3 1 CCMB, Mayiladuthurai,
Instrumentation for Monitoring around Marine Renewable Energy Devices
Instrumentation for Monitoring around Marine Renewable Energy Devices 1 Introduction As marine renewable energy has developed, a set of consistent challenges has emerged following attempts to understand
Comparison of resampling method applied to censored data
International Journal of Advanced Statistics and Probability, 2 (2) (2014) 48-55 c Science Publishing Corporation www.sciencepubco.com/index.php/ijasp doi: 10.14419/ijasp.v2i2.2291 Research Paper Comparison
Exam C, Fall 2006 PRELIMINARY ANSWER KEY
Exam C, Fall 2006 PRELIMINARY ANSWER KEY Question # Answer Question # Answer 1 E 19 B 2 D 20 D 3 B 21 A 4 C 22 A 5 A 23 E 6 D 24 E 7 B 25 D 8 C 26 A 9 E 27 C 10 D 28 C 11 E 29 C 12 B 30 B 13 C 31 C 14
UNDERGRADUATE DEGREE DETAILS : BACHELOR OF SCIENCE WITH
QATAR UNIVERSITY COLLEGE OF ARTS & SCIENCES Department of Mathematics, Statistics, & Physics UNDERGRADUATE DEGREE DETAILS : Program Requirements and Descriptions BACHELOR OF SCIENCE WITH A MAJOR IN STATISTICS
**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.
**BEGINNING OF EXAMINATION** 1. You are given: (i) The annual number of claims for an insured has probability function: 3 p x q q x x ( ) = ( 1 ) 3 x, x = 0,1,, 3 (ii) The prior density is π ( q) = q,
MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
Teaching Multivariate Analysis to Business-Major Students
Teaching Multivariate Analysis to Business-Major Students Wing-Keung Wong and Teck-Wong Soon - Kent Ridge, Singapore 1. Introduction During the last two or three decades, multivariate statistical analysis
SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
Life Table Analysis using Weighted Survey Data
Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 [email protected] Genomics A genome is an organism s
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Overview Missingness and impact on statistical analysis Missing data assumptions/mechanisms Conventional
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics
UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science & Statistics BA (Mod) Enter Course Title Trinity Term 2013 Junior/Senior Sophister ST7002
BayesX - Software for Bayesian Inference in Structured Additive Regression
BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich
Statistics in Retail Finance. Chapter 6: Behavioural models
Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
Practical Calculation of Expected and Unexpected Losses in Operational Risk by Simulation Methods
Practical Calculation of Expected and Unexpected Losses in Operational Risk by Simulation Methods Enrique Navarrete 1 Abstract: This paper surveys the main difficulties involved with the quantitative measurement
From the help desk: Bootstrapped standard errors
The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution
Reflection and Refraction
Equipment Reflection and Refraction Acrylic block set, plane-concave-convex universal mirror, cork board, cork board stand, pins, flashlight, protractor, ruler, mirror worksheet, rectangular block worksheet,
Integration of Marine Mammal Movement and Behavior into the Effects of Sound on the Marine Environment
DISTRIBUTION STATEMENT A: Approved for public release; distribution is unlimited. Integration of Marine Mammal Movement and Behavior into the Effects of Sound on the Marine Environment Dorian S. Houser
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]
Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance
99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm
Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the
Paper Reference. Ruler graduated in centimetres and millimetres, protractor, compasses, pen, HB pencil, eraser. Tracing paper may be used.
Centre No. Candidate No. Paper Reference 1 3 8 0 3 H Paper Reference(s) 1380/3H Edexcel GCSE Mathematics (Linear) 1380 Paper 3 (Non-Calculator) Higher Tier Monday 18 May 2009 Afternoon Time: 1 hour 45
Lecture 12: The Black-Scholes Model Steven Skiena. http://www.cs.sunysb.edu/ skiena
Lecture 12: The Black-Scholes Model Steven Skiena Department of Computer Science State University of New York Stony Brook, NY 11794 4400 http://www.cs.sunysb.edu/ skiena The Black-Scholes-Merton Model
Applying Statistics Recommended by Regulatory Documents
Applying Statistics Recommended by Regulatory Documents Steven Walfish President, Statistical Outsourcing Services [email protected] 301-325 325-31293129 About the Speaker Mr. Steven
Introduction to mixed model and missing data issues in longitudinal studies
Introduction to mixed model and missing data issues in longitudinal studies Hélène Jacqmin-Gadda INSERM, U897, Bordeaux, France Inserm workshop, St Raphael Outline of the talk I Introduction Mixed models
Solutions to Exercises, Section 5.1
Instructor s Solutions Manual, Section 5.1 Exercise 1 Solutions to Exercises, Section 5.1 1. Find all numbers t such that ( 1 3,t) is a point on the unit circle. For ( 1 3,t)to be a point on the unit circle
Financial Risk Forecasting Chapter 8 Backtesting and stresstesting
Financial Risk Forecasting Chapter 8 Backtesting and stresstesting Jon Danielsson London School of Economics 2015 To accompany Financial Risk Forecasting http://www.financialriskforecasting.com/ Published
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
A Model of Optimum Tariff in Vehicle Fleet Insurance
A Model of Optimum Tariff in Vehicle Fleet Insurance. Bouhetala and F.Belhia and R.Salmi Statistics and Probability Department Bp, 3, El-Alia, USTHB, Bab-Ezzouar, Alger Algeria. Summary: An approach about
An introduction to Value-at-Risk Learning Curve September 2003
An introduction to Value-at-Risk Learning Curve September 2003 Value-at-Risk The introduction of Value-at-Risk (VaR) as an accepted methodology for quantifying market risk is part of the evolution of risk
Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach
Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Refik Soyer * Department of Management Science The George Washington University M. Murat Tarimcilar Department of Management Science
An Introduction to Point Pattern Analysis using CrimeStat
Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign
Java Modules for Time Series Analysis
Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering Department of Industrial Engineering and Management Sciences Northwestern University September 15th, 2014
Sound Power Measurement
Sound Power Measurement A sound source will radiate different sound powers in different environments, especially at low frequencies when the wavelength is comparable to the size of the room 1. Fortunately
MMP Goal - effectively manage the (manatee) population in perpetuity throughout Florida by securing habitat and minimizing threats
Statewide Monitoring of the Florida Manatee Presentation for the Marine Mammal Commission 2015 Annual Meeting Leslie Ward-Geiger, Marine Mammal Research Program Leader Florida Fish and Wildlife Conservation
Solving Simultaneous Equations and Matrices
Solving Simultaneous Equations and Matrices The following represents a systematic investigation for the steps used to solve two simultaneous linear equations in two unknowns. The motivation for considering
Section 1.1. Introduction to R n
The Calculus of Functions of Several Variables Section. Introduction to R n Calculus is the study of functional relationships and how related quantities change with each other. In your first exposure to
Important Probability Distributions OPRE 6301
Important Probability Distributions OPRE 6301 Important Distributions... Certain probability distributions occur with such regularity in real-life applications that they have been given their own names.
