Using rainfall radar data to improve interpolated maps of dose rate in the Netherlands

Size: px
Start display at page:

Download "Using rainfall radar data to improve interpolated maps of dose rate in the Netherlands"

Transcription

1 Using rainfall radar data to improve interpolated maps of dose rate in the Netherlands Paul H. Hiemstra a,, Edzer J. Pebesma b, Gerard B.M. Heuvelink c, Chris J.W. Twenhöfel d a University of Utrecht, Department of Physical Geography, P.O. Box 8.115, 358 TC Utrecht, The Netherlands b University of Münster, Institute for Geoinformatics, Weseler Straße 253, Münster, Germany c Environmental Sciences Group, Wageningen University, P.O. Box 47, 67 AA Wageningen, The Netherlands d National Institute for Public Health and the Environment (RIVM), Antonie van Leeuwenhoeklaan 9, 3721 MA Bilthoven, The Netherlands Abstract The radiation monitoring network in the Netherlands is designed to detect and track increased radiation levels, dose rate more specifically, in 1 minute intervals. The network consists of 153 monitoring stations. Washout of radon progeny by rainfall is the most important cause of natural variations in dose rate. The increase in dose rate at a given time is a function of the amount of progeny decaying, which in turn is a balance between deposition of progeny by rainfall and radioactive decay. The increase in progeny is closely related to average rainfall intensity over the last 2.5 hours. We included decay of progeny by using weighted averaged rainfall intensity, where the weight decreases back in time. The decrease in weight is related to the half-life of radon progeny. In this paper we show for a rain storm on the 2 th of July 27 that weighted averaged rainfall intensity estimated from rainfall radar images, collected every 5 minutes, performs much better as a predictor of increases in dose rate than using the non-averaged rainfall intensity. In addition, we show through cross-validation that including weighted averaged rainfall intensity in an interpolated map using universal kriging (UK) does not necessarily lead to a more accurate map. This might be attributed to the high density of monitoring stations in comparison to the spatial extent of a typical rain event. Reducing the network density improved the accuracy of the map when universal kriging was used instead of ordinary kriging (no trend). Consequently, in a less dense network the positive influence of including a trend is likely to increase. Furthermore, we suspect that UK better reproduces the sharp boundaries present in rainfall maps, but that the lack of short-distance monitoring station pairs prevents cross-validation from revealing this effect. Keywords: density ordinary kriging, universal kriging, interpolation, dose rate, rainfall intensity, rainfall radar, trend, network 1. Introduction In case of releases of radioactive material into the atmosphere, a fast and accurate estimate of the spatial distribution of radiation levels is needed to estimate health effects on the population. In the Netherlands, radiation levels are measured by 153 monitoring stations of the National Radioactivity Monitoring network (NRM), see figure 1. The NRM provides point information on radiation level, dose rate more specifically, no data are available in between the stations. Interpolated maps provide estimated dose rate in between the monitoring stations and provide an estimate of the spatial distribution of dose rate. EUR EN (25), Dubois et al. (27) and Hiemstra et al. (29) explored the mapping of dose rate using geostatistics. Geostatistical mapping, i.e. kriging, has an advantage over more simple interpolation methods in that it can take in Corresponding author. addresses: p.hiemstra@geo.uu.nl (Paul H. Hiemstra), edzer.pebesma@uni-muenster.de (Edzer J. Pebesma) account trends and provides an estimate of the predicton error. Hiemstra et al. (29) focused on the interpolation of dose rate in non-emergency, background situations, providing a first step towards an interpolation system suitable for emergency situations. In addition, Hiemstra et al. (29) suggested to use trend information in a universal kriging (UK) approach (Chilès and Delfiner, 1999; Christensen, 1996), using soil type to improve the interpolated map. Many other studies (Knotters et al., 1995; Bishop and McBratney, 21; Bourennane and King, 23; Lloyd, 25; Yemefack et al., 25; Hengl et al., 27) showed that accounting for a trend can improve an interpolated map. Rainfall intensity is a major factor determining the spatial distribution of dose rate on a short time scale (Smetsers and Blaauboer, 1997b; Horng and Jiang, 24). Therefore, we hypothesized that accounting for rainfall intensity as predictor would improve the accuracy of the interpolated maps of dose rate. Rainfall intensity influences dose rate because radon daughter products or progeny, primarily bismuth ( 214 Bi) and lead ( 214 Pb), are washed out Preprint submitted to Science of the Total Environment August 27, 21

2 of clouds and the atmosphere and are deposited on the ground. Radioactive decay of the radon progeny increases dose rate at that location. The increase in dose rate at any given time is proportional to the amount of radon progeny that is decaying at that time. In turn, the amount of radon progeny is a balance between the deposition of progeny by rainfall and the radioactive decay of those progeny. We hypothesized that we could model this balance between deposition and decay by taking the weighted average over the history of rainfall intensity. Taking the average over the history of rainfall captured the deposition of radon progeny. In addition, letting the weight drop in time captured the decay of already deposited radon progeny. The rate at which the weight dropped was related to the halflife of the radon progeny. The goal of this study was to make interpolated maps of increase in dose rate based on NRM data and rainfall intensity estimated by rainfall radar. We defined the following research questions: 1. How well does the weighted averaged rainfall intensity perform as a predictor for increase in dose rate in comparison to using non-averaged rainfall intensity, measured at an individual time step? 2. Can the relationship between rainfall intensity and increase in dose rate improve our interpolated map? 3. Is there a relationship between monitoring network density and the improvement mentioned in research question 2? We fitted a linear model to a rain storm travelling over the Netherlands from southwest to northeast on the 2th of July 27. We used the goodness of fit R 2 as a measure for how well both non-averaged and weighted rainfall intensity explained the variation in increase in dose rate. Consequently, we made maps of increase in dose rate with and without accounting for rainfall, using leave-one-out cross-validation and the mean kriging prediction variance. Furthermore, we reduced monitoring network density and repeated the cross-validation procedure. 2. Methods In the study we used rainfall and dose rate data from the 2th of July 27. On this day, a large rainstorm passed over the Netherlands and caused significant increases in dose rate Measuring dose rate Dose rate in the Netherlands is measured at 153 locations (figure 1) every 1 minutes by the NRM (Twenhöfel et al., 25). Dose rate is commonly expressed in ambient dose equivalent rate, H (1) (ICRU, 1993), abbreviated in this study to dose rate. The unit used for dose rate in this study was nano Sievert per hour (nsv/h). Deposition of radon daughter products by rainfall increases 2 the dose rate. Consequently, we were only interested in increase of dose rate, not in the absolute value. We determined the increase in dose rate by subtracting the mean dose rate of each NRM station for the 2th of July from the 1-min dose rate data of that station. Note that the mean dose rate was calculate based on times without rainfall. Using the increase in dose rate has the added advantage of eliminating variations between stations, for example caused by calibration differences or soil type (Smetsers and Blaauboer, 1996) Estimating rainfall intensity In this study rainfall intensity maps were estimated every five minutes using two C-band Doppler radars operated by the Royal Netherlands Meteorological Institute (KNMI) (figure 1). The radar emits radio waves and registers reflectivity. Increased reflectivity indicates more water present in the air, and thus a higher rainfall intensity. The radar provides the spatial distribution of radar reflectivity (Z, mm 6 m 3 ) for 2.5 km 2.5 km grid cells. Battan (1973) describes how the radar reflectivity can be converted to the rainfall intensity at the surface (R u, mm h 1 ). Figure 2 shows the radar rainfall intensity maps for the 2th of July 27 from 8AM to 7PM in hourly time steps. From the rainfall intensity maps we derived the rainfall intensity at the monitoring stations of the NRM. The rainfall intensity at a particular NRM monitoring station was defined as the closest cell centre of the rainfall intensity map Weighted averaged rainfall intensity Rainfall intensity was averaged over time using a weighted average where the weight was determined by the half-life of the deposited radon progeny. We calibrated an overall half-life to that part of the data that clearly shows the effect of radon progeny, without disturbances. We used large scale bound constrained optimization (Zhu et al., 1997) to perform the calibration, which lead to an overall half-life of 25.8 minutes. Using weighted averaged rainfall intensity assumes that the deposition of radon progeny is only influenced by rainfall intensity and not by how long it has already rained. This assumption is supported by the work of Fujinami (1996). The weighted averaged rainfall intensity (R w ) at time t was determined by: m i= R w (t) = α ir u (t i ) m i= α (1) i where t is the time for which we calculate the average, m is the number of timesteps of five minutes over which we averaged, R u is rainfall intensity measured at an individual time step and α i is the weight at t i = t i t, where t is the size of the timestep. We chose m equal to 3 because at α i=3 the weight is very low. The weight α i is determined by: ( α i = exp ln2 ) i t (2) t 1/2

3 where t 1/2 is the overall half-life. Figure 3 shows maps comparing R w to R u at 5PM Relating dose rate to rainfall intensity To determine how well R w described increase in dose rate (H) compared to R u, we investigated the temporal relationship between both R u and R w and H. For the temporal relationship we kept location constant and varied time. The temporal relationship was determined using linear regression. The assumption in linear regression is that the n observations of increase in dose rate, H, at a certain location can be described by the following linear model (Christensen, 1996): H = Xβ + e, E(e) =, Cov(e) = σ 2 I. (3) where X is the n 2 design matrix where the i-th row equals (1, R u (t i )) or (1, R w (t i )), β = (β, β 1 ), are unknown regression coefficients describing the temporal relationship between H and R u or R w, and e is the residual. We used the R 2 as a goodness of fit for the fitted regression coefficients: R 2 = 1 SS n e i=1 = 1 (H i Ĥi) 2 SS n tot i=1 (H i H) (4) 2 where SS e is the residual sums of squares, SS tot is the total sums of squares, H i is the observed H, Ĥ i is the H estimated by the linear regression and H is the mean of H Mapping dose rate with- and without trend We compared ordinary kriging (OK) to universal kriging (UK) (Chilès and Delfiner, 1999; Christensen, 1996) to determine whether including a spatial trend improved the accuracy of the interpolated map. Note that in contrast to section 2.4, the trend is fitted in space and not in time. An important step in kriging is fitting the variogram model to the residuals. For UK these are residuals to a trend, for OK these are residuals to a spatially constant mean. The variogram model was automatically fitted to the residuals as described in Hiemstra et al. (29). Based on the sample variogram, calculated based on the residuals, we made an initial guess of the variogram parameters, nugget, sill and range. After that, we used iterative reweighted least squares, or Gauss-Newton fitting (Cressie, 1993), to fit the variogram model to the sample variogram. We fitted a single isotropic variogram model to the entire study area. We used ordinary least squares (OLS) residuals (assuming uncorrelated residuals) instead of generalized least squares (GLS) residuals to fit the variogram model. More information on using OLS residuals to find the variogram model is found in Kitanidis (1993). We used the gstat package (Pebesma, 24) in the statistical computing environment R (R Development Core Team, 21) for all geostatistical calculations Quantifying the accuracy of the map We quantified the accuracy of the maps produced by OK and UK using three different measures. The first was the Root Mean Squared Error (RMSE) of the leave-oneout cross-validation residuals: RMSE = 1 n (Ĥcv,i H i ) n 2 (5) i=1 where n is the number of observations, Ĥcv,i is the increase in dose rate estimated by cross-validation and H i is the measured increase in dose rate. A smaller RMSE indicates a smaller error and thus a more accurate map. The second measure was the Mean Error (ME) and is defined as: ME = 1 n n (Ĥcv,i H i ) (6) i=1 ME provides an indication for the systematic error or bias in the cross-validation residuals. The third measure is the Mean Kriging Variance (MKV) defined as the mean of the kriging variance calculated for each prediction location (Christensen, 1996). 3. Results 3.1. Weighted averaged vs. non-averaged rainfall intensity To compare non-averaged rainfall intensity (R u ) and weighted averaged rainfall intensity (R w ) as a predictor for increase in dose rate (H) figures 4 and 5 show time series of these three variables for four monitoring stations. In addition, scatterplots of R u versus H and R w versus H are shown with the fitted regression line. The goodness of fit (R 2 ) is shown below the scatterplots in the x-axis caption. At all four monitoring stations the R 2 increased when we used R w instead of R u. To compare the R 2 between using R u and R w for all monitoring stations, figure 6 shows these R 2 s. Filled dots represent the R 2 value for R w, open dots the R 2 for R u. On average, R 2 increased from.17 for R u to.78 for R w Estimation of the spatial distribution of dose rate Figure 7 shows scatterplots of R w versus H (location varies, time is constant) between 8 AM and 7 PM in hourly timesteps. For all the hourly timesteps the fitted regression parameters are significant (p <.25). The goodness of fit of the fitted regression parameters varies between.42 and.71. Figure 8 shows variograms of R w and the correlation length in kilometers. The correlation length is determined by fitting a spherical variogram model to the sample variogram and using the range of the variogram model as the correlation length. The correlation lengths are quite large in comparison to the typical distance between the NRM stations, which is about 12 km. Figure 9 shows the fitted variogram models of H for both OK and UK between

4 8AM and 7PM. The fitted variogram models show a drop in both sill and range for UK in comparison to OK. The semivariance in these plots is on the log-scale, to make the differences between the fitted models more clear. Figure 1 shows interpolated maps for OK and UK for three moments in time. Table 1 shows the mean increase in dose rate ( H), root mean squared error (RMSE) of the cross-validation residuals, the mean error (ME) of the cross-validation residuals and the mean kriging variance (MKV) for OK and UK. The differences between OK and UK in terms of RMSE are small compared to H, indicating that the results for OK and UK are comparable. In addition, ME and MKV for OK and UK are comparable in size. RMSE is large in comparison to the ME, suggesting that there is no bias in the cross-validation residuals. We reduced the network density to see how this could effect the RMSE. We randomly took out 2%, 4%, 6% and 8% of the stations. We repeated this procedure a number of times for each moment in time. From these randomly reduced networks we selected the one that had the change in RMSE mose favorable to UK. Note that we kept the variogram models and the regression coefficients for cross-validating the reduced network equal to those of the full network. Figure 11 shows the results of reducing the network with panels for different reduction percentages, time on the y-axis and the change in RMSE on the x-axis. The vertical lines represent the mean values for the full and reduced network respectively. These lines indicate that the change in RMSE shifts in favor of UK when the network is reduced. 4. Discussion 4.1. Weighted averaged vs. non-averaged rainfall intensity Weighted averaged rainfall intensity (R w ) performs much better as a predictor for the increase in dose rate (H) than non-averaged rainfall intensity (R u ). The R 2 increases for all monitoring stations when we use R w instead of R u (figure 6), on average from.17 to.78. This confirms our hypothesis that taking a weighted averaged rainfall intensity is a much better description of the radon washout and decay process. The high R 2 s for R w underline the fact that rain out of radon progeny is an important process in describing the variations in dose rate. Figure 5 shows that not all monitoring stations show a high R 2. We offer two possible explanations: firstly, the observed dose rate enhancements include contributions from sources other than rainfall. The spiky patterns in e.g. figure 5(a) may well be attributed to the transport of medical radioactive sources or to radiographic screening during welding activities in the vicinity of the monitoring station. Secondly, the correlation between rainfall at surface level and the rainfall radar images may fail occasionally (figure 5(b)). For example when rainfall detected by the radar system high up in the atmosphere does not 4 reach the ground surface. In addition, reflectance from buildings or large flocks of birds can produce false rainfall patterns. EUR 26 EN (23) provides a more thorough description of complications when using rainfall radar to estimate rainfall intensity. Our estimate of the rainfall intensity could be improved by combining rainfall radar with ground measurements of rainfall intensity (Schuurmans et al., 27). Including ground measurements combines the spatial coverage of the radar images with the accuracy of ground measurements. In this study we expressed the temporal structure of the relation between rainfall intensity and increase in dose rate by taking a weighted average. An alternative approach could be to use space-time kriging, see e.g. Jost et al. (25). Space-time kriging captures the temporal aspect in defining a variogram model not only in space, but also in time Estimation of the spatial distribution of dose rate The accuracy in terms of RMSE, ME and MKV is more or less the same for OK and UK (table 1). Consequently, there is no significant improvement in our estimate of the distribution of dose rate when we take into account the relationship between R w and H. This is surprising given the fact that UK takes into account a significant trend with a goodness of fit of up to.71 (figure 7). We discuss this fact for MKV and RMSE seperately in the next two sections. Because ME is very small in comparison to RMSE, i.e. there is no bias in the cross-validation residuals, and we will not further discuss ME. MKV We expected MKV to drop because the fitted linear model explains part of the variance in the data (sill becomes lower), decreasing the kriging variance. To illustrate why the MKV sometimes does not drop for UK, we discuss the way the kriging variance is calculated and the role of the variogram model in this calculation (see figure 9 for the fitted variogram models). The kriging variance at a prediction location is calculated as a weighted average of the semivariance of the surrounding observations, similar to the kriging prediction. The semivariance of the surrounding observations is obtained from the variogram model and the weights are equal to the kriging weights. In our case the kriging weight is mainly distributed over the points within a 4 km radius. Consequently, the kriging variance is mainly determined by the behaviour of the variogram model in this distance interval. When the variogram model shows greater semivariances in this distance interval for UK than for OK, the MKV for UK increases. A good example of the increase when using UK are the fitted variogram models for 9AM (see figure 9). The sill drops, but the decrease in range causes the semivariance values in the range upto 4 km to be greater for UK than for OK. In conclusion: the total variance in the dataset (the sill) drops, but because of the density of the monitoring network the kriging weights are mainly distributed

5 over stations that are closer than the range of the variogram model. Consequently, we do not take advantage of the decrease in the sill and the MKV does not drop significantly. RMSE The correlation length of R w is large in comparison to the average distance between the monitoring stations, about 12 km. Figure 8 shows an average correlation length of 184 kilometers. So the NRM is dense compared to the correlation length of the rain storm. The density of the network allows OK to be succesful in interpolating the increases in dose rate caused by rainfall. The succes of OK is also apparent from the interpolated maps in figure 1. The interpolated maps by OK and UK broadly show the same pattern. So the density of the network causes OK and UK to perform equally well in reproducing the spatial pattern of H, and thus have a comparable RMSE in crossvalidation. In conclusion: when the monitoring network is dense in comparison to the phenomenon causing the trend in the data, the increase in accuracy when including the trend is likely to be small. This conclusion is in line with the work of Journel and Rossi (1989). In case the correlation length is smaller than the typical distance between the monitoring stations, we expect an improvement in the accuracy of the map. This is supported by the results shown in figure 11. The figure shows that for a less dense network UK has smaller RMSE values. This indicates that in a less dense network, the positive influence of adding a trend increases. In addition to a less dense network, we expect an improvement in RMSE for rain storms with a smaller correlation length. Smaller correlation lengths occur with more localized thunder storms or in mountainous regions where rain storms are restricted to the valleys. Although for the current network density UK does not perform much better than OK in terms of cross validation statistics, we consider the maps resulting from UK to be the more realistic ones. OK tends to create highly smooth surfaces, where UK shows much sharper boundaries, see for example the interpolated of 12: in figure 1. Rainfall time series show that boundaries are often sharp rather than smooth, see e.g. figure 4(b). That this more realistic pattern does not lead to better cross-validation result might be attributed to the even spread of monitoring stations over the country. Lacking monitoring station pairs at short distances prohibit the detection, and thereby validation, of sharp boundaries. Fujinami, 1996), show that wash out of radon progeny is a very important process in describing the variations in dose rate. The accuracy of maps produced by ordinary kriging (OK, no trend) and universal kriging (UK) is comparable. This is mainly caused by the density of the NRM in comparison to the scale of the rainfall radar data. When the monitoring network is dense in comparison to the phenomenon causing the trend in the data, the increase in accuracy when including the trend is likely to be small. In support of this conclusion, our results show that for networks with a decreased density the performance of UK in comparison to OK increases. In a less dense network the positive effect of including a trend increases. In addition, for rainfall patterns with a shorter correlation length, we expect to see an improved performance of UK in comparison to OK. Furthermore, we suspect that cross validating the evenly spread monitoring stations works in the advantage of OK, when the external variable (rainfall) exhibits sharp boundaries. Maps resulting from UK better follow the sharp boundaries present in rainfall, but the lack of short-distance monitoring station pairs prevents crossvalidation to reveal this effect. Acknowledgements The authors thank the Royal Dutch Meteorological Institute (KNMI) for supplying the rainfall radar data. We gratefully acknowledge financial support from the innovation programme Space for Geo-Information (RGI), project RGI-32. This work has been partially funded by the European Commission, under the Sixth Framework Programme, by the INTAMAP project Contract N with the DG INFSO, action Line IST ICT for Environmental Risk Management. The views expressed herein are those of the authors and are not necessarily those of the RGI or the European Commission. The authors would also like to thank Stephanie Melles and three anonymous reviewers for providing comments that improved the manuscript. 5. Conclusions Our results show that the weighted averaged rainfall intensity performs much better as a predictor for increase in dose rate than the non-averaged rainfall intensity. This conclusion, in combination with the results from literature (Smetsers and Blaauboer, 1997a; Horng and Jiang, 23; 5

6 References Battan, L. J., Radar Observations of the Atmosphere. University of Chicago Press. Bishop, T. F. A., McBratney, A. B., 21. A comparison of prediction methods for the creation of field-extent soil property maps. Geoderma 13 (1-2), Bourennane, H., King, D., 23. Using multiple external drifts to estimate a soil variable. Geoderma 114 (1-2), Chilès, J. P., Delfiner, P., Geostatistics: Modeling Spatial Uncertainty. John Wiley & Sons, New York, 72p. Christensen, R., Plane Answers to Complex Questions: The Theory of Linear Models, 2nd Edition. Springer, New York, 496p. Cressie, N. A., Statistics for Spatial Data. Wiley, NY, 9p. Dubois, G., Pebesma, E. J., Bossew, P., 27. Automatic mapping in emergency: A geostatistical perspective. International Journal of Emergency Management 4 (3), Fujinami, N., Observational study of the scavenging of radon daughters by precipitation from the atmosphere. Environment International 22 (Supplement 1), Hengl, T., Heuvelink, G. B. M., Rossiter, D. G., 27. About regression-kriging: From equations to case studies. Computers & Geosciences 33 (1), Hiemstra, P. H., Pebesma, E. J., Twenhöfel, C. J. W., Heuvelink, G. B. M., 29. Real-time automatic interpolation of ambient gamma dose rates from the dutch radioactivity monitoring network. Computers & Geosciences 35 (8), Horng, M., Jiang, S., Dec. 23. A rainout model for the study of the additional exposure rate due to rainfall. Radiation Measurements 37 (6), Horng, M., Jiang, S., Feb. 24. In situ measurements of gamma-ray intensity from radon progeny in rainwater. Radiation Measurements 38 (1), ICRU, Quantities and units in radiation protection dosimetry. ICRU report 51. Tech. rep., Bethesda MD. Jost, G., Heuvelink, G., Papritz, A., 25. Analysing the space-time distribution of soil water storage of a forest ecosystem using spatiotemporal kriging. Geoderma 128 (3-4 SPEC. ISS.), Journel, A. G., Rossi, M. E., Oct When do we need a trend model in kriging? Mathematical Geology 21 (7), Kitanidis, P. K., Generalized covariance functions in estimation. Mathematical Geology 25 (5), Knotters, M., Brus, D. J., Oude Voshaar, J. H., A comparison of kriging, co-kriging and kriging combined with regression for spatial interpolation of horizon depth with censored observations. Geoderma 67 (3-4), Lloyd, C. D., 25. Assessing the effect of integrating elevation data into the estimation of monthly precipitation in great britain. Journal of Hydrology 38 (1-4), EUR 26 EN, 23. Quality and assimilation of radar data for NWP. Alberoni, P. P., Ducrocq, V., Gregoric, G., Haase, G., Holleman, I., Lindskog, M., Macpherson, B., Nuret, M. and A. Rossa (Eds). Office for Official Publications of the European Communities, Luxembourg., 38 p. EUR EN, 25. Automatic mapping algorithms for routine and emergency monitoring data. Report on the Spatial Interpolation Comparison (SIC24) exercise. Dubois G. (Ed). Office for Official Publications of the European Communities, Luxembourg, 15 p. Pebesma, E. J., 24. Multivariable geostatistics in S: the gstat package. Computers & Geosciences 3 (7), R Development Core Team, 21. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, ISBN URL Schuurmans, J. M., Bierkens, M. F. P., Pebesma, E. J., Uijlenhoet, R., 27. Automatic prediction of high-resolution daily rainfall fields for multiple extents: The potential of operational radar. Journal of Hydrometeorology 8 (6), Smetsers, R. C. G., Blaauboer, R. O., Variations in outdoor radiation levels in the Netherlands. Ph.D. thesis, Rijksuniversiteit Groningen. 6 Smetsers, R. C. G., Blaauboer, R. O., 1997a. A dynamic compensation method for natural ambient dose rate based on 6 years data from the dutch radioactivity monitoring network. Radiation Protection Dosimetry 69 (1), Smetsers, R. C. G., Blaauboer, R. O., 1997b. Source-dependent probability densities explaining frequency distributions of ambient dose rate in the Netherlands. Radiation Protection Dosimetry 69 (1), Twenhöfel, C. J. W., de Hoog van Beynen, C., van Lunenburg, A. P. P. A., Slagt, G. J. E., Tax, R. B., van Westerlaak, P. J. M., Aldenkamp, F. J., 25. Operation of the Dutch 3rd generation national radioactivity monitoring network. In: EUR EN, 25. Automatic mapping algorithms for routine and emergency monitoring data. Report on the Spatial Interpolation Comparison (SIC24) exercise. Dubois G. (Ed). Office for Official Publications of the European Communities, Luxembourg, pp Yemefack, M., Rossiter, D. G., Njomgang, R., 25. Multi-scale characterization of soil variability within an agricultural landscape mosaic system in southern Cameroon. Geoderma 125 (1-2), Zhu, C., Byrd, R., Lu, P., Nocedal, J., Algorithm 778: L- bfgs-b: Fortran subroutines for large-scale bound-constrained optimization. ACM Transactions on Mathematical Software 23 (4),

7 t 8: 9: 1: 11: 12: 13: 14: 15: 16: H RMSE OK RMSE UK RMSE MKV OK MKV UK MKV ME OK ME UK Table 1: Mean increase in dose rate ( H), Root Mean Squared Error (RMSE) and Mean Error (ME) of the leave-one-out cross-validation residuals, the Mean Kriging Variance (MKV) for Ordinary and Universal kriging and the difference between them. We performed crossvalidation for the data between 8AM and 7PM. A negative difference for either MKV or RMSE means that UK was performing better than OK. 7

8 25 km Figure 1: Monitoring stations of the National Radioactivity Monitoring network ( ) and the location of the rainfall radar stations ( ). 8

9 Figure 2: Rainfall intensity in mm/h estimated from rainfall radar images. 9

10 Figure 3: Maps of rainfall intensity and weighted averaged rainfall intensity in mm/h at 5PM. 1

11 R u: Rainfall intensity (mm/h) R w: Weighted averaged rainfall intensity (mm/h) H (nsv/h) H: Increase in dose rate (nsv/h) H (nsv/h) R u (mm/h, R 2 =.19 ) 6: 9: 12: 15: 18: 21: time R w (mm/h, R 2 =.9 ) (a) R u: Rainfall intensity (mm/h) R w: Weighted averaged rainfall intensity (mm/h) H (nsv/h) H: Increase in dose rate (nsv/h) H (nsv/h) R u (mm/h, R 2 =.26 ) 6: 9: 12: 15: 18: 21: time R w (mm/h, R 2 =.94 ) (b) Figure 4: Non-averaged rainfall intensity (R u), weighted averaged rainfall intensity (R w) and increase in dose rate (H) versus time (left), and scatterplots of R u and R w vs H (right) for two stations (a, b) that show a high correlation between R w and H. 11

12 R u: Rainfall intensity (mm/h) R w: Weighted averaged rainfall intensity (mm/h) H (nsv/h) H: Increase in dose rate (nsv/h) R u (mm/h, R 2 =.1 ) H (nsv/h) 4 2 6: 9: 12: 15: 18: 21: time R w (mm/h, R 2 =.16 ) (a) R u: Rainfall intensity (mm/h) R w: Weighted averaged rainfall intensity (mm/h) H (nsv/h) H: Increase in dose rate (nsv/h) R u (mm/h, R 2 = ) H (nsv/h) 1 5 6: 9: 12: 15: 18: 21: time R w (mm/h, R 2 =.14 ) (b) Figure 5: Non-averaged rainfall intensity (R u), weighted averaged rainfall intensity (R w) and increase in dose rate (H) versus time (left), and scatterplots of R u and R w vs H (right) for two stations (a, b) that show a low correlation between R w and H. 12

13 R 2 stations Weighted averaged rainfall intensity Rainfall intensity Figure 6: Goodness of fit (R 2 ) between increase in dose rate and non-averaged rainfall intensity (open dots) and weighted averaged rainfall intensity (filled dots) per station. Note how the R 2 shifts in favor of weighted averaged rainfall intensity. 13

14 R 2 =.56 9 R 2 =.55 1 R 2 = R 2 = Increase in radioactivity level (nsv/h) R 2 =.48 R 2 = R 2 =.47 R 2 = R 2 =.5 R 2 = R 2 =.61 R 2 = Weighted averaged rainfall intensity (mm/h) Figure 7: Weighted averaged rainfall intensity versus increase in dose rate from 8 AM to 7 PM. Line fitted using linear regression. numbers in the plots represent the goodness of fit (R 2 ). The 14

15 range : 144 km 13 range : 187 km 14 range : 147 km 15 range : 175 km.8 Semivariance (nsv/h) range : 239 km 16 range : 263 km 17 range : 255 km range : 221 km range : 186 km range : 165 km range : 15 km range : 22 km Distance (km) Figure 8: Spherical variogram models fitted to weighted averaged rainfall intensity from 8 AM to 7 PM. The number in the lower right corner is the correlation length. 15

16 Semivariance (nsv/h) Distance (km) OK UK Figure 9: Hourly sample variograms and fitted models for ordinary kriging (o) and universal kriging (+) from 8AM to 7PM. Note that semivariances are shown on the log-scale. 16

17 Figure 1: Interpolated maps of dose rate (nsv/h) of 8AM, 12AM and 5PM using ordinary kriging (OK) and universal kriging (UK). 17

18 19: 18: 17: 16: 15: 14: 13: 12: 11: 1: 9: 8: Time 19: 18: 17: 16: 15: 14: 13: 12: 11: 1: 9: 8: 19: 18: 17: 16: 15: 14: 13: 12: 11: 1: 9: 8: Normal Decreased density x + 19: 18: 17: 16: 15: 14: 13: 12: 11: 1: 9: 8: Change in Cross validation RMSE Figure 11: The effect of reducing the size of the network by 2%, 4%, 6% and 8% on the change in cross-validation RMSE between OK and UK. Time is on the y-axis, the change in RMSE on the x-axis. A negative change in RMSE means that UK is outperforming OK and vice versa. The plusses (+) show the best RMSE in favor of UK for the reduced network and the crosses ( ) show the RMSE for the full network. The vertical lines represent the mean values for RMSE for the reduced and the full network. 18

Sampling Optimization Trade-Offs for Long-Term Monitoring of Gamma Dose Rates

Sampling Optimization Trade-Offs for Long-Term Monitoring of Gamma Dose Rates Sampling Optimization Trade-Offs for Long-Term Monitoring of Gamma Dose Rates S.J. Melles 1, G.B.M. Heuvelink 1, C.J.W. Twenhöfel 2, and U. Stöhlker 3 1 Wageningen University and Research Centre (WUR),

More information

Optimization for the design of environmental monitoring networks in routine and emergency settings

Optimization for the design of environmental monitoring networks in routine and emergency settings Optimization for the design of environmental monitoring networks in routine and emergency settings S.J. Melles 1,2, G.B.M. Heuvelink 1, C.J.W. Twenhöfel 3, A. van Dijk 3, P. Hiemstra 4, O. Baume 1, U.

More information

Spatial sampling effect of laboratory practices in a porphyry copper deposit

Spatial sampling effect of laboratory practices in a porphyry copper deposit Spatial sampling effect of laboratory practices in a porphyry copper deposit Serge Antoine Séguret Centre of Geosciences and Geoengineering/ Geostatistics, MINES ParisTech, Fontainebleau, France ABSTRACT

More information

AMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS

AMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS AMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS William V. Harper 1 and Isobel Clark 2 1 Otterbein College, United States of America 2 Alloa Business Centre, United Kingdom wharper@otterbein.edu

More information

INTRODUCTION TO GEOSTATISTICS And VARIOGRAM ANALYSIS

INTRODUCTION TO GEOSTATISTICS And VARIOGRAM ANALYSIS INTRODUCTION TO GEOSTATISTICS And VARIOGRAM ANALYSIS C&PE 940, 17 October 2005 Geoff Bohling Assistant Scientist Kansas Geological Survey geoff@kgs.ku.edu 864-2093 Overheads and other resources available

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

Introduction to Modeling Spatial Processes Using Geostatistical Analyst

Introduction to Modeling Spatial Processes Using Geostatistical Analyst Introduction to Modeling Spatial Processes Using Geostatistical Analyst Konstantin Krivoruchko, Ph.D. Software Development Lead, Geostatistics kkrivoruchko@esri.com Geostatistics is a set of models and

More information

Geostatistics Exploratory Analysis

Geostatistics Exploratory Analysis Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras cfelgueiras@isegi.unl.pt

More information

An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R)

An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R) DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R) Ernst

More information

Geography 4203 / 5203. GIS Modeling. Class (Block) 9: Variogram & Kriging

Geography 4203 / 5203. GIS Modeling. Class (Block) 9: Variogram & Kriging Geography 4203 / 5203 GIS Modeling Class (Block) 9: Variogram & Kriging Some Updates Today class + one proposal presentation Feb 22 Proposal Presentations Feb 25 Readings discussion (Interpolation) Last

More information

Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data

Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data Brian J. Smith, Ph.D. The University of Iowa Joint Statistical Meetings August 10,

More information

AP Physics 1 and 2 Lab Investigations

AP Physics 1 and 2 Lab Investigations AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

Product Description KNMI14 Daily Grids

Product Description KNMI14 Daily Grids Product Description KNMI14 Daily Grids Dr. R. Sluiter De Bilt, July 2014 Technical report; TR-346 Product Description KNMI14 Daily Grids Version 1.0 Date July 2014 Status Final Colofon Title Product Description

More information

Impact of rainfall and model resolution on sewer hydrodynamics

Impact of rainfall and model resolution on sewer hydrodynamics Impact of rainfall and model resolution on sewer hydrodynamics G. Bruni a, J.A.E. ten Veldhuis a, F.H.L.R. Clemens a, b a Water management Department, Faculty of Civil Engineering and Geosciences, Delft

More information

On Correlating Performance Metrics

On Correlating Performance Metrics On Correlating Performance Metrics Yiping Ding and Chris Thornley BMC Software, Inc. Kenneth Newman BMC Software, Inc. University of Massachusetts, Boston Performance metrics and their measurements are

More information

5. Linear Regression

5. Linear Regression 5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4

More information

GEOENGINE MSc in Geomatics Engineering (Master Thesis) Anamelechi, Falasy Ebere

GEOENGINE MSc in Geomatics Engineering (Master Thesis) Anamelechi, Falasy Ebere Master s Thesis: ANAMELECHI, FALASY EBERE Analysis of a Raster DEM Creation for a Farm Management Information System based on GNSS and Total Station Coordinates Duration of the Thesis: 6 Months Completion

More information

Big Data Analysis and the Advantages of Organizational Sustainability Modeling

Big Data Analysis and the Advantages of Organizational Sustainability Modeling The Big Data Analysis for Measuring Popularity in the Mobile Cloud Victor Chang School of Computing, Creative Technologies and Engineering, Leeds Metropolitan University, Headinley, Leeds LS6 3QR, U.K.

More information

Use of numerical weather forecast predictions in soil moisture modelling

Use of numerical weather forecast predictions in soil moisture modelling Use of numerical weather forecast predictions in soil moisture modelling Ari Venäläinen Finnish Meteorological Institute Meteorological research ari.venalainen@fmi.fi OBJECTIVE The weather forecast models

More information

A Comparative Study of the Pickup Method and its Variations Using a Simulated Hotel Reservation Data

A Comparative Study of the Pickup Method and its Variations Using a Simulated Hotel Reservation Data A Comparative Study of the Pickup Method and its Variations Using a Simulated Hotel Reservation Data Athanasius Zakhary, Neamat El Gayar Faculty of Computers and Information Cairo University, Giza, Egypt

More information

Forecaster comments to the ORTECH Report

Forecaster comments to the ORTECH Report Forecaster comments to the ORTECH Report The Alberta Forecasting Pilot Project was truly a pioneering and landmark effort in the assessment of wind power production forecast performance in North America.

More information

Multi-scale upscaling approaches of soil properties from soil monitoring data

Multi-scale upscaling approaches of soil properties from soil monitoring data local scale landscape scale forest stand/ site level (management unit) Multi-scale upscaling approaches of soil properties from soil monitoring data sampling plot level Motivation: The Need for Regionalization

More information

MAPPING FOREST SOIL ORGANIC MATTER ON NEW JERSEY S COASTAL PLAIN

MAPPING FOREST SOIL ORGANIC MATTER ON NEW JERSEY S COASTAL PLAIN MAPPING FOREST SOIL ORGANIC MATTER ON NEW JERSEY S COASTAL PLAIN Brian J. Clough, Edwin J. Green, and Richard G. Lathrop 1 Abstract. Managing forest soil organic matter (SOM) stocks is a vital strategy

More information

CSO Modelling Considering Moving Storms and Tipping Bucket Gauge Failures M. Hochedlinger 1 *, W. Sprung 2,3, H. Kainz 3 and K.

CSO Modelling Considering Moving Storms and Tipping Bucket Gauge Failures M. Hochedlinger 1 *, W. Sprung 2,3, H. Kainz 3 and K. CSO Modelling Considering Moving Storms and Tipping Bucket Gauge Failures M. Hochedlinger 1 *, W. Sprung,, H. Kainz and K. König 1 Linz AG Wastewater, Wiener Straße 151, A-41 Linz, Austria Municipality

More information

REDUCING UNCERTAINTY IN SOLAR ENERGY ESTIMATES

REDUCING UNCERTAINTY IN SOLAR ENERGY ESTIMATES REDUCING UNCERTAINTY IN SOLAR ENERGY ESTIMATES Mitigating Energy Risk through On-Site Monitoring Marie Schnitzer, Vice President of Consulting Services Christopher Thuman, Senior Meteorologist Peter Johnson,

More information

Annealing Techniques for Data Integration

Annealing Techniques for Data Integration Reservoir Modeling with GSLIB Annealing Techniques for Data Integration Discuss the Problem of Permeability Prediction Present Annealing Cosimulation More Details on Simulated Annealing Examples SASIM

More information

CORRELATIONS BETWEEN RAINFALL DATA AND INSURANCE DAMAGE DATA ON PLUVIAL FLOODING IN THE NETHERLANDS

CORRELATIONS BETWEEN RAINFALL DATA AND INSURANCE DAMAGE DATA ON PLUVIAL FLOODING IN THE NETHERLANDS 10 th International Conference on Hydroinformatics HIC 2012, Hamburg, GERMANY CORRELATIONS BETWEEN RAINFALL DATA AND INSURANCE DAMAGE DATA ON PLUVIAL FLOODING IN THE NETHERLANDS SPEKKERS, M.H. (1), TEN

More information

Normality Testing in Excel

Normality Testing in Excel Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal

MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal MGT 267 PROJECT Forecasting the United States Retail Sales of the Pharmacies and Drug Stores Done by: Shunwei Wang & Mohammad Zainal Dec. 2002 The retail sale (Million) ABSTRACT The present study aims

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR

VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR Andrew Goldstein Yale University 68 High Street New Haven, CT 06511 andrew.goldstein@yale.edu Alexander Thornton Shawn Kerrigan Locus Energy 657 Mission St.

More information

How To Forecast Solar Power

How To Forecast Solar Power Forecasting Solar Power with Adaptive Models A Pilot Study Dr. James W. Hall 1. Introduction Expanding the use of renewable energy sources, primarily wind and solar, has become a US national priority.

More information

FIXED AND MIXED-EFFECTS MODELS FOR MULTI-WATERSHED EXPERIMENTS

FIXED AND MIXED-EFFECTS MODELS FOR MULTI-WATERSHED EXPERIMENTS FIXED AND MIXED-EFFECTS MODELS FOR MULTI-WATERSHED EXPERIMENTS Jack Lewis, Mathematical Statistician, Pacific Southwest Research Station, 7 Bayview Drive, Arcata, CA, email: jlewis@fs.fed.us Abstract:

More information

Proposals of Summer Placement Programme 2015

Proposals of Summer Placement Programme 2015 Proposals of Summer Placement Programme 2015 Division Project Title Job description Subject and year of study required A2 Impact of dual-polarization Doppler radar data on Mathematics or short-term related

More information

Forecasting in supply chains

Forecasting in supply chains 1 Forecasting in supply chains Role of demand forecasting Effective transportation system or supply chain design is predicated on the availability of accurate inputs to the modeling process. One of the

More information

How to Test Seasonality of Earthquakes

How to Test Seasonality of Earthquakes DRAFT: Statistical methodology to test for evidence of seasonal variation in rates of earthquakes in the Groningen field Restricted PRELIMINARY DRAFT: SR.15.xxxxx April 2015 Restricted PRELIMINARY DRAFT:

More information

Application and results of automatic validation of sewer monitoring data

Application and results of automatic validation of sewer monitoring data Application and results of automatic validation of sewer monitoring data M. van Bijnen 1,3 * and H. Korving 2,3 1 Gemeente Utrecht, P.O. Box 8375, 3503 RJ, Utrecht, The Netherlands 2 Witteveen+Bos Consulting

More information

The Effect of Environmental Factors on Real Estate Value

The Effect of Environmental Factors on Real Estate Value The Effect of Environmental Factors on Real Estate Value Radoslaw CELLMER, Adam SENETRA, Agnieszka SZCZEPANSKA, Poland Key words: environment, landscape, property value, geostatistics SUMMARY The objective

More information

Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering

Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering Department of Industrial Engineering and Management Sciences Northwestern University September 15th, 2014

More information

Amplification of the Radiation from Two Collocated Cellular System Antennas by the Ground Wave of an AM Broadcast Station

Amplification of the Radiation from Two Collocated Cellular System Antennas by the Ground Wave of an AM Broadcast Station Amplification of the Radiation from Two Collocated Cellular System Antennas by the Ground Wave of an AM Broadcast Station Dr. Bill P. Curry EMSciTek Consulting Co., W101 McCarron Road Glen Ellyn, IL 60137,

More information

Geostatistical Analyst Tutorial

Geostatistical Analyst Tutorial Copyright 1995-2012 Esri All rights reserved. Table of Contents Introduction to the ArcGIS Geostatistical Analyst Tutorial................... 0 Exercise 1: Creating a surface using default parameters...................

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

Introduction to Geostatistics

Introduction to Geostatistics Introduction to Geostatistics GEOL 5446 Dept. of Geology & Geophysics 3 Credits University of Wyoming Fall, 2013 Instructor: Ye Zhang Grading: A-F Location: ESB1006 Time: TTh (9:35 am~10:50 am), Office

More information

Robust procedures for Canadian Test Day Model final report for the Holstein breed

Robust procedures for Canadian Test Day Model final report for the Holstein breed Robust procedures for Canadian Test Day Model final report for the Holstein breed J. Jamrozik, J. Fatehi and L.R. Schaeffer Centre for Genetic Improvement of Livestock, University of Guelph Introduction

More information

ArcGIS Geostatistical Analyst: Statistical Tools for Data Exploration, Modeling, and Advanced Surface Generation

ArcGIS Geostatistical Analyst: Statistical Tools for Data Exploration, Modeling, and Advanced Surface Generation ArcGIS Geostatistical Analyst: Statistical Tools for Data Exploration, Modeling, and Advanced Surface Generation An ESRI White Paper August 2001 ESRI 380 New York St., Redlands, CA 92373-8100, USA TEL

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

More information

y = Xβ + ε B. Sub-pixel Classification

y = Xβ + ε B. Sub-pixel Classification Sub-pixel Mapping of Sahelian Wetlands using Multi-temporal SPOT VEGETATION Images Jan Verhoeye and Robert De Wulf Laboratory of Forest Management and Spatial Information Techniques Faculty of Agricultural

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

5. Multiple regression

5. Multiple regression 5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful

More information

APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING

APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING Sulaimon Mutiu O. Department of Statistics & Mathematics Moshood Abiola Polytechnic, Abeokuta, Ogun State, Nigeria. Abstract

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Machine Learning and Data Mining. Regression Problem. (adapted from) Prof. Alexander Ihler

Machine Learning and Data Mining. Regression Problem. (adapted from) Prof. Alexander Ihler Machine Learning and Data Mining Regression Problem (adapted from) Prof. Alexander Ihler Overview Regression Problem Definition and define parameters ϴ. Prediction using ϴ as parameters Measure the error

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Java Modules for Time Series Analysis

Java Modules for Time Series Analysis Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series

More information

Econometrics Simple Linear Regression

Econometrics Simple Linear Regression Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

WM2012 Conference, February 26 March 1, 2012, Phoenix, Arizona, USA

WM2012 Conference, February 26 March 1, 2012, Phoenix, Arizona, USA ABSTRACT Comparison of Activity Determination of Radium 226 in FUSRAP Soil using Various Energy Lines - 12299 Brian Tucker*, Jough Donakowski**, David Hays*** *Shaw Environmental & Infrastructure, Stoughton,

More information

SOLAR IRRADIANCE FORECASTING, BENCHMARKING of DIFFERENT TECHNIQUES and APPLICATIONS of ENERGY METEOROLOGY

SOLAR IRRADIANCE FORECASTING, BENCHMARKING of DIFFERENT TECHNIQUES and APPLICATIONS of ENERGY METEOROLOGY SOLAR IRRADIANCE FORECASTING, BENCHMARKING of DIFFERENT TECHNIQUES and APPLICATIONS of ENERGY METEOROLOGY Wolfgang Traunmüller 1 * and Gerald Steinmaurer 2 1 BLUE SKY Wetteranalysen, 4800 Attnang-Puchheim,

More information

2.8 Objective Integration of Satellite, Rain Gauge, and Radar Precipitation Estimates in the Multisensor Precipitation Estimator Algorithm

2.8 Objective Integration of Satellite, Rain Gauge, and Radar Precipitation Estimates in the Multisensor Precipitation Estimator Algorithm 2.8 Objective Integration of Satellite, Rain Gauge, and Radar Precipitation Estimates in the Multisensor Precipitation Estimator Algorithm Chandra Kondragunta*, David Kitzmiller, Dong-Jun Seo and Kiran

More information

EVALUATING SOLAR ENERGY PLANTS TO SUPPORT INVESTMENT DECISIONS

EVALUATING SOLAR ENERGY PLANTS TO SUPPORT INVESTMENT DECISIONS EVALUATING SOLAR ENERGY PLANTS TO SUPPORT INVESTMENT DECISIONS Author Marie Schnitzer Director of Solar Services Published for AWS Truewind October 2009 Republished for AWS Truepower: AWS Truepower, LLC

More information

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares Linear Regression Chapter 5 Regression Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). We can then predict the average response for all subjects

More information

Havnepromenade 9, DK-9000 Aalborg, Denmark. Denmark. Sohngaardsholmsvej 57, DK-9000 Aalborg, Denmark

Havnepromenade 9, DK-9000 Aalborg, Denmark. Denmark. Sohngaardsholmsvej 57, DK-9000 Aalborg, Denmark Urban run-off volumes dependency on rainfall measurement method - Scaling properties of precipitation within a 2x2 km radar pixel L. Pedersen 1 *, N. E. Jensen 2, M. R. Rasmussen 3 and M. G. Nicolajsen

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Module 5: Statistical Analysis

Module 5: Statistical Analysis Module 5: Statistical Analysis To answer more complex questions using your data, or in statistical terms, to test your hypothesis, you need to use more advanced statistical tests. This module reviews the

More information

FLOODALERT: A SIMPLIFIED RADAR-BASED EWS FOR URBAN FLOOD WARNING

FLOODALERT: A SIMPLIFIED RADAR-BASED EWS FOR URBAN FLOOD WARNING 11 th International Conference on Hydroinformatics HIC 2014, New York City, USA FLOODALERT: A SIMPLIFIED RADAR-BASED EWS FOR URBAN FLOOD WARNING XAVIER LLORT (1), RAFAEL SÁNCHEZ-DIEZMA (1), ÁLVARO RODRÍGUEZ

More information

Application and results of automatic validation of sewer monitoring data

Application and results of automatic validation of sewer monitoring data Application and results of automatic validation of sewer monitoring data M. van Bijnen 1,3 * and H. Korving 2,3 1 Gemeente Utrecht, P.O. Box 8375, 3503 RJ, Utrecht, The Netherlands 2 Witteveen+Bos Consulting

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

An Introduction to Point Pattern Analysis using CrimeStat

An Introduction to Point Pattern Analysis using CrimeStat Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

More information

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building

More information

Supporting Online Material for Achard (RE 1070656) scheduled for 8/9/02 issue of Science

Supporting Online Material for Achard (RE 1070656) scheduled for 8/9/02 issue of Science Supporting Online Material for Achard (RE 1070656) scheduled for 8/9/02 issue of Science Materials and Methods Overview Forest cover change is calculated using a sample of 102 observations distributed

More information

A HYDROLOGIC NETWORK SUPPORTING SPATIALLY REFERENCED REGRESSION MODELING IN THE CHESAPEAKE BAY WATERSHED

A HYDROLOGIC NETWORK SUPPORTING SPATIALLY REFERENCED REGRESSION MODELING IN THE CHESAPEAKE BAY WATERSHED A HYDROLOGIC NETWORK SUPPORTING SPATIALLY REFERENCED REGRESSION MODELING IN THE CHESAPEAKE BAY WATERSHED JOHN W. BRAKEBILL 1* AND STEPHEN D. PRESTON 2 1 U.S. Geological Survey, Baltimore, MD, USA; 2 U.S.

More information

ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA

ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA Michael R. Middleton, McLaren School of Business, University of San Francisco 0 Fulton Street, San Francisco, CA -00 -- middleton@usfca.edu

More information

15.062 Data Mining: Algorithms and Applications Matrix Math Review

15.062 Data Mining: Algorithms and Applications Matrix Math Review .6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop

More information

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

Virtual Met Mast verification report:

Virtual Met Mast verification report: Virtual Met Mast verification report: June 2013 1 Authors: Alasdair Skea Karen Walter Dr Clive Wilson Leo Hume-Wright 2 Table of contents Executive summary... 4 1. Introduction... 6 2. Verification process...

More information

Estimating Weighing Uncertainty From Balance Data Sheet Specifications

Estimating Weighing Uncertainty From Balance Data Sheet Specifications Estimating Weighing Uncertainty From Balance Data Sheet Specifications Sources Of Measurement Deviations And Uncertainties Determination Of The Combined Measurement Bias Estimation Of The Combined Measurement

More information

Forecasting the sales of an innovative agro-industrial product with limited information: A case of feta cheese from buffalo milk in Thailand

Forecasting the sales of an innovative agro-industrial product with limited information: A case of feta cheese from buffalo milk in Thailand Forecasting the sales of an innovative agro-industrial product with limited information: A case of feta cheese from buffalo milk in Thailand Orakanya Kanjanatarakul 1 and Komsan Suriya 2 1 Faculty of Economics,

More information

APPENDIX N. Data Validation Using Data Descriptors

APPENDIX N. Data Validation Using Data Descriptors APPENDIX N Data Validation Using Data Descriptors Data validation is often defined by six data descriptors: 1) reports to decision maker 2) documentation 3) data sources 4) analytical method and detection

More information

CORRECTIONS TO RADAR-ESTIMATED PRECIPITATION USING OBSERVED RAIN GAUGE DATA. A Thesis. Presented to the Faculty of the Graduate School

CORRECTIONS TO RADAR-ESTIMATED PRECIPITATION USING OBSERVED RAIN GAUGE DATA. A Thesis. Presented to the Faculty of the Graduate School CORRECTIONS TO RADAR-ESTIMATED PRECIPITATION USING OBSERVED RAIN GAUGE DATA A Thesis Presented to the Faculty of the Graduate School of Cornell University in Partial Fulfillment of the Requirements for

More information

Estimation of σ 2, the variance of ɛ

Estimation of σ 2, the variance of ɛ Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated

More information

Data Preparation and Statistical Displays

Data Preparation and Statistical Displays Reservoir Modeling with GSLIB Data Preparation and Statistical Displays Data Cleaning / Quality Control Statistics as Parameters for Random Function Models Univariate Statistics Histograms and Probability

More information

Statistical Machine Learning

Statistical Machine Learning Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes

More information

Metrological features of a beta absorption particulate air monitor operating with wireless communication system

Metrological features of a beta absorption particulate air monitor operating with wireless communication system NUKLEONIKA 2008;53(Supplement 2):S37 S42 ORIGINAL PAPER Metrological features of a beta absorption particulate air monitor operating with wireless communication system Adrian Jakowiuk, Piotr Urbański,

More information

Algebra 1 Course Information

Algebra 1 Course Information Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through

More information

Multiple Regression: What Is It?

Multiple Regression: What Is It? Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

7 Time series analysis

7 Time series analysis 7 Time series analysis In Chapters 16, 17, 33 36 in Zuur, Ieno and Smith (2007), various time series techniques are discussed. Applying these methods in Brodgar is straightforward, and most choices are

More information

SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING

SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING AAS 07-228 SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING INTRODUCTION James G. Miller * Two historical uncorrelated track (UCT) processing approaches have been employed using general perturbations

More information

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model Xavier Conort xavier.conort@gear-analytics.com Motivation Location matters! Observed value at one location is

More information

How To Use Statgraphics Centurion Xvii (Version 17) On A Computer Or A Computer (For Free)

How To Use Statgraphics Centurion Xvii (Version 17) On A Computer Or A Computer (For Free) Statgraphics Centurion XVII (currently in beta test) is a major upgrade to Statpoint's flagship data analysis and visualization product. It contains 32 new statistical procedures and significant upgrades

More information

Index-Velocity Rating Development (Calibration) for H-ADCP Real-Time Discharge Monitoring in Open Channels

Index-Velocity Rating Development (Calibration) for H-ADCP Real-Time Discharge Monitoring in Open Channels Index-Velocity Rating Development (Calibration) for H-ADCP Real-Time Discharge Monitoring in Open Channels Hening Huang Teledyne RD Instruments, Inc., 14020 Stowe Drive, Poway, CA. 92064, USA (Tel: 858-842-2600,

More information

Developing sub-domain verification methods based on Geographic Information System (GIS) tools

Developing sub-domain verification methods based on Geographic Information System (GIS) tools APPROVED FOR PUBLIC RELEASE: DISTRIBUTION UNLIMITED U.S. Army Research, Development and Engineering Command Developing sub-domain verification methods based on Geographic Information System (GIS) tools

More information