Local Quantile Regression
|
|
|
- Felicia Sharp
- 5 years ago
- Views:
From this document you will learn the answers to the following questions:
What is the measure of?
Transcription
1 Wolfgang K. Härdle Vladimir Spokoiny Weining Wang Ladislaus von Bortkiewicz Chair of Statistics C.A.S.E. - Center for Applied Statistics and Economics Humboldt-Universität zu Berlin
2 Motivation 1-1 logratio rangesca Figure 1: 221 observations, a light detection and ranging (LIDAR) experiment, X: range distance traveled before the light is reflected back to its source, Y: logarithm of the ratio of received light from two laser sources. 95% quantile.
3
4
5 Motivation 1-2 Quantile Estimation Figure 2: Y: HK stock cheung kong returns, X: HK stock clpholdings returns, daily 01/01/ /26/2010, 90% quantile.
6 Motivation 1-3 Electricity Demand Figure 3: Electricity Demand over Time
7 Motivation 1-4 InitValues$hseq[index] :length(index) logratio rangesca l_tau Figure 4: Plot of quantile for standardized weather residuals over 51 years at Berlin, 90% quantile.
8 Motivation 1-5 Weather Residuals cbind(aaav, bbbv, cccv)[, 1:3] Figure 5: Estimated 90% quantile of variance functions, Berlin, average over , ,
9 Motivation 1-6 Conditional Quantile Regression Median regression = mean regression (symmetric) Describes the conditional behavior of a response Y in a broader spectrum Example Financial Market & Econometrics VaR (Value at Risk) tool Detect conditional heteroskedasticity
10 Motivation 1-7 Example Labor Market To detect discrimination effects, need split other effects at first. log (Income) = A(year, age, education, etc) +β B(gender, nationality, union status, etc) + ε
11 Motivation 1-8 Parametric e.g. polynomial model Interior point method, Koenker and Bassett (1978), Koenker and Park (1996), Portnoy and Koenker (1996). Nonparametric model Check function approach, Fan et al. (1994), Yu and Jones (1997, 1998). Double-kernel technique, Fan et al. (1996). Weighted NW estimation, Hall et al. (1999), Cai (2002). LCP method, Spokoiny (2009). LMS, Katkovnik and Spokoiny (2007)
12 Outline 1. Motivation 2. Basic Setup 3. Parametric Exponential Bounds 4. Calibration by "Propagation" Condition 5. "Small Modeling Bias" and Propagation Properties 6. "Stability" and "Oracle" Property 7. Simulation 8. Application 9. Further work
13 Basic Setup 2-1 Basic Setup {(X i, Y i )} n i=1 i.i.d., pdf f (x, y), f (y x) l(x) = F 1 y x (τ), τ-quantile regression curve l n (x) quantile smoother
14 Basic Setup 2-2 Asymmetric Laplace Distribution (ALD) x Purple are 5,10,...,95 percentiles Figure 6: The cdf and pdf of an ALD with µ = 0, σ = 1.5, τ = 0.8.
15 Basic Setup 2-3 Quantile & ALD Location Model Y i = θ + ε i, ε i ALD τ (0, 1) l = F 1 (τ) = θ = 0 f (u, θ ) = τ(1 τ) exp{ ρ τ (u θ )} l(u, θ ) = log{τ(1 τ)} ρ τ (u θ ) ρ τ (u) def = τu1{u (0, )} (1 τ)u1{u (, 0)}
16 Basic Setup 2-4 Check Function Figure 7: Check function ρ τ conditional mean regression for τ=0.9, τ=0.5 and weight function in
17 Basic Setup 2-5 Quasi-likelihood Approach θ = arg min E λ0 (.) ρ τ (Y θ) θ θ = arg min L(θ) def = arg min θ θ Tail probabilities determine F : n { log τ(1 τ) + ρ τ (Y i θ)} i=1 λ + 0 (y) = (τ 1 y) 1 log{τ 1 P(Y 1 > y)}, y > 0 λ 0 (y) = {(1 τ) 1 y} 1 log{(1 τ) 1 P(Y 1 < y)}, y < 0 Assume λ +( ) 0 (y){λ 0 (y)} 0 as y +/ (heavy tails).
18 Basic Setup 2-6 Conditional Quantile Regression l(x) = argmin θ E λ0 (.){ρ τ (Y θ) X = x} l n (θ) = argmin θ n 1 n ρ τ (Y i θ)k h(x) (x X i ) i=1 K h(x) (u) = h(x) 1 K{u/h(x)} is a kernel with bandwidth h(x). L(θ, θ ) def = L(Y, θ) L(Y, θ )
19 Basic Setup 2-7 Local Polynomial Quantile Regression l(u) θ 0 (u) + θ 1 (u)(x u) + θ 2 (u)(x u) θ p (u)(x u) p, θ(u) = { θ 0 (u), θ 1 (u),..., θ p (u)} (1) n n = argmax θ Θ log τ(1 τ) w i ρ τ (Y i θ ψ i )w i i=1 = argmin θ Θ ρ τ (Y i θ ψ i )w i, i=1 i=1 where ψ i = {(X i u), (X i u) 2,..., (X i u) p } and w i = K{(x i u)/h}, for i in 1, 2,..., n.
20 Parametric Exponential Bounds 3-1 First Step to Happiness
21 Parametric Exponential Bounds 3-2 Parametric Exponential Bounds Theorem 1 Let V 0 = V (θ ). Let R satisfy λ qr/µ for µ = a/(2ν 0 q 3 ). We have for any r < R, if R µλ /(ν 0 q), then for any z > 0 with 2z/a R 2, it holds P λ0 (.){L( θ, θ ) > z} exp{ za/(ν 0 q 3 ) Q 0 (p, q)} + exp{ g(r)}, (2) where g(r) is defined as: g(r) def = inf sup θ Θ µ M = inf θ Θ g(r, θ, θ ) {µr + C(µ, θ, θ )} Go to details
22 Parametric Exponential Bounds 3-3 Parametric Exponential Bounds Theorem 2 Under technical assumptions, where R r > 0 is a constant. E λ0 (.) L(θ, θ ) r R r,
23 Calibration by "Propagation" Condition 4-1 Adaptation Scale Fix x, and a sequence of ordered weights Define LPA: l(x i ) l θ (X i ) W (k) = (w (k) 1, w (k) 2,..., w n (k) ) w (k) i = K hk (x X i ), (h 1 < h 2 <... < h K ) θ k = argmax θ def = argmax θ n i=1 l{y i, l θ (X i )}w (k) i L(W (k), θ)
24 Calibration by "Propagation" Condition 4-2 LMS Procedure Construct an estimate ˆθ = ˆθ(x), based on θ 1, θ 2,..., θ K. Start with ˆθ 1 = θ 1. For k 2, θ k is accepted and ˆθ k = θ k if θ k 1 was accepted and L(W (l), θ l, θ k ) z l, l = 1,..., k 1 ˆθ k is the latest accepted estimate after the first k steps. L(W (k), θ, θ ) = L(W (k), θ) L(W (k), θ )
25 Local Model Selection procedure Calibration by "Propagation" Condition 4-3 LMS procedure LMS: Illustration Illustration CS θ k + 1 θ 2 θ1 θ 3 θ k * k * k +1 Stop
26 Calibration by "Propagation" Condition 4-4 "Propagation" Condition The critical value is determined via a false alarm consideration: E θ L(W (k), θ k, ˆθ k ) r R r (W (k) ) α, where R r (W (k) ) is a risk bound depending on W (k).
27 Calibration by "Propagation" Condition 4-5 Critical Values Consider first z 1 letting z 2 =... = z K 1 =. Leads to the estimates ˆθ k (z 1 ) for k = 2,..., K. The value z 1 is selected as the minimal one for which L{W (k), θ k, ˆθ k (z 1 )} r sup E θ,λ(.) θ R r (W (k) ) α, k = 2,..., K. K 1 Set z k+1 =... = z K 1 = and fix z k gives the set of parameters z 1,..., z k,,..., and the estimates ˆθ m (z 1,..., z k ) for m = k + 1,..., K. Select z k s.t. L{W (k), θ m, ˆθ m (z 1, z 2,..., z k )} r sup E θ,λ(.) θ R r (W (k) ) m = k + 1,..., K. kα K 1,
28 Calibration by "Propagation" Condition 4-6 c hseq Figure 8: Critical value with λ(.) corresponding to standard normal distribution, t(3), pareto(0.5,1) ρ = 0.2, r = 0.5, τ = 0.75.
29 Calibration by "Propagation" Condition 4-7 Bounds for Critical Values Theorem 3 Under the assumptions of probability bounds, there are a 0, a 1, a 2, s.t. the propagation condition is fulfilled with the choice of z k = a 0 r log(ρ 1 ) + a 1 r log(h K /h k ) + a 2 log(nh k ).
30 "Small Modeling Bias" and Propagation Properties 5-1 Small Modeling Bias λ0 (.)(W (k), θ) = n i=1 K{P l(xi ),λ 0 (.), P lθ (X i ),λ 0 (.)}1{w (k) i > 0} "Small Modeling Bias" Condition: λ0 (.)(W (k), θ), k < k E λ0 (.) log{1 + L(W (k), θ k, θ) r /R r (W (k) )} + 1,
31 "Small Modeling Bias" and Propagation Properties 5-2 Stability The attained quality of estimation during "propagation" can not get lost at further steps. Theorem 4 L(W (k ), θ k, θˆk)1{ˆk > k } z k
32 "Small Modeling Bias" and Propagation Properties 5-3 Oracle Combing the "propagation" and "stability" results yields Theorem 5 Let (W (k), θ) for some θ Θ and k k. Then E λ0 (.) log {1 + L(W (k ), θ k, θ) r } R r (W (k ) + 1 ) E λ0 (.) log {1 + L(W (k ), θ k, ˆθ) r } R r (W (k ) + α ) z k + log{1 + R r (W (k ) ) }
33 Monte Carlo Simulation Figure 9: The bandwidth sequence (upper panel), the data with exp(2) noise, the adaptive estimation of 0.75 quantile, the quantile smoother with fixed optimal bandwidth = 0.08 (yellow)
34 Monte Carlo Simulation Figure 10: The block residuals of the fixed bandwidth estimation (upper panel)and the adaptive estimation (lower panel)
35 Monte Carlo Simulation 6-3 Figure 11: The bandwidth sequence (upper panel), the data with standard normal noise, the adaptive estimation of 0.75 quantile, the quantile smoother with fixed optimal bandwidth = (yellow),
36 Monte Carlo Simulation Figure 12: The bandwidth sequence (upper panel), the data with t(3) noise, the adaptive estimation of 0.75 quantile, the quantile smoother with fixed optimal bandwidth = 0.06 (yellow)
37 Monte Carlo Simulation 6-5 Local linear Figure 13: The bandwidth sequence (upper left panel), the data with t(3) noise, the adaptive estimation of 0.80 quantile, the quantile smoother with fixed optimal bandwidth = 0.06 (yellow) ; The blocked error fixed vs. adaptive(right panel).
38 Monte Carlo Simulation Figure 14: The bandwidth sequence (upper left panel), the true trend curve (lower left panel), the adaptive estimation of 0.80 quantile,the quantile smoother with fixed optimal bandwidth = (yellow); The blocked error fixed vs. adaptive (right panel).
39 Application 7-1 Tail Dependency y Figure 15: Plot of quantile curve for two normal random variables with ρ = x
40 Application 7-2 q[ ] q q _ Figure 16: Plot of bandwidth sequence (upper left), plot of quantile smoother (upper left), scatter plot on scaled X (lower left), original scale (lower right)
41 Application Figure 17: 221 observations, a light detection and ranging (LIDAR) experiment, X: range distance travelled before the light is reflected back to its source, Y: logarithm of the ratio of received light from two laser sources.
42
43 Application 7-4 cbind(aaav, bbbv, cccv)[, 1:3] Figure 18: Estimated 90% quantile of variance functions, Kaoshiung, average over , ,
44 Wolfgang K. Härdle Vladimir Spokoiny Weining Wang Ladislaus von Bortkiewicz Chair of Statistics C.A.S.E. - Center for Applied Statistics and Economics Humboldt-Universität zu Berlin
45 Assumptions and Definitions C(µ, θ, θ ) = q 1 M(qµ, θ, θ ) (p+1) log + {ɛ 1 µ V (θ θ ) } Q(p, q) and Q(p, q) = ν 0 qɛ 2 /2 log(2 + 2p) pc(q) log(1 a (p+1) )
46 Appendix 8-7 Assumptions and Definitions ζ(θ) ζ(θ, θ ) N(µ, θ, θ ) M(µ, θ, θ ) def = L(θ) E L(θ) (3) def = ζ(θ) ζ(θ ) def = log E exp{µζ(θ, θ )} def = log E exp{µl(θ, θ )}
47 Appendix 8-8 Assumptions and Definitions ζ(θ) satisfying the condition (ED): λ > 0 and a symmetric positive matrix V 2 s.t. 0 < λ λ, log E exp{λγ ζ(θ)} ν 0 λ 2 V 2 γ 2 /2, (4) for some fixed ν 0 1 and γ R p+1. Set V 2 (θ) = n i=1 ψ iψi p i (ψ θ)w i. (L) For R > 0, there exists constant a = a(r) > 0 such that it holds on the set Θ 0 (R) = {θ : V (θ θ ) R}: E L(θ, θ ) a V (θ θ ) 2 /2 return
48 Appendix 8-9 References Z. W. Cai Regression quantiles for time series Econometric Theory, 18: , J. Fan and T. C. Hu and Y. K. Troung Robust nonparametric function estimation Scandinavian Journal of Statistics, 21: , Y. Golubev and V. Spokoiny Exponential bounds for minimum contrast estimators Electronic Journal of Statistics, ISSN:
49 Appendix 8-10 References W. Härdle, P. Janssen and R. Serfling Strong uniform consistency rates for estimators of conditional functionals Ann. Statist., 16: X. He and H. Liang Quantile regression estimates for a class of linear and partially linear errors-in-variables model Ann. Statist., 10: K. Jeong and W. Härdle A Consistent Nonparametric Test for Causality in Quantile SFB 649 Discussion Paper, 2008.
50 Appendix 8-11 References V. Katkovnik and V. Spokoiny Spatially adaptive estimation via fitted local likelihood (FLL) techniques R. Koenker and G. W. Bassett Regression quantiles Econometrica, 46:33-50, 1978.
51 Appendix 8-12 References R. Koenker and B. J. Park An interior point algorithm for nonlinear quantile regression Journal of Econometrics, 71: M. G. Lejeune and P. Sarda Quantile regression: A nonparametric approach Computational Statistics & Data Analysis, 6: K. Murphy and F. Welch Empirical age-earnings profiles Journal of Labor Economics, 8(2):
52 Appendix 8-13 References S. Portnoy and R. Koenker The Gaussian hare and the Laplacian tortoise: computability of squared-error versus absolute -error estimatiors (with discussion) Statistical Sciences, 12: , V. Spokoiny Local parametric estimation Heildelberg: Springer Verlag, 2008 K. Yu and M. C. Jones A comparison of local constant and local linear regression quantile estimation Computational Statistics and Data Analysis, 25: , 1997.
53 Appendix 8-14 References K. Yu and M. C. Jones Local linear quantile regression J. Amer. Statist. Assoc., 93: , K. Yu, Z. Lu and J. Stander Quantile regression: applications and current research areas J. Roy. Statistical Society: Series D (The Statistician), 52: , 2003.
α α λ α = = λ λ α ψ = = α α α λ λ ψ α = + β = > θ θ β > β β θ θ θ β θ β γ θ β = γ θ > β > γ θ β γ = θ β = θ β = θ β = β θ = β β θ = = = β β θ = + α α α α α = = λ λ λ λ λ λ λ = λ λ α α α α λ ψ + α =
Maximum Likelihood Estimation
Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for
Basics of Statistical Machine Learning
CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu [email protected] Modern machine learning is rooted in statistics. You will find many familiar
Monte Carlo Simulation
1 Monte Carlo Simulation Stefan Weber Leibniz Universität Hannover email: [email protected] web: www.stochastik.uni-hannover.de/ sweber Monte Carlo Simulation 2 Quantifying and Hedging
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering
Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering Department of Industrial Engineering and Management Sciences Northwestern University September 15th, 2014
Exact Confidence Intervals
Math 541: Statistical Theory II Instructor: Songfeng Zheng Exact Confidence Intervals Confidence intervals provide an alternative to using an estimator ˆθ when we wish to estimate an unknown parameter
Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010
Simulation Methods Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Simulation Methods 15.450, Fall 2010 1 / 35 Outline 1 Generating Random Numbers 2 Variance Reduction 3 Quasi-Monte
Recent Developments of Statistical Application in. Finance. Ruey S. Tsay. Graduate School of Business. The University of Chicago
Recent Developments of Statistical Application in Finance Ruey S. Tsay Graduate School of Business The University of Chicago Guanghua Conference, June 2004 Summary Focus on two parts: Applications in Finance:
An Introduction to Machine Learning
An Introduction to Machine Learning L5: Novelty Detection and Regression Alexander J. Smola Statistical Machine Learning Program Canberra, ACT 0200 Australia [email protected] Tata Institute, Pune,
How to assess the risk of a large portfolio? How to estimate a large covariance matrix?
Chapter 3 Sparse Portfolio Allocation This chapter touches some practical aspects of portfolio allocation and risk assessment from a large pool of financial assets (e.g. stocks) How to assess the risk
Dongfeng Li. Autumn 2010
Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis
For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )
Probability Review 15.075 Cynthia Rudin A probability space, defined by Kolmogorov (1903-1987) consists of: A set of outcomes S, e.g., for the roll of a die, S = {1, 2, 3, 4, 5, 6}, 1 1 2 1 6 for the roll
Sales forecasting # 2
Sales forecasting # 2 Arthur Charpentier [email protected] 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting
Credit Risk Models: An Overview
Credit Risk Models: An Overview Paul Embrechts, Rüdiger Frey, Alexander McNeil ETH Zürich c 2003 (Embrechts, Frey, McNeil) A. Multivariate Models for Portfolio Credit Risk 1. Modelling Dependent Defaults:
VI. Real Business Cycles Models
VI. Real Business Cycles Models Introduction Business cycle research studies the causes and consequences of the recurrent expansions and contractions in aggregate economic activity that occur in most industrialized
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data
Working Paper A simple graphical method to explore taildependence
econstor www.econstor.eu Der Open-Access-Publikationsserver der ZBW Leibniz-Informationszentrum Wirtschaft The Open Access Publication Server of the ZBW Leibniz Information Centre for Economics Abberger,
An Internal Model for Operational Risk Computation
An Internal Model for Operational Risk Computation Seminarios de Matemática Financiera Instituto MEFF-RiskLab, Madrid http://www.risklab-madrid.uam.es/ Nicolas Baud, Antoine Frachot & Thierry Roncalli
Note on the EM Algorithm in Linear Regression Model
International Mathematical Forum 4 2009 no. 38 1883-1889 Note on the M Algorithm in Linear Regression Model Ji-Xia Wang and Yu Miao College of Mathematics and Information Science Henan Normal University
A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA
REVSTAT Statistical Journal Volume 4, Number 2, June 2006, 131 142 A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA Authors: Daiane Aparecida Zuanetti Departamento de Estatística, Universidade Federal de São
Geostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras [email protected]
A HYBRID GENETIC ALGORITHM FOR THE MAXIMUM LIKELIHOOD ESTIMATION OF MODELS WITH MULTIPLE EQUILIBRIA: A FIRST REPORT
New Mathematics and Natural Computation Vol. 1, No. 2 (2005) 295 303 c World Scientific Publishing Company A HYBRID GENETIC ALGORITHM FOR THE MAXIMUM LIKELIHOOD ESTIMATION OF MODELS WITH MULTIPLE EQUILIBRIA:
Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015.
Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment -3, Probability and Statistics, March 05. Due:-March 5, 05.. Show that the function 0 for x < x+ F (x) = 4 for x < for x
A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS
A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS Eusebio GÓMEZ, Miguel A. GÓMEZ-VILLEGAS and J. Miguel MARÍN Abstract In this paper it is taken up a revision and characterization of the class of
Java Modules for Time Series Analysis
Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series
Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page
Errata for ASM Exam C/4 Study Manual (Sixteenth Edition) Sorted by Page 1 Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page Practice exam 1:9, 1:22, 1:29, 9:5, and 10:8
Probabilistic Models for Big Data. Alex Davies and Roger Frigola University of Cambridge 13th February 2014
Probabilistic Models for Big Data Alex Davies and Roger Frigola University of Cambridge 13th February 2014 The State of Big Data Why probabilistic models for Big Data? 1. If you don t have to worry about
A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails
12th International Congress on Insurance: Mathematics and Economics July 16-18, 2008 A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails XUEMIAO HAO (Based on a joint
Detection of changes in variance using binary segmentation and optimal partitioning
Detection of changes in variance using binary segmentation and optimal partitioning Christian Rohrbeck Abstract This work explores the performance of binary segmentation and optimal partitioning in the
Online Appendices to the Corporate Propensity to Save
Online Appendices to the Corporate Propensity to Save Appendix A: Monte Carlo Experiments In order to allay skepticism of empirical results that have been produced by unusual estimators on fairly small
Fitting Subject-specific Curves to Grouped Longitudinal Data
Fitting Subject-specific Curves to Grouped Longitudinal Data Djeundje, Viani Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh, EH14 4AS, UK E-mail: [email protected] Currie,
R 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models
Faculty of Health Sciences R 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models Inference & application to prediction of kidney graft failure Paul Blanche joint work with M-C.
Lecture 8: More Continuous Random Variables
Lecture 8: More Continuous Random Variables 26 September 2005 Last time: the eponential. Going from saying the density e λ, to f() λe λ, to the CDF F () e λ. Pictures of the pdf and CDF. Today: the Gaussian
1 Prior Probability and Posterior Probability
Math 541: Statistical Theory II Bayesian Approach to Parameter Estimation Lecturer: Songfeng Zheng 1 Prior Probability and Posterior Probability Consider now a problem of statistical inference in which
A Log-Robust Optimization Approach to Portfolio Management
A Log-Robust Optimization Approach to Portfolio Management Dr. Aurélie Thiele Lehigh University Joint work with Ban Kawas Research partially supported by the National Science Foundation Grant CMMI-0757983
Statistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
Permutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods
Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:
Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes
Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes JunXuJ.ScottLong Indiana University August 22, 2005 The paper provides technical details on
Principle of Data Reduction
Chapter 6 Principle of Data Reduction 6.1 Introduction An experimenter uses the information in a sample X 1,..., X n to make inferences about an unknown parameter θ. If the sample size n is large, then
On Marginal Effects in Semiparametric Censored Regression Models
On Marginal Effects in Semiparametric Censored Regression Models Bo E. Honoré September 3, 2008 Introduction It is often argued that estimation of semiparametric censored regression models such as the
Chapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )
Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) and Neural Networks( 類 神 經 網 路 ) 許 湘 伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 35 13 Examples
M1 in Economics and Economics and Statistics Applied multivariate Analysis - Big data analytics Worksheet 1 - Bootstrap
Nathalie Villa-Vialanei Année 2015/2016 M1 in Economics and Economics and Statistics Applied multivariate Analsis - Big data analtics Worksheet 1 - Bootstrap This worksheet illustrates the use of nonparametric
A Study on the Comparison of Electricity Forecasting Models: Korea and China
Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 675 683 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.675 Print ISSN 2287-7843 / Online ISSN 2383-4757 A Study on the Comparison
Nonlinear Regression:
Zurich University of Applied Sciences School of Engineering IDP Institute of Data Analysis and Process Design Nonlinear Regression: A Powerful Tool With Considerable Complexity Half-Day : Improved Inference
Nonparametric adaptive age replacement with a one-cycle criterion
Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: [email protected]
Cyber-Security Analysis of State Estimators in Power Systems
Cyber-Security Analysis of State Estimators in Electric Power Systems André Teixeira 1, Saurabh Amin 2, Henrik Sandberg 1, Karl H. Johansson 1, and Shankar Sastry 2 ACCESS Linnaeus Centre, KTH-Royal Institute
Gamma Distribution Fitting
Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics
problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved
4 Data Issues 4.1 Truncated Regression population model y i = x i β + ε i, ε i N(0, σ 2 ) given a random sample, {y i, x i } N i=1, then OLS is consistent and efficient problem arises when only a non-random
Bandwidth Selection for Nonparametric Distribution Estimation
Bandwidth Selection for Nonparametric Distribution Estimation Bruce E. Hansen University of Wisconsin www.ssc.wisc.edu/~bhansen May 2004 Abstract The mean-square efficiency of cumulative distribution function
Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)
Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091) Magnus Wiktorsson Centre for Mathematical Sciences Lund University, Sweden Lecture 5 Sequential Monte Carlo methods I February
Growth Charts of Body Mass Index (BMI) with Quantile Regression
Growth Charts of Body Mass Index (BMI) with Quantile Regression Colin Chen SAS Institute Inc. Cary, NC, U.S.A. Abstract Growth charts of body mass index (BMI) are constructed from the recent four-year
HYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
1 Sufficient statistics
1 Sufficient statistics A statistic is a function T = rx 1, X 2,, X n of the random sample X 1, X 2,, X n. Examples are X n = 1 n s 2 = = X i, 1 n 1 the sample mean X i X n 2, the sample variance T 1 =
Contents. List of Figures. List of Tables. List of Examples. Preface to Volume IV
Contents List of Figures List of Tables List of Examples Foreword Preface to Volume IV xiii xvi xxi xxv xxix IV.1 Value at Risk and Other Risk Metrics 1 IV.1.1 Introduction 1 IV.1.2 An Overview of Market
Supplement to Call Centers with Delay Information: Models and Insights
Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290
Numerical methods for American options
Lecture 9 Numerical methods for American options Lecture Notes by Andrzej Palczewski Computational Finance p. 1 American options The holder of an American option has the right to exercise it at any moment
Chapter 4: Statistical Hypothesis Testing
Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin
Sovereign Defaults. Iskander Karibzhanov. October 14, 2014
Sovereign Defaults Iskander Karibzhanov October 14, 214 1 Motivation Two recent papers advance frontiers of sovereign default modeling. First, Aguiar and Gopinath (26) highlight the importance of fluctuations
EMPIRICAL RISK MINIMIZATION FOR CAR INSURANCE DATA
EMPIRICAL RISK MINIMIZATION FOR CAR INSURANCE DATA Andreas Christmann Department of Mathematics homepages.vub.ac.be/ achristm Talk: ULB, Sciences Actuarielles, 17/NOV/2006 Contents 1. Project: Motor vehicle
NIST GCR 06-906 Probabilistic Models for Directionless Wind Speeds in Hurricanes
NIST GCR 06-906 Probabilistic Models for Directionless Wind Speeds in Hurricanes Mircea Grigoriu Cornell University NIST GCR 06-906 Probabilistic Models for Directionless Wind Speeds in Hurricanes Prepared
Planning And Scheduling Maintenance Resources In A Complex System
Planning And Scheduling Maintenance Resources In A Complex System Martin Newby School of Engineering and Mathematical Sciences City University of London London m.j.newbycity.ac.uk Colin Barker QuaRCQ Group
Lecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...
MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................
A Comparison of Correlation Coefficients via a Three-Step Bootstrap Approach
ISSN: 1916-9795 Journal of Mathematics Research A Comparison of Correlation Coefficients via a Three-Step Bootstrap Approach Tahani A. Maturi (Corresponding author) Department of Mathematical Sciences,
Comparison of resampling method applied to censored data
International Journal of Advanced Statistics and Probability, 2 (2) (2014) 48-55 c Science Publishing Corporation www.sciencepubco.com/index.php/ijasp doi: 10.14419/ijasp.v2i2.2291 Research Paper Comparison
1.1 Introduction, and Review of Probability Theory... 3. 1.1.1 Random Variable, Range, Types of Random Variables... 3. 1.1.2 CDF, PDF, Quantiles...
MATH4427 Notebook 1 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 1 MATH4427 Notebook 1 3 1.1 Introduction, and Review of Probability
Reject Inference in Credit Scoring. Jie-Men Mok
Reject Inference in Credit Scoring Jie-Men Mok BMI paper January 2009 ii Preface In the Master programme of Business Mathematics and Informatics (BMI), it is required to perform research on a business
Option Valuation Using Intraday Data
Option Valuation Using Intraday Data Peter Christoffersen Rotman School of Management, University of Toronto, Copenhagen Business School, and CREATES, University of Aarhus 2nd Lecture on Thursday 1 Model
Standard errors of marginal effects in the heteroskedastic probit model
Standard errors of marginal effects in the heteroskedastic probit model Thomas Cornelißen Discussion Paper No. 320 August 2005 ISSN: 0949 9962 Abstract In non-linear regression models, such as the heteroskedastic
Linear Classification. Volker Tresp Summer 2015
Linear Classification Volker Tresp Summer 2015 1 Classification Classification is the central task of pattern recognition Sensors supply information about an object: to which class do the object belong
Class #6: Non-linear classification. ML4Bio 2012 February 17 th, 2012 Quaid Morris
Class #6: Non-linear classification ML4Bio 2012 February 17 th, 2012 Quaid Morris 1 Module #: Title of Module 2 Review Overview Linear separability Non-linear classification Linear Support Vector Machines
Average Redistributional Effects. IFAI/IZA Conference on Labor Market Policy Evaluation
Average Redistributional Effects IFAI/IZA Conference on Labor Market Policy Evaluation Geert Ridder, Department of Economics, University of Southern California. October 10, 2006 1 Motivation Most papers
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its
VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA
VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA Csilla Csendes University of Miskolc, Hungary Department of Applied Mathematics ICAM 2010 Probability density functions A random variable X has density
Course 4 Examination Questions And Illustrative Solutions. November 2000
Course 4 Examination Questions And Illustrative Solutions Novemer 000 1. You fit an invertile first-order moving average model to a time series. The lag-one sample autocorrelation coefficient is 0.35.
Comparing Neural Networks and ARMA Models in Artificial Stock Market
Comparing Neural Networks and ARMA Models in Artificial Stock Market Jiří Krtek Academy of Sciences of the Czech Republic, Institute of Information Theory and Automation. e-mail: [email protected]
Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.
Normal Distribution Definition A continuous random variable has a normal distribution if its probability density e -(y -µ Y ) 2 2 / 2 σ function can be written as for < y < as Y f ( y ) = 1 σ Y 2 π Notation:
MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables
MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables Tony Pourmohamad Department of Mathematics De Anza College Spring 2015 Objectives By the end of this set of slides,
Credit Scoring Modelling for Retail Banking Sector.
Credit Scoring Modelling for Retail Banking Sector. Elena Bartolozzi, Matthew Cornford, Leticia García-Ergüín, Cristina Pascual Deocón, Oscar Iván Vasquez & Fransico Javier Plaza. II Modelling Week, Universidad
Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see. Description
Title stata.com lpoly Kernel-weighted local polynomial smoothing Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see Syntax lpoly yvar xvar [ if
Investors and Central Bank s Uncertainty Embedded in Index Options On-Line Appendix
Investors and Central Bank s Uncertainty Embedded in Index Options On-Line Appendix Alexander David Haskayne School of Business, University of Calgary Pietro Veronesi University of Chicago Booth School
Linear Threshold Units
Linear Threshold Units w x hx (... w n x n w We assume that each feature x j and each weight w j is a real number (we will relax this later) We will study three different algorithms for learning linear
University of Maryland Fraternity & Sorority Life Spring 2015 Academic Report
University of Maryland Fraternity & Sorority Life Academic Report Academic and Population Statistics Population: # of Students: # of New Members: Avg. Size: Avg. GPA: % of the Undergraduate Population
Univariate Time Series Analysis; ARIMA Models
Econometrics 2 Fall 25 Univariate Time Series Analysis; ARIMA Models Heino Bohn Nielsen of4 Univariate Time Series Analysis We consider a single time series, y,y 2,..., y T. We want to construct simple
General Sampling Methods
General Sampling Methods Reference: Glasserman, 2.2 and 2.3 Claudio Pacati academic year 2016 17 1 Inverse Transform Method Assume U U(0, 1) and let F be the cumulative distribution function of a distribution
Estimation and attribution of changes in extreme weather and climate events
IPCC workshop on extreme weather and climate events, 11-13 June 2002, Beijing. Estimation and attribution of changes in extreme weather and climate events Dr. David B. Stephenson Department of Meteorology
