M5MS09. Graphical Modelling

Similar documents
Probability and Random Variables. Generation of random variables (r.v.)

5 Directed acyclic graphs

THE NUMBER OF GRAPHS AND A RANDOM GRAPH WITH A GIVEN DEGREE SEQUENCE. Alexander Barvinok

Statistical machine learning, high dimension and big data

Introduction. Appendix D Mathematical Induction D1

CONDITIONAL, PARTIAL AND RANK CORRELATION FOR THE ELLIPTICAL COPULA; DEPENDENCE MODELLING IN UNCERTAINTY ANALYSIS

Bayesian networks - Time-series models - Apache Spark & Scala

The Basics of Graphical Models

Statistics Graduate Courses

3. The Junction Tree Algorithms

SECTION 10-2 Mathematical Induction

Extracting correlation structure from large random matrices

Precalculus REVERSE CORRELATION. Content Expectations for. Precalculus. Michigan CONTENT EXPECTATIONS FOR PRECALCULUS CHAPTER/LESSON TITLES

FOREWORD. Executive Secretary

1 Causality and graphical models in time series analysis

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Graphical Modeling for Genomic Data

Module 3: Correlation and Covariance

Data Structures and Algorithms Written Examination

How To Understand And Solve Algebraic Equations

STA 4273H: Statistical Machine Learning

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University

Load Balancing and Switch Scheduling

Graph Theory Origin and Seven Bridges of Königsberg -Rhishikesh

Alabama Department of Postsecondary Education

Least-Squares Intersection of Lines

SHARP BOUNDS FOR THE SUM OF THE SQUARES OF THE DEGREES OF A GRAPH

McDougal Littell California:

Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague.

Statistics in Retail Finance. Chapter 6: Behavioural models

QUALITY ENGINEERING PROGRAM

7 Gaussian Elimination and LU Factorization

Course Syllabus For Operations Management. Management Information Systems


Using graphical modelling in official statistics

Bayesian Information Criterion The BIC Of Algebraic geometry

Online Model Predictive Control of a Robotic System by Combining Simulation and Optimization

Diablo Valley College Catalog

Graphical Models in Time Series Analysis

Mathematics Online Instructional Materials Correlation to the 2009 Algebra I Standards of Learning and Curriculum Framework

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

Geometry Solve real life and mathematical problems involving angle measure, area, surface area and volume.

Math 55: Discrete Mathematics

Algebra Unpacked Content For the new Common Core standards that will be effective in all North Carolina schools in the school year.

Just the Factors, Ma am

LECTURE 4. Last time: Lecture outline

Algebra 1 Course Title

BayesX - Software for Bayesian Inference in Structured Additive Regression

Tensor Factorization for Multi-Relational Learning

Mathematics Course 111: Algebra I Part IV: Vector Spaces

Properties of Real Numbers

MATH ADVISEMENT GUIDE

DRAFT. Further mathematics. GCE AS and A level subject content

1 The Brownian bridge construction

oil liquid water water liquid Answer, Key Homework 2 David McIntyre 1

Piecewise Cubic Splines

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

Common Core Unit Summary Grades 6 to 8

Minnesota Academic Standards

Exam Introduction Mathematical Finance and Insurance

2. (a) Explain the strassen s matrix multiplication. (b) Write deletion algorithm, of Binary search tree. [8+8]

What is Linear Programming?

Software Review: ITSM 2000 Professional Version 6.0.

A Sublinear Bipartiteness Tester for Bounded Degree Graphs

DATA ANALYSIS II. Matrix Algorithms

SIMPLIFIED PERFORMANCE MODEL FOR HYBRID WIND DIESEL SYSTEMS. J. F. MANWELL, J. G. McGOWAN and U. ABDULWAHID

Lecture 7: NP-Complete Problems

Small Maximal Independent Sets and Faster Exact Graph Coloring

Solutions to Exam in Speech Signal Processing EN2300

The number of marks is given in brackets [ ] at the end of each question or part question. The total number of marks for this paper is 72.

CS&SS / STAT 566 CAUSAL MODELING

Performance Level Descriptors Grade 6 Mathematics

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Microeconomic Theory: Basic Math Concepts

Introduction to General and Generalized Linear Models

Analyzing Functions Intervals of Increase & Decrease Lesson 76

Probability Calculator

2.3. Finding polynomial functions. An Introduction:

Introduction to Multivariate Analysis

each college c i C has a capacity q i - the maximum number of students it will admit

Factor analysis. Angela Montanari

IB Maths SL Sequence and Series Practice Problems Mr. W Name

Monitoring the Behaviour of Credit Card Holders with Graphical Chain Models

Stochastic Processes and Queueing Theory used in Cloud Computer Performance Simulations

Multivariate Normal Distribution

Overview of Factor Analysis

Elliptical copulae. Dorota Kurowicka, Jolanta Misiewicz, Roger Cooke

DETERMINANTS IN THE KRONECKER PRODUCT OF MATRICES: THE INCIDENCE MATRIX OF A COMPLETE GRAPH

Granger-causality graphs for multivariate time series

3.2. Solving quadratic equations. Introduction. Prerequisites. Learning Outcomes. Learning Style

Transportation Polytopes: a Twenty year Update

096 Professional Readiness Examination (Mathematics)

Estimated Pre Calculus Pacing Timeline

Prentice Hall Mathematics: Algebra Correlated to: Utah Core Curriculum for Math, Intermediate Algebra (Secondary)

Year 9 mathematics test

Probabilistic Models for Big Data. Alex Davies and Roger Frigola University of Cambridge 13th February 2014

AN ALGORITHM FOR DETERMINING WHETHER A GIVEN BINARY MATROID IS GRAPHIC

Tracking based on graph of pairs of plots

To give it a definition, an implicit function of x and y is simply any relationship that takes the form:

Transcription:

Course: MMS09 Setter: Walden Checker: Ginzberg Editor: Calderhead External: Wood Date: April, 0 MSc EXAMINATIONS (STATISTICS) May-June 0 MMS09 Graphical Modelling Setter s signature Checker s signature Editor s signature................................................... MMS09 Graphical Modelling (0) Page of

MSc EXAMINATIONS (STATISTICS) May-June 0 MMS09 Graphical Modelling Date: Friday st May 0 Time:.0 6.00 Credit will be given for all questions attempted but extra credit will be given for complete or nearly complete answers. Calculators may not be used. c 0 Imperial College London MMS09 Page of

. (a) Consider X = [X, X, X, X ] T having pdf f X (x) = C exp(x + x x + x x x ) on the four dimensional cube 0 < x j <, j =,..., where the constant C ensures integration to unity. (i) Using the factorization theorem draw the corresponding conditional independence graph G. What are the cliques of the graph? Explain their relationship to the factorization of f X (x) according to G. (b) (i) Let G = (V, E) be an undirected graph and consider ( apple j < k apple p). Let A be the adjacency matrix such that A jk = if (j, k) E and zero otherwise. Define the Gaussian graphical model M(G). In the context of a Gaussian graphical model describe what is meant by the datagenerating distribution being faithful to the graph. (c) Let X,..., X N be a sample from a multivariate normal distribution N (0, ) in a Gaussian graphical model M(G) where G = (V, E) is an undirected graph with a single missing edge (, ). The sample covariance matrix and its inverse are b = 0 0 ; b = (i) Carefully describe the single-step Wermuth-Scheidt algorithm and use it to find the modified covariance matrix T such that T jk = 0 if (j, k) 6 E, where as usual T jk is the (j, k)th element of T. Where constraints are imposed by more than one missing edge how is the singlestep Wermuth-Scheidt algorithm modified? (Equations are not required, a short description is su cient). (d) Briefly describe the lasso-type regularization approach to estimating a Gaussian graphical model proposed by Yuan and Lin. What is the key property of the lasso-type approach? The BIC formula is used for determining an appropriate value for the constraint parameter. How are the degrees of freedom calculated in the BIC formula? MMS09 Graphical Modelling (0) Page of

Figure : DAG G!.. (a) (i) State the well-numbered pairwise directed Markov property determined by any DAG graph G!. (iii) Write down the five pairwise conditional independencies implied by the wellnumbered pairwise directed Markov property for the DAG G! in Fig.. Use the conditional independencies from to find the recursive factorization identity for the joint density f X,X,X,X,X in simplified form. Fully explain each step of the simplification. (b) (i) Consider a chain consisting of the vertices i 0, i,..., i m of a DAG G!. Define a collider and a noncollider. Define what it means for two of the vertices of the DAG G! to be d-separated. (iii) State the global directed Markov property of the DAG G!. (iv) Derive the five pairwise conditional independencies of (a) for the DAG G! in Fig. using the global directed Markov property, giving full justifications. MMS09 Graphical Modelling (0) Page of

. (a) (i) Define the undirected partial correlation graph for p-vector-valued stationary time series. In the building of a partial correlation graph researchers typically make use of partial coherence or the spectral matrix and its inverse. Why are such approaches favoured over use of the partial cross-covariance sequence? (b) Describe, in a step-by-step fashion, the building of a partial correlation graph for p- vector-valued stationary time series using the Kullback-Leibler divergence and multiple hypothesis testing. It is not necessary to go into full mathematical details regarding the exact form of the Kullback-Leibler divergence or the form of Matsuda s test statistic. The steps and statistical ideas/terms being used must be clearly explained/defined. MMS09 Graphical Modelling (0) Page of