2. Descriptive statistics in EViews



Similar documents
Data exploration with Microsoft Excel: univariate analysis

A Short Introduction to Eviews

2. Filling Data Gaps, Data validation & Descriptive Statistics

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.

An introduction to using Microsoft Excel for quantitative data analysis

Data exploration with Microsoft Excel: analysing more than one variable

Java Modules for Time Series Analysis

Quantitative Methods for Finance

Data Analysis Tools. Tools for Summarizing Data

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

MBA 611 STATISTICS AND QUANTITATIVE METHODS

Data analysis and regression in Stata

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Introduction; Descriptive & Univariate Statistics

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Description. Textbook. Grading. Objective

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION

Forecasting in STATA: Tools and Tricks

Module 3: Correlation and Covariance

Descriptive Statistics

Lecture 1: Review and Exploratory Data Analysis (EDA)

How to Use EViews (Econometric Views)

OLS Examples. OLS Regression

Introduction to Risk, Return and the Historical Record

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

seven Statistical Analysis with Excel chapter OVERVIEW CHAPTER

3. What is the difference between variance and standard deviation? 5. If I add 2 to all my observations, how variance and mean will vary?

Technical Efficiency Accounting for Environmental Influence in the Japanese Gas Market

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

GeoGebra Statistics and Probability

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

Financial Econometrics MFE MATLAB Introduction. Kevin Sheppard University of Oxford

03 The full syllabus. 03 The full syllabus continued. For more information visit PAPER C03 FUNDAMENTALS OF BUSINESS MATHEMATICS

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Chapter 5 Functions. Introducing Functions

How To Understand And Solve A Linear Programming Problem

Forecasting Using Eviews 2.0: An Overview

Multiple Linear Regression

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

User Guide.

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Probability Distributions

Concepts in Investments Risks and Returns (Relevant to PBE Paper II Management Accounting and Finance)

Investment Statistics: Definitions & Formulas

Confidence Intervals for the Difference Between Two Means

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

A Guide to Using EViews with Using Econometrics: A Practical Guide

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Exercise 1.12 (Pg )

Risk and return (1) Class 9 Financial Management,

Geostatistics Exploratory Analysis

ijcrb.com INTERDISCIPLINARY JOURNAL OF CONTEMPORARY RESEARCH IN BUSINESS AUGUST 2014 VOL 6, NO 4

Module 4: Data Exploration

STATISTICAL ANALYSIS WITH EXCEL COURSE OUTLINE

Expression. Variable Equation Polynomial Monomial Add. Area. Volume Surface Space Length Width. Probability. Chance Random Likely Possibility Odds

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

How To Write A Data Analysis

ADD-INS: ENHANCING EXCEL

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Simple linear regression

Foundation of Quantitative Data Analysis

Introduction to Quantitative Methods

Math Review. for the Quantitative Reasoning Measure of the GRE revised General Test

CALCULATIONS & STATISTICS

Exploratory data analysis (Chapter 2) Fall 2011

Final Exam Review: VBA

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Lecture Notes Module 1

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Algebra 1 Course Information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

4 Other useful features on the course web page. 5 Accessing SAS

430 Statistics and Financial Mathematics for Business

Week 1. Exploratory Data Analysis

Additional sources Compilation of sources:

A Short Guide to R with RStudio

Probability Distribution for Discrete Random Variables

SPSS Introduction. Yi Li

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

STAT355 - Probability & Statistics

4. Continuous Random Variables, the Pareto and Normal Distributions

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Forecasting the US Dollar / Euro Exchange rate Using ARMA Models

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Algebra Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

MEASURES OF VARIATION

Chapter 3. The Normal Distribution

In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data.

Transcription:

2. Descriptive statistics in EViews Features of EViews: Data processing (importing, editing, handling, exporting data) Basic statistical tools (descriptive statistics, inference, graphical tools) Regression analysis Time series analysis Specification diagnostics, specification testing Forecasting, simulation studies Programming 7

2.1. Introduction to EViews Fundamental concept behind EViews: EViews is based on objects Some typical EViews objects: Data series (single: series, collection of series: groups) graphs equations How to enter EViews commands: Via the EViews menu (clicking) Via the command line (typing commands) 8

EViews screenshot 9

Basis of all EViews actions: workfile Definition of a workfile: Container for all EViews objects with which you want to work (series, graphs, equations) Features of a workfile: Prespecified data frequency Prespecified sampling period 10

Creating an EViews-workfile: Either by typing the command create Or by clicking through the menu items File/New/ Workfile dialogue requesting two pieces of information: (1) Data frequency (2) Start date and end date 11

Data frequency and data representation Frequency Representation annual 2014, 2015, etc. semi-annual 2015:1, 2015:2 quarterly 2015:1,..., 2015:4 monthly 2015:01,..., 2015:12 weekly mm/dd/yyyy, e.g. 03/26/2015 daily (5 days weeks) mm/dd/yyyy daily (7 days weeks) mm/dd/yyyy integer date 1,..., 150 12

Generating data series: Manual data input (invoking the EViews data editor by the command data) Importing data from external data bases (e.g. from Excel, Lotus,...) Afterwards, we may use data series to generate graphs in statistcial and econometric routines 13

Two fundamental EViews concepts: Transformating data series (via the genr command) Setting the active sample (via the smpl command) Objective of many data transformations: Creating new data series from existing data series 14

Example: Assume we are given the following series in EViews: EX RATE: the nominal Euro-USD exchange rate P EURO: the overall price level in Euroland P US: the overall price level in the US Creating the real exchange-rate series: genr EX RATE REAL = EX RATE * P US / P EURO 15

Some operators and functions for the genr command Operator Meaning Example + Sum - Difference * Product / Ratio ^ Power genr H = (A+B/(H+K))^2 log(x) Natural log genr Z = log(x) exp(x) Natural exp abs(x) Absolute value sqr(x) Square root sin(x) Sine cos(x) Cosine genr Z = log(sqr(sin(y))) 16

Lagged values (lag operator, lags): Let P t denote an overall price level at date t The inflation rate π t between the dates t 1 and t is defined as π t = P t P t 1 P t 1 Lag operator in EViews: Let P be the price-level series in EViews The inflation rates may be generated via the command genr INFL RATE = (P-P(-1))/P(-1) 17

Setting the active sample: Sometimes, it may not be reasonable to consider all observations of a series in statistical operations Via the smpl command we are able to restrict the data range to be processed Example: Assume that your worfile contains yearly GDP data between 1950 and 2015: If you only need to consider the time period 1970 until 2010, you set smpl 1970 2010 Then, all subsequent EViews operations only process these data 18

Remarks: The smpl command allows us to further restrict our data base via the if statement If you only need to analyze the years between 1970 and 2010, in which the inflation rate exceeded 2%, you set smpl 1970 2010 if INFL RATE > 2.0 19

2.2. Descriptive statistics Notation: Consider the data series x 1,..., x T T is the number of observations, x t is the t-th observation The ordered series is x (1) x (2)... x (T ) 20

Example: Prices (in euros) of the mutual fund DEKALUX-JAPAN during the calender weeks #10 and #11 in 2002 Date t x t x (t) 03/04/2002 1 527.54 x (3) 03/05/2002 2 523.79 x (2) 03/06/2002 3 521.92 x (1) 03/07/2002 4 540.91 x (7) 03/08/2002 5 551.68 x (9) 03/11/2002 6 556.54 x (10) 03/12/2002 7 543.45 x (8) 03/13/2002 8 530.52 x (4) 03/14/2002 9 534.60 x (5) 03/15/2002 10 538.04 x (6) 21

2.2.1. Histogram and empirical cumulative distribution function Definition 2.1: (Histogram) The histogram divides the series range (the distance between the maximum and minimum values) into a number of equal length intervals (bins) and displays a count of the number of observations that fall into each bin. Definition 2.2: (Empirical cumulative distribution function) Given the data series x 1,..., x T, for every x R the empirical cumulative distribution function F T : R [0, 1] is defined as F T (x) = number of x t x. T 22

Histogram with descriptive statistics in EViews 3 2 1 Series: DEKALUX Sample 3/04/2002 3/15/2002 Observations 10 Mean 536.8990 Median 536.3200 Maximum 556.5400 Minimum 521.9200 Std. Dev. 11.51973 Skewness 0.340804 Kurtosis 2.018182 Jarque-Bera 0.595232 Probability 0.742587 0 520 525 530 535 540 545 550 555 560 23

Empirical cumulative distribution function in EViews 1.0 DEKALUX 0.8 Probability 0.6 0.4 0.2 0.0 524 528 532 536 540 544 548 552 556 24

2.2.2. Measures of a single series Minimum, maximum: Formulae: x min = x (1), x max = x (T ) EViews commands: =@min(dekalux), =@max(dekalux) Arithmetic mean: Formula: x = 1 T (x 1 + x 2 +... + x T ) = 1 T EViews command: =@mean(dekalux) T x t t=1 25

Median: Formula: x med = x ([T +1]/2) 1 2 [ ] x (T/2) + x ([T +2]/2), if T odd, if T even EViews command: =@median(dekalux) Variance, standard deviation: Formulae: s 2 = 1 T 1 T t=1 (x t x) 2, s = 1 T 1 T t=1 (x t x) 2 EViews commands: =@vars(dekalux), =@stdev(dekalux) 26

Skewness: Formula: x skew = 1 T T t=1 x t x 1T Tt=1 (x t x) 2 3 EViews command: =@skew(dekalux) Kurtosis: Formula: x kurt = 1 T T t=1 x t x 1T Tt=1 (x t x) 2 4 EViews command: =@kurt(dekalux) 27

2.2.3. Covariance and correlation Now: Assume that you have collected pairwise observations (x 1, y 1 ),..., (x T, y T ) for the two data series X and Y in EViews Covariance: Formula: S XY = 1 T 1 T t=1 EViews command: =@covs(x,y) (x t x)(y t y) 28

Correlation coefficient: Formula: R XY = S XY S X S Y = Tt=1 (x t x)(y t y) [ Tt=1 (x t x) 2] [ Tt=1 (y t y) 2] EViews command: =@cor(x,y) 29