Econometrics 1[edit | edit source]
This course is concerned with theory and application of linear regression methods, including an examination of the classical regression model and the statistical properties of the estimator. The effect of violations of the classical assumptions are considered, and appropriate estimation methods are introduced. This course is the first of a two-course sequence. At course completion, a successful student will:
- understand the statistical foundations of the classical regression model.
- be able to explain the properties of the least-squares estimator and related test statistics.
- be able to apply these methods to data and interpret the results.
Introduction and Review[edit | edit source]
Economic Questions and Data[edit | edit source]
Review of Statistics[edit | edit source]
Review of Probability[edit | edit source]
Random variables[edit | edit source]
How do we handle random processes? We can define a random variable as a measurable function defined on a probability space. Bievens(2004)
See set theory. Kolmorgorov gives us the axioms of probability.
- The probability of some outcome, A, is greater than or equal to zero, for A contained in the probability space S .
- The probability over the sample space is equal to one, .
- The probability of A and B is equal to the probability of A plus the probability of B, if A and B are disjoint and contained in the sample space S .
For example, in a coin toss, the probability space is heads and tails. If x is a function over the probability space, we can say that x(heads) equals one and x(tails) equals zero. So the probability that x is one equals the probability of heads, and the probability that x is zero equals the probability of tails, and they are disjoint probabilities. So if the probability of heads is Ph, the probability of tails is 1-Ph. (Note that this example does not assume that you have a fair coin, though you could.)
Review of Matrices[edit | edit source]
positive semidefinite (PSD), positive definite (PD), quadratic forms, symettric, idempotent, diagonal, block diagonal,
The Probability Distribution Function is where
II. Fundamentals of Regression Analysis[edit | edit source]
Linear Regression with one regressor[edit | edit source]
Linear Regression with Multiple Regressors[edit | edit source]
Linear regression[edit | edit source]
By having a set of ys, assumed to be dependent on a set of xs, we solve for some constant . (Where k is the number of observations, and n is the number of predictor or independent variables, y is a kx1 vector and x is a kxn matrix multiplied by an nx1 vector which relates y and x.)
To solve for , which will give us an expected value of that we will call beta hat we use this linear operation:
This can be derived from OLS, ordinary least squares, which minimizes the squared residuals. Since , then . The residuals are then the estimated errors , found from the estimated model: The squared residuals are then . By deriving this expression with respect to and solving for gives the OLS estimator.
Then or x times beta hat equals y hat, where y hat is our expected value for y given x and our calculated value of beta, beta hat.
Now the difference between the predicted values of y and the actual values of y are given by: where is residual values, sometimes called error.
Properties of Ordinary Least Squares (OLS estimator beta hat is unbiased for beta, and the Covariance Matrix for beta hat is ), Classical Normal, The Information Matrix, Chi Square Distribution, The relationship between e and epsilon, The Maximum Likelihood Estimator of the variance is biased, Distribution of the variance under normality,
regression, principles of estimation and testing, stationary time series models, limited dependent variable models, longitudinal (panel) data models, generalized methods of moments, instrumental variable models, non-stationarity, stochastic trends, co-integration,
Errors in functional form specification[edit | edit source]
Omission of relevant explanatory variables
Inclusion of irrelevant explanatory variables
Nonlinear Regression Functions[edit | edit source]
Squared explanatory variables
Cubed explanatory variables
Assessing Studies Based on Multiple Regression[edit | edit source]
Multicollinearity[edit | edit source]
Rarely is perfect multicollinearity a problem, but multicollinearity, nearly linear relationships between explanatory variables, often is. Multicollinearity causes an increase in the variance of ß estimations, which in turn decreases the probability of rejecting the null hypotheses that such ßs are insignificant.
How to find out if two or more variables are correlated:
- Correlation matrix
- Variance inflation factor (VIF)
Resources[edit | edit source]
- Davidson and MacKinnon "Econometric Theory and Methods"
- DeGroot and Schervish "Probability and Statistics" 3rd edition
- A review of matrix algebra is recommended. In econometrics, it is necessary to work with very large sets of data. In order to manipulate the data and follow the discussion, you must be familiar with matrices.
- Econometric Theory Wikibook Econometric Theory