This course is concerned with theory and application of linear regression methods, including an examination of the classical regression model and the statistical properties of the estimator. The effect of violations of the classical assumptions are considered, and appropriate estimation methods are introduced. This course is the first of a two-course sequence. At course completion, a successful student will:
- understand the statistical foundations of the classical regression model.
- be able to explain the properties of the least-squares estimator and related test statistics.
- be able to apply these methods to data and interpret the results.
Introduction and Review
Economic Questions and Data
Review of Statistics
Review of Probability
How do we handle random processes? We can define a random variable as a measurable function defined on a probability space. Bievens(2004)
See set theory. Kolmorgorov gives us the axioms of probability.
- The probability of some outcome, A, is greater than or equal to zero, for A contained in the probability space S .
- The probability over the sample space is equal to one, .
- The probability of A and B is equal to the probability of A plus the probability of B, if A and B are disjoint and contained in the sample space S .
For example, in a coin toss, the probability space is heads and tails. If x is a function over the probability space, we can say that x(heads) equals one and x(tails) equals zero. So the probability that x is one equals the probability of heads, and the probability that x is zero equals the probability of tails, and they are disjoint probabilities. So if the probability of heads is Ph, the probability of tails is 1-Ph. (Note that this example does not assume that you have a fair coin, though you could.)
Review of Matrices
positive semidefinite (PSD), positive definite (PD), quadratic forms, symettric, idempotent, diagonal, block diagonal,
The Probability Distribution Function is where
II. Fundamentals of Regression Analysis
Linear Regression with one regressor
Linear Regression with Multiple Regressors
By having a set of ys, assumed to be dependent on a set of xs, we solve for some constant . (Where k is the number of observations, and n is the number of predictor or independent variables, y is a kx1 vector and x is a kxn matrix multiplied by an nx1 vector which relates y and x.)
To solve for , which will give us an expected value of that we will call beta hat we use this linear operation:
This can be derived from OLS, ordinary least squares, which minimizes the squared residuals. Since , then . The residuals are then the estimated errors , found from the estimated model: The squared residuals are then . By deriving this expression with respect to and solving for gives the OLS estimator.
Then or x times beta hat equals y hat, where y hat is our expected value for y given x and our calculated value of beta, beta hat.
Now the difference between the predicted values of y and the actual values of y are given by: where is residual values, sometimes called error.
Properties of Ordinary Least Squares (OLS estimator beta hat is unbiased for beta, and the Covariance Matrix for beta hat is ), Classical Normal, The Information Matrix, Chi Square Distribution, The relationship between e and epsilon, The Maximum Likelihood Estimator of the variance is biased, Distribution of the variance under normality,
regression, principles of estimation and testing, stationary time series models, limited dependent variable models, longitudinal (panel) data models, generalized methods of moments, instrumental variable models, non-stationarity, stochastic trends, co-integration,
Errors in functional form specification
Omission of relevant explanatory variables
Inclusion of irrelevant explanatory variables
Nonlinear Regression Functions
Squared explanatory variables
Cubed explanatory variables
Assessing Studies Based on Multiple Regression
Rarely is perfect multicollinearity a problem, but multicollinearity, nearly linear relationships between explanatory variables, often is. Multicollinearity causes an increase in the variance of ß estimations, which in turn decreases the probability of rejecting the null hypotheses that such ßs are insignificant.
- Davidson and MacKinnon "Econometric Theory and Methods"
- DeGroot and Schervish "Probability and Statistics" 3rd edition
- A review of matrix algebra is recommended. In econometrics, it is necessary to work with very large sets of data. In order to manipulate the data and follow the discussion, you must be familiar with matrices.
- Econometric Theory Wikibook Econometric Theory