Consider a dataset obtained from a study on the impact of various factors on the number of bicyclists hit by motor vehicles in Eugene. The dataset includes a random sample of 26 months and the following variables:
\(\text{Hits}_i\): The total number of bicyclists hit by motor vehicles in month \(i\).
\(\text{Rain}_i\): The total number of days it rained in month \(i\).
\(\text{Temp}_i\): The average daytime temperature in month \(i\) (in degrees Celsius).
\(\text{Days}_i\): The total number of days in month \(i\).
\(\text{WeekDays}_i\): The total number of weekdays (Monday-Friday) in month \(i\).
\(\text{School}_i\): A binary variable indicating if the University of Oregon is in session in month \(i\).
You then estimate the following regression models via OLS:
The corresponding values of RSS for each regression model are as follows:
Regression
RSS
Regression 1
400
Regression 2
410
Regression 3
450
Regression 4
420
Regression 5
445
Questions
1. Interpretation of Coefficients (5 points)
In Regression 1, interpret the coefficient \(\beta_3\) (the coefficient for the variable \(\text{Temp}_i\)). What does this coefficient tell us about the relationship between the average daytime temperature and the number of bicyclists hit by motor vehicles?
Solution:
For each additional degree Celsius increase in the average daytime temperature, the number of bicyclists hit by motor vehicles is expected to increase by \(\beta_3\) units, holding all other variables constant.
2. Hypothesis Testing
Test the following hypothesis at the 1% significance level (parameter subscripts refer to Regression 1):
where \(q\) is the number of restrictions, \(n\) is the number of observations, and \(k\) is the number of regressors in the unrestricted model. The critical value for the \(F\)-statistic at the 1% significance level is:
F_crit =qf(0.99, q, n-k-1)F_crit
[1] 5.925879
Do you reject or fail to reject the null hypothesis? Explain your answer.
Solution:
The big model is Regression 1. The small model is Regression 4. So RSS for the small model is 420 and RSS for the big model is 400. We have ( m = 6 ) and ( k = 4 ). The F-statistic is: