Assignment #3
- PPol 604
- Due: Thursday, 31 January 2013
Type up your answers. Give proper credit to those you work with and/or the text(s).
Solve the following problems. Show all of your work, but keep your answers concise.
Highlight your (final) answer
to distinguish it from your other numbers and text. Include a copy of your input
(e.g. do file) or output (e.g. log file),
when it is an appropriate way to show your work.
However, do not include unnecessary output (i.e. no data dumps), and format any output
so that it is easily readable.
An appropriate time to include output is when you put your results
in a table--if your results are wrong, then the grader has no idea how you came to your
conclusions (i.e. give partial credit) unless you provide some output. Explanation
includes statistical and substantive explanation (explain so that a statistical
layperson can understand it, and so that a
statistical analyst will see your erudition).
- {5} Do Problem 12.9 in Stock and Watson.
- {30} [from Bartels 1991] Use the Stata dataset
here
(spending86.dta) to solve this problem.
This dataset contains information on 250 non-freshmen incumbents in the 1986 election
who had major-party challengers in
the 1986 and 1984 elections (excluding a couple of outliers).
See the data set for variable
descriptions. Note that the spending variables are logged to reflect diminishing
returns to spending. Investigate how incumbent and challenger spending affect the
percentage of the vote that the incumbent receives. Put your results in a table
with four columns for parts b, e, f, and h.
a. Before running a regression, hypothesize how party (democrat),
challenger quality (hq), previous vote (vote_1), incumbent
spending (lispend), and challenger spending (lcspend) will affect the
two-party vote percentage, vote, and explain why.
b. Run an OLS regression using the variables in part a. (do not worry about endogeneity yet).
Compare your hypotheses with the results. If there are any differences, why?
c. Give two reasons why incumbent spending might be endogenous.
d. The variable incumbent spending in the previous election cycle (lispend1) is
proposed as an instrument for incumbent spending in the current election cycle.
Which reason(s) of endogeneity might this fix? Why?
e. Conduct TSLS (sometimes called 2SLS) "by hand," i.e. you run the two stages
yourself, without using the Stata command ivregress (or commands like it).
f. Conduct TSLS using the Stata command ivregress 2sls. Compare the results to
the previous two parts. Interpret and explain (statistically and substantively) your
findings. Compare your hypotheses with the results. If there are any differences, why?
g. Conduct a test whether incumbent spending is endogenous.
Use whatever tests you can for instrument validity. If you cannot include
a test, explain why.
h. There is a theory that challenger quality does not directly affect the vote, but
only affects it through incumbent spending. Thus, challenger quality could be used
as an instrument and not included as a control variable. Using this specification,
test whether incumbent spending is endogenous. Use whatever tests you can for
instrument validity. If you cannot include a test, explain why. What do you conclude
about this theory?
i. What other instruments could you suggest for incumbent spending? Would these
overcome both types of endogeneity discussed in part c.? Why might challenger
spending be endogenous? What instruments could you suggest for challenger
spending?
j. Justify or criticize the races that are excluded from the data set: open seats (where
there is no incumbent), unopposed incumbents (in either 1984 or 1986), first-year
incumbents, and the outliers (who were financing national campaigns).
- {40} [from Mullahy and Sindelar 1996 via Stock 2007] Case study: Alcohol and jobs.
Conventional wisdom holds that heavy drinking can result in difficulties in the workplace
and problems holding a job. Job loss associated with heavy drinking could exacerbate
other problems within the drinker’s family and increase reliance on publicly provided
social services. But, as a quantitative matter, what is the effect of heavy drinking on the
ability of a worker to find and to hold a job? Looked at from a different perspective,
suppose there were an effective intervention that could reduce heavy drinking (a drug
treatment, support groups, counseling, etc.); would this reduction in drinking improve
the job market prospects for the individual? In this problem, you will analyze data on
job market status and alcohol consumption. The data are 9822 observations on
individual-level data from the 1988 Alcohol Supplement of the National Health Interview
Survey. The data (alcojobs.dta) is available
here. The variables are described
within the data set. (Use the command describe after opening the file.)
As an analyst for the Congressional Research Service, you have been tasked by a member
of Congress to address this issue to determine whether an intervention would be worthwhile.
Write a professional report that provides an estimate of the effect of heavy
drinking on holding a job. [Remember that the main text of such a report is one page, followed by
a (statistical) appendix that often includes professional tables (i.e. not Stata output)
and other technical comments, followed by something that
shows your work for the grader, e.g. do-files, formattted Stata output.]
Is this effect substantively large? How confident are you in the
validity of your estimated effect, and why? What are the limitations of your analysis?
If you could be given more time and money, what other variables would you include
in your analysis? The report should both provide your best
estimate (or estimates) of this effect, based on the data provided, and provide a careful
assessment of the validity of your estimate(s). Provide the reader with a sense of the
statistical uncertainty surrounding your estimates.
Notes on the data set: The data are from the 1988 Alcohol Supplement to the
National Health Interview Survey of the adult (age 18 and older) U.S. population.
The data are for men only. The data set contains information on family characteristics,
employment status, socioeconomic indicators. These data were linked to state-level
data on alcohol and cigarette taxes and to other pertinent labor market and health data.
The survey asks, among other things, about alcohol consumption in the two weeks
prior to the survey. The measure of heavy drinking used here is that the respondent’s
alcohol consumption falls at or above the 90th percentile of alcohol consumption,
which is 18.0 ounces of ethanol per two-week period (approximately 36 drinks in two
weeks). The survey asks about other health indicators and includes an overall index of
health status (excellent, very good, good, fair, poor), constructed using the self-reported
health status indicators.
Three categories of labor market status are reported: employed, which means the
respondent has a paying job; unemployed, which means the respondent does not
have a paying job but has engaged in some job search activity over the past two
weeks; and out of the labor market, which means the respondent which means the
respondent does not have a paying job and is not looking for one.
- {25} Research Project Proposal:
Turn in a 1-2 page, double-spaced proposal (standard font and margins) outlining the
research question you plan to address, explains a potential causal connection linking an
independent and dependent variable, suggests which econometric model(s) will be used, offers at least 4 relevant citations, and discusses possible data sources to be used. If you do not have a ready idea that will use the tools taught this semester, you may wish to look at a panel data set, or some other hierarchical data set.
Back to
Assignments
page