Age &
Income, Part 1
Background
Economists
have done considerable work on the relationship between age and income. They find that (on average) people who are
very old or very young have lower incomes than those in between. The question is how exactly to model this
relationship.
1.
Why
does it make sense that the incomes of the old and the young should be lower
than the incomes of those whose age is in-between?
The
most famous work on this issue is a book by Jacob Mincer [Min], from 1974. Mizner realized that age was less important
than the number of years of work experience.
This means that we must consider, for example, college graduates
separately from workers with only a a high-school diploma.
2.
Assuming
that a similar relationship between years of work experience and age exists for
high-school graduates and college graduates, how do you think the graphs will
differ? I.e. what shifts, stretches,
etc. might be necessary to transform one graph into the other?
In
this project, we will try to develop a reasonable model for the relationship
between age and income.
Data
Below
is data from the Current Population Survey [CPS] for the year 1999. This table gives the average (mean) income
for white male college graduates who worked full-time for the previous year,
broken down by age. By restricting to a
single educational group (college grads), we can just consider age, rather than
years of work experience.
|
Age |
Mean Income |
|
|
|
|
18 to 24 years |
31,793 |
|
25 to 29 years |
42,245 |
|
30 to 34 years |
57,040 |
|
35 to 39 years |
70,479 |
|
40 to 44 years |
76,018 |
|
45 to 49 years |
78,855 |
|
50 to 54 years |
82,253 |
|
55 to 59 years |
92,773 |
|
60 to 64 years |
85,283 |
|
65 to 69 years |
80,875 |
|
70 to 74 years |
73,355 |
|
75 years and over |
54,329 |
Problems
1.
Graph
this data. How are you selecting values
for the "age" coordinate of each data point? What is the shape of the graph (roughly)?
2.
Mincer
suggested that the data should be modeled by a quadratic function (a
parabola). Assuming this data is
roughly parabolic, estimate the coordinates of the vertex.
3.
Now
find the parabola which best fits the data (by trial and error). Using linear regression, discuss how well
your parabola fits the data.
4.
Graph
the parabola with the data. Plot the
residuals between the model and the data.
Discuss the fit of the model with the data in more detail. At what ages
is the fit better or worse? Why do you
think that is? For what ages do you
think your model is valid? Why?
References
[CPS] Current Population Survey website, http://www.bls.census.gov/cps/cpsmain.htm
[Min] Mincer, Jacob. Schooling, Experience and Earnings. New York:
National Bureau of Economic Research, 1974.