Example: Job training and worker productivity
\[wage = f(educ,exper,training)\]
- \(wage\): hourly wage
- \(educ\): years of formal education
- \(exper\): years of workforce experience
- \(training\): weeks spent in job training
Note
Depending on questions you would like to answer, the economic model can (and should) be much more involved
We have built a conceptual model:
\[wage = f(educ,exper,training)\]
Now, the form of the function \(f(\cdot)\) must be specified (almost always) before we can undertake an econometric analysis
\[
wage = \beta_0 + \beta_1 educ + \beta_2 exper + \beta_3 training + u
\]
\(\beta_0,\beta_1,\beta_2,\beta_3\)
- are the parameters of the econometric model.
- describe the directions and strengths of the relationship between \(wage\) and the factors used to determine \(wage\) in the model
\(u\)
- is called error term
- includes ALL the other factors that can affect wage other than the included variables (like innate ability)
We can collect data using various ways. Some of them include survey, websites, experiments. Let’s look at different data types:
- Sample of individuals, households, firms, cities, states, countries, or a variety of other units, taken at a given point in time
- The data on all units do not correspond to precisely the same time period
- some families surveyed during different weeks within a year
What a cross-sectional data looks like on R
wage educ exper female married
<num> <int> <int> <int> <int>
1: 3.10 11 2 1 0
2: 3.24 12 22 1 1
3: 3.00 11 2 0 0
4: 6.00 8 44 0 1
5: 5.30 12 7 0 1
---
522: 15.00 16 14 1 1
523: 2.27 10 2 1 0
524: 4.67 15 13 0 1
525: 11.56 16 5 0 1
526: 3.50 14 5 1 0
Observations on a variable or several variables over time + corn price + oil price
Note
- The econometric frameworks necessary to analyze time series data are quite different from those for cross-sectional data
- We do NOT learn time-series econometric methods
Time series data for each cross-sectional member in the data set ( same cross-sectional units are tracked over a given period of time)
Example
- wage data for individuals collected every five years over the past 30 years
- yearly GDP data for 60 countries over the past 10 years
What a panel data looks like on R
county year crmrte prbarr prbpris
<int> <int> <num> <num> <num>
1: 1 81 0.0398849 0.289696 0.472222
2: 1 82 0.0383449 0.338111 0.506993
3: 1 83 0.0303048 0.330449 0.479705
4: 1 84 0.0347259 0.362525 0.520104
5: 1 85 0.0365730 0.325395 0.497059
---
626: 197 83 0.0155747 0.226667 0.428571
627: 197 84 0.0136619 0.204188 0.372727
628: 197 85 0.0130857 0.180556 0.333333
629: 197 86 0.0128740 0.112676 0.244444
630: 197 87 0.0141928 0.207595 0.360825
This is what you learn for the next few months!!
- estimate the model using econometrics
- test hypothesis