resume 1 data from 4,870 applications| Variable | Description |
|---|---|
received_callback |
Whether applicant received call from employer |
job_city |
Location of job (Boston or Chicago) |
college_degree |
Indicator: whether resume listed college degree |
years_experience |
Number of years of experience listed on resume |
honors |
Indicator: whether resume listed some sort of honors (e.g., employee of the month) |
military |
Indicator: whether resume listed military experience |
has_email_address |
Indicator: whether resume listed applicant’s email address |
race |
Race of applicant (implied by first name) |
sex |
Sex of applicant (implied by first name) |
Let’s look at the data.
Sample sizes
| race | female | male |
|---|---|---|
| black | 1,886 | 549 |
| white | 1,860 | 575 |
Proportions of applicants receiving calls back from employer
| race | female | male |
|---|---|---|
| black | 0.0663 | 0.0583 |
| white | 0.0989 | 0.0887 |
received_callback, is categorical with two levels: 0 (no) and 1 (yes)received_callback = 1) to be a success(3/4)/(1/4) = 3/1 (“3 to 1”)heart_transplant 1 dataset is from a study that tracked 5-year survival rates of heart transplant candidatesis_alive based on the variable survivedRows: 103
Columns: 8
$ id <int> 15, 43, 61, 75, 6, 42, 54, 38, 85, 2, 103, 12, 48, 102, 35,…
$ acceptyear <int> 68, 70, 71, 72, 68, 70, 71, 70, 73, 68, 67, 68, 71, 74, 70,…
$ age <int> 53, 43, 52, 52, 54, 36, 47, 41, 47, 51, 39, 53, 56, 40, 43,…
$ survived <fct> dead, dead, dead, dead, dead, dead, dead, dead, dead, dead,…
$ survtime <int> 1, 2, 2, 2, 3, 3, 3, 5, 5, 6, 6, 8, 9, 11, 12, 16, 16, 16, …
$ prior <fct> no, no, no, no, no, no, no, no, no, no, no, no, no, no, no,…
$ transplant <fct> control, control, control, control, control, control, contr…
$ wait <int> NA, NA, NA, NA, NA, NA, NA, 5, NA, NA, NA, NA, NA, NA, NA, …
Scatter plot (jittered) showing survival vs. age. Curve shows predicted probability of survival using logistic model.
The resulting model is
\[\begin{array}{rcl}\log\left(\frac{\hat{p}}{1-\hat{p}}\right) &=& -2.66 \\ & - & 0.44\times job\_cityChicago \\ & - & 0.07 \times college\_degree \\ & + & 0.020 \times years\_experience \\ & + & 0.77 \times honors \\ & - & 0.34 \times military \\ & + & 0.22 \times has\_email\_address \\ & + & 0.44 \times racewhite \\ & - & 0.18 \times sexm\end{array} \]
| Variable | Value |
|---|---|
job_city |
Boston |
college_degree |
has college degree |
years_experience |
3 |
honors |
No honors |
military |
No military experience |
has_email_address |
Resume has email address |
race |
Black |
sex |
Female |
\[\begin{array}{rcl}\log\left(\frac{\hat{p}}{1-\hat{p}}\right) &=& -2.66 \\ & - & 0.44\times 0 \\ & - & 0.07 \times 1 \\ & + & 0.020 \times 3 \\ & + & 0.77 \times 0 \\ & - & 0.34 \times 0 \\ & + & 0.22 \times 1 \\ & + & 0.44 \times 0 \\ & - & 0.18 \times 0 \\ & = & -2.45\end{array} \]