Return to Video

Joshua Angrist Nobel Prize Lecture 2021

  • 0:01 - 0:04
    ♪ [music] ♪
  • 0:11 - 0:15
    - [Joshua] As I stilled my trembling
    iPhone early on October 11th,
  • 0:15 - 0:18
    my thoughts went to the question
    of whether Nobel-level recognition
  • 0:18 - 0:21
    might change life
    for the Angrist family.
  • 0:21 - 0:23
    Ours is a close-knit family.
  • 0:23 - 0:24
    We lack for nothing.
  • 0:24 - 0:27
    So I worried
    that stressful Nobel celebrity
  • 0:27 - 0:28
    might not be a plus.
  • 0:28 - 0:30
    But with the first cup of coffee,
  • 0:30 - 0:32
    I began to relax.
  • 0:32 - 0:33
    It occurred to me
  • 0:33 - 0:35
    that the matter
    of how public recognition
  • 0:35 - 0:37
    affects a scholars life
  • 0:37 - 0:40
    is, after all,
    a simple causal question.
  • 0:40 - 0:42
    The Nobel intervention
  • 0:42 - 0:45
    is substantial, sudden,
    and well-measured.
  • 0:45 - 0:48
    Outcomes like health and wealth
    are easy to record.
  • 0:49 - 0:52
    Having just been recognized
    with my co-laureates,
  • 0:52 - 0:54
    Guido Imbens and David Card,
  • 0:54 - 0:57
    for answering causal questions
    using observational data,
  • 0:57 - 1:00
    my thoughts moved
    from personal upheaval
  • 1:00 - 1:02
    to the more familiar demands
  • 1:02 - 1:06
    of identification and estimation
    of causal effects.
  • 1:06 - 1:08
    I was able to soothe
    my worried mind
  • 1:08 - 1:13
    by imagining a study
    of the Nobel Prize treatment effect.
  • 1:13 - 1:15
    How would such a study
    be organized?
  • 1:16 - 1:20
    in a 1999 essay published in the
    "Handbook of Labor Economics,"
  • 1:20 - 1:24
    Alan Krueger and I embraced
    the phrase "empirical strategy."
  • 1:24 - 1:26
    The handbook volume
    in question was edited
  • 1:26 - 1:29
    by two of my Princeton
    PhD thesis advisors,
  • 1:29 - 1:32
    Orley Ashenfelter and David Card,
  • 1:32 - 1:35
    among the most successful
    and prolific graduate advisors
  • 1:35 - 1:37
    economics has known
  • 1:37 - 1:39
    An empirical strategy
    is a research plan
  • 1:39 - 1:41
    that encompasses data collection,
  • 1:41 - 1:45
    identification,
    and econometric estimation.
  • 1:45 - 1:48
    Identification is the applied
    econometricians term
  • 1:48 - 1:50
    for research design --
  • 1:50 - 1:52
    a randomized, clinical trial.
  • 1:52 - 1:57
    And RCT is the simplest
    and most powerful research design.
  • 1:57 - 1:59
    In RCTs, causal effects
    are identified
  • 1:59 - 2:02
    by the random assignment
    of treatment.
  • 2:02 - 2:03
    Random assignment ensures
  • 2:03 - 2:05
    that treatment and control groups
  • 2:05 - 2:07
    are comparable
    in the absence of treatment.
  • 2:07 - 2:09
    So differences
    between them afterwards
  • 2:09 - 2:12
    reflect only the treatment effect.
  • 2:12 - 2:15
    Nobel prizes are probably
    not randomly assigned.
  • 2:15 - 2:17
    This challenge notwithstanding,
  • 2:17 - 2:20
    a compelling empirical strategy
    for the Nobel treatment effect
  • 2:20 - 2:21
    comes to mind,
  • 2:21 - 2:24
    at least as a flight of empirical fancy.
  • 2:24 - 2:28
    Imagine a pool of prize-eligible
    Nobel applicants,
  • 2:28 - 2:31
    the group under consideration
    for the prize.
  • 2:31 - 2:33
    Applicants need not apply themselves.
  • 2:33 - 2:36
    They are, I presume, nominated
    by their peer scholars.
  • 2:37 - 2:41
    My fanciful Nobel impact study
    looks only at Nobel applicants
  • 2:41 - 2:43
    since these are all elite scholars.
  • 2:43 - 2:45
    But that is only the first step.
  • 2:45 - 2:48
    Credible applicants, I imagine,
    are evaluated by judges
  • 2:48 - 2:54
    using criteria, like publications,
    citations, nominating statements.
  • 2:54 - 2:58
    I imagine this material is reviewed
    and assigned a numerical score,
  • 2:58 - 3:01
    using some kind of scoring rubric.
  • 3:01 - 3:04
    Top scorers up to three per field,
    in any single year,
  • 3:04 - 3:06
    win a prize.
  • 3:06 - 3:08
    Having identified applicants
  • 3:08 - 3:10
    that collected data
    on their scores,
  • 3:10 - 3:12
    the next step
    in my Nobel impact study
  • 3:12 - 3:15
    is to record the relevant cutoffs.
  • 3:15 - 3:17
    The Nobel cutoff
    is the lowest score
  • 3:17 - 3:19
    among those awarded a prize,
  • 3:20 - 3:23
    Many Nobel hopefuls
    just missed the cutoff.
  • 3:23 - 3:27
    Looking only at near-misses,
    along with the winners,
  • 3:27 - 3:28
    differences in scores
  • 3:28 - 3:30
    between those above
    and below the cutoff
  • 3:30 - 3:35
    begin to look serendipitous,
    almost randomly assigned.
  • 3:35 - 3:36
    After all,
  • 3:36 - 3:39
    near-Nobels are among
    the most eminent of scholars, too.
  • 3:40 - 3:43
    With one more
    high-impact publication,
  • 3:43 - 3:45
    a little more support
    from nominators.
  • 3:45 - 3:47
    they would have been awarded
    Nobel gold --
  • 3:47 - 3:50
    some of them, someday,
    surely will be.
  • 3:50 - 3:52
    The empirical strategy
    sketched here is called
  • 3:52 - 3:54
    a Regression Discontinuity Design,
  • 3:54 - 3:56
    RD.
  • 3:56 - 3:58
    RD exploits the jumps
    in human affairs,
  • 3:58 - 4:01
    induced by rules, regulations,
  • 4:01 - 4:05
    and the need to classify people
    for various assignment purposes.
  • 4:05 - 4:07
    When treatment
    or intervention is determined
  • 4:07 - 4:10
    by whether a tiebreaking variable
    crosses a threshold,
  • 4:10 - 4:13
    those just below the threshold
    become a natural control group
  • 4:13 - 4:15
    for those who clear it.
  • 4:15 - 4:17
    RD does not require
    that the variable,
  • 4:17 - 4:19
    whose causes we seek,
  • 4:19 - 4:22
    switch fully on or fully off
    at the cutoff.
  • 4:22 - 4:23
    We require only
  • 4:23 - 4:26
    that the average value
    of this variable
  • 4:26 - 4:27
    jump at the cutoff
  • 4:27 - 4:30
    RD can allow, for example,
    for the fact
  • 4:30 - 4:33
    that this year's near-Nobel
    might be next year's winner.
  • 4:33 - 4:36
    Allowing for this leads
    to the use of jumps
  • 4:36 - 4:38
    and the rate at which
    treatment is assigned
  • 4:38 - 4:40
    to construct instrumental variables,
  • 4:40 - 4:41
    IV,
  • 4:41 - 4:44
    estimates of the effect
    of treatment received.
  • 4:44 - 4:47
    This sort of RD
    is said to be fuzzy,
  • 4:47 - 4:50
    But as Steve Pischke and I
    wrote in our first book:
  • 4:50 - 4:52
    "fuzzy RD is IV."
  • 4:52 - 4:53
    (cheering)
  • 4:53 - 4:55
    The first RD study I contributed to
  • 4:55 - 4:58
    was written with my frequent
    collaborator, Victor Lavy.
  • 4:58 - 4:59
    This study is motivated
  • 4:59 - 5:02
    by the high costs
    and uncertain returns
  • 5:02 - 5:04
    to smaller elementary school classes.
  • 5:04 - 5:07
    We exploited a rule
    used by Israeli elementary schools
  • 5:07 - 5:09
    to determine class size.
  • 5:09 - 5:12
    This rule is used to estimate
    class size effects,
  • 5:12 - 5:15
    as if in a class size RCT.
  • 5:16 - 5:19
    In the 1990s,
    Israeli classes were large.
  • 5:19 - 5:22
    Students enrolled
    in a grade cohort of 40.
  • 5:22 - 5:25
    We're likely to be
    seated in a class of 40.
  • 5:25 - 5:27
    That's the relevant cutoff.
  • 5:27 - 5:30
    Add another child
    to the cohort, making 41,
  • 5:30 - 5:33
    and the cohort
    was likely to be split
  • 5:33 - 5:35
    into two much smaller classes.
  • 5:35 - 5:39
    This leads to the Maimonide's
    rule research design,
  • 5:39 - 5:41
    so named because
    the 12th-century Rambam
  • 5:41 - 5:44
    proposed a maximum
    class size of 40.
  • 5:44 - 5:47
    This figure plots is rarely
    fourth grade class sizes
  • 5:47 - 5:50
    as a function
    of fourth grade enrollment,
  • 5:50 - 5:52
    overlaid with the theoretical
    class size rule,
  • 5:53 - 5:54
    Maimonides rule.
  • 5:54 - 5:56
    The fit isn't perfect.
  • 5:56 - 5:59
    That's a feature that makes
    this application of RD fuzzy.
  • 5:59 - 6:02
    But the gist of the thing
    is a marked class size drop,
  • 6:02 - 6:06
    at each integer multiple of 40,
    the relevant cutoff,
  • 6:06 - 6:08
    just as predicted by the rule.
  • 6:08 - 6:11
    As it turns out,
    these drops in class size
  • 6:11 - 6:12
    are reflected in jumps
  • 6:12 - 6:15
    in 4th and 5th grade math scores.
  • 6:16 - 6:18
    ♪ [music] ♪
  • 6:20 - 6:23
    Would a comparison
    of Nobel laureates to near laureates
  • 6:23 - 6:25
    really be a good natural experiment?
  • 6:26 - 6:28
    The logic behind this sort of claim
  • 6:28 - 6:30
    seems more compelling
    for comparisons of schools
  • 6:30 - 6:32
    with 40 and 41 4th graders
  • 6:33 - 6:36
    than for comparisons
    of laureates and near laureates.
  • 6:36 - 6:40
    Yet, both scenarios exploit
    a feature of the physical world.
  • 6:40 - 6:42
    Provided the tiebreaking variable
  • 6:42 - 6:45
    known to RD mavens
    as the running variable,
  • 6:45 - 6:47
    has a continuous distribution,
  • 6:47 - 6:51
    the probability of crossing
    the cutoff approaches one-half
  • 6:51 - 6:53
    when examined in a narrow window
  • 6:53 - 6:54
    around the cutoff.
  • 6:54 - 6:56
    In RD empirical work,
  • 6:56 - 6:59
    the window around such cutoffs
    is known as a bandwidth.
  • 7:00 - 7:04
    Importantly, this limiting
    probability is 0.5 for everybody
  • 7:04 - 7:06
    regardless of how qualified they look
  • 7:06 - 7:08
    going into the Nobel competition.
  • 7:09 - 7:12
    This remarkable fact can be seen
    in data on applicants
  • 7:12 - 7:15
    to one of New York's
    highly coveted screen schools.
  • 7:15 - 7:16
    By way of background,
  • 7:16 - 7:20
    roughly 40% of New York City's
    middle and high schools
  • 7:20 - 7:24
    select their applicants
    on the basis of test scores, grades,
  • 7:24 - 7:26
    and other exacting criteria.
  • 7:26 - 7:29
    In other words, the admissions
    regime for screen schools
  • 7:29 - 7:31
    is a lot like the scheme
    I've imagined
  • 7:31 - 7:32
    for the Nobel Prize.
  • 7:33 - 7:35
    Screen schools
    are but one of a number
  • 7:35 - 7:38
    of highly selective systems
    within a system
  • 7:38 - 7:40
    in large US school districts.
  • 7:40 - 7:43
    Boston, Chicago, San Francisco,
    and Washington, D.C
  • 7:43 - 7:46
    all feature highly selective
    institutions,
  • 7:46 - 7:48
    often known as exam schools.
  • 7:48 - 7:49
    Exam schools operate
  • 7:49 - 7:51
    as part of larger
    public school systems
  • 7:51 - 7:54
    that enroll students
    without screening.
  • 7:54 - 7:56
    Motivated by
    the enduring controversy
  • 7:56 - 7:58
    over the equity
    of screened admissions,
  • 7:58 - 8:00
    my blueprint
    labs collaborators and I
  • 8:00 - 8:04
    have examined the causal effects
    of exam school attendance
  • 8:04 - 8:06
    in Boston, Chicago, and New York.
  • 8:06 - 8:09
    This figure shows the probability
    of being offered a seat
  • 8:09 - 8:13
    at New York's storied
    Townsend Harris High School,
  • 8:13 - 8:15
    ranked 12th, nationwide.
  • 8:15 - 8:18
    Bar height in the figure
    marks the qualification rate --
  • 8:18 - 8:22
    that is, the likelihood of earning
    a Townsend Harris admission score,
  • 8:22 - 8:25
    above that of the lowest
    scoring applicant
  • 8:25 - 8:26
    offered a seat.
  • 8:26 - 8:30
    Importantly, the bars show
    qualification rates conditional
  • 8:30 - 8:34
    on a measure of pre-application
    baseline achievement.
  • 8:34 - 8:37
    In particular, the bars mark
    qualification rates conditional
  • 8:37 - 8:38
    on whether an applicant
  • 8:38 - 8:43
    has upper quartile or lower quartile
    6th grade math scores.
  • 8:43 - 8:46
    Townsend Harris applicants
    with high baseline scores
  • 8:46 - 8:48
    are much more likely to qualify
  • 8:48 - 8:50
    than applicants
    with low baseline scores.
  • 8:50 - 8:52
    This isn't surprising.
  • 8:52 - 8:55
    But in a shrinking
    symmetric bandwidth
  • 8:55 - 8:57
    around the schools cutoff,
  • 8:57 - 9:00
    qualification rates
    in the two groups converge
  • 9:00 - 9:03
    Qualification rate in the last
    and smallest groups
  • 9:04 - 9:06
    are both remarkably close
    to one-half
  • 9:07 - 9:09
    This is what we'd expect to see
  • 9:09 - 9:10
    where Townsend Harris
    to admit students,
  • 9:10 - 9:12
    by tossing a coin,
  • 9:12 - 9:15
    rather than by selecting
    only those who scored highly
  • 9:15 - 9:17
    on the school's entrance exam.
  • 9:17 - 9:20
    Even when admissions
    operates by screening,
  • 9:20 - 9:23
    the data can be arranged
    so as to mimic an RCT.
  • 9:28 - 9:31
    A few of the questions I've studied
    are more controversial
  • 9:31 - 9:34
    than the question of access
    to public exam schools,
  • 9:34 - 9:36
    like the Boston Latin School,
  • 9:36 - 9:40
    Chicago's Payton and Northside
    selective enrollment high schools,
  • 9:40 - 9:42
    and New York's legendary
    Brooklyn Tech,
  • 9:42 - 9:43
    Bronx Science,
  • 9:43 - 9:46
    and Stuyvesant
    specialized high schools,
  • 9:46 - 9:49
    which have graduated
    14 Nobel laureates between them.
  • 9:49 - 9:52
    Townsend Harris, the school
    we started with today,
  • 9:52 - 9:56
    graduated three Nobels,
    including economist Ken Arrow.
  • 9:56 - 9:57
    Exam school proponents
  • 9:57 - 9:59
    see the opportunities
    these schools provide
  • 9:59 - 10:02
    as democratizing public education.
  • 10:02 - 10:04
    "Wealthy families," they argue,
  • 10:04 - 10:07
    can access exam school curricula
    in the private sector.
  • 10:08 - 10:10
    Shouldn't ambitious
    low-income students
  • 10:10 - 10:12
    be afforded the same chance
    at elite education?
  • 10:13 - 10:15
    Critics of selective
    enrollment schools
  • 10:15 - 10:18
    argue that rather than
    expanding equity,
  • 10:18 - 10:20
    exam schools are inherently biased
  • 10:20 - 10:22
    against the Black
    and Hispanic students
  • 10:22 - 10:25
    that make up the bulk
    of America's urban districts,
  • 10:25 - 10:28
    New York's super selective
    Stuyvesant, for example,
  • 10:28 - 10:31
    enrolled only
    seven Black students in 2019,
  • 10:31 - 10:34
    out of an incoming class of 895.
  • 10:34 - 10:38
    But are exam school seats
    really worth fighting for?
  • 10:39 - 10:42
    My collaborators and I
    have repeatedly used
  • 10:42 - 10:45
    RD empirical strategies
    to study the causal effects
  • 10:45 - 10:47
    of attendance at exam schools
  • 10:47 - 10:49
    like Townsend Harris
    and Boston Latin.
  • 10:49 - 10:51
    Our first exam school study,
  • 10:51 - 10:54
    which looks at schools
    in Boston and New York
  • 10:54 - 10:56
    encapsulates
    these findings in its title:
  • 10:56 - 10:57
    "The Elite Illusion."
  • 10:58 - 11:00
    "The Elite Illusion"
    refers to the fact
  • 11:00 - 11:04
    that while exam school students
    undoubtedly have high test scores
  • 11:04 - 11:05
    and other good outcomes,
  • 11:05 - 11:08
    this is sot a causal effect
    of exam School attendance.
  • 11:09 - 11:11
    Our estimates consistently suggest
  • 11:11 - 11:14
    that the causal effects
    of exam school attendance
  • 11:14 - 11:17
    on their students learning
    and college-going are 0.
  • 11:17 - 11:20
    Maybe even negative.
  • 11:20 - 11:22
    The good performance
    of exam school students
  • 11:22 - 11:24
    reflect selection bias --
  • 11:24 - 11:27
    that is, the process by which
    these students are chosen,
  • 11:27 - 11:29
    rather than causal effects.
  • 11:30 - 11:32
    Data from Chicago's large exam
    school sector
  • 11:32 - 11:34
    illustrate the elite illusion.
  • 11:34 - 11:37
    This figure plots
    peer mean achievement --
  • 11:37 - 11:42
    that is, the 6th grade test scores
    of my 9th grade classmates
  • 11:42 - 11:44
    against the admissions tiebreaker
  • 11:44 - 11:47
    for a subset of applicants
  • 11:47 - 11:49
    to any one of Chicago's
    nine exam schools.
  • 11:49 - 11:52
    Applicants to these schools
    rank up to 6,
  • 11:52 - 11:54
    while the exam schools
    prioritize their applicants
  • 11:54 - 11:57
    using a common composite index,
  • 11:57 - 11:59
    formed from an admissions test,
  • 11:59 - 12:02
    GPAs, and grade 7
    standardized scores.
  • 12:03 - 12:06
    This composite tiebreaker
    is the running variable
  • 12:06 - 12:09
    for an RD design
    that reveals what happens
  • 12:09 - 12:10
    when any applicant is offered
  • 12:10 - 12:12
    an exam school seat.
  • 12:12 - 12:15
    In Chicago's exam school match,
  • 12:15 - 12:16
    which is actually an application
  • 12:16 - 12:19
    of the celebrated Gale and Shapley
    matching algorithm
  • 12:19 - 12:22
    exam school applicants are sure
    to be offered a seat somewhere
  • 12:23 - 12:26
    when they clear the lowest
    in their set of cutoffs
  • 12:26 - 12:28
    among the schools they rank.
  • 12:28 - 12:31
    We call this lowest cutoff
    the "qualifying cutoff."
  • 12:32 - 12:35
    The figure shows a sharp jump
    in peer mean achievement
  • 12:35 - 12:37
    for Chicago exam school applicants
  • 12:37 - 12:40
    who clear their qualifying cutoff.
  • 12:40 - 12:41
    This jump reflects the fact
  • 12:41 - 12:44
    that most applicants
    offered an exam school seat take it,
  • 12:44 - 12:46
    and applicants who enroll
  • 12:46 - 12:48
    at one of Chicago's selective
    enrollment high schools
  • 12:48 - 12:51
    are sure to be seated
    in a 9th grade classroom
  • 12:51 - 12:53
    filled with academically
    precocious peers,
  • 12:53 - 12:57
    because only the relatively
    precocious make it in.
  • 12:57 - 13:00
    The increase in peer achievement
    across the qualifying cutoff
  • 13:00 - 13:03
    amounts to almost
    half a standard deviation --
  • 13:03 - 13:04
    a very large effect.
  • 13:04 - 13:07
    And yet, precocious peers,
    notwithstanding,
  • 13:07 - 13:11
    the offer of an exam school seat
    does not appear to increase learning.
  • 13:12 - 13:16
    Let's plot applicants ACT scores
    against their tiebreaker values.
  • 13:16 - 13:19
    This plot shows that exam
    school applicants
  • 13:19 - 13:21
    who clear their qualifying cutoff
  • 13:21 - 13:24
    perform sharply worse on the ACT.
  • 13:24 - 13:25
    What explains this?
  • 13:26 - 13:29
    It takes a tale of IV and RD
    to untangle the forces
  • 13:29 - 13:33
    behind this intriguing
    and unexpected negative effect.
  • 13:33 - 13:35
    But first, some IV theory,
  • 13:40 - 13:43
    Guido Imbens and I
    developed theoretical tools
  • 13:43 - 13:46
    that enhance economists'
    understanding of empirical strategies
  • 13:46 - 13:48
    involving IV and RD.
  • 13:49 - 13:52
    The prize we share
    is in recognition of this work.
  • 13:52 - 13:55
    Guido and I overlapped for
    only one year at Harvard,
  • 13:55 - 13:58
    where we had both taken
    our first jobs post PhD.
  • 13:58 - 14:01
    I welcomed Guido
    to Cambridge, Massachusetts
  • 14:01 - 14:04
    with a pair of interesting
    instrumental variables.
  • 14:04 - 14:06
    I had used
    the draft lottery instrument
  • 14:06 - 14:08
    in my PhD thesis
  • 14:08 - 14:11
    to estimate the long-run
    economic consequences
  • 14:11 - 14:13
    of serving in the Armed Forces
  • 14:13 - 14:15
    for soldiers who were drafted.
  • 14:15 - 14:17
    The draft lottery instrument
    relies on the fact
  • 14:17 - 14:20
    that lottery numbers
    randomly assigned to birthdays
  • 14:20 - 14:23
    determined Vietnam-era
    conscription risk.
  • 14:24 - 14:27
    Yet, even then, most soldiers
    were volunteers,
  • 14:27 - 14:28
    as they are today,
  • 14:28 - 14:29
    The quarter birth instrument
  • 14:29 - 14:32
    is used in my 1991 paper
    with Alan Krueger
  • 14:32 - 14:35
    to estimate the economic
    returns to schooling.
  • 14:35 - 14:36
    This instrument uses the fact
  • 14:36 - 14:39
    that men who are born earlier
    in the year
  • 14:39 - 14:40
    are allowed to drop out
    of high school
  • 14:40 - 14:42
    on their 16th birthday
  • 14:42 - 14:45
    with less schooling completed
    than those born later.
  • 14:45 - 14:48
    Guido and I soon
    began asking each other,
  • 14:48 - 14:51
    "What really do we learn
    from the draft eligibility
  • 14:51 - 14:53
    and quarter of birth
    natural experiments?"
  • 14:54 - 14:57
    An early result in our quest
    for a new understanding of IV
  • 14:57 - 15:00
    was a solution to the problem
    of selection bias
  • 15:00 - 15:02
    in an RCT with partial compliance.
  • 15:03 - 15:05
    Even in a randomized clinical trial,
  • 15:05 - 15:08
    some of the people assigned
    to treatment may opt out.
  • 15:08 - 15:11
    This fact has long vexed trialists
  • 15:11 - 15:15
    because decisions to opt out
    are not made by random assignment.
  • 15:15 - 15:17
    Our first manuscript together
  • 15:17 - 15:20
    shows that in a randomized trial
    with partial compliance,
  • 15:20 - 15:22
    you can use IV
  • 15:22 - 15:24
    to estimate the effect
    of treatment on the treated,
  • 15:24 - 15:27
    even when some offered
    treatment decline it.
  • 15:27 - 15:28
    This works in spite of the fact
  • 15:28 - 15:31
    that those who comply
    with treatment
  • 15:31 - 15:33
    may be a very select group.
  • 15:33 - 15:36
    Unfortunately, for us,
    we were late to the party.
  • 15:36 - 15:39
    Not long after releasing
    our first working paper,
  • 15:39 - 15:42
    we learned of a concise
    contribution from Howard Bloom
  • 15:42 - 15:44
    that includes this theoretical result.
  • 15:44 - 15:47
    Remarkably, Bloom had derived
    this from first principles
  • 15:48 - 15:50
    without making a connection to IV.
  • 15:50 - 15:52
    So Guido and I went back
    to the drawing board.
  • 15:52 - 15:55
    And a few months later, we had LATE --
  • 15:55 - 15:56
    a theorem showing how to estimate
  • 15:56 - 15:59
    the local average treatment effect.
  • 15:59 - 16:02
    The LATE theorem generalizes
    the Bloom theorem
  • 16:02 - 16:05
    and establishes the connection
    between compliance and IV.
  • 16:06 - 16:08
    Maintaining the clinical trials analogy,
  • 16:08 - 16:12
    let "Zi" indicate whether subject "i"
    is offered treatment.
  • 16:12 - 16:14
    This is randomly assigned.
  • 16:14 - 16:16
    Also, let "D1i" indicate
    subject "i's" treatment status
  • 16:16 - 16:18
    when assigned to treatment,
  • 16:18 - 16:21
    and let the "0i" indicate
    subject "i's" treatment status
  • 16:21 - 16:23
    when assigned to control.
  • 16:23 - 16:25
    I'll use this formal notation
  • 16:25 - 16:27
    to give a clear statement
    of the late result,
  • 16:27 - 16:29
    and then follow up with examples.
  • 16:30 - 16:31
    A key piece of the late framework
  • 16:31 - 16:34
    pioneered by statistician
    Don Rubin,
  • 16:34 - 16:36
    is the pair of potential outcomes.
  • 16:36 - 16:38
    As is customary,
  • 16:38 - 16:40
    I denote potential outcomes
    for subject "i"
  • 16:40 - 16:42
    in the treated and untreated states
  • 16:42 - 16:45
    by "Y1i" and "Y0i" respectively,
  • 16:46 - 16:49
    The observed outcome
    is "Y1i" for the treated
  • 16:49 - 16:51
    and "Y0i" for those not treated.
  • 16:52 - 16:54
    "Y1i" minus "Y0i"
  • 16:54 - 16:57
    is the causal effect
    of treatment on individual "i",
  • 16:57 - 16:59
    but we can never see.
  • 16:59 - 17:03
    We try, therefore, to estimate
    some kind of average causal effect.
  • 17:03 - 17:06
    The late framework allows us
    to do that in an RCT
  • 17:06 - 17:08
    where some controls are treated.
  • 17:08 - 17:09
    The theorem says
  • 17:09 - 17:10
    that the average causal
    effect on people,
  • 17:10 - 17:12
    whose treatment status
    can be changed
  • 17:12 - 17:14
    by the offer of treatment
  • 17:14 - 17:17
    is the ratio of ITT
    to the treatment control difference
  • 17:17 - 17:18
    in compliance rates.
  • 17:19 - 17:21
    A mathematical statement of
    this result appears here,
  • 17:22 - 17:24
    where Greek letter Delta
    symbolizes the ITT effect
  • 17:24 - 17:29
    and Greek symbols pi1 and pi0
  • 17:29 - 17:31
    are compliance rates
    in the group assigned to treatment
  • 17:31 - 17:34
    and the group assigned
    to control, respectively,
  • 17:35 - 17:36
    The print version of this lecture
  • 17:36 - 17:39
    delves deeper
    into LATE intellectual history,
  • 17:39 - 17:41
    highlighting key contributions
    made with Rubin.
  • 17:42 - 17:45
    For now, though, I'd like to make
    the late theorem concrete for you
  • 17:45 - 17:48
    by sharing one of
    my favorite applications of it.
  • 17:53 - 17:54
    I'll explain the late framework
  • 17:54 - 17:57
    through a research question
    that has fascinated me
  • 17:57 - 17:58
    for almost two decades.
  • 17:59 - 18:00
    What is the causal effect
  • 18:00 - 18:02
    of charter school attendance
    on learning?
  • 18:02 - 18:04
    Charter schools are public schools
  • 18:04 - 18:07
    that operate independently
  • 18:07 - 18:10
    of traditional American
    public school districts.
  • 18:10 - 18:12
    A charter, the right
    to operate a public school
  • 18:12 - 18:14
    is typically awarded
    for a limited period
  • 18:15 - 18:18
    subject to renewal, conditional
    on good school performance.
  • 18:19 - 18:20
    Charter schools are free
  • 18:20 - 18:22
    to structure their curriculum
    and school environment.
  • 18:22 - 18:24
    The most controversial difference
  • 18:24 - 18:26
    between charters
    and traditional public schools
  • 18:27 - 18:29
    is the fact that the teachers
    and staff who work at charter schools
  • 18:29 - 18:32
    rarely belonged to labor unions.
  • 18:32 - 18:35
    By contrast, most
    big city public school teachers
  • 18:35 - 18:37
    work under union contracts.
  • 18:38 - 18:41
    The 2010 documentary film
    "Waiting for Superman"
  • 18:41 - 18:43
    feature schools belonging to
    the Knowledge is Power Program,
  • 18:43 - 18:44
    KIPP.
  • 18:44 - 18:49
    KIPP schools are emblematic
    of the high expectations,
  • 18:49 - 18:53
    sometimes also called "no excuses"
    approach to public education.
  • 18:53 - 18:57
    The "no excuses" model features
  • 18:57 - 18:59
    a long school day
    and extended school year,
  • 18:59 - 19:00
    selective teacher hiring
  • 19:00 - 19:01
    and focuses on traditional
    reading and math skills.
  • 19:01 - 19:05
    The American debate
    over education reform
  • 19:05 - 19:08
    often focuses on the achievement gap --
  • 19:08 - 19:09
    that's shorthand
    for large test score differences
  • 19:09 - 19:10
    by race and ethnicity.
  • 19:10 - 19:14
    Because of its focus
    on minority students,
  • 19:14 - 19:17
    KIPP is often central
    in this debate
  • 19:17 - 19:19
    with supporters pointing to the fact
  • 19:19 - 19:24
    that non-White KIPP students
    have markedly higher test scores
  • 19:24 - 19:25
    than non-White students
    from nearby schools.
  • 19:25 - 19:27
    KIPP skeptics on the other hand,
  • 19:27 - 19:30
    argue that KIPP's apparent success,
  • 19:30 - 19:33
    reflects the fact that KIPP
    attracts families
  • 19:33 - 19:35
    whose children would be
    more likely to succeed, anyway.
  • 19:35 - 19:36
    Who's right?
  • 19:37 - 19:39
    As you've probably guessed by now,
  • 19:39 - 19:41
    a randomized trial
    might prove decisive
  • 19:41 - 19:43
    in the debate
    over schools like KIPP.
  • 19:44 - 19:46
    Like Nobel Prize is, though,
  • 19:46 - 19:48
    seats at KIPP
    are not randomly assigned.
  • 19:48 - 19:50
    Well, at least, not entirely.
  • 19:51 - 19:52
    In fact,
  • 19:52 - 19:55
    Massachusetts charter schools
    with more applicants than seats
  • 19:55 - 19:57
    must offer their seats by lottery.
  • 19:57 - 20:00
    Sounds like a good natural experiment.
  • 20:00 - 20:02
    A little over a decade ago,
  • 20:02 - 20:04
    my collaborators and I
    collected data
  • 20:04 - 20:06
    on KIPP admissions lotteries
  • 20:06 - 20:09
    laying the foundation
    for two pioneering charter studies,
  • 20:09 - 20:12
    the first to use lotteries to study KIPP.
  • 20:12 - 20:15
    Our KIPP analysis is a classic IV story
  • 20:16 - 20:18
    because many students
    offered a seat in the KIPP lottery
  • 20:19 - 20:20
    failed to show up in the fall,
  • 20:20 - 20:24
    while a few not offered a seat,
    nevertheless, find their way in.
  • 20:24 - 20:27
    This graphic shows KIPP
    middle school applicants math scores
  • 20:27 - 20:30
    one year after applying to KIPP.
  • 20:30 - 20:32
    The entries above the line
  • 20:32 - 20:34
    show that Kip applicants
    who were offered a seat
  • 20:34 - 20:36
    have standardized
    math scores close to zero --
  • 20:36 - 20:39
    that is near the state average.
  • 20:39 - 20:42
    As before, we're working
    with standardized score data
  • 20:42 - 20:45
    that has a mean of 0 and
    a standard deviation of 1.
  • 20:46 - 20:48
    because KIPP applicants
    start with 4th grade scores
  • 20:48 - 20:53
    that are roughly .3 standard
    deviations below the state mean,
  • 20:53 - 20:57
    achievement at the level
    of the state average is impressive.
  • 20:57 - 21:00
    By contrast, the average math score
    among those not offered a seat
  • 21:01 - 21:03
    is about minus .36 sigma,
  • 21:03 - 21:07
    that is, .36 standard deviations
    below the state mean --
  • 21:07 - 21:10
    a result typical for urban
    students in Massachusetts.
  • 21:11 - 21:13
    Since lottery offers
    are randomly assigned,
  • 21:13 - 21:16
    we could say with confidence
    that the offer of a seated KIPP
  • 21:16 - 21:18
    boost math scores by
  • 21:18 - 21:20
    an average of .36 sigma,
  • 21:20 - 21:24
    a large effect that's
    also statistically precise.
  • 21:24 - 21:26
    We can be confident
    this is.n't a chance finding
  • 21:27 - 21:30
    What does an offer effect
    .36 sigma
  • 21:30 - 21:33
    tell us about the effects
    of actually going to KIPP?
  • 21:34 - 21:38
    IV methods convert KIPP offer
    effects into KIPP attendance effects.
  • 21:38 - 21:43
    I'll use this brief clip from my Marginal
    Revolution University short course
  • 21:43 - 21:46
    to quickly review the key
    assumptions behind this conversion.
  • 21:47 - 21:49
    - [Narrator] IV describes
    a chain reaction.
  • 21:50 - 21:52
    Why do offers affect achievement?
  • 21:52 - 21:55
    Probably because they affect
    charter attendance
  • 21:55 - 21:57
    and charter attendance
    improves math scores.
  • 21:58 - 22:01
    The first link in the chain
    called the "First Stage"
  • 22:01 - 22:06
    is the effect of the lottery
    on charter attendance.
  • 22:06 - 22:09
    The "Second Stage" is the link
  • 22:09 - 22:12
    between attending a charter
    and an outcome variable --
  • 22:12 - 22:14
    in this case, math scores.
  • 22:14 - 22:17
    The instrumental variable
    or "Instrument," for short,
  • 22:17 - 22:22
    is the variable that initiates
    the chain reaction.
  • 22:23 - 22:26
    The effect of the instrument
    on the outcome
  • 22:26 - 22:28
    is called the "Reduced Form."
  • 22:30 - 22:33
    This chain reaction can be
    represented mathematically.
  • 22:34 - 22:38
    We multiply the first stage --
    the effect of winning on attendance,
  • 22:38 - 22:42
    by the second stage --
    the effect of attendance on scores,
  • 22:42 - 22:45
    and we get the reduced form --
  • 22:45 - 22:47
    the effect of winning
    the lottery on scores.
  • 22:48 - 22:53
    The Reduced Form and First Stage
    are observable and easy to compute.
  • 22:54 - 22:56
    However, the effect
    of attendance on achievement
  • 22:56 - 22:59
    is not directly observed.
  • 22:59 - 23:02
    This is the causal effect.
    we're trying to determine
  • 23:03 - 23:06
    Given some important assumptions
    will discuss shortly,
  • 23:06 - 23:08
    we can find the effect
    of KIPP attendance
  • 23:08 - 23:10
    by dividing the reduced form
    by the first stage.
  • 23:10 - 23:15
    - [Joshua] IV eliminates selection bias,
  • 23:15 - 23:17
    but like all of our tools,
  • 23:17 - 23:18
    the solution builds
    on a set of assumptions
  • 23:18 - 23:21
    not to be taken for granted
  • 23:22 - 23:25
    first. There must be a
    substantial first stage.
  • 23:25 - 23:27
    That is the instrumental variable,
  • 23:27 - 23:30
    winning or losing the
    lottery must really change.
  • 23:30 - 23:34
    The variable, whose effect we're
    interested in here, KIPP attendance.
  • 23:35 - 23:37
    In this case. The first stage is
  • 23:37 - 23:41
    not really in doubt, winning the lottery,
    make skip attendance, much more likely,
  • 23:42 - 23:44
    not all IV, stories are like that.
  • 23:45 - 23:48
    S the instrument must
    be as good as randomly.
  • 23:48 - 23:52
    Signed meaning lottery winners and
    losers have similar characteristics.
  • 23:52 - 23:55
    This is the independence Assumption.
  • 23:55 - 23:56
    Of course,
  • 23:56 - 23:59
    KIPP lottery wins really
    are randomly assigned
  • 23:59 - 24:03
    still, we should check for balance
    and confirm that winners and losers
  • 24:03 - 24:07
    have similar family, backgrounds,
    similar, aptitudes, and so on,
  • 24:07 - 24:10
    in essence, we're checking to
    ensure KIPP lotteries are fair
  • 24:11 - 24:14
    with no group of applicants,
    suspiciously, likely to win.
  • 24:15 - 24:18
    Finally, we require the
    instrument change outcomes soul.
  • 24:18 - 24:21
    Through the variable of interest.
    In this case, attending camp,
  • 24:22 - 24:25
    this assumption is called
    the exclusion restriction.
  • 24:27 - 24:31
    The causal effect of KIPP attendance can
    therefore be written as the ratio of
  • 24:31 - 24:34
    the effect of offers on
    scores in the numerator,
  • 24:34 - 24:37
    over the effect of offers on KIPP
    and Roman in the denominator.
  • 24:38 - 24:40
    The numerator in this IV formula,
  • 24:40 - 24:44
    that is the direct effect of the
    instrument on outcomes. Has a special name.
  • 24:44 - 24:48
    This is called the reduced form.
    The denominator is the first step.
  • 24:48 - 24:49
    Stage
  • 24:49 - 24:53
    the exclusion restriction is often the
    trickiest or most controversial part of
  • 24:53 - 24:58
    an IV story. Here, the exclusion
    restriction amounts to the claim
  • 24:58 - 25:02
    that the .36 score, differential
    between lottery. Winners and losers
  • 25:02 - 25:08
    is entirely attributable to the .74,
    win-loss difference in attendance rates.
  • 25:08 - 25:09
    Plugging in the numbers.
  • 25:09 - 25:13
    The effect of KIPP attendance works
    out to be point four eight Sigma,
  • 25:13 - 25:18
    almost half a standard deviation gain in
    math scores. That's a remarkably large.
  • 25:18 - 25:24
    Perfect, who exactly benefits.
    So spectacularly from KIPP,
  • 25:24 - 25:27
    does everyone who applies to
    KIPP, see such large gains,
  • 25:28 - 25:29
    late answers. This question.
  • 25:30 - 25:33
    The late interpretation of the
    KIPP IV, empirical strategy,
  • 25:33 - 25:36
    is illuminated by the
    biblical story of Passover,
  • 25:37 - 25:40
    which explains that there
    are four types of children
  • 25:40 - 25:42
    each with characteristic behaviors
  • 25:42 - 25:45
    to keep track of these
    children and their behavior.
  • 25:45 - 25:48
    I'll give them a literate
    of names. Applicants like
  • 25:48 - 25:53
    Alvaro are dying to go to KIPP.
    If Alvaro loses, the KIPP lottery.
  • 25:53 - 25:56
    His mother finds a way to
    enroll him in KIPP. Anyway,
  • 25:56 - 26:02
    perhaps by reapplying applicants like
    Camilla are happy to go to Camp if they win
  • 26:02 - 26:07
    a seat in the lottery, but stoically
    accept the verdict, if they lose finally,
  • 26:07 - 26:12
    applicants, like normando worried about
    long days and lots of homework at KIPP
  • 26:12 - 26:17
    normando doesn't really want to go and
    refuses to go to KIPP when told that he won
  • 26:17 - 26:17
    the lottery.
  • 26:18 - 26:22
    That was called a never taker because
    win or lose. He doesn't go to KIPP
  • 26:22 - 26:27
    at the other end of KIPP commitment.
    Alvaro is called an always taker.
  • 26:27 - 26:31
    He'll happily take a seat. Went offered
    while his mother simply finds a way
  • 26:31 - 26:32
    to make it happen for him.
  • 26:33 - 26:37
    Even when he loses for Alvaro and
    normando. Both choice of school.
  • 26:38 - 26:41
    KIPP traditional is
    unaffected by the lottery.
  • 26:41 - 26:45
    Camilla is the type of
    applicant who gives IV its power
  • 26:45 - 26:48
    the instrument determines
    her treatment status.
  • 26:48 - 26:54
    I IV strategies depend on applicants,
    like, Camilla who are called compliers.
  • 26:54 - 26:58
    This term comes from the world of
    randomized, Trials, introduced earlier
  • 26:58 - 27:00
    as we've already discussed,
  • 27:00 - 27:05
    many randomized, trials, randomize
    only the opportunity to be treated
  • 27:05 - 27:10
    while the decision to comply with the
    treatment remains voluntary and non-random.
  • 27:11 - 27:15
    RCT. Compliers are those who take treatment
    when the offer of treatment is made?
  • 27:15 - 27:18
    But not otherwise with
    Lottery instruments.
  • 27:18 - 27:23
    Late is the effect of KIPP attendance on
    Camilla and other compliers like her.
  • 27:23 - 27:28
    Who enroll at KIPP take treatment when
    offered treatment through the lottery.
  • 27:28 - 27:29
    But not otherwise
  • 27:30 - 27:34
    IV methods are uninformed of for always
    takers like Alvaro and never takers
  • 27:34 - 27:39
    like normando because the instrument
    is unrelated to their treatment status.
  • 27:39 - 27:42
    Hey, didn't I say there
    are four types of children.
  • 27:42 - 27:48
    A fourth type of child in IV Theory
    behaves perversely every family has one.
  • 27:48 - 27:52
    These defiant children and Roland
    KIPP only when they lose the lottery.
  • 27:53 - 27:57
    Actually, the late theorem requires
    us to assume there are few defiers,
  • 27:57 - 28:00
    that seems like a reasonable assumption
    for Charter Lottery instruments.
  • 28:00 - 28:02
    If not in life.
  • 28:02 - 28:05
    The late theorem is sometimes
    seen as limiting the relevance
  • 28:05 - 28:07
    of econometric estimates
  • 28:07 - 28:10
    because it focuses attention
    on groups of compliers
  • 28:11 - 28:15
    yet. The population of compliers is a
    group. We'd very much like to learn about
  • 28:15 - 28:18
    in the KIPP example, compliers
    our children, likely to be,
  • 28:18 - 28:24
    Drawn into KIPP where the school to expand
    and offer additional seats in a lottery.
  • 28:24 - 28:27
    How relevant is this a few years ago,
  • 28:27 - 28:31
    Massachusetts indeed allowed
    thriving charter schools to expand
  • 28:31 - 28:33
    a recent study by some of my lab mates
  • 28:34 - 28:37
    shows that late estimates like
    the one we just computed for KIPP
  • 28:37 - 28:41
    predict learning gains at the
    schools created by Charter expansion.
  • 28:46 - 28:48
    Late isn't just a theorem.
  • 28:48 - 28:48
    It's a
  • 28:48 - 28:49
    framework.
  • 28:49 - 28:53
    The late framework can be used to
    estimate the entire distribution
  • 28:53 - 28:55
    of potential outcomes for compliers
  • 28:55 - 28:59
    as if we really did have a
    randomized trial for this group.
  • 28:59 - 29:03
    Although the theory behind this
    fact is necessarily technical.
  • 29:03 - 29:08
    The value of the framework is easily
    appreciated in practice by way of illustration.
  • 29:09 - 29:10
    Recall that the KIPP study
  • 29:10 - 29:14
    is motivated in part by
    differences in test scores by race.
  • 29:14 - 29:18
    Let's look at the distribution of
    4th grade scores separately by.
  • 29:18 - 29:21
    Race for applicants to Boston
    Charter, Middle Schools,
  • 29:21 - 29:25
    the two sides of this figure
    show distributions for treated
  • 29:25 - 29:30
    and untreated compliers treated.
    Compliers are compliers offered.
  • 29:30 - 29:34
    A charter seat in a lottery. While
    untreated compliers are not offered a seat
  • 29:34 - 29:39
    because these are 4th grade scores
    while middle school begins in 5th
  • 29:39 - 29:42
    or 6th grade. The two sides
    of the figure are similar.
  • 29:42 - 29:47
    Both sides show score distributions for
    black applicants shifted to the left
  • 29:47 - 29:48
    of the corresponding.
  • 29:48 - 29:52
    Gorgeous tribulations
    for Whites by 8th grade.
  • 29:52 - 29:56
    Treated compliers have completed
    Middle School at a Boston Charter.
  • 29:56 - 30:00
    Well, I'm treated compliers have
    remained in traditional Public School.
  • 30:00 - 30:01
    Remarkably.
  • 30:01 - 30:05
    This next graphic shows that the
    eighth grade score distributions
  • 30:05 - 30:08
    of black-and-white treated.
    Compliers are indistinguishable.
  • 30:09 - 30:12
    Boston charter middle schools,
    close the achievement Gap,
  • 30:13 - 30:17
    but for the untreated black and white
    score distributions remained distinct
  • 30:17 - 30:18
    with black students.
  • 30:18 - 30:22
    Hind white students as
    they were in 4th grade,
  • 30:22 - 30:23
    Boston Charters,
  • 30:23 - 30:26
    close the achievement Gap because
    those who enter Charter Schools,
  • 30:26 - 30:30
    the farthest behind tend to gain
    the most from Charter. Enrollment
  • 30:31 - 30:34
    I elaborate on this point in
    the print version of this talk.
  • 30:39 - 30:43
    Remember the puzzle of -
    Chicago exam School effects.
  • 30:43 - 30:48
    I'll finish the scientific part of my
    talk by using IV and RD to explain this.
  • 30:48 - 30:50
    This surprising finding
  • 30:50 - 30:54
    the resolution of this puzzle starts
    with the fact that economic reasoning
  • 30:55 - 30:56
    is about Alternatives.
  • 30:57 - 31:00
    So what's the alternative
    to an exam school education
  • 31:00 - 31:03
    for most applicants to
    Chicago exam schools.
  • 31:03 - 31:07
    The leading non exam. Alternative
    is a traditional public school,
  • 31:08 - 31:13
    but many of Chicago's rejected exam school
    applicants enroll in a charter school
  • 31:14 - 31:18
    exam school offers. Therefore reduce the
    likelihood of Charter School attendance.
  • 31:18 - 31:24
    Specifically exam schools divert
    applicants away from high schools
  • 31:24 - 31:26
    in the noble network of charter schools.
  • 31:27 - 31:27
    Noble
  • 31:27 - 31:33
    with pedagogy much, like KIPP is one of
    Chicago's most visible Charter providers.
  • 31:33 - 31:38
    Also like KIPP convincing evidence on
    Noble Effectiveness comes from admissions,
  • 31:38 - 31:39
    lotteries.
  • 31:39 - 31:45
    The x-axis in this graphic shows Lottery
    offer effects on years enrolled at Noble.
  • 31:46 - 31:48
    This is the noble first stage.
  • 31:48 - 31:51
    For an IV setup that uses a dummy.
  • 31:51 - 31:55
    Indicating Noble Lottery offers as
    an instrument for Noble enrollment.
  • 31:55 - 31:58
    Now, this graphic has a
    feature that distinguishes.
  • 31:58 - 32:04
    It from the simpler KIPP analysis,
    the plot shows. First stage effects.
  • 32:04 - 32:05
    For two groups.
  • 32:05 - 32:07
    One for Noble applicants who live
  • 32:07 - 32:12
    in Chicago's lowest income
    neighborhoods Tier 1 and 1/4 Noble,
  • 32:12 - 32:18
    applicants who live in higher, income
    areas, tier 3, remember the IV chain.
  • 32:19 - 32:24
    Each point in this graphic has coordinates
    given by first stage reduced form
  • 32:24 - 32:27
    and therefore implies an IV estimate.
  • 32:27 - 32:32
    The effect of noble enrollment on ACT scores
    is the ratio of reduced form coordinate
  • 32:32 - 32:34
    to First Stage coordinate.
  • 32:34 - 32:37
    The graphic shows to such ratios.
  • 32:37 - 32:41
    The relevant results for Tier 1 are .35.
  • 32:41 - 32:47
    While for tier 3. We have .33 not bad
    for Noble applicants from both tears.
  • 32:48 - 32:48
    These
  • 32:48 - 32:50
    stage in reduced form estimates
  • 32:50 - 32:55
    imply a yearly Noble and Roman effective
    about a third of a standard deviation gain,
  • 32:55 - 32:57
    in act math scores.
  • 32:58 - 33:02
    Notice. There's also a line connecting
    the two Ivy estimates in the figure
  • 33:02 - 33:05
    because this line passes
    through the origin,
  • 33:05 - 33:11
    it's slope rise. Over run is about equal
    to the to IV estimates. In this case.
  • 33:11 - 33:13
    The slope is about Point 3 for
  • 33:14 - 33:18
    the fact that the line passes
    through 0 0 is significant for
  • 33:18 - 33:23
    Reason by this fact, we've
    substantiated the exclusion restriction,
  • 33:23 - 33:26
    specifically, the exclusion restriction,
  • 33:26 - 33:31
    says that given a group for which Noble
    offers are unrelated to Noble enrollment.
  • 33:31 - 33:37
    We should expect to see Zero reduced form
    effect of these offers made to applicants
  • 33:37 - 33:38
    in that group.
  • 33:38 - 33:43
    How consistent is the evidence for a
    noble cause learning gain on the order
  • 33:43 - 33:48
    of point, three, four Sigma per year
    in this next graphic? We've added
  • 33:48 - 33:50
    L've more points to the original to
  • 33:50 - 33:55
    the red points here, show, first stage
    in reduced form, Noble offer effects
  • 33:55 - 33:58
    for 12, additional groups to more tears,
  • 33:58 - 34:04
    and 12 groups, defined by demographic,
    characteristics related to race sex,
  • 34:04 - 34:06
    family, income, and Baseline scores.
  • 34:06 - 34:08
    Although not a perfect fit
  • 34:08 - 34:12
    these points cluster around a line
    with slope Point Three, Six Sigma
  • 34:12 - 34:16
    much like the line. We saw earlier
    for applicants from tiers 1 and 3.
  • 34:17 - 34:18
    You're likely now.
  • 34:18 - 34:22
    Ring. What the noble IV estimates
    in this figure have to do
  • 34:22 - 34:24
    with exam School enrollment.
  • 34:24 - 34:25
    Here's the answer,
  • 34:26 - 34:30
    the Blue Line in this new graphic
    shows. As we should expect
  • 34:30 - 34:32
    that exam. School exposure jumps up
  • 34:32 - 34:36
    for applicants who clear their
    qualifying cutoff. At the same time.
  • 34:37 - 34:42
    The Red Line shows that Noble School
    enrollment clearly Falls at the same point.
  • 34:42 - 34:48
    This is the diversion effect of exam
    school offers on Noble enrollment.
  • 34:48 - 34:53
    Many kids offered an exam School
    seat, prefer that exam School seat,
  • 34:53 - 34:58
    to enrollment at Noble IV, affords us,
    the opportunity to go out on a limb
  • 34:58 - 35:02
    with strong claims about the
    mechanism behind the causal effect.
  • 35:02 - 35:07
    Here's a strong causal, claim Regarding why
    Chicago exam schools reduce achievement.
  • 35:08 - 35:11
    The primary force driving
    reduced form exam School.
  • 35:11 - 35:13
    Qualification effects on ACT scores.
  • 35:14 - 35:18
    I claim is the effect of exam
    school offers on Noble and
  • 35:18 - 35:20
    Vomit in support of this claim,
  • 35:20 - 35:27
    consider the points plotted here in blue
    all well to the left of 0 on the x-axis.
  • 35:27 - 35:32
    These points are negative because they Mark
    the effect of exam School qualification
  • 35:32 - 35:36
    on Noble School, enrollment for
    particular, groups of applicants.
  • 35:37 - 35:38
    Now, we've already seen
  • 35:38 - 35:41
    that Noble applicants offered a noble seen
  • 35:41 - 35:44
    realize large act math, gains as a result.
  • 35:45 - 35:48
    Now consider exam school offers as
  • 35:48 - 35:50
    An instrument for Noble enrollment,
  • 35:51 - 35:53
    as always, Ivy is a chain reaction.
  • 35:54 - 35:58
    If exam School qualification reduces
    time at Noble, by Point 37 years,
  • 35:58 - 36:00
    and each year of noble, enrollment
  • 36:00 - 36:04
    boosts act math scores by
    about Point Three, Six Sigma.
  • 36:04 - 36:08
    We should expect reduced form
    effects of exam School. Qualification
  • 36:08 - 36:12
    to reduce ACT scores by the
    product of these two numbers.
  • 36:12 - 36:15
    That is by about Point 1, 3 Sigma
  • 36:15 - 36:18
    the reduced form qualification
    effects at the left of the
  • 36:18 - 36:20
    Here are broadly consistent with this.
  • 36:21 - 36:25
    They cluster closer to -
    .16. Then 2 minus Point 1 3,
  • 36:25 - 36:28
    but that difference is well
    within the sampling variance
  • 36:28 - 36:29
    of the underlying estimates.
  • 36:30 - 36:31
    The causal story told here,
  • 36:32 - 36:35
    postulates diversion
    away from Charter Schools
  • 36:35 - 36:39
    as the mechanism by which exam
    school offers effect achievement.
  • 36:40 - 36:42
    In other words, it's Noble enrollment,
  • 36:42 - 36:45
    that's presumed to satisfy
    an exclusion restriction.
  • 36:45 - 36:48
    When we use exam school offers as an
  • 36:48 - 36:50
    Variable importantly,
  • 36:50 - 36:55
    as we saw before the line in this final
    graphic, with two sets of 14 points,
  • 36:55 - 36:57
    runs through the origin.
  • 36:57 - 37:00
    This fact supports our
    new exclusion restriction
  • 37:01 - 37:02
    for any applicant group
  • 37:02 - 37:06
    for which exam school offers have little
    or no effect on Noble School enrollment.
  • 37:06 - 37:10
    We should also see ACT scores unchanged
  • 37:10 - 37:14
    at the same time because the blue and
    red dots cluster around the same line,
  • 37:14 - 37:18
    the IV estimates of noble School,
    enrollment effects generated.
  • 37:18 - 37:23
    I both Noble and exam school
    offers are about the same.
  • 37:23 - 37:28
    I hope this empirical story convinces,
    you of the power of IV and RD
  • 37:28 - 37:31
    to generate new causal,
    knowledge for decades.
  • 37:31 - 37:35
    I've been lucky to work on many
    equally engaging empirical problems.
  • 37:40 - 37:43
    I computed the draft
    lottery IV estimates in
  • 37:43 - 37:47
    my Princeton PhD thesis on a
    big hairy Mainframe monster.
  • 37:47 - 37:51
    Using nine track tapes, and Lease
    space on a communal. Hard drive,
  • 37:52 - 37:55
    Princeton graduate students learn
    to mount and manipulate tape.
  • 37:55 - 37:57
    Reels the size of a cheesecake.
  • 37:58 - 38:02
    Thankfully empirical work today
    is a little less labor-intensive.
  • 38:02 - 38:05
    What else is improved in
    the modern empirical era
  • 38:06 - 38:09
    in a 2010 article, Steve
    Pischke, and I coined the phrase.
  • 38:10 - 38:11
    He Revolution
  • 38:11 - 38:16
    by this, we mean economic shift towards
    transparent empirical strategies.
  • 38:16 - 38:18
    Applied to concrete, causal questions,
  • 38:19 - 38:22
    like the questions. David
    Carter studied. So convincingly.
  • 38:23 - 38:28
    The econometrics of my schooldays
    focused more on models than on questions.
  • 38:28 - 38:31
    The modeling concerns of
    that era have mostly faded
  • 38:32 - 38:35
    but econometricians have since
    found much to contribute.
  • 38:35 - 38:40
    I'll save my personal lists of greatest
    hits and exciting new artists for the
  • 38:40 - 38:41
    The print version of this lecture.
  • 38:42 - 38:46
    I'll wrap up here by saying that I'm
    proud to be part of the Contemporary.
  • 38:46 - 38:48
    Empirical economics, Enterprise
  • 38:48 - 38:51
    and I'm gratified beyond words
  • 38:51 - 38:56
    to have been recognized for contributing
    to it back at Princeton in the late 80s.
  • 38:56 - 39:00
    My graduate school classmates and I
    chuckled leading Ed lemurs. Lament,
  • 39:00 - 39:04
    that no Economist takes another
    economists. Empirical work. Seriously.
  • 39:05 - 39:10
    This is no longer, true, empirical
    work today aspires to tell convincing.
  • 39:10 - 39:14
    Also stories, not that every
    effort succeeds far from it.
  • 39:14 - 39:18
    But as any economics job
    market candidate will tell you,
  • 39:18 - 39:23
    empirical work carefully executed and
    clearly explained is taken seriously. Indeed.
  • 39:24 - 39:27
    That is a measure of
    our Enterprises success.
  • 39:34 - 39:36
    If you'd like to learn more from Josh,
  • 39:36 - 39:40
    check out his free course mastering
    econometrics if you'd like to
  • 39:40 - 39:41
    Spore, Josh's research,
  • 39:41 - 39:44
    check out the links in the description
    or you can click to watch more
  • 39:44 - 39:45
    of Josh's videos.
Title:
Joshua Angrist Nobel Prize Lecture 2021
ASR Confidence:
0.85
Description:

more » « less
Video Language:
English
Team:
Marginal Revolution University
Duration:
39:55

English subtitles

Revisions Compare revisions