Return to Video

How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)

  • 0:01 - 0:05
    - [Narrator] On his quest
    to master econometrics,
  • 0:05 - 0:09
    Grasshopper Kamal has
    made great progress,
  • 0:09 - 0:14
    stretching his capabilities
    and outsmarting his foes.
  • 0:14 - 0:17
    Alas, today he's despondent,
  • 0:17 - 0:19
    for one challenge remains unmet.
  • 0:19 - 0:24
    Kamal cannot yet decode
    the scriptures of academic research,
  • 0:24 - 0:27
    journals like
    "The American Economic Review"
  • 0:27 - 0:29
    and "Econometrica."
  • 0:29 - 0:34
    These seemed to him to be inscribed
    in an obscure foreign tongue.
  • 0:34 - 0:35
    - [Kamal] Ugh, what the... ?
  • 0:37 - 0:40
    - These volumes are
    opaque to the novice, Kamal,
  • 0:40 - 0:42
    but can be deciphered with study.
  • 0:42 - 0:45
    Let us learn to read them together.
  • 0:52 - 0:55
    Let's dive into the West Point study,
  • 0:55 - 0:58
    published in the "Economics
    of Education Review."
  • 0:59 - 1:02
    This paper reports
    on a randomized evaluation
  • 1:02 - 1:06
    of student electronics use
    in Economics 101 classrooms.
  • 1:06 - 1:09
    First, a quick review
    of the research design.
  • 1:09 - 1:11
    - Okay.
  • 1:12 - 1:14
    - [Josh] 'Metrics masters
    teaching at West Point,
  • 1:14 - 1:17
    the military college that trains
    American Army officers
  • 1:17 - 1:20
    designed a randomized trial
    to answer this question.
  • 1:20 - 1:23
    These masters randomly assigned
    West Point cadets
  • 1:23 - 1:26
    into Economics classes
    operating under different rules.
  • 1:27 - 1:29
    Unlike most American colleges,
  • 1:29 - 1:32
    the West Point default
    is no electronics.
  • 1:32 - 1:35
    For purposes of this experiment,
    some students were left
  • 1:35 - 1:39
    in such traditional
    technology-free classes,
  • 1:39 - 1:42
    no laptops, no tablets
    and no phones!
  • 1:42 - 1:43
    [voice echoes]
  • 1:43 - 1:46
    This is the control group,
    or baseline case.
  • 1:46 - 1:49
    Another group was allowed
    to use electronics.
  • 1:49 - 1:53
    This is the treatment group,
    subject to a changed environment.
  • 1:53 - 1:56
    The treatment in this case
    is the unrestricted use
  • 1:56 - 1:58
    of laptops or tablets in class.
  • 1:59 - 2:02
    Every causal question
    has a clear outcome,
  • 2:02 - 2:05
    the variables we hope to influence
    defined in advance of the study.
  • 2:06 - 2:08
    The outcomes in the West Point
    electronics study
  • 2:08 - 2:10
    are final exam scores.
  • 2:10 - 2:13
    The study seeks to answer
    the following question,
  • 2:13 - 2:17
    what is the causal effect
    of classroom electronics on learning
  • 2:17 - 2:20
    as measured by exam scores?
  • 2:21 - 2:24
    - Economics journal articles
    usually begin with a table
  • 2:24 - 2:27
    of descriptive statistics,
    giving key facts
  • 2:27 - 2:28
    about the study sample.
  • 2:28 - 2:32
    - Oh my gosh, I remember this table,
    so confusing!
  • 2:32 - 2:37
    - [Narrator] Columns 1 to 3 report
    mean, or average, characteristics.
  • 2:37 - 2:40
    These give a sense
    of who we're studying.
  • 2:40 - 2:44
    Let's start with column 1
    which describes covariates
  • 2:44 - 2:45
    in the control group.
  • 2:45 - 2:49
    Covariates are characteristics
    of the control and treatment groups
  • 2:49 - 2:52
    measured before
    the experiment begins.
  • 2:52 - 2:57
    For example, we see the control group
    has an average age a bit over 20.
  • 2:57 - 3:00
    Many of these covariates
    are dummy variables.
  • 3:01 - 3:06
    A dummy variable can only have
    two values, a zero or a one.
  • 3:06 - 3:10
    For example, student gender
    is captured by a dummy variable
  • 3:10 - 3:13
    that equals one for women
    and zero for men.
  • 3:13 - 3:17
    The mean of this variable
    is the proportion female.
  • 3:17 - 3:21
    We also see that the control group
    is 13% Hispanic
  • 3:21 - 3:24
    and 19% had prior military service.
  • 3:25 - 3:27
    The table notes are key.
  • 3:27 - 3:29
    Refer to these
    as you scan the table.
  • 3:29 - 3:33
    These notes explain what's shown
    in each column and panel.
  • 3:39 - 3:43
    The notes tell us, for example,
    that standard deviations
  • 3:43 - 3:45
    are reported in brackets.
  • 3:46 - 3:50
    Standard deviations tell us how
    spread out the data are.
  • 3:50 - 3:55
    For example, a standard deviation
    of 0.52 tells us that most
  • 3:55 - 3:59
    of the control group's GPAs
    fall between 2.35,
  • 3:59 - 4:03
    which is 0.52 below
    the mean GPA of 2.87,
  • 4:03 - 4:08
    and 3.39, which is 0.52 above 2.87.
  • 4:09 - 4:12
    A lower standard deviation
    would mean the GPAs were
  • 4:12 - 4:14
    more tightly clustered
    around the mean.
  • 4:15 - 4:17
    - [Kamal] Yeah, but they're missing
    for most of the variables.
  • 4:17 - 4:19
    - [Narrator] That's right.
  • 4:19 - 4:22
    Masters usually omit
    standard deviations for dummies
  • 4:22 - 4:26
    because the mean of this variable
    determines its standard deviation.
  • 4:27 - 4:31
    This study compares two treatment
    groups with the control group.
  • 4:31 - 4:36
    The first was allowed free use
    of laptops and tablets.
  • 4:36 - 4:38
    The second treatment
    was more restrictive,
  • 4:38 - 4:42
    allowing only tablets placed
    flat on the desk.
  • 4:42 - 4:45
    The treatment groups
    look much like the control group.
  • 4:46 - 4:51
    This takes us to the next feature
    of this table, columns 4 through 6
  • 4:51 - 4:55
    use statistical tests to compare
    the characteristics
  • 4:55 - 4:58
    of the treatment and control group
    before the experiment.
  • 4:58 - 5:02
    In column 4, the two treatment
    groups are combined.
  • 5:02 - 5:05
    You can see that the difference
    in proportion female
  • 5:05 - 5:10
    between the treatment
    and control group is only 0.03.
  • 5:10 - 5:14
    The difference is not
    statistically significant.
  • 5:14 - 5:17
    It is the sort of difference
    we can easily put down
  • 5:17 - 5:20
    to chance results
    in our sample selection process.
  • 5:20 - 5:22
    - [Kamal] Hmm, how do we know that?
  • 5:22 - 5:24
    - [Narrator] Remember
    the rule of thumb?
  • 5:24 - 5:27
    Statistical estimates
    that exceed the standard error
  • 5:27 - 5:30
    by a multiple of 2
    in absolute value
  • 5:30 - 5:34
    are usually said
    to be statistically significant.
  • 5:35 - 5:38
    The standard error is 0.03,
  • 5:39 - 5:41
    same as the difference
    in proportion female.
  • 5:42 - 5:46
    So the ratio of the latter
    to the former is only 1,
  • 5:46 - 5:49
    which of course is less than 2.
  • 5:49 - 5:51
    - [Kamal] Uh huh. So none
    of the treatment/control differences
  • 5:51 - 5:54
    in the table are more than twice
    their standard errors.
  • 5:54 - 5:56
    - [Narrator] Correct.
  • 5:56 - 5:59
    The random division of students
    appears to have succeeded
  • 5:59 - 6:02
    in creating groups
    that are indeed comparable.
  • 6:03 - 6:06
    We can be confident therefore
    that any later differences
  • 6:06 - 6:10
    in classroom achievement
    are the result of the experimental
  • 6:10 - 6:13
    intervention rather
    than a reflection
  • 6:13 - 6:15
    of preexisting differences.
  • 6:15 - 6:17
    Ceteris paribus achieved!
  • 6:17 - 6:21
    - [Kamal] Cool. Wait,
    what about the bottom,
  • 6:21 - 6:23
    the numbers with the stars?
  • 6:23 - 6:25
    Those differences are a lot more
    than double the standard error.
  • 6:25 - 6:27
    - [Narrator] Good eye, Kamal!
  • 6:27 - 6:29
    The table has many numbers.
  • 6:29 - 6:32
    Those in Panel B are important too.
  • 6:32 - 6:36
    This panel measures the extent
    to which students in treatment
  • 6:36 - 6:39
    and control groups actually use
    computers in class.
  • 6:39 - 6:43
    The treatment here was
    to allow computer use.
  • 6:43 - 6:46
    The researchers must show
    that students allowed
  • 6:46 - 6:49
    to use computers took advantage
    of the opportunity to do so.
  • 6:50 - 6:53
    If they didn't, then there's
    really no treatment.
  • 6:53 - 6:58
    Luckily, 81% of those
    in the first treatment group
  • 6:58 - 7:02
    used computers compared
    with none in the control group.
  • 7:02 - 7:05
    And many in the second
    tablet treatment group
  • 7:05 - 7:07
    used computers as well.
  • 7:07 - 7:10
    These differences
    in computer use are large
  • 7:10 - 7:12
    and statistically significant.
  • 7:12 - 7:15
    We also get to see
    the sample size in each group.
  • 7:15 - 7:18
    - [Kamal] The stars
    are just like decoration?
  • 7:18 - 7:22
    - [Narrator] Some academic papers
    use stars to indicate differences
  • 7:22 - 7:24
    that are statistically significant.
  • 7:24 - 7:27
    This makes them jump out at you.
  • 7:27 - 7:32
    Here three stars indicate that
    the result is statistically different
  • 7:32 - 7:35
    from zero with a p value
    less than 1%.
  • 7:35 - 7:39
    In other words, there's less
    than a 1 in 100 chance
  • 7:39 - 7:42
    this result is purely
    a chance finding.
  • 7:42 - 7:43
    [applause]
  • 7:43 - 7:49
    Two stars indicate a 1 in 20
    or 5% chance of a chance finding.
  • 7:49 - 7:53
    And one star denotes results
    we might see as often as 10%
  • 7:53 - 7:56
    of the time merely due to chance.
  • 7:56 - 8:00
    Today, stars are seen
    as a little old fashioned.
  • 8:00 - 8:01
    Some journals omit them.
  • 8:01 - 8:03
    - [Kamal] What about
    those last two columns?
  • 8:03 - 8:06
    - [Narrator] Unlike column 4,
    which combines
  • 8:06 - 8:10
    both treatment groups into one,
    these last two columns
  • 8:10 - 8:12
    look separately
    at treatment/control differences
  • 8:12 - 8:14
    for each treatment group.
  • 8:14 - 8:17
    This provides a more detailed
    analysis of balance.
  • 8:18 - 8:21
    Also, for now,
    you can ignore this row
  • 8:21 - 8:24
    which provides
    another test of significance.
  • 8:24 - 8:29
    Now we get to the article's
    punchline, table 4.
  • 8:30 - 8:33
    This table reports
    regression estimates
  • 8:33 - 8:37
    of the effects of electronics use
    on measures of student learning.
  • 8:37 - 8:40
    - [Kamal Why does the study
    report regression estimates?
  • 8:40 - 8:42
    See, that's why I was getting lost.
  • 8:42 - 8:45
    I thought one reason
    why we liked randomized trials
  • 8:45 - 8:47
    is that we use them
    to obtain causal effects
  • 8:47 - 8:50
    simply by comparing
    treatment and control groups.
  • 8:50 - 8:53
    Since these groups are balanced,
    no need to use regression.
  • 8:53 - 8:55
    - [Narrator] Well said, Kamal.
  • Not Synced
    In practice, it's customary
    to report regression estimates
  • Not Synced
    for two reasons.
  • Not Synced
    First, evidence of balance
    not withstanding, an abundance
  • Not Synced
    of caution might lead the analyst
    to allow for chance differences.
  • Not Synced
    Second, regression estimates
    are likely to be more precise.
  • Not Synced
    That is, they have lower
    standard errors than
  • Not Synced
    the simple treatment
    control comparisons.
  • Not Synced
    The dependent variable
    in this study
  • Not Synced
    is the outcome of interest.
  • Not Synced
    Since the question at hand
    is how classroom electronics
  • Not Synced
    affect learning, a good outcome
    is the economics final exam score.
  • Not Synced
    Each column reports results
    from a different regression model.
  • Not Synced
    Models are distinguished
    by the control variables
  • Not Synced
    or covariates they include
    besides treatment status.
  • Not Synced
    Estimates with no covariates
    are simple comparisons
  • Not Synced
    of treatment and control groups.
  • Not Synced
    - [Kamal] I thought
    they just forgot to fill it out.
  • Not Synced
    - [Narrator] Column 1 suggests
    electronics use reduced
  • Not Synced
    final exam scores
    by 0.28 standard deviations.
  • Not Synced
    In our last lesson, Master Joshway
    explained, we use standard deviation
  • Not Synced
    units because these units
    are easily compared across studies.
  • Not Synced
    Column 2 reports results
    from a model
  • Not Synced
    that adds demographic controls.
  • Not Synced
    Here we're comparing test scores
    but holding constant factors
  • Not Synced
    such as age and sex.
  • Not Synced
    Column 3 reports results
    from a model that adds GPA
  • Not Synced
    to the list of covariates.
  • Not Synced
    Column 4 adds ACT scores.
  • Not Synced
    Analysts often report
    results this way,
  • Not Synced
    starting with models that include
    few or no covariates
  • Not Synced
    and then reporting estimates
    from models that add more
  • Not Synced
    and more covariates
    as we move across columns.
  • Not Synced
    Looking across columns,
    what do you notice?
  • Not Synced
    - [Kamal] Well, the coefficient
    on using a computer is always
  • Not Synced
    a pretty big negative number.
  • Not Synced
    - [Narrator] That's right!
  • Not Synced
    We can also see that
    the standard errors are small enough
  • Not Synced
    to make these negative results
    statistically significant.
  • Not Synced
    In other words, the primary
    takeaway from this experiment
  • Not Synced
    is that electronics in the classroom
    reduce student learning.
  • Not Synced
    - [Kama] GPA and ACT scores
    are also significant.
  • Not Synced
    Why is that?
  • Not Synced
    - [Narrator] Good observation!
  • Not Synced
    That's not surprising.
  • Not Synced
    We expect these variables
    to predict college performance.
  • Not Synced
    - [Kamal] Oh right, of course.
  • Not Synced
    Kids who got better grades before
    are more likely to get
  • Not Synced
    a better grade in this course.
  • Not Synced
    - [Narrator] You'll also notice a lot
    of other information on this table.
  • Not Synced
    Remaining panels in the table
    report effects of electronics use
  • Not Synced
    on components of the final exam,
  • Not Synced
    such as the multiple
    choice questions.
  • Not Synced
    These results are mostly consistent
    with computer use effects
  • Not Synced
    on overall scores.
  • Not Synced
    - [Kamal] What about the rows
    not in English?
  • Not Synced
    - [Narrator] These rows give
    additional statistical information.
  • Not Synced
    R-squared is a measure
    of goodness of fit.
  • Not Synced
    This isn't too important, though
    some readers may want to know it.
  • Not Synced
    Other rows report on alternative
    tests of statistical significance
  • Not Synced
    that you can ignore for now.
  • Not Synced
    - [Kamal] Oh my gosh,
    these tables aren't that hard.
  • Not Synced
    Thank you so much.
  • Not Synced
    Next up is regression.
  • Not Synced
    See you then!
  • Not Synced
    ♪ [music] ♪
  • Not Synced
    You're on your way
    to mastering econometrics.
  • Not Synced
    Make sure this video sticks
  • Not Synced
    by taking a few
    quick practice questions.
  • Not Synced
    Or, if you're ready,
    click for the next video.
  • Not Synced
    You can also check out MRU's
    website for more courses,
  • Not Synced
    teacher resources and more.
Title:
How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
Description:

more » « less
Video Language:
English
Team:
Marginal Revolution University
Project:
Mastering Econometrics
Duration:
12:40

English subtitles

Revisions Compare revisions