Return to Video

How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)

  • 0:01 - 0:05
    - [Narrator] On his quest
    to master econometrics,
  • 0:05 - 0:09
    Grasshopper Kamal has
    made great progress,
  • 0:09 - 0:14
    stretching his capabilities
    and outsmarting his foes.
  • 0:14 - 0:17
    Alas, today he's despondent,
  • 0:17 - 0:19
    for one challenge remains unmet.
  • 0:19 - 0:24
    Kamal cannot yet decode
    the scriptures of academic research,
  • 0:24 - 0:27
    journals like
    "The American Economic Review"
  • 0:27 - 0:29
    and "Econometrica."
  • 0:29 - 0:34
    These seemed to him to be inscribed
    in an obscure foreign tongue.
  • 0:34 - 0:35
    - [Kamal] Ugh, what the... ?
  • 0:37 - 0:40
    - These volumes are
    opaque to the novice, Kamal,
  • 0:40 - 0:42
    but can be deciphered with study.
  • 0:42 - 0:45
    Let us learn to read them together.
  • 0:52 - 0:55
    Let's dive into the West Point study,
  • 0:55 - 0:58
    published in the "Economics
    of Education Review."
  • Not Synced
    This paper reports
    on a randomized evaluation
  • Not Synced
    of student electronics use
    in Economics 101 classrooms.
  • Not Synced
    First, a quick review
    of the research design.
  • Not Synced
    - Okay.
  • Not Synced
    - [Josh] 'Metrics masters
    teaching at West Point,
  • Not Synced
    the military college that trains
    American Army officers
  • Not Synced
    designed a randomized trial
    to answer this question.
  • Not Synced
    These masters randomly assigned
    West Point cadets
  • Not Synced
    into Economics classes
    operating under different rules.
  • Not Synced
    Unlike most American colleges,
  • Not Synced
    the West Point default
    is no electronics.
  • Not Synced
    For purposes of this experiment,
    some students were left
  • Not Synced
    in such traditional
    technology-free classes,
  • Not Synced
    no lap tops, no tablets
    and no phones!
  • Not Synced
    [voice echoes]
  • Not Synced
    This is the control group,
    or baseline case.
  • Not Synced
    Another group was allowed
    to use electronics.
  • Not Synced
    This is the treatment group,
    subject to a changed environment.
  • Not Synced
    The treatment in this case
    is the unrestricted use
  • Not Synced
    of laptops or tablets in class.
  • Not Synced
    Every causal question
    has a clear outcome,
  • Not Synced
    the variables we hope to influence
    defined in advance of the study.
  • Not Synced
    The outcomes in the West Point
    electronics study
  • Not Synced
    are final exam scores.
  • Not Synced
    The study seeks to answer
    the following question,
  • Not Synced
    what is the causal effect
    of classroom electronics on learning
  • Not Synced
    as measured by exam scores?
  • Not Synced
    - Economics journal articles
    usually begin with a table
  • Not Synced
    of descriptive statistics,
    giving key facts
  • Not Synced
    about the study sample.
  • Not Synced
    - Oh my gosh, I remember this table,
    so confusing!
  • Not Synced
    Columns 1 to 3 report mean,
    or average, characteristics.
  • Not Synced
    These give a sense
    of who we're studying.
  • Not Synced
    Let's start with column 1
    which describes covariates
  • Not Synced
    in a control group.
  • Not Synced
    Covariates are characteristics
    of the control and treatment groups
  • Not Synced
    measured before
    the experiment begins.
  • Not Synced
    For example, we see the control group
    has an average age a bit over 20.
  • Not Synced
    Many of these covariates
    are dummy variables.
  • Not Synced
    A dummy variable can only have
    two values, a zero or a one.
  • Not Synced
    For example, student gender
    is captured by a dummy variable
  • Not Synced
    that equals one for woman
    and zero for man.
  • Not Synced
    The mean of this variable
    is the proportion female.
  • Not Synced
    We also see that the control group
    is 13% Hispanic
  • Not Synced
    and 19% had prior military service.
  • Not Synced
    The table notes are key.
  • Not Synced
    Refer to these
    as you scan the table.
  • Not Synced
    These notes explain what's shown
    in each column and panel.
  • Not Synced
    The notes tell us, for example,
    that standard deviations
  • Not Synced
    are reported in brackets.
  • Not Synced
    Standard deviations tell us how
    spread out the data are.
  • Not Synced
    For example, a standard deviation
    of 0.52 tells us that most
  • Not Synced
    of the control group's GPAs
    fall between 2.35,
  • Not Synced
    which is 0.52 below
    the mean GPA of 2.87,
  • Not Synced
    and 3.39, which is 0.52 above 2.87.
  • Not Synced
    A lower standard deviation
    would mean the GPAs were
  • Not Synced
    more tightly clustered
    around the mean.
  • Not Synced
    - [Kamal] Yeah, but they're missing
    from most of the variables.
  • Not Synced
    - [Narrator] That's right.
  • Not Synced
    Masters usually omit
    standard deviations for dummies
  • Not Synced
    because the mean of this variable
    determines its standard deviaton.
  • Not Synced
    This study compares two treatment
    groups with the control group.
  • Not Synced
    The first was allowed free use
    of laptops and tablets.
  • Not Synced
    The second treatment
    was more restrictive,
  • Not Synced
    allowing only tablets placed
    flat on the desk.
  • Not Synced
    The treatment groups
    look much like the control group.
  • Not Synced
    Hmmm...
  • Not Synced
    This takes us to the next feature
    of this table, columns 4 through 6
  • Not Synced
    use statistical tests to compare
    the characteristics
  • Not Synced
    of the treatment and control group
    before the experiment.
  • Not Synced
    In column 4, the two treatment
    groups are combined.
  • Not Synced
    You can see that the difference
    in proportion female
  • Not Synced
    between the treatment
    and control group is only 0.03.
  • Not Synced
    The difference is not
    statistically significant.
  • Not Synced
    It is the sort of difference
    we can easily put down
  • Not Synced
    to chance results
    in our sample selection process.
  • Not Synced
    - [Kamal] Hmm, how do we know that?
  • Not Synced
    - [Narrator] Remember
    the rule of thumb?
  • Not Synced
    Statistical estimates
    that exceed the standard
  • Not Synced
    by a multiple of 2
    in absolute value
  • Not Synced
    are usually said
    to be statistically significant.
  • Not Synced
    The standard error is 0.03,
  • Not Synced
    same as the difference
    in proportion female.
  • Not Synced
    So the ratio of the latter
    to the former is only 1,
  • Not Synced
    which of course is less than 2.
  • Not Synced
    - [Kamal] Uh huh. So none
    of the treatment/control differences
  • Not Synced
    in the table are more than twice
    their standard errors.
  • Not Synced
    - [Narrator] Correct.
  • Not Synced
    The random division of students
    appears to have succeeded
  • Not Synced
    in creating groups
    that are indeed comparable.
  • Not Synced
    We can be confident therefore
    that any later differences
  • Not Synced
    in classroom achievement
    are the result of the experimental
  • Not Synced
    intervention rather
    than a reflection
  • Not Synced
    of preexisting differences.
  • Not Synced
    Ceteris paribus achieved!
  • Not Synced
    - [Kamal] Cool. Wait,
    what about the bottom,
  • Not Synced
    the numbers with the stars?
  • Not Synced
    Those differences are a lot more
    than double the standard error.
  • Not Synced
    - [Narrator] Good eye, Kamal!
  • Not Synced
    The table has many numbers.
  • Not Synced
    Those in Panel B are important too.
  • Not Synced
    This panel measures the extent
    to which students in treatment
  • Not Synced
    and control groups actually use
    computers in class.
  • Not Synced
    The treatment here was
    to allow computer use.
  • Not Synced
    The researchers must show
    that students allowed
  • Not Synced
    to use computers took advantage
    of the opportunity to do so.
  • Not Synced
    If they didn't, then there's
    really no treatment.
  • Not Synced
    Luckily, 81% of those
    in the first treatment group
  • Not Synced
    used computers compared
    with none in the control group.
  • Not Synced
    And many in the second
    tablet treatment group
  • Not Synced
    used computers as well.
  • Not Synced
    These differences
    in computer use are large
  • Not Synced
    and statistically significant.
  • Not Synced
    We also get to see
    the sample size in each group.
  • Not Synced
    - [Kamal] The stars
    are just like decoration?
  • Not Synced
    - [Narrator] Some academic papers
    use stars to indicate differences
  • Not Synced
    that are statistically significant.
  • Not Synced
    This makes them jump out at you.
  • Not Synced
    Here three stars indicate that
    the result is statistically different
  • Not Synced
    from zero with a p value
    less than 1%.
  • Not Synced
    In other words, there's less
    than a 1 in 100 chance
  • Not Synced
    this result is purely
    a chance finding.
  • Not Synced
    [applause]
  • Not Synced
    Two stars indicate a 1 in 20
    or 5% chance of a chance finding.
  • Not Synced
    And one star denotes results
    we might see as often as 10%
  • Not Synced
    of the time merely due to chance.
  • Not Synced
    Today, stars are seen
    as a little old fashioned.
  • Not Synced
    Some journals omit them.
  • Not Synced
    - [Kamal] What about
    those last two columns?
  • Not Synced
    - [Narrator] Unlike column 4,
    which combines
  • Not Synced
    both treatment groups into one,
    these last two columns
  • Not Synced
    look separately
    at treatment/control differences
  • Not Synced
    for each treatment group.
  • Not Synced
    This provides a more detailed
    analysis of balance.
  • Not Synced
    Also, for now,
    you can ignore this row
  • Not Synced
    which provides
    another test of significance.
  • Not Synced
    Now we get to the article's
    punchline, table 4.
  • Not Synced
    This table reports
    regression estimates
  • Not Synced
    of the effects of electronics use
    on measures of student learning.
  • Not Synced
    - [Kamal Why does the study
    report regression estimates?
  • Not Synced
    See, that's why I was getting lost.
  • Not Synced
    I thought one reason
    why we liked randomized trials
  • Not Synced
    is that we use them
    to obtain causal effects
  • Not Synced
    simply by comparing
    treatment and control groups.
  • Not Synced
    Since these groups are balanced,
    no need to use regression.
  • Not Synced
    - [Narrator] Well said, Kamal.
  • Not Synced
    In practice, it's customary
    to report regression estimates
  • Not Synced
    for two reasons.
  • Not Synced
    First, evidence of balance
    not withstanding, an abundance
  • Not Synced
    of caution might lead the analyst
    to allow for chance differences.
  • Not Synced
    Second, regression estimates
    are likely to be more precise.
  • Not Synced
    That is, they have lower
    standard errors than
  • Not Synced
    the simple treatment
    control comparisons.
  • Not Synced
    The dependent variable
    in this study
  • Not Synced
    is the outcome of interest.
  • Not Synced
    Since the question at hand
    is how classroom electronics
  • Not Synced
    affect learning, a good outcome
    is the economics final exam score.
  • Not Synced
    Each column reports results
    from a different regression model.
  • Not Synced
    Models are distinguished
    by the control variables
  • Not Synced
    or covariates they include
    besides treatment status.
  • Not Synced
    Estimates with no covariates
    are simple comparisons
  • Not Synced
    of treatment and control groups.
  • Not Synced
    - [Kamal] I thought
    they just forgot to fill it out.
  • Not Synced
    - [Narrator] Column 1 suggests
    electronics use reduced
  • Not Synced
    final exam scores
    by 0.28 standard deviations.
  • Not Synced
    In our last lesson, Master Joshway
    explained, we use standard deviation
  • Not Synced
    units because these units
    are easily compared across studies.
  • Not Synced
    Column 2 reports results
    from a model
  • Not Synced
    that adds demographic controls.
  • Not Synced
    Here we're comparing test scores
    but holding constant factors
  • Not Synced
    such as age and sex.
  • Not Synced
    Column 3 reports results
    from a model that adds GPA
  • Not Synced
    to the list of covariates.
  • Not Synced
    Column 4 adds ACT scores.
  • Not Synced
    Analysts often report
    results this way,
  • Not Synced
    starting with models that include
    few or no covariates
  • Not Synced
    and then reporting estimates
    from models that add more
  • Not Synced
    and more covariates
    as we move across columns.
  • Not Synced
    Looking across columns,
    what do you notice?
  • Not Synced
    - [Kamal] Well, the coefficient
    on using a computer is always
  • Not Synced
    a pretty big negative number.
  • Not Synced
    - [Narrator] That's right!
  • Not Synced
    We can also see that
    the standard errors are small enough
  • Not Synced
    to make these negative results
    statistically significant.
  • Not Synced
    In other words, the primary
    takeaway from this experiment
  • Not Synced
    is that electronics in the classroom
    reduce student learning.
  • Not Synced
    - [Kama] GPA and ACT scores
    are also significant.
  • Not Synced
    Why is that?
  • Not Synced
    - [Narrator] Good observation!
  • Not Synced
    That's not surprising.
  • Not Synced
    We expect these variables
    to predict college performance.
  • Not Synced
    - [Kamal] Oh right, of course.
  • Not Synced
    Kids who got better grades before
    are more likely to get
  • Not Synced
    a better grade in this course.
  • Not Synced
    - [Narrator] You'll also notice a lot
    of other information on this table.
  • Not Synced
    Remaining panels in the table
    report effects of electronics use
  • Not Synced
    on components of the final exam,
  • Not Synced
    such as the multiple
    choice questions.
  • Not Synced
    These results are mostly consistent
    with computer use effects
  • Not Synced
    on overall scores.
  • Not Synced
    - [Kamal] What about the rows
    not in English?
  • Not Synced
    - [Narrator] These rows give
    additional statistical information.
  • Not Synced
    R-squared is a measure
    of goodness of fit.
  • Not Synced
    This isn't too important, though
    some readers may want to know it.
  • Not Synced
    Other rows report on alternative
    tests of statistical significance
  • Not Synced
    that you can ignore for now.
  • Not Synced
    - [Kamal] Oh my gosh,
    these tables aren't that hard.
  • Not Synced
    Thank you so much.
  • Not Synced
    Next up is regression.
  • Not Synced
    See you then!
  • Not Synced
    ♪ [music] ♪
  • Not Synced
    You're on your way
    to mastering econometrics.
  • Not Synced
    Make sure this video sticks
  • Not Synced
    by taking a few
    quick practice questions.
  • Not Synced
    Or, if you're ready,
    click for the next video.
  • Not Synced
    You can also check out MRU's
    website for more courses,
  • Not Synced
    teacher resources and more.
Title:
How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
Description:

more » « less
Video Language:
English
Team:
Marginal Revolution University
Project:
Mastering Econometrics
Duration:
12:40

English subtitles

Revisions Compare revisions