How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)

0:01 - 0:05

- [Narrator] On his quest
to master econometrics,
0:05 - 0:09

Grasshopper Kamal has
made great progress,
0:09 - 0:14

stretching his capabilities
and outsmarting his foes.
0:14 - 0:17

Alas, today he's despondent,
0:17 - 0:19

for one challenge remains unmet.
0:19 - 0:24

Kamal cannot yet decode
the scriptures of academic research,
0:24 - 0:27

journals like
"The American Economic Review"
0:27 - 0:29

and "Econometrica."
0:29 - 0:34

These seemed to him to be inscribed
in an obscure foreign tongue.
0:34 - 0:35

- [Kamal] Ugh, what the... ?
0:37 - 0:40

- These volumes are
opaque to the novice, Kamal,
0:40 - 0:42

but can be deciphered with study.
0:42 - 0:45

Let us learn to read them together.
0:52 - 0:55

Let's dive into the West Point study,
0:55 - 0:58

published in the "Economics
of Education Review."
0:59 - 1:02

This paper reports
on a randomized evaluation
1:02 - 1:06

of student electronics use
in Economics 101 classrooms.
1:06 - 1:09

First, a quick review
of the research design.
1:09 - 1:11

- Okay.
1:12 - 1:14

- [Josh] 'Metrics masters
teaching at West Point,
1:14 - 1:17

the military college that trains
American Army officers
1:17 - 1:20

designed a randomized trial
to answer this question.
1:20 - 1:23

These masters randomly assigned
West Point cadets
1:23 - 1:26

into Economics classes
operating under different rules.
1:27 - 1:29

Unlike most American colleges,
1:29 - 1:32

the West Point default
is no electronics.
1:32 - 1:35

For purposes of this experiment,
some students were left
1:35 - 1:39

in such traditional
technology-free classes,
1:39 - 1:42

no laptops, no tablets
and no phones!
1:42 - 1:43

[voice echoes]
1:43 - 1:46

This is the control group,
or baseline case.
1:46 - 1:49

Another group was allowed
to use electronics.
1:49 - 1:53

This is the treatment group,
subject to a changed environment.
1:53 - 1:56

The treatment in this case
is the unrestricted use
1:56 - 1:58

of laptops or tablets in class.
1:59 - 2:02

Every causal question
has a clear outcome,
2:02 - 2:05

the variables we hope to influence
defined in advance of the study.
2:06 - 2:08

The outcomes in the West Point
electronics study
2:08 - 2:10

are final exam scores.
2:10 - 2:13

The study seeks to answer
the following question,
2:13 - 2:17

what is the causal effect
of classroom electronics on learning
2:17 - 2:20

as measured by exam scores?
2:21 - 2:24

- Economics journal articles
usually begin with a table
2:24 - 2:27

of descriptive statistics,
giving key facts
2:27 - 2:28

about the study sample.
2:28 - 2:32

- Oh my gosh, I remember this table,
so confusing!
2:32 - 2:37

- [Narrator] Columns 1 to 3 report
mean, or average, characteristics.
2:37 - 2:40

These give a sense
of who we're studying.
2:40 - 2:44

Let's start with column 1
which describes covariates
2:44 - 2:45

in the control group.
2:45 - 2:49

Covariates are characteristics
of the control and treatment groups
2:49 - 2:52

measured before
the experiment begins.
2:52 - 2:57

For example, we see the control group
has an average age a bit over 20.
2:57 - 3:00

Many of these covariates
are dummy variables.
3:01 - 3:06

A dummy variable can only have
two values, a zero or a one.
3:06 - 3:10

For example, student gender
is captured by a dummy variable
3:10 - 3:13

that equals one for women
and zero for men.
3:13 - 3:17

The mean of this variable
is the proportion female.
3:17 - 3:21

We also see that the control group
is 13% Hispanic
3:21 - 3:24

and 19% had prior military service.
3:25 - 3:27

The table notes are key.
3:27 - 3:29

Refer to these
as you scan the table.
3:29 - 3:33

These notes explain what's shown
in each column and panel.
3:39 - 3:43

The notes tell us, for example,
that standard deviations
3:43 - 3:45

are reported in brackets.
3:46 - 3:50

Standard deviations tell us how
spread out the data are.
3:50 - 3:55

For example, a standard deviation
of 0.52 tells us that most
3:55 - 3:59

of the control group's GPAs
fall between 2.35,
3:59 - 4:03

which is 0.52 below
the mean GPA of 2.87,
4:03 - 4:08

and 3.39, which is 0.52 above 2.87.
4:09 - 4:12

A lower standard deviation
would mean the GPAs were
4:12 - 4:14

more tightly clustered
around the mean.
4:15 - 4:17

- [Kamal] Yeah, but they're missing
for most of the variables.
4:17 - 4:19

- [Narrator] That's right.
4:19 - 4:22

Masters usually omit
standard deviations for dummies
4:22 - 4:26

because the mean of this variable
determines its standard deviation.
4:27 - 4:31

This study compares two treatment
groups with the control group.
4:31 - 4:36

The first was allowed free use
of laptops and tablets.
4:36 - 4:38

The second treatment
was more restrictive,
4:38 - 4:42

allowing only tablets placed
flat on the desk.
4:42 - 4:45

The treatment groups
look much like the control group.
4:46 - 4:51

This takes us to the next feature
of this table, columns 4 through 6
4:51 - 4:55

use statistical tests to compare
the characteristics
4:55 - 4:58

of the treatment and control group
before the experiment.
4:58 - 5:02

In column 4, the two treatment
groups are combined.
5:02 - 5:05

You can see that the difference
in proportion female
5:05 - 5:10

between the treatment
and control group is only 0.03.
5:10 - 5:14

The difference is not
statistically significant.
5:14 - 5:17

It is the sort of difference
we can easily put down
5:17 - 5:20

to chance results
in our sample selection process.
5:20 - 5:22

- [Kamal] Hmm, how do we know that?
5:22 - 5:24

- [Narrator] Remember
the rule of thumb?
5:24 - 5:27

Statistical estimates
that exceed the standard error
5:27 - 5:30

by a multiple of 2
in absolute value
5:30 - 5:34

are usually said
to be statistically significant.
5:35 - 5:38

The standard error is 0.03,
5:39 - 5:41

same as the difference
in proportion female.
5:42 - 5:46

So the ratio of the latter
to the former is only 1,
5:46 - 5:49

which of course is less than 2.
5:49 - 5:51

- [Kamal] Uh huh. So none
of the treatment/control differences
5:51 - 5:54

in the table are more than twice
their standard errors.
5:54 - 5:56

- [Narrator] Correct.
5:56 - 5:59

The random division of students
appears to have succeeded
5:59 - 6:02

in creating groups
that are indeed comparable.
6:03 - 6:06

We can be confident therefore
that any later differences
6:06 - 6:10

in classroom achievement
are the result of the experimental
6:10 - 6:13

intervention rather
than a reflection
6:13 - 6:15

of preexisting differences.
6:15 - 6:17

Ceteris paribus achieved!
6:17 - 6:21

- [Kamal] Cool. Wait,
what about the bottom,
6:21 - 6:23

the numbers with the stars?
6:23 - 6:25

Those differences are a lot more
than double the standard error.
6:25 - 6:27

- [Narrator] Good eye, Kamal!
6:27 - 6:29

The table has many numbers.
6:29 - 6:32

Those in Panel B are important too.
6:32 - 6:36

This panel measures the extent
to which students in treatment
6:36 - 6:39

and control groups actually use
computers in class.
6:39 - 6:43

The treatment here was
to allow computer use.
6:43 - 6:46

The researchers must show
that students allowed
6:46 - 6:49

to use computers took advantage
of the opportunity to do so.
6:50 - 6:53

If they didn't, then there's
really no treatment.
6:53 - 6:58

Luckily, 81% of those
in the first treatment group
6:58 - 7:02

used computers compared
with none in the control group.
7:02 - 7:05

And many in the second
tablet treatment group
7:05 - 7:07

used computers as well.
7:07 - 7:10

These differences
in computer use are large
7:10 - 7:12

and statistically significant.
7:12 - 7:15

We also get to see
the sample size in each group.
7:15 - 7:18

- [Kamal] The stars
are just like decoration?
7:18 - 7:22

- [Narrator] Some academic papers
use stars to indicate differences
7:22 - 7:24

that are statistically significant.
7:24 - 7:27

This makes them jump out at you.
7:27 - 7:32

Here three stars indicate that
the result is statistically different
7:32 - 7:35

from zero with a p value
less than 1%.
7:35 - 7:39

In other words, there's less
than a 1 in 100 chance
7:39 - 7:42

this result is purely
a chance finding.
7:42 - 7:43

[applause]
7:43 - 7:49

Two stars indicate a 1 in 20
or 5% chance of a chance finding.
7:49 - 7:53

And one star denotes results
we might see as often as 10%
7:53 - 7:56

of the time merely due to chance.
7:56 - 8:00

Today, stars are seen
as a little old fashioned.
8:00 - 8:01

Some journals omit them.
8:01 - 8:03

- [Kamal] What about
those last two columns?
8:03 - 8:06

- [Narrator] Unlike column 4,
which combines
8:06 - 8:10

both treatment groups into one,
these last two columns
8:10 - 8:12

look separately
at treatment/control differences
8:12 - 8:14

for each treatment group.
8:14 - 8:17

This provides a more detailed
analysis of balance.
8:18 - 8:21

Also, for now,
you can ignore this row
8:21 - 8:24

which provides
another test of significance.
8:24 - 8:29

Now we get to the article's
punchline, table 4.
8:30 - 8:33

This table reports
regression estimates
8:33 - 8:37

of the effects of electronics use
on measures of student learning.
8:37 - 8:40

- [Kamal Why does the study
report regression estimates?
8:40 - 8:42

See, that's why I was getting lost.
8:42 - 8:45

I thought one reason
why we liked randomized trials
8:45 - 8:47

is that we use them
to obtain causal effects
8:47 - 8:50

simply by comparing
treatment and control groups.
8:50 - 8:53

Since these groups are balanced,
no need to use regression.
8:53 - 8:55

- [Narrator] Well said, Kamal.
Not Synced

In practice, it's customary
to report regression estimates
Not Synced

for two reasons.
Not Synced

First, evidence of balance
not withstanding, an abundance
Not Synced

of caution might lead the analyst
to allow for chance differences.
Not Synced

Second, regression estimates
are likely to be more precise.
Not Synced

That is, they have lower
standard errors than
Not Synced

the simple treatment
control comparisons.
Not Synced

The dependent variable
in this study
Not Synced

is the outcome of interest.
Not Synced

Since the question at hand
is how classroom electronics
Not Synced

affect learning, a good outcome
is the economics final exam score.
Not Synced

Each column reports results
from a different regression model.
Not Synced

Models are distinguished
by the control variables
Not Synced

or covariates they include
besides treatment status.
Not Synced

Estimates with no covariates
are simple comparisons
Not Synced

of treatment and control groups.
Not Synced

- [Kamal] I thought
they just forgot to fill it out.
Not Synced

- [Narrator] Column 1 suggests
electronics use reduced
Not Synced

final exam scores
by 0.28 standard deviations.
Not Synced

In our last lesson, Master Joshway
explained, we use standard deviation
Not Synced

units because these units
are easily compared across studies.
Not Synced

Column 2 reports results
from a model
Not Synced

that adds demographic controls.
Not Synced

Here we're comparing test scores
but holding constant factors
Not Synced

such as age and sex.
Not Synced

Column 3 reports results
from a model that adds GPA
Not Synced

to the list of covariates.
Not Synced

Column 4 adds ACT scores.
Not Synced

Analysts often report
results this way,
Not Synced

starting with models that include
few or no covariates
Not Synced

and then reporting estimates
from models that add more
Not Synced

and more covariates
as we move across columns.
Not Synced

Looking across columns,
what do you notice?
Not Synced

- [Kamal] Well, the coefficient
on using a computer is always
Not Synced

a pretty big negative number.
Not Synced

- [Narrator] That's right!
Not Synced

We can also see that
the standard errors are small enough
Not Synced

to make these negative results
statistically significant.
Not Synced

In other words, the primary
takeaway from this experiment
Not Synced

is that electronics in the classroom
reduce student learning.
Not Synced

- [Kama] GPA and ACT scores
are also significant.
Not Synced

Why is that?
Not Synced

- [Narrator] Good observation!
Not Synced

That's not surprising.
Not Synced

We expect these variables
to predict college performance.
Not Synced

- [Kamal] Oh right, of course.
Not Synced

Kids who got better grades before
are more likely to get
Not Synced

a better grade in this course.
Not Synced

- [Narrator] You'll also notice a lot
of other information on this table.
Not Synced

Remaining panels in the table
report effects of electronics use
Not Synced

on components of the final exam,
Not Synced

such as the multiple
choice questions.
Not Synced

These results are mostly consistent
with computer use effects
Not Synced

on overall scores.
Not Synced

- [Kamal] What about the rows
not in English?
Not Synced

- [Narrator] These rows give
additional statistical information.
Not Synced

R-squared is a measure
of goodness of fit.
Not Synced

This isn't too important, though
some readers may want to know it.
Not Synced

Other rows report on alternative
tests of statistical significance
Not Synced

that you can ignore for now.
Not Synced

- [Kamal] Oh my gosh,
these tables aren't that hard.
Not Synced

Thank you so much.
Not Synced

Next up is regression.
Not Synced

See you then!
Not Synced

♪ [music] ♪
Not Synced

You're on your way
to mastering econometrics.
Not Synced

Make sure this video sticks
Not Synced

by taking a few
quick practice questions.
Not Synced

Or, if you're ready,
click for the next video.
Not Synced

You can also check out MRU's
website for more courses,
Not Synced

teacher resources and more.

Title:: How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
Description:: more » « less
Video Language:: English
Team:: Marginal Revolution University
Project:: Mastering Econometrics
Duration:: 12:40

	Theresa Ranft edited English subtitles for How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
	Theresa Ranft edited English subtitles for How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
	Theresa Ranft edited English subtitles for How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
	Kirstin Cosper edited English subtitles for How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
	Kirstin Cosper edited English subtitles for How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
	Kirstin Cosper edited English subtitles for How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
	Kirstin Cosper edited English subtitles for How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)
	Kirstin Cosper edited English subtitles for How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)

Show all

English subtitles

Revisions Compare revisions

Revision 22 Edited

Theresa Ranft
Revision 21 Edited

Theresa Ranft
Revision 20 Edited

Theresa Ranft
Revision 19 Edited

Kirstin Cosper
Revision 18 Edited

Kirstin Cosper
Revision 17 Edited

Kirstin Cosper
Revision 16 Edited

Kirstin Cosper
Revision 15 Edited

Kirstin Cosper
Revision 14 Edited

Kirstin Cosper
Revision 13 Edited

Kirstin Cosper
Revision 12 Edited

Kirstin Cosper
Revision 11 Edited

Kirstin Cosper
Revision 10 Edited

Kirstin Cosper
Revision 9 Edited

Kirstin Cosper
Revision 8 Edited

Kirstin Cosper
Revision 7 Edited

Kirstin Cosper
Revision 6 Edited

Kirstin Cosper
Revision 5 Edited

Kirstin Cosper
Revision 4 Edited

Kirstin Cosper
Revision 3 Edited

Kirstin Cosper
Revision 2 Edited

Kirstin Cosper
Revision 1 Edited

Kirstin Cosper

	Revision Number	Author	Created
	22	Theresa Ranft
	21	Theresa Ranft
	20	Theresa Ranft
	19	Kirstin Cosper
	18	Kirstin Cosper
	17	Kirstin Cosper
	16	Kirstin Cosper
	15	Kirstin Cosper
	14	Kirstin Cosper
	13	Kirstin Cosper
	12	Kirstin Cosper
	11	Kirstin Cosper
	10	Kirstin Cosper
	9	Kirstin Cosper
	8	Kirstin Cosper
	7	Kirstin Cosper
	6	Kirstin Cosper
	5	Kirstin Cosper
	4	Kirstin Cosper
	3	Kirstin Cosper
	2	Kirstin Cosper
	1	Kirstin Cosper

How to Read Economics Research Papers: Randomized Controlled Trials (RCTs)

Revisions Compare revisions

Our website uses cookies

Operating cookies (Required)