Intro to Linear Regression

Rollback to version 3

0:00 - 0:04

♪ [music] ♪
0:21 - 0:22

- [Thomas Stratmann] Hi!
0:22 - 0:24

In the upcoming series of videos,
0:24 - 0:27

we're going to give you
a shiny new tool
0:27 - 0:30

to put into your
Understanding Data toolbox:
0:30 - 0:32

linear regression.
0:33 - 0:35

Say you've got this theory.
0:35 - 0:37

You've witnessed
how good-looking people
0:37 - 0:39

seem to get special perks.
0:40 - 0:41

You're wondering,
0:41 - 0:44

"Where else might we see
this phenomenon?"
0:44 - 0:46

What about for professors?
0:46 - 0:48

Is it possible
good-looking professors
0:48 - 0:50

might get special perks too?
0:50 - 0:54

Is it possible
students treat them better
0:54 - 0:57

by showering them
with better student evaluations?
0:58 - 1:00

If so, is the effect of looks
1:00 - 1:04

on evaluations really big
or really small?
1:04 - 1:08

And say there is a new professor
starting at a university.
1:09 - 1:12

What can we predict
about his evaluation
1:12 - 1:13

simply by his looks?
1:14 - 1:17

Given that these evaluations
can determine pay raises,
1:18 - 1:22

if this theory were true,
we might see professors resort
1:22 - 1:25

to some surprising tactics
to boost their scores.
1:25 - 1:27

Suppose you wanted to find out
1:27 - 1:31

if evaluations really improve
with better looks.
1:31 - 1:34

How would you go about
testing this hypothesis?
1:35 - 1:37

You could collect data.
1:37 - 1:40

First you would have students rate
on a scale from 1 to 10
1:40 - 1:42

how good-looking a professor was,
1:42 - 1:45

which gives you
an average beauty score.
1:45 - 1:49

Then you could retrieve
the teacher's teaching evaluations
1:49 - 1:50

from twenty-five students.
1:50 - 1:53

Let's look at these two variables
at the same time
1:53 - 1:55

by using a scatterplot.
1:55 - 1:57

We'll put beauty
on the horizontal axis,
1:58 - 2:01

and teacher evaluations
on the vertical axis.
2:01 - 2:03

For example, this dot
represents Professor Peate,
2:03 - 2:06

- ["Star Wars"
2:06 - 2:09

who received a beauty score of 3
2:09 - 2:12

and an evaluation of 8.425.
2:12 - 2:15

This one way out here
is Professor Helmchen.
2:15 - 2:17

- [Ben Stiller, "Zoolander"]
Ridiculously good-looking!
2:17 - 2:19

- [Thomas] Who got
a very high beauty score,
2:19 - 2:21

but not such a good evaluation.
2:21 - 2:22

Can you see a trend?
2:22 - 2:26

As we move from left to right
on the horizontal axis,
2:26 - 2:28

from the ugly to the gorgeous,
2:28 - 2:31

we see a trend upwards
in evaluation scores.
2:32 - 2:35

By the way, the data
we're exploring in this series
2:35 - 2:39

is not made up --
it comes from a real study
2:39 - 2:41

done at the University of Texas.
2:41 - 2:46

If you're wondering, "pulchritude"
is just the fancy academic way
2:46 - 2:48

of saying beauty.
2:48 - 2:51

With scatterplots
it can sometimes be hard
2:51 - 2:56

to make out the exact relationship
between two variables --
2:56 - 2:59

especially when the values
bounce around quite a bit
2:59 - 3:01

as we go from left to right.
3:02 - 3:05

One way to cut through
this bounciness
3:05 - 3:08

is to draw a straight line
through the data cloud
3:08 - 3:11

in such a way that this line
summarizes the data
3:11 - 3:13

as closely as possible.
3:13 - 3:17

The technical term for this
is "linear regression."
3:18 - 3:21

Later on we'll talk about
how this line is created,
3:21 - 3:24

but for now we can assume
that the line fits the data
3:24 - 3:26

as closely as possible.
3:27 - 3:30

So, what can this line tell us?
3:30 - 3:33

First, we immediately see
3:33 - 3:35

if the line is sloping
upward or downward.
3:36 - 3:40

In our data set we see
the [fitted] line slopes upward.
3:41 - 3:44

It thus confirms what
we have conjectured earlier
3:44 - 3:46

by just looking at the scatterplot.
3:46 - 3:50

The upward slope means
that there is a positive association
3:50 - 3:53

between looks
and evaluation scores.
3:54 - 3:56

In other words, on average,
3:56 - 3:59

better-looking professors
are getting better evaluations.
4:00 - 4:04

For other data sets we might see
a stronger positive association.
4:04 - 4:07

Or, you might see
a negative association.
4:08 - 4:11

Or perhaps no association at all.
4:11 - 4:14

And our lines
don't have to be straight.
4:14 - 4:17

They can curve to fit the data
when necessary.
4:18 - 4:21

This line also gives us
a way to predict outcomes.
4:22 - 4:26

We can simply take a beauty score
and read off the line
4:26 - 4:28

what the predicted
evaluation score would be.
4:29 - 4:31

So, back to our new professor.
4:31 - 4:34

We can precisely predict
his evaluation score.
4:35 - 4:37

"But wait! Wait!" you might say.
4:37 - 4:39

"Can we trust this prediction?"
4:39 - 4:42

How well does
this one beauty variable
4:42 - 4:44

really predict evaluations?
4:45 - 4:48

Linear regression gives us
some useful measures
4:48 - 4:50

to answer those questions
4:50 - 4:52

which we'll cover
in a future video.
4:53 - 4:55

We also have to be aware
of other pitfalls
4:55 - 4:58

before we draw
any definite conclusions.
4:59 - 5:00

You could imagine a scenario
5:00 - 5:04

where what is driving
the association we see
5:04 - 5:07

is really a third variable
that we have left out.
5:07 - 5:10

For example,
the difficulty of the course
5:10 - 5:12

might be behind
the positive association
5:12 - 5:16

between beauty ratings
and evaluation scores.
5:16 - 5:19

Easy intro. courses
get good evaluations.
5:19 - 5:23

Harder, more advanced courses
get bad evaluations.
5:24 - 5:28

And younger professors might
get assigned to intro. courses.
5:28 - 5:32

Then, if students judge
younger professors more attractive,
5:32 - 5:34

you will find
a positive association
5:34 - 5:37

between beauty ratings
and evaluation scores.
5:38 - 5:40

But it's really
the difficulty of the course,
5:40 - 5:44

the variable that we've left out,
not beauty,
5:44 - 5:46

that is driving evaluation scores.
5:46 - 5:50

In that case, all the primping
would be for naught --
5:50 - 5:54

a case of mistaken correlation
for causation,
5:55 - 5:58

something we'll talk about further
in a later video.
5:59 - 6:02

And what if there were
other important variables
6:02 - 6:06

that affect both beauty ratings
and evaluation scores?
6:07 - 6:10

You might want to add
considerations like skill,
6:10 - 6:15

race, sex, and whether English
is the teacher's native language
6:15 - 6:19

to isolate more cleanly the effect
of beauty on evaluations.
6:19 - 6:22

When we get
into multiple regression
6:22 - 6:24

we will be able to measure
the impact of beauty
6:24 - 6:26

on teacher evaluations
6:26 - 6:28

while accounting
for other variables
6:28 - 6:31

that might confound
this association.
6:32 - 6:36

Next up, we'll get our hands dirty
by playing with this data
6:36 - 6:39

to gain a better understanding
of what this line can tell us.
6:41 - 6:42

- [Narrator] Congratulations!
6:42 - 6:45

You're one step closer
to being a data ninja!
6:46 - 6:47

However, to master this
6:47 - 6:49

you'll need
to strengthen your skills
6:49 - 6:50

with some practice questions.
6:51 - 6:54

Ready for your next mission?
Click "Next Video."
6:54 - 6:55

Still here?
6:56 - 6:58

Move from understanding data
to understanding your world
6:58 - 7:02

by checking out MRU's
other popular economics videos.
7:02 - 7:04

♪ [music] ♪

Title:: Intro to Linear Regression
Description:: more » « less
Video Language:: English
Team:: Marginal Revolution University
Project:: Understanding Data
Duration:: 07:05

	Marilia_PM approved English subtitles for Intro to Linear Regression
	Kirstin Cosper accepted English subtitles for Intro to Linear Regression
	Kirstin Cosper edited English subtitles for Intro to Linear Regression
	Kirstin Cosper edited English subtitles for Intro to Linear Regression
	Kirstin Cosper edited English subtitles for Intro to Linear Regression
	Retired user edited English subtitles for Intro to Linear Regression
	Retired user edited English subtitles for Intro to Linear Regression

English subtitles

Revisions Compare revisions

Revision 5 Edited

Kirstin Cosper
Revision 4 Edited

Kirstin Cosper
Revision 3 Edited

Kirstin Cosper
Revision 2 Edited

Retired user
Revision 1 Edited

Retired user

	Revision Number	Author	Created
	5	Kirstin Cosper
	4	Kirstin Cosper
	3	Kirstin Cosper
	2	Retired user
	1	Retired user

Intro to Linear Regression

Revisions Compare revisions

Our website uses cookies

Operating cookies (Required)