< Return to Video

Intro to Linear Regression

  • 0:00 - 0:04
    ♪ [music] ♪
  • 0:21 - 0:22
    - [Thomas Stratmann] Hi!
  • 0:22 - 0:24
    In the upcoming series of videos,
  • 0:24 - 0:27
    we're going to give you
    a shiny new tool
  • 0:27 - 0:30
    to put into your
    Understanding Data toolbox:
  • 0:30 - 0:32
    linear regression.
  • 0:33 - 0:35
    Say you've got this theory.
  • 0:35 - 0:37
    You've witnessed
    how good-looking people
  • 0:37 - 0:39
    seem to get special perks.
  • 0:40 - 0:41
    You're wondering,
  • 0:41 - 0:44
    "Where else might we see
    this phenomenon?"
  • 0:44 - 0:46
    What about for professors?
  • 0:46 - 0:48
    Is it possible
    good-looking professors
  • 0:48 - 0:50
    might get special perks too?
  • 0:50 - 0:54
    Is it possible
    students treat them better
  • 0:54 - 0:57
    by showering them
    with better student evaluations?
  • 0:58 - 1:00
    If so, is the effect of looks
  • 1:00 - 1:04
    on evaluations really big
    or really small?
  • 1:04 - 1:08
    And say there is a new professor
    starting at a university.
  • 1:09 - 1:12
    What can we predict
    about his evaluation
  • 1:12 - 1:13
    simply by his looks?
  • 1:14 - 1:17
    Given that these evaluations
    can determine pay raises,
  • 1:18 - 1:22
    if this theory were true,
    we might see professors resort
  • 1:22 - 1:25
    to some surprising tactics
    to boost their scores.
  • 1:25 - 1:27
    Suppose you wanted to find out
  • 1:27 - 1:31
    if evaluations really improve
    with better looks.
  • 1:31 - 1:34
    How would you go about
    testing this hypothesis?
  • 1:35 - 1:37
    You could collect data.
  • 1:37 - 1:40
    First you would have students rate
    on a scale from 1 to 10
  • 1:40 - 1:42
    how good-looking a professor was,
  • 1:42 - 1:45
    which gives you
    an average beauty score.
  • 1:45 - 1:49
    Then you could retrieve
    the teacher's teaching evaluations
  • 1:49 - 1:50
    from twenty-five students.
  • 1:50 - 1:53
    Let's look at these two variables
    at the same time
  • 1:53 - 1:55
    by using a scatterplot.
  • 1:55 - 1:57
    We'll put beauty
    on the horizontal axis,
  • 1:58 - 2:01
    and teacher evaluations
    on the vertical axis.
  • 2:01 - 2:03
    For example, this dot
    represents Professor Peate,
  • 2:03 - 2:06
    - ["Star Wars"
  • 2:06 - 2:09
    who received a beauty score of 3
  • 2:09 - 2:12
    and an evaluation of 8.425.
  • 2:12 - 2:15
    This one way out here
    is Professor Helmchen.
  • 2:15 - 2:17
    - [Ben Stiller, "Zoolander"]
    Ridiculously good-looking!
  • 2:17 - 2:19
    - [Thomas] Who got
    a very high beauty score,
  • 2:19 - 2:21
    but not such a good evaluation.
  • 2:21 - 2:22
    Can you see a trend?
  • 2:22 - 2:26
    As we move from left to right
    on the horizontal axis,
  • 2:26 - 2:28
    from the ugly to the gorgeous,
  • 2:28 - 2:31
    we see a trend upwards
    in evaluation scores.
  • 2:32 - 2:35
    By the way, the data
    we're exploring in this series
  • 2:35 - 2:39
    is not made up --
    it comes from a real study
  • 2:39 - 2:41
    done at the University of Texas.
  • 2:41 - 2:46
    If you're wondering, "pulchritude"
    is just the fancy academic way
  • 2:46 - 2:48
    of saying beauty.
  • 2:48 - 2:51
    With scatterplots
    it can sometimes be hard
  • 2:51 - 2:56
    to make out the exact relationship
    between two variables --
  • 2:56 - 2:59
    especially when the values
    bounce around quite a bit
  • 2:59 - 3:01
    as we go from left to right.
  • 3:02 - 3:05
    One way to cut through
    this bounciness
  • 3:05 - 3:08
    is to draw a straight line
    through the data cloud
  • 3:08 - 3:11
    in such a way that this line
    summarizes the data
  • 3:11 - 3:13
    as closely as possible.
  • 3:13 - 3:17
    The technical term for this
    is "linear regression."
  • 3:18 - 3:21
    Later on we'll talk about
    how this line is created,
  • 3:21 - 3:24
    but for now we can assume
    that the line fits the data
  • 3:24 - 3:26
    as closely as possible.
  • 3:27 - 3:30
    So, what can this line tell us?
  • 3:30 - 3:33
    First, we immediately see
  • 3:33 - 3:35
    if the line is sloping
    upward or downward.
  • 3:36 - 3:40
    In our data set we see
    the [fitted] line slopes upward.
  • 3:41 - 3:44
    It thus confirms what
    we have conjectured earlier
  • 3:44 - 3:46
    by just looking at the scatterplot.
  • 3:46 - 3:50
    The upward slope means
    that there is a positive association
  • 3:50 - 3:53
    between looks
    and evaluation scores.
  • 3:54 - 3:56
    In other words, on average,
  • 3:56 - 3:59
    better-looking professors
    are getting better evaluations.
  • 4:00 - 4:04
    For other data sets we might see
    a stronger positive association.
  • 4:04 - 4:07
    Or, you might see
    a negative association.
  • 4:08 - 4:11
    Or perhaps no association at all.
  • 4:11 - 4:14
    And our lines
    don't have to be straight.
  • 4:14 - 4:17
    They can curve to fit the data
    when necessary.
  • 4:18 - 4:21
    This line also gives us
    a way to predict outcomes.
  • 4:22 - 4:26
    We can simply take a beauty score
    and read off the line
  • 4:26 - 4:28
    what the predicted
    evaluation score would be.
  • 4:29 - 4:31
    So, back to our new professor.
  • 4:31 - 4:34
    We can precisely predict
    his evaluation score.
  • 4:35 - 4:37
    "But wait! Wait!" you might say.
  • 4:37 - 4:39
    "Can we trust this prediction?"
  • 4:39 - 4:42
    How well does
    this one beauty variable
  • 4:42 - 4:44
    really predict evaluations?
  • 4:45 - 4:48
    Linear regression gives us
    some useful measures
  • 4:48 - 4:50
    to answer those questions
  • 4:50 - 4:52
    which we'll cover
    in a future video.
  • 4:53 - 4:55
    We also have to be aware
    of other pitfalls
  • 4:55 - 4:58
    before we draw
    any definite conclusions.
  • 4:59 - 5:00
    You could imagine a scenario
  • 5:00 - 5:04
    where what is driving
    the association we see
  • 5:04 - 5:07
    is really a third variable
    that we have left out.
  • 5:07 - 5:10
    For example,
    the difficulty of the course
  • 5:10 - 5:12
    might be behind
    the positive association
  • 5:12 - 5:16
    between beauty ratings
    and evaluation scores.
  • 5:16 - 5:19
    Easy intro. courses
    get good evaluations.
  • 5:19 - 5:23
    Harder, more advanced courses
    get bad evaluations.
  • 5:24 - 5:28
    And younger professors might
    get assigned to intro. courses.
  • 5:28 - 5:32
    Then, if students judge
    younger professors more attractive,
  • 5:32 - 5:34
    you will find
    a positive association
  • 5:34 - 5:37
    between beauty ratings
    and evaluation scores.
  • 5:38 - 5:40
    But it's really
    the difficulty of the course,
  • 5:40 - 5:44
    the variable that we've left out,
    not beauty,
  • 5:44 - 5:46
    that is driving evaluation scores.
  • 5:46 - 5:50
    In that case, all the primping
    would be for naught --
  • 5:50 - 5:54
    a case of mistaken correlation
    for causation,
  • 5:55 - 5:58
    something we'll talk about further
    in a later video.
  • 5:59 - 6:02
    And what if there were
    other important variables
  • 6:02 - 6:06
    that affect both beauty ratings
    and evaluation scores?
  • 6:07 - 6:10
    You might want to add
    considerations like skill,
  • 6:10 - 6:15
    race, sex, and whether English
    is the teacher's native language
  • 6:15 - 6:19
    to isolate more cleanly the effect
    of beauty on evaluations.
  • 6:19 - 6:22
    When we get
    into multiple regression
  • 6:22 - 6:24
    we will be able to measure
    the impact of beauty
  • 6:24 - 6:26
    on teacher evaluations
  • 6:26 - 6:28
    while accounting
    for other variables
  • 6:28 - 6:31
    that might confound
    this association.
  • 6:32 - 6:36
    Next up, we'll get our hands dirty
    by playing with this data
  • 6:36 - 6:39
    to gain a better understanding
    of what this line can tell us.
  • 6:41 - 6:42
    - [Narrator] Congratulations!
  • 6:42 - 6:45
    You're one step closer
    to being a data ninja!
  • 6:46 - 6:47
    However, to master this
  • 6:47 - 6:49
    you'll need
    to strengthen your skills
  • 6:49 - 6:50
    with some practice questions.
  • 6:51 - 6:54
    Ready for your next mission?
    Click "Next Video."
  • 6:54 - 6:55
    Still here?
  • 6:56 - 6:58
    Move from understanding data
    to understanding your world
  • 6:58 - 7:02
    by checking out MRU's
    other popular economics videos.
  • 7:02 - 7:04
    ♪ [music] ♪
Title:
Intro to Linear Regression
Description:

more » « less
Video Language:
English
Team:
Marginal Revolution University
Project:
Understanding Data
Duration:
07:05
Marilia_PM approved English subtitles for Intro to Linear Regression
Kirstin Cosper accepted English subtitles for Intro to Linear Regression
Kirstin Cosper edited English subtitles for Intro to Linear Regression
Kirstin Cosper edited English subtitles for Intro to Linear Regression
Kirstin Cosper edited English subtitles for Intro to Linear Regression
Retired user edited English subtitles for Intro to Linear Regression
Retired user edited English subtitles for Intro to Linear Regression

English subtitles

Revisions Compare revisions