< Return to Video

Margin of Error 2

  • 0:00 - 0:01
  • 0:01 - 0:03
    Where we left off in the
    last video I kind
  • 0:03 - 0:03
    of gave you a question.
  • 0:03 - 0:06
    Find an interval so that we're
    reasonably confident-- we'll
  • 0:06 - 0:08
    talk a little bit more about why
    I have to give this kind
  • 0:08 - 0:13
    of vague wording right here--
    reasonably confident that
  • 0:13 - 0:19
    there's a 95% chance that the
    true population mean, which is
  • 0:19 - 0:22
    p, which is the same thing as
    the mean of the sampling
  • 0:22 - 0:24
    distribution of the
    sampling mean.
  • 0:24 - 0:27
    So there's a 95% chance that
    the true mean-- and
  • 0:27 - 0:27
    let me put this here.
  • 0:27 - 0:30
    This is also the same thing as
    the mean of the sampling
  • 0:30 - 0:33
    distribution of the sampling
    mean is in that interval.
  • 0:33 - 0:37
    And to do that let me just
    throw out a few ideas.
  • 0:37 - 0:46
    What is the probability that if
    I take a sample and I were
  • 0:46 - 0:49
    to take a mean of that sample,
    so the probability that a
  • 0:49 - 0:59
    random sample mean is within two
    standard deviations of the
  • 0:59 - 1:04
    sampling mean, of
    our sample mean?
  • 1:04 - 1:08
    So what is this probability
    right over here?
  • 1:08 - 1:10
    Let's just look at our
    actual distribution.
  • 1:10 - 1:13
    So this is our distribution,
    this right here is our
  • 1:13 - 1:14
    sampling mean.
  • 1:14 - 1:16
    Maybe I should do it in
    blue because that's
  • 1:16 - 1:19
    the color up here.
  • 1:19 - 1:20
    This is our sampling mean.
  • 1:20 - 1:23
    And so what is the probability
    that a random sampling mean is
  • 1:23 - 1:25
    going to be two standard
    deviations?
  • 1:25 - 1:28
    Well a random sampling is a
    sample from this distribution.
  • 1:28 - 1:31
    It is a sample from the sampling
    distribution of the
  • 1:31 - 1:32
    sample mean.
  • 1:32 - 1:35
    So it's literally what is the
    probability of finding a
  • 1:35 - 1:37
    sample within two standard
    deviations of the mean?
  • 1:37 - 1:40
    That's one standard deviation,
    that's another standard
  • 1:40 - 1:43
    deviation right over there.
  • 1:43 - 1:45
    In general, if you haven't
    committed this to memory
  • 1:45 - 1:48
    already, it's not a bad thing
    to commit to memory, is that
  • 1:48 - 1:51
    if you have a normal
    distribution the probability
  • 1:51 - 1:55
    of taking a sample within two
    standard deviations is 95--
  • 1:55 - 1:57
    and if you want to get
    a little bit more
  • 1:57 - 2:00
    accurate it's 95.4%.
  • 2:00 - 2:04
    But you could say it's roughly--
    or maybe I could
  • 2:04 - 2:06
    write it like this--
    it's roughly 95%.
  • 2:06 - 2:08
    And really that's all that
    matters because we have this
  • 2:08 - 2:11
    little funny language here
    called reasonably confident,
  • 2:11 - 2:14
    and we have to estimate the
    standard deviation anyway.
  • 2:14 - 2:16
    In fact, we could say if we
    want, I could say that it's
  • 2:16 - 2:21
    going to be exactly
    equal to 95.4%.
  • 2:21 - 2:24
    But in general, two standard
    deviations, 95%, that's what
  • 2:24 - 2:26
    people equate with each other.
  • 2:26 - 2:29
    Now this statement is the
    exact same thing as the
  • 2:29 - 2:36
    probability that the sample
    mean, that the sampling mean--
  • 2:36 - 2:38
    not the sample mean, the
    probability of the mean of the
  • 2:38 - 2:47
    sampling distribution is within
    two standard deviations
  • 2:47 - 2:51
    of the sampling distribution of
    x is also going to be the
  • 2:51 - 2:55
    same number, is also going
    to be equal to 95.4%.
  • 2:55 - 2:56
    These are the exact
    same statements.
  • 2:56 - 3:00
    If x is within two standard
    deviations of this, then this,
  • 3:00 - 3:03
    then the mean, is within two
    standard deviations of x.
  • 3:03 - 3:05
    These are just two ways of
    phrasing the same thing.
  • 3:05 - 3:09
    Now we know that the mean of the
    sampling distribution, the
  • 3:09 - 3:12
    same thing as a mean of the
    population distribution, which
  • 3:12 - 3:15
    is the same thing as the
    parameter p-- the proportion
  • 3:15 - 3:20
    of people or the proportion of
    the population that is a 1.
  • 3:20 - 3:23
    So this right here is the same
    thing as the population mean.
  • 3:23 - 3:27
    So this statement right here
    we can switch this with p.
  • 3:27 - 3:32
    So the probability that p is
    within two standard deviations
  • 3:32 - 3:37
    of the sampling distribution
    of x is 95.4%.
  • 3:37 - 3:42
    Now we don't know what this
    number right here is.
  • 3:42 - 3:44
    But we have estimated it.
  • 3:44 - 3:49
    Remember, our best estimate of
    this is the true standard, or
  • 3:49 - 3:51
    it is the true standard
    deviation of the population
  • 3:51 - 3:52
    divided by 10.
  • 3:52 - 3:54
    We can estimate the true
    standard deviation of the
  • 3:54 - 3:57
    population with our sampling
    standard deviation, which was
  • 3:57 - 4:00
    0.5, 0.5 divided by 10.
  • 4:00 - 4:04
    Our best estimate of the
    standard deviation of the
  • 4:04 - 4:08
    sampling distribution of the
    sample mean is 0.05.
  • 4:08 - 4:11
    So now we can say-- and I'll
    switch colors-- the
  • 4:11 - 4:15
    probability that the parameter
    p, the proportion of the
  • 4:15 - 4:22
    population saying 1, is within
    two times-- remember, our best
  • 4:22 - 4:29
    estimate of this right here is
    0.05 of a sample mean that we
  • 4:29 - 4:34
    take is equal to 95.4%.
  • 4:34 - 4:41
    And so we could say the
    probability that p is within 2
  • 4:41 - 4:47
    times 0.05 is going to be equal
    to-- 2.0 is going to be
  • 4:47 - 4:53
    0.10 of our mean is equal to
    95-- and actually let me be a
  • 4:53 - 4:54
    little careful here.
  • 4:54 - 4:58
    I can't say the equal now,
    because over here if we knew
  • 4:58 - 5:01
    this, if we knew this parameter
    of the sampling
  • 5:01 - 5:03
    distribution of the sample
    mean, we could
  • 5:03 - 5:05
    say that it is 95.4%.
  • 5:05 - 5:06
    We don't know it.
  • 5:06 - 5:09
    We are just trying to find our
    best estimator for it.
  • 5:09 - 5:11
    So actually what I'm going to
    do here is actually just say
  • 5:11 - 5:14
    is roughly-- and just to show
    that we don't even have that
  • 5:14 - 5:18
    level of accuracy, I'm going
    to say roughly 95%.
  • 5:18 - 5:21
    We're reasonably confident that
    it's about 95% because
  • 5:21 - 5:24
    we're using this estimator that
    came out of our sample,
  • 5:24 - 5:26
    and if the sample is really
    skewed this is going to be a
  • 5:26 - 5:27
    really weird number.
  • 5:27 - 5:30
    So this is why we just have to
    be a little bit more exact
  • 5:30 - 5:31
    about what we're doing.
  • 5:31 - 5:32
    But this is the tool
    for at least saying
  • 5:32 - 5:34
    how good is our result.
  • 5:34 - 5:38
    So this is going to
    be about 95%.
  • 5:38 - 5:46
    Or we could say that the
    probability that p is within
  • 5:46 - 5:50
    0.10 of our sample mean
    that we actually got.
  • 5:50 - 5:52
    So what was the sample mean
    that we actually got?
  • 5:52 - 5:53
    It was 0.43.
  • 5:53 - 6:00
    So if we're within 0.1 of 0.43,
    that means we are within
  • 6:00 - 6:08
    0.43 plus or minus 0.1 is
    also, roughly, we're
  • 6:08 - 6:12
    reasonably confident
    it's about 95%.
  • 6:12 - 6:13
    And I want to be very clear.
  • 6:13 - 6:15
    Everything that I started all
    the way from up here in brown
  • 6:15 - 6:18
    to yellow and all this magenta,
    I'm just restating
  • 6:18 - 6:19
    the same thing inside of this.
  • 6:19 - 6:22
    It became a little bit more
    loosey-goosey once I went from
  • 6:22 - 6:26
    the exact standard deviation of
    the sampling distribution
  • 6:26 - 6:27
    to an estimator for it.
  • 6:27 - 6:30
    And that's why this is just
    becoming-- I kind of put the
  • 6:30 - 6:33
    squiggly equal signs there
    to say we're reasonably
  • 6:33 - 6:35
    confident-- and I even got rid
    of some of the precision.
  • 6:35 - 6:37
    But we just found
    our interval.
  • 6:37 - 6:39
    An interval that we can be
    reasonably confident that
  • 6:39 - 6:42
    there's a 95% probability that
    p is within that, is going to
  • 6:42 - 6:45
    be 0.43 plus or minus 0.1.
  • 6:45 - 6:48
    Or an interval of-- we have
    a confidence interval.
  • 6:48 - 7:00
    We have a 95% confidence
    interval of, and we could say,
  • 7:00 - 7:04
    0.43 minus 0.1 is 0.33.
  • 7:04 - 7:09
    If we write that as a percent
    we could say 33% to-- and if
  • 7:09 - 7:17
    we add the 0.1, 0.43 plus
    0.1 we get 53%-- to 53%.
  • 7:17 - 7:21
    So we are 95% confident.
  • 7:21 - 7:24
    So we're not saying kind of
    precisely that the probability
  • 7:24 - 7:29
    of the actual proportion is 95%,
    but we're 95% confident
  • 7:29 - 7:36
    that the true proportion
    is between 33% and 55%.
  • 7:36 - 7:38
    That p is in this
    range over here.
  • 7:38 - 7:41
    Or another way, and you'll see
    this in a lot of surveys that
  • 7:41 - 7:45
    have been done, people will say
    we did a survey and we got
  • 7:45 - 7:55
    43% will vote for number one,
    and number one in this case is
  • 7:55 - 7:56
    candidate B.
  • 7:56 - 8:02
  • 8:02 - 8:04
    And then the other side, since
    everyone else voted for
  • 8:04 - 8:13
    candidate A, 57% will
    vote for A.
  • 8:13 - 8:15
    And then they're going to
    put on margin of error.
  • 8:15 - 8:18
    And you'll see this in any
    survey that you see on TV.
  • 8:18 - 8:22
    They'll put a margin of error.
  • 8:22 - 8:25
    And the margin of error is just
    another way of describing
  • 8:25 - 8:26
    this confidence interval.
  • 8:26 - 8:29
    And they'll say that the margin
    of error in this case
  • 8:29 - 8:37
    is 10%, which means that there's
    a 95% confidence
  • 8:37 - 8:42
    interval, if you go plus or
    minus 10% from that value
  • 8:42 - 8:43
    right over there.
  • 8:43 - 8:45
    And I really want to emphasize,
    you can't say with
  • 8:45 - 8:49
    certainty that there is a 95%
    chance that the true result
  • 8:49 - 8:52
    will be within 10% of this,
    because we had to estimate the
  • 8:52 - 8:55
    standard deviation of
    the sampling mean.
  • 8:55 - 8:58
    But this is the best measure
    we can with the information
  • 8:58 - 9:00
    you have. If you're going to
    do a survey of 100 people,
  • 9:00 - 9:04
    this is the best kind of
    confidence that we can get.
  • 9:04 - 9:06
    And this number is actually
    fairly big.
  • 9:06 - 9:09
    So if you were to look at this
    you would say, roughly there's
  • 9:09 - 9:12
    a 95% chance that the true
    value of this number is
  • 9:12 - 9:15
    between 33% and 53%.
  • 9:15 - 9:18
    So there's actually still a
    chance that candidate B can
  • 9:18 - 9:21
    win, even though only
    43% of your 100 are
  • 9:21 - 9:22
    going to vote for him.
  • 9:22 - 9:25
    If you wanted to make it a
    little bit more precise you
  • 9:25 - 9:27
    would want to take
    more samples.
  • 9:27 - 9:28
    You can imagine.
  • 9:28 - 9:32
    Instead of taking 100 samples,
    instead of n being 100, if you
  • 9:32 - 9:35
    made n equal 1,000, then you
    would take this number over
  • 9:35 - 9:38
    here, you would take this number
    here and divide by the
  • 9:38 - 9:41
    square root of 1,000 instead
    of the square root of 100.
  • 9:41 - 9:43
    So you'd be dividing
    by 33 or whatever.
  • 9:43 - 9:49
    And so then the size of the
    standard deviation of your
  • 9:49 - 9:51
    sampling distribution
    will go down.
  • 9:51 - 9:53
    And so the distance of two
    standard deviations will be a
  • 9:53 - 9:55
    smaller number, and so
    then you will have a
  • 9:55 - 9:57
    smaller margin of error.
  • 9:57 - 10:00
    And maybe you want to get the
    margin of error small enough
  • 10:00 - 10:03
    so that you can figure out
    decisively who's going to win
  • 10:03 - 10:04
    the election.
  • 10:04 - 10:05
Title:
Margin of Error 2
Description:

more » « less
Video Language:
English
Team:
Khan Academy
Duration:
10:05
Fran Ontanaya edited English subtitles for Margin of Error 2
Amara Bot edited English subtitles for Margin of Error 2

English subtitles

Revisions Compare revisions