< Return to Video

Statistics intro: mean, median and mode

  • 0:01 - 0:07
    We will now begin our journey
    into the world of statistics,
  • 0:07 - 0:10
    which is really a way
    to understand or get
  • 0:10 - 0:12
    our head around data.
  • 0:12 - 0:15
    So statistics is all about data.
  • 0:15 - 0:19
    And as we begin our journey
    into the world of statistics,
  • 0:19 - 0:21
    we will be doing
    a lot of what we
  • 0:21 - 0:23
    can call descriptive statistics.
  • 0:23 - 0:25
    So if we have a bunch
    of data, and if we
  • 0:25 - 0:28
    want to tell something
    about all of that data
  • 0:28 - 0:30
    without giving them
    all of the data,
  • 0:30 - 0:34
    can we somehow describe it
    with a smaller set of numbers?
  • 0:34 - 0:36
    So that's what we're
    going to focus on.
  • 0:36 - 0:37
    And then once we
    build our toolkit
  • 0:37 - 0:39
    on the descriptive
    statistics, then we
  • 0:39 - 0:42
    can start to make
    inferences about that data,
  • 0:42 - 0:44
    start to make conclusions,
    start to make judgments.
  • 0:44 - 0:49
    And we'll start to do a lot
    of inferential statistics,
  • 0:49 - 0:51
    make inferences.
  • 0:51 - 0:53
    So with that out of
    the way, let's think
  • 0:53 - 0:56
    about how we can describe data.
  • 0:56 - 1:01
    So let's say we have
    a set of numbers.
  • 1:01 - 1:02
    We can consider this to be data.
  • 1:02 - 1:05
    Maybe we're measuring
    the heights of our plants
  • 1:05 - 1:06
    in our garden.
  • 1:06 - 1:07
    And let's say we
    have six plants.
  • 1:07 - 1:14
    And the heights are 4 inches,
    3 inches, 1 inch, 6 inches,
  • 1:14 - 1:18
    and another one's 1 inch,
    and another one is 7 inches.
  • 1:18 - 1:21
    And let's say someone just
    said-- in another room, not
  • 1:21 - 1:22
    looking at your
    plants, just said,
  • 1:22 - 1:25
    well, you know, how
    tall are your plants?
  • 1:25 - 1:26
    And they only want
    to hear one number.
  • 1:26 - 1:31
    They want to somehow
    have one number that
  • 1:31 - 1:33
    represents all of these
    different heights of plants.
  • 1:33 - 1:37
    How would you do that?
  • 1:37 - 1:39
    Well, you'd say, well,
    how can I find something
  • 1:39 - 1:41
    that-- maybe I want
    a typical number.
  • 1:41 - 1:44
    Maybe I want some number that
    somehow represents the middle.
  • 1:44 - 1:46
    Maybe I want the
    most frequent number.
  • 1:46 - 1:49
    Maybe I want the number
    that somehow represents
  • 1:49 - 1:51
    the center of all
    of these numbers.
  • 1:51 - 1:53
    And if you said any
    of those things,
  • 1:53 - 1:55
    you would actually have
    done the same things
  • 1:55 - 1:58
    that the people who first came
    up with descriptive statistics
  • 1:58 - 1:58
    said.
  • 1:58 - 2:00
    They said, well,
    how can we do it?
  • 2:00 - 2:05
    And we'll start by thinking
    of the idea of average.
  • 2:05 - 2:08
    And in every day
    terminology, average
  • 2:08 - 2:10
    has a very particular
    meaning, as we'll see.
  • 2:10 - 2:12
    When many people
    talk about average,
  • 2:12 - 2:13
    they're talking
    about the arithmetic
  • 2:13 - 2:15
    mean, which we'll see shortly.
  • 2:15 - 2:18
    But in statistics, average
    means something more general.
  • 2:18 - 2:23
    It really means
    give me a typical,
  • 2:23 - 2:30
    or give me a middle number,
    or-- and these are or's.
  • 2:30 - 2:32
    And really it's
    an attempt to find
  • 2:32 - 2:33
    a measure of central tendency.
  • 2:39 - 2:41
    So once again, you have
    a bunch of numbers.
  • 2:41 - 2:43
    You're somehow trying
    to represent these
  • 2:43 - 2:46
    with one number we'll call
    the average, that's somehow
  • 2:46 - 2:49
    typical, or middle,
    or the center somehow
  • 2:49 - 2:50
    of these numbers.
  • 2:50 - 2:54
    And as we'll see, there's
    many types of averages.
  • 2:54 - 2:57
    The first is the one that you're
    probably most familiar with.
  • 2:57 - 2:58
    It's the one-- and
    people talk about hey,
  • 2:58 - 3:01
    the average on this exam
    or the average height.
  • 3:01 - 3:03
    And that's the arithmetic mean.
  • 3:03 - 3:05
    Just let me write it in.
  • 3:05 - 3:13
    I'll write in yellow,
    arithmetic mean.
  • 3:13 - 3:16
    When arithmetic is a noun,
    we call it arithmetic.
  • 3:16 - 3:20
    When it's an adjective like
    this, we call it arithmetic,
  • 3:20 - 3:22
    arithmetic mean.
  • 3:22 - 3:25
    And this is really just the
    sum of all the numbers divided
  • 3:25 - 3:28
    by-- this is a human-constructed
    definition that we've
  • 3:28 - 3:32
    found useful-- the sum of
    all these numbers divided
  • 3:32 - 3:34
    by the number of
    numbers we have.
  • 3:34 - 3:37
    So given that, what
    is the arithmetic mean
  • 3:37 - 3:39
    of this data set?
  • 3:39 - 3:40
    Well, let's just compute it.
  • 3:40 - 3:46
    It's going to be 4 plus
    3 plus 1 plus 6 plus 1
  • 3:46 - 3:51
    plus 7 over the number
    of data points we have.
  • 3:51 - 3:53
    So we have six data points.
  • 3:53 - 3:55
    So we're going to divide by 6.
  • 3:55 - 4:02
    And we get 4 plus 3 is 7,
    plus 1 is 8, plus 6 is 14,
  • 4:02 - 4:05
    plus 1 is 15, plus 7.
  • 4:05 - 4:08
    15 plus 7 is 22.
  • 4:08 - 4:09
    Let me do that one more time.
  • 4:09 - 4:15
    You have 7, 8, 14, 15,
    22, all of that over 6.
  • 4:15 - 4:17
    And we could write
    this as a mixed number.
  • 4:17 - 4:21
    6 goes into 22 three times
    with a remainder of 4.
  • 4:21 - 4:25
    So it's 3 and 4/6, which is
    the same thing as 3 and 2/3.
  • 4:25 - 4:29
    We could write this as a
    decimal with 3.6 repeating.
  • 4:29 - 4:32
    So this is also 3.6 repeating.
  • 4:32 - 4:34
    We could write it any
    one of those ways.
  • 4:34 - 4:37
    But this is kind of a
    representative number.
  • 4:37 - 4:40
    This is trying to get
    at a central tendency.
  • 4:40 - 4:42
    Once again, these are
    human-constructed.
  • 4:42 - 4:44
    No one ever-- it's
    not like someone just
  • 4:44 - 4:46
    found some religious
    document that said,
  • 4:46 - 4:48
    this is the way that
    the arithmetic mean
  • 4:48 - 4:49
    must be defined.
  • 4:49 - 4:53
    It's not as pure
    of a computation
  • 4:53 - 4:55
    as, say, finding the
    circumference of the circle,
  • 4:55 - 4:57
    which there really is--
    that was kind of-- we
  • 4:57 - 4:58
    studied the universe.
  • 4:58 - 5:01
    And that just fell out of
    our study of the universe.
  • 5:01 - 5:02
    It's a human-constructed
    definition
  • 5:02 - 5:04
    that we found useful.
  • 5:04 - 5:07
    Now there are other ways
    to measure the average
  • 5:07 - 5:10
    or find a typical
    or middle value.
  • 5:10 - 5:14
    The other very typical
    way is the median.
  • 5:14 - 5:16
    And I will write median.
  • 5:16 - 5:17
    I'm running out of colors.
  • 5:17 - 5:19
    I will write median in pink.
  • 5:19 - 5:21
    So there is the median.
  • 5:21 - 5:25
    And the median is literally
    looking for the middle number.
  • 5:25 - 5:27
    So if you were to order
    all the numbers in your set
  • 5:27 - 5:31
    and find the middle one,
    then that is your median.
  • 5:31 - 5:34
    So given that, what's the
    median of this set of numbers
  • 5:34 - 5:36
    going to be?
  • 5:36 - 5:37
    Let's try to figure it out.
  • 5:37 - 5:38
    Let's try to order it.
  • 5:38 - 5:40
    So we have 1.
  • 5:40 - 5:41
    Then we have another 1.
  • 5:41 - 5:43
    Then we have a 3.
  • 5:43 - 5:47
    Then we have a 4, a 6, and a 7.
  • 5:47 - 5:49
    So all I did is
    I reordered this.
  • 5:49 - 5:51
    And so what's the middle number?
  • 5:51 - 5:52
    Well, you look here.
  • 5:52 - 5:55
    Since we have an even number of
    numbers, we have six numbers,
  • 5:55 - 5:57
    there's not one middle number.
  • 5:57 - 6:00
    You actually have two
    middle numbers here.
  • 6:00 - 6:02
    You have two middle
    numbers right over here.
  • 6:02 - 6:03
    You have the 3 and the 4.
  • 6:03 - 6:06
    And in this case, when you
    have two middle numbers,
  • 6:06 - 6:10
    you actually go halfway
    between these two numbers.
  • 6:10 - 6:12
    You're essentially taking the
    arithmetic mean of these two
  • 6:12 - 6:14
    numbers to find the median.
  • 6:14 - 6:16
    So the median is going
    to be halfway in-between
  • 6:16 - 6:19
    3 and 4, which is
    going to be 3.5.
  • 6:19 - 6:24
    So the median in
    this case is 3.5.
  • 6:24 - 6:27
    So if you have an even
    number of numbers, the median
  • 6:27 - 6:29
    or the middle two, the--
    essentially the arithmetic
  • 6:29 - 6:31
    mean of the middle two, or
    halfway between the middle two.
  • 6:31 - 6:33
    If you have an odd
    number of numbers,
  • 6:33 - 6:34
    it's a little bit
    easier to compute.
  • 6:34 - 6:36
    And just so that
    we see that, let
  • 6:36 - 6:37
    me give you another data set.
  • 6:37 - 6:39
    Let's say our data
    set-- and I'll
  • 6:39 - 6:42
    order it for us--
    let's say our data set
  • 6:42 - 6:56
    was 0, 7, 50, I don't know,
    10,000, and 1 million.
  • 6:56 - 6:57
    Let's say that is our data set.
  • 6:57 - 6:58
    Kind of a crazy data set.
  • 6:58 - 7:02
    But in this situation,
    what is our median?
  • 7:02 - 7:04
    Well, here we have five numbers.
  • 7:04 - 7:05
    We have an odd
    number of numbers.
  • 7:05 - 7:07
    So it's easier to
    pick out a middle.
  • 7:07 - 7:12
    The middle is the number that is
    greater than two of the numbers
  • 7:12 - 7:14
    and is less than
    two of the numbers.
  • 7:14 - 7:15
    It's exactly in the middle.
  • 7:15 - 7:19
    So in this case,
    our median is 50.
  • 7:19 - 7:21
    Now, the third measure
    of central tendency,
  • 7:21 - 7:22
    and this is the
    one that's probably
  • 7:22 - 7:26
    used least often in
    life, is the mode.
  • 7:26 - 7:28
    And people often
    forget about it.
  • 7:28 - 7:30
    It sounds like
    something very complex.
  • 7:30 - 7:31
    But what we'll see
    is it's actually
  • 7:31 - 7:33
    a very straightforward idea.
  • 7:33 - 7:36
    And in some ways, it
    is the most basic idea.
  • 7:36 - 7:41
    So the mode is actually the most
    common number in a data set,
  • 7:41 - 7:42
    if there is a most
    common number.
  • 7:42 - 7:44
    If all of the numbers
    are represented equally,
  • 7:44 - 7:46
    if there's no one single
    most common number,
  • 7:46 - 7:47
    then you have no mode.
  • 7:47 - 7:50
    But given that
    definition of the mode,
  • 7:50 - 7:54
    what is the single most common
    number in our original data
  • 7:54 - 7:58
    set, in this data
    set right over here?
  • 7:58 - 8:00
    Well, we only have one 4.
  • 8:00 - 8:01
    We only have one 3.
  • 8:01 - 8:03
    But we have two 1's.
  • 8:03 - 8:05
    We have one 6 and one 7.
  • 8:05 - 8:09
    So the number that shows up
    the most number of times here
  • 8:09 - 8:11
    is our 1.
  • 8:11 - 8:14
    So the mode, the most typical
    number, the most common number
  • 8:14 - 8:18
    here is a 1.
  • 8:18 - 8:20
    So, you see, these
    are all different ways
  • 8:20 - 8:23
    of trying to get at a typical,
    or middle, or central tendency.
  • 8:23 - 8:26
    But they do it in very,
    very different ways.
  • 8:26 - 8:27
    And as we study more
    and more statistics,
  • 8:27 - 8:30
    we'll see that they're
    good for different things.
  • 8:30 - 8:32
    This is used very frequently.
  • 8:32 - 8:35
    The median is really good if you
    have some kind of crazy number
  • 8:35 - 8:36
    out here that could
    have otherwise
  • 8:36 - 8:38
    skewed the arithmetic mean.
  • 8:38 - 8:41
    The mode could also be useful
    in situations like that,
  • 8:41 - 8:43
    especially if you do
    have one number that's
  • 8:43 - 8:46
    showing up a lot
    more frequently.
  • 8:46 - 8:48
    Anyway, I'll leave you there.
  • 8:48 - 8:52
    And we'll-- the next few videos,
    we will explore statistics even
  • 8:52 - 8:53
    deeper.
Title:
Statistics intro: mean, median and mode
Description:

more » « less
Video Language:
English
Team:
Khan Academy
Duration:
08:54

English subtitles

Revisions Compare revisions