Standard Error of the Mean

0:00 - 0:01
0:01 - 0:03

We've seen in the last several
videos you start off with
0:03 - 0:05

any crazy distribution.
0:05 - 0:07

It doesn't have to be
crazy, it could be a nice
0:07 - 0:08

normal distribution.
0:08 - 0:10

But to really make the point
that you don't have to have
0:10 - 0:12

a normal distribution I
like to use crazy ones.
0:12 - 0:15

So let's say you have some kind
of crazy distribution that
0:15 - 0:16

looks something like that.
0:16 - 0:17

It could look like anything.
0:17 - 0:19

So we've seen multiple times
you take samples from
0:19 - 0:21

this crazy distribution.
0:21 - 0:28

So let's say you were to take
samples of n is equal to 10.
0:28 - 0:33

So we take 10 instances of this
random variable, average them
0:33 - 0:35

out, and then plot our average.
0:35 - 0:36

We plot our average.
0:36 - 0:38

We get 1 instance there.
0:38 - 0:39

We keep doing that.
0:39 - 0:39

We do that again.
0:39 - 0:43

We take 10 samples from this
random variable, average
0:43 - 0:43

them, plot them again.
0:43 - 0:47

You plot again and eventually
you do this a gazillion times--
0:47 - 0:50

in theory an infinite number of
times-- and you're going to
0:50 - 0:53

approach the sampling
distribution of the sample
0:53 - 0:56

mean. n equal 10 is not going
to be a perfect normal
0:56 - 0:59

distribution but it's
going to be close.
0:59 - 1:01

It'd be perfect only
if n was infinity.
1:01 - 1:06

But let's say we eventually--
all of our samples we get a lot
1:06 - 1:08

of averages that are there that
stacks up, that stacks up
1:08 - 1:10

there, and eventually will
approach something that
1:10 - 1:13

looks something like that.
1:13 - 1:16

And we've seen from the last
video that one-- if let's say
1:16 - 1:19

we were to do it again and this
time let's say that n is equal
1:19 - 1:23

to 20-- one, the distribution
that we get is going
1:23 - 1:25

to be more normal.
1:25 - 1:27

And maybe in future videos
we'll delve even deeper into
1:27 - 1:30

things like kurtosis and skew.
1:30 - 1:31

But it's going to
be more normal.
1:31 - 1:34

But even more important here or
I guess even more obviously
1:34 - 1:36

to us, we saw that in the
experiment it's going to have
1:36 - 1:37

a lower standard deviation.
1:37 - 1:39

So they're all going to
have the same mean.
1:39 - 1:42

Let's say the mean here is,
I don't know, let's say
1:42 - 1:43

the mean here is 5.
1:43 - 1:45

Then the mean here is
also going to be 5.
1:45 - 1:48

The mean of our sampling
distribution of the sample
1:48 - 1:49

mean is going to be 5.
1:49 - 1:50

It doesn't matter
what our n is.
1:50 - 1:52

If our n is 20 it's
still going to be 5.
1:52 - 1:54

But our standard deviation
is going to be less than
1:54 - 1:56

either of these scenarios.
1:56 - 1:57

And we saw that just
by experimenting.
1:57 - 1:58

It might look like this.
1:58 - 2:00

It's going to be more normal
but it's going to have a
2:00 - 2:01

tighter standard deviation.
2:01 - 2:03

So maybe it'll look like that.
2:03 - 2:07

And if we did it with an even
larger sample size-- let me do
2:07 - 2:10

that in a different color-- if
we did that with an even larger
2:10 - 2:13

sample size, n is equal to 100,
what we're going to get is
2:13 - 2:17

something that fits the normal
distribution even better.
2:17 - 2:20

We take a hundred instances
of this random variable,
2:20 - 2:21

average them, plot it.
2:21 - 2:23

A hundred instances of
this random variable,
2:23 - 2:24

average them, plot it.
2:24 - 2:25

And we just keep doing that.
2:25 - 2:28

If we keep doing that, what
we're going to have is
2:28 - 2:30

something that's even more
normal than either of these.
2:30 - 2:32

So it's going to be a much
closer fit to a true
2:32 - 2:33

normal distribution.
2:33 - 2:36

But even more obvious to
the human, it's going
2:36 - 2:38

to be even tighter.
2:38 - 2:41

So it's going to be a very
low standard deviation.
2:41 - 2:42

It's going to look
something like that.
2:42 - 2:47

And I'll show you on the
simulation app in the next or
2:47 - 2:49

probably later in this video.
2:49 - 2:50

So two things happen.
2:50 - 2:52

As you increase your sample
size for every time you
2:52 - 2:54

do the average, two
things are happening.
2:54 - 2:57

You're becoming more normal
and your standard deviation
2:57 - 2:58

is getting smaller.
2:58 - 3:01

So the question might
arise is there a formula?
3:01 - 3:05

So if I know the standard
deviation-- so this is my
3:05 - 3:08

standard deviation of just my
original probability density
3:08 - 3:11

function, this is the mean of
my original probability
3:11 - 3:12

density function.
3:12 - 3:15

So if I know the standard
deviation and I know n-- n is
3:15 - 3:17

going to change depending on
how many samples I'm taking
3:17 - 3:21

every time I do a sample mean--
if I know that my standard
3:21 - 3:24

deviation, or maybe if I
know my variance, right?
3:24 - 3:26

The variance to just the
standard deviation squared.
3:26 - 3:28

If you don't remember
that you might want to
3:28 - 3:30

review those videos.
3:30 - 3:34

But if I know the variance of
my original distribution and if
3:34 - 3:39

I know what my n is-- how many
samples I'm going to take every
3:39 - 3:42

time before I average them in
order to plot one thing in my
3:42 - 3:47

sampling distribution of my
sample mean-- is there a way to
3:47 - 3:51

predict what the mean of
these distributions are?
3:51 - 3:53

And so-- I'm sorry, the
standard deviation of
3:53 - 3:54

these distributions.
3:54 - 3:56

And so you don't get confused
between that and that,
3:56 - 3:57

let me say the variance.
3:57 - 3:59

If you know the variance
you can figure out the
3:59 - 4:00

standard deviation.
4:00 - 4:01

One is just the square
root of the other.
4:01 - 4:06

So this is the variance of
our original distribution.
4:06 - 4:09

Now to show that this is the
variance of our sampling
4:09 - 4:12

distribution of our sample mean
we'll write it right here.
4:12 - 4:16

This is the variance of our
mean of our sample mean.
4:16 - 4:19

Remember the sample--
our true mean is this.
4:19 - 4:22

The Greek letter Mu
is our true mean.
4:22 - 4:27

This is equal to the mean,
while an x a line over
4:27 - 4:28

it means sample mean.
4:28 - 4:31
4:31 - 4:34

So here what we're saying is
this is the variance of our
4:34 - 4:37

sample mean, that this is going
to be true distribution.
4:37 - 4:38

This isn't an estimate.
4:38 - 4:43

There's some-- you know, if we
magically knew distribution--
4:43 - 4:45

there's some true
variance here.
4:45 - 4:49

And of course the mean-- so
this has a mean-- this right
4:49 - 4:51

here, we can just get our
notation right, this is the
4:51 - 4:55

mean of the sampling
distribution of the
4:55 - 4:56

sampling mean.
4:56 - 4:58

So this is the mean
of our means.
4:58 - 5:00

It just happens to
be the same thing.
5:00 - 5:03

This is the mean of
our sample means.
5:03 - 5:05

It's going to be the same thing
as that, especially if we do
5:05 - 5:07

the trial over and over again.
5:07 - 5:09

But anyway, the point of this
video, is there any way to
5:09 - 5:14

figure out this variance given
the variance of the original
5:14 - 5:16

distribution and your n?
5:16 - 5:17

And it turns out there is.
5:17 - 5:18

And I'm not going to
do a proof here.
5:18 - 5:20

I really want to give you
the intuition of it.
5:20 - 5:23

I think you already do have the
sense that every trial you
5:23 - 5:26

take-- if you take a hundred,
you're much more likely when
5:26 - 5:29

you average those out, to get
close to the true mean than if
5:29 - 5:31

you took an n of
2 or an n of 5.
5:31 - 5:34

You're just very unlikely to be
far away, right, if you took
5:34 - 5:37

100 trials as opposed
to taking 5.
5:37 - 5:39

So I think you know that
in some way it should be
5:39 - 5:41

inversely proportional to n.
5:41 - 5:44

The larger your n the smaller
a standard deviation.
5:44 - 5:46

And actually it turns out it's
about as simple as possible.
5:46 - 5:48

It's one of those magical
things about mathematics.
5:48 - 5:50

And I'll prove it
to you one day.
5:50 - 5:52

I want to give you
working knowledge first.
5:52 - 5:54

In statistics, I'm always
struggling whether I should be
5:54 - 5:57

formal in giving you rigorous
proofs but I've kind of come to
5:57 - 5:59

the conclusion that it's more
important to get the working
5:59 - 6:02

knowledge first in statistics
and then later, once you've
6:02 - 6:05

gotten all of that down, we can
get into the real deep math
6:05 - 6:06

of it and prove it to you.
6:06 - 6:09

But I think experimental proofs
are kind of all you need for
6:09 - 6:11

right now, using those
simulations to show that
6:11 - 6:12

they're really true.
6:12 - 6:15

So it turns out that the
variance of your sampling
6:15 - 6:18

distribution of your sample
mean is equal to the
6:18 - 6:21

variance of your original
distribution-- that guy
6:21 - 6:23

right there-- divided by n.
6:23 - 6:24

That's all it is.
6:24 - 6:30

So if this up here has a
variance of-- let's say this up
6:30 - 6:34

here has a variance of 20-- I'm
just making that number up--
6:34 - 6:36

then let's say your n is 20.
6:36 - 6:39

Then the variance of your
sampling distribution of your
6:39 - 6:41

sample mean for an n of 20,
well you're just going to take
6:41 - 6:44

that, the variance up here--
your variance is 20--
6:44 - 6:46

divided by your n, 20.
6:46 - 6:50

So here your variance is
going to be 20 divided by
6:50 - 6:51

20 which is equal to 1.
6:51 - 6:53

This is the variance of
your original probability
6:53 - 6:56

distribution and
this is your n.
6:56 - 6:57

What's your standard
deviation going to be?
6:57 - 7:00

What's going to be the
square root of that, right?
7:00 - 7:01

Standard deviation is going
to be square root of 1.
7:01 - 7:02

Well that's also going to be 1.
7:02 - 7:04

So we could also write this.
7:04 - 7:07

We could take the square root
of both sides of this and say
7:07 - 7:11

the standard deviation of the
sampling distribution
7:11 - 7:14

standard-- the standard
deviation of the sampling
7:14 - 7:17

distribution of the sample mean
is often called the standard
7:17 - 7:19

deviation of the mean.
7:19 - 7:20

And it's also called-- I'm
going to write this down-- the
7:20 - 7:22

standard error of the mean.
7:22 - 7:28
7:28 - 7:30

All of these things that I just
mentioned, they all just mean
7:30 - 7:33

the standard deviation of the
sampling distribution
7:33 - 7:34

of the sample mean.
7:34 - 7:37

That's why this is confusing
because you use the word mean
7:37 - 7:38

and sample over and over again.
7:38 - 7:40

And if it confuses
you let me know.
7:40 - 7:42

I'll do another video or pause
and repeat or whatever.
7:42 - 7:44

But if we just take the square
root of both sides, the
7:44 - 7:47

standard error of the mean or
the standard deviation of the
7:47 - 7:50

sampling distribution of the
sample mean is equal to the
7:50 - 7:54

standard deviation of your
original function-- of your
7:54 - 7:57

original probability density
function-- which could be very
7:57 - 8:00

non-normal, divided by
the square root of n.
8:00 - 8:03

I just took the square root of
both sides of this equation.
8:03 - 8:07

I personally like to remember
this: that the variance is just
8:07 - 8:09

inversely proportional to n.
8:09 - 8:10

And then I like to
go back to this.
8:10 - 8:12

Because this is very
simple in my head.
8:12 - 8:14

You just take the
variance, divide it by n.
8:14 - 8:16

Oh and if I want the standard
deviation, I just take the
8:16 - 8:18

square roots of both sides
and I get this formula.
8:18 - 8:22

So here the standard
deviation-- when n is 20-- the
8:22 - 8:26

standard deviation of the
sampling distribution of the
8:26 - 8:27

sample mean is going to be 1.
8:27 - 8:32

Here when n is 100, our
variance here when
8:32 - 8:32

n is equal to 100.
8:32 - 8:35

So our variance of the sampling
mean of the sample distribution
8:35 - 8:38

or our variance of the mean--
of the sample mean, we
8:38 - 8:40

could say-- is going to be
equal to 20-- this guy's
8:40 - 8:43

variance-- divided by n.
8:43 - 8:47

So it equals-- n is
100-- so it equals 1/5.
8:47 - 8:51

Now this guy's standard
deviation or the standard
8:51 - 8:54

deviation of the sampling
distribution of the sample mean
8:54 - 8:56

or the standard error of the
mean is going to be the
8:56 - 8:56

square root of that.
8:56 - 8:59

So 1 over the square root of 5.
8:59 - 9:03

And so this guy's will be a
little bit under 1/2 the
9:03 - 9:05

standard deviation while
this guy had a standard
9:05 - 9:06

deviation of 1.
9:06 - 9:07

So you see, it's
definitely thinner.
9:07 - 9:08

Now I know what you're saying.
9:08 - 9:10

Well, Sal, you just gave
a formula, I don't
9:10 - 9:11

necessarily believe you.
9:11 - 9:14

Well let's see if we can
prove it to ourselves
9:14 - 9:16

using the simulation.
9:16 - 9:20

So just for fun let me make
a-- I'll just mess with this
9:20 - 9:22

distribution a little bit.
9:22 - 9:23

So that's my new distribution.
9:23 - 9:25

And let me take an n of-- let
me take two things that's easy
9:25 - 9:28

to take the square root of
because we're looking at
9:28 - 9:28

standard deviations.
9:28 - 9:34

So we take an n of
16 and an n of 25.
9:34 - 9:35

Let's do 10,000 trials.
9:35 - 9:37

So in this case every one of
the trials we're going to take
9:37 - 9:40

16 samples from here, average
them, plot it here, and
9:40 - 9:42

then do a frequency plot.
9:42 - 9:45

Here we're going to do 25 at a
time and then average them.
9:45 - 9:47

I'll do it once animated
just to remember.
9:47 - 9:51

So I'm taking 16
samples, plot it there.
9:51 - 9:54

I take 16 samples as described
by this probability density
9:54 - 9:57

function-- or 25 now,
plot it down here.
9:57 - 10:03

Now if I do that 10,000
times, what do I get?
10:03 - 10:07

All right, so here, just
visually you can tell just when
10:07 - 10:09

n was larger, the standard
deviation here is smaller.
10:09 - 10:10

This is more squeezed together.
10:10 - 10:12

But actually let's
write this stuff down.
10:12 - 10:14

Let's see if I can
remember it here.
10:14 - 10:17

So in this random distribution
I made my standard
10:17 - 10:19

deviation was 9.3.
10:19 - 10:20

I'm going to remember these.
10:20 - 10:24

Our standard deviation for
the original thing was 9.3.
10:24 - 10:28

And so standard deviation here
was 2.3 and the standard
10:28 - 10:30

deviation here is 1.87.
10:30 - 10:33

Let's see if it conforms
to our formula.
10:33 - 10:35

So I'm going to take this off
screen for a second and I'm
10:35 - 10:39

going to go back and
do some mathematics.
10:39 - 10:41

So I have this on my
other screen so I can
10:41 - 10:43

remember those numbers.
10:43 - 10:47

So in the trial we just did,
my wacky distribution had a
10:47 - 10:53

standard deviation of 9.3.
10:53 - 10:58

When n is equal to-- let me do
this in another color-- when n
10:58 - 11:02

was equal to 16, just doing the
experiment, doing a bunch of
11:02 - 11:04

trials and averaging and doing
all the things, we got the
11:04 - 11:08

standard deviation of the
sampling distribution of the
11:08 - 11:10

sample mean or the standard
error of the mean, we
11:10 - 11:16

experimentally determined
it to be 2.33.
11:16 - 11:22

And then when n is equal to 25
we got the standard error of
11:22 - 11:25

the mean being equal to 1.87.
11:25 - 11:28

Let's see if it conforms
to our formulas.
11:28 - 11:33

So we know that the variance or
we could almost say the
11:33 - 11:36

variance of the mean or the
standard error-- the variance
11:36 - 11:39

of the sampling distribution of
the sample mean is equal to the
11:39 - 11:42

variance of our original
distribution divided by n, take
11:42 - 11:45

the square roots of both sides,
and then you get the standard
11:45 - 11:49

error of the mean is equal to
the standard deviation of your
11:49 - 11:52

original distribution divided
by the square root of n.
11:52 - 11:54

So let's see if this works
out for these two things.
11:54 - 11:59

So if I were to take 9.3--
so let me do this case.
11:59 - 12:04

So 9.3 divided by the
square root of 16, right?
12:04 - 12:05

N is 16.
12:05 - 12:07

So divided by the square
root of 16, which is
12:07 - 12:09

4, what do I get?
12:09 - 12:12

So 9.3 divided by 4.
12:12 - 12:15

Let me get a little
calculator out here.
12:15 - 12:16

Let's see.
12:16 - 12:19

We have-- let me clear it
out-- we want to divide
12:19 - 12:21

9.3 divided by 4.
12:21 - 12:25

9.3 three divided by our
square root of n. n was 16.
12:25 - 12:32

So divided by 4 is
equal to 2.32.
12:32 - 12:42

So this is equal to 2.32 which
is pretty darn close to 2.33.
12:42 - 12:43

This was after 10,000 trials.
12:43 - 12:46

Maybe right after this I'll see
what happens if we did 20,000
12:46 - 12:49

or 30,000 trials where we take
samples of 16 and average them.
12:49 - 12:50

Now let's look at this.
12:50 - 12:55

Here we would take 9.3-- so let
me draw a little line here.
12:55 - 12:57

Let me scroll over,
that might be better.
12:57 - 13:00

So we take our standard
deviation of our
13:00 - 13:02

original distribution.
13:02 - 13:05

So just that formula that we've
derived right here would tell
13:05 - 13:09

us that our standard error
should be equal to the standard
13:09 - 13:13

deviation of our original
distribution, 9.3, divided by
13:13 - 13:15

the square root of n, divided
by the square root
13:15 - 13:16

of 25, right?
13:16 - 13:18

4 was just the
square root of 16.
13:18 - 13:22

So this is equal to
9.3 divided by 5.
13:22 - 13:24

And let's see if it's 1.87.
13:24 - 13:28

So let me get my
calculator back.
13:28 - 13:36

So if I take 9.3 divided
by 5, what do I get?
13:36 - 13:42

1.86 which is very
close to 1.87.
13:42 - 13:49

So we got in this case 1.86.
13:49 - 13:53

So as you can see what we got
experimentally was almost
13:53 - 13:56

exactly-- and this was after
10,000 trials-- of what
13:56 - 13:57

you would expect.
13:57 - 13:59

Let's do another 10,000.
13:59 - 14:00

So you've got another
10,000 trials.
14:00 - 14:02

Well we're still
in the ballpark.
14:02 - 14:05

We're not going to-- maybe I
can't hope to get the exact
14:05 - 14:07

number rounded or whatever.
14:07 - 14:11

But as you can see, hopefully
that'll be pretty satisfying to
14:11 - 14:14

you, that the variance of the
sampling distribution of the
14:14 - 14:22

sample mean is just going to be
equal to the variance of your
14:22 - 14:24

original distribution, no
matter how wacky that
14:24 - 14:27

distribution might be, divided
by your sample size-- by the
14:27 - 14:34

number of samples you take for
every basket that you average I
14:34 - 14:35

guess is the best way
to think about it.
14:35 - 14:38

You know, sometimes this can
get confusing because you are
14:38 - 14:40

taking samples of averages
based on samples.
14:40 - 14:43

So when someone says sample
size, you're like, is sample
14:43 - 14:47

size the number of times I
took averages or the number
14:47 - 14:49

of things I'm taking
averages of each time?
14:49 - 14:51

And you know, it doesn't
hurt to clarify that.
14:51 - 14:53

Normally when they talk
about sample size
14:53 - 14:54

they're talking about n.
14:54 - 14:58

And, at least in my head, when
I think of the trials as you
14:58 - 15:01

take a sample size of 16, you
average it, that's the one
15:01 - 15:02

trial, and then you plot it.
15:02 - 15:04

Then you do it again and
you do another trial.
15:04 - 15:05

And you do it over
and over again.
15:05 - 15:07

But anyway, hopefully this
makes everything clear and then
15:07 - 15:11

you now also understand how to
get to the standard
15:11 - 15:14

error of the mean.
15:14 - 15:15

Title:: Standard Error of the Mean
Description:: Standard Error of the Mean (a.k.a. the standard deviation of the sampling distribution of the sample mean!)

more » « less
Video Language:: English
Duration:: 15:15

	brettle edited English subtitles for Standard Error of the Mean
	brettle edited English subtitles for Standard Error of the Mean
	brettle edited English subtitles for Standard Error of the Mean
	brettle edited English subtitles for Standard Error of the Mean
	brettle edited English subtitles for Standard Error of the Mean
	brettle edited English subtitles for Standard Error of the Mean
	booksforlife edited English subtitles for Standard Error of the Mean
	booksforlife edited English subtitles for Standard Error of the Mean

Show all

English subtitles

Revisions

Revision 4

brettle

Standard Error of the Mean

Revisions

Our website uses cookies

Operating cookies (Required)