The Joy of Stats
-
0:03 - 0:10The world we live in is awash with
data that comes pouring in
from everywhere around us. -
0:10 - 0:15On its own this data
is just noise and confusion. -
0:15 - 0:23To make sense of data, to find the
meaning in it, we need the powerful
branch of science - statistics. -
0:23 - 0:26Believe me there's nothing
boring about statistics. -
0:26 - 0:29Especially not today
when we can make the data sing. -
0:29 - 0:33With statistics we can
really make sense of the world. -
0:33 - 0:35And there's more.
-
0:35 - 0:40With statistics, the data deluge, as
it's being called, is leading us -
0:40 - 0:46to an ever greater understanding
of life on Earth
and the universe beyond. -
0:46 - 0:51And thanks to the incredible
power of today's computers, -
0:51 - 0:57it may fundamentally transform the
process of scientific discovery. -
0:57 - 1:03I kid you not, statistics is
now the sexiest subject around. -
1:23 - 1:26Did you know that there is
one million boats in Sweden? -
1:26 - 1:28That's one boat per nine people!
-
1:28 - 1:31It's the highest number of
boats per person in Europe! -
1:41 - 1:46Being a statistician,
you don't like telling
your profession at dinner parties. -
1:46 - 1:48But really,
statisticians shouldn't be shy -
1:48 - 1:51because everyone wants to
understand what's going on. -
1:51 - 1:56And statistics gives us a
perspective on the world we live in -
1:56 - 1:59that we can't get in any other way.
-
2:04 - 2:09Statistics tells us whether
the things we think
and believe are actually true. -
2:20 - 2:25And statistics are far more useful
than we usually like to admit. -
2:25 - 2:30In the last recession there
was this famous call-in
to a talk radio station. -
2:30 - 2:37The man complained, "In times like
this when unemployment rates are up
to 13%, income has fallen by 5%, -
2:37 - 2:41"and suicide rates are climbing, and
I get so angry that the government -
2:41 - 2:46"is wasting money on things like
collection of statistics." -
2:48 - 2:50I'm not officially a statistician.
-
2:50 - 2:55Strictly speaking,
my field is global health. -
2:58 - 3:03But I got really obsessed with stats
when I realised how much people -
3:03 - 3:06in Sweden just don't know
about the rest of the world. -
3:06 - 3:11I started in our medical
university, Karolinska Institutet, -
3:11 - 3:14an undergraduate course
called Global Health. -
3:14 - 3:17These students coming to us actually
have the highest grade you can get -
3:17 - 3:19in the Swedish college system,
-
3:19 - 3:22so I thought, "Maybe they know
everything I'm going to teach them." -
3:22 - 3:26So I did a pre-test when they came,
and one of the questions -
3:26 - 3:28from which I learned a lot
was this one - -
3:28 - 3:32which country has the highest
child mortality of these five pairs? -
3:32 - 3:35I won't put you at test here,
but it is Turkey -
3:35 - 3:37which is highest there, Poland,
-
3:37 - 3:41Russia, Pakistan, and South Africa.
-
3:41 - 3:43And these were the result of
the Swedish students. -
3:43 - 3:45A 1.8 right answer
out of five possible. -
3:45 - 3:50And that means there was a place for
a professor of International Health
and for my course. -
3:50 - 3:56But one late night
when I was compiling the report,
I really realised my discovery. -
3:56 - 4:01I had shown that Swedish
top students know statistically -
4:01 - 4:04significantly less about
the world than the chimpanzees. -
4:06 - 4:10Because the chimpanzees
would score half right. -
4:10 - 4:12If I gave them two bananas
with Sri Lanka and Turkey, -
4:12 - 4:16they would be right
half of the cases,
but the students are not there. -
4:16 - 4:20I did also an unethical study
of the professors of
the Karolinska Institutet, -
4:20 - 4:26that hands out the Nobel Prize
for medicine, and they are on par
with the chimpanzees there. -
4:28 - 4:33Today there's more information
accessible than ever before. -
4:33 - 4:36'And I work with my team at
the Gapminder Foundation -
4:36 - 4:42'using new tools that help everyone
make sense of the changing world. -
4:42 - 4:45'We draw on the masses of data
that's now freely available -
4:45 - 4:50'from international institutions
like the UN and the World Bank. -
4:50 - 4:54'And it's become my mission to
share the insights -
4:54 - 5:00'from this data with anyone who'll
listen, and to reveal how statistics
is nothing to be frightened of.' -
5:02 - 5:05I'm going to provide you a view of
-
5:05 - 5:09the global health situation
across mankind. -
5:09 - 5:14And I'm going to do that in
hopefully an enjoyable way,
so relax. -
5:14 - 5:17So we did this software
which displays it like this. -
5:17 - 5:19Every bubble here is a country -
-
5:19 - 5:21this is China, this is India.
-
5:21 - 5:24The size of the bubble
is the population. -
5:24 - 5:28I'm going to stage a race between
this sort of yellowish Ford here -
5:28 - 5:33and the red Toyota down there
and the brownish Volvo. -
5:33 - 5:36The Toyota has a very bad start
down here, and United States, -
5:36 - 5:38Ford is going off-road there,
-
5:38 - 5:40and the Volvo is doing quite fine,
this is the war. -
5:40 - 5:44The Toyota got off track, now Toyota
is on the healthier side of Sweden. -
5:44 - 5:47That's about where I sold
the Volvo and bought the Toyota. -
5:47 - 5:48AUDIENCE LAUGH
-
5:48 - 5:51This is the great leap forward,
when China fell down. -
5:51 - 5:53It was the central planning
by Mao Zedong. -
5:53 - 5:57China recovered and said, "Never
more stupid central planning," -
5:57 - 5:58but they went up here.
-
5:58 - 6:03No, there is one more inequity,
look there - United States -
6:03 - 6:07They broke my frame. Washington DC
is so rich over there, -
6:07 - 6:13but they are
not as healthy as Kerala in India.
It's quite interesting, isn't it? -
6:13 - 6:15LAUGHTER AND APPLAUSE
-
6:20 - 6:26Welcome to the USA,
world leaders in big cars -
6:26 - 6:28and free data.
-
6:28 - 6:36There are many here who share
my vision of making public data
accessible and useful for everyone. -
6:36 - 6:43The city of San Francisco
is in the lead, opening up
its data on everything. -
6:43 - 6:47Even the police department is
releasing all its crime reports. -
6:47 - 6:51This official
crime data has been turned -
6:51 - 6:56into a wonderful interactive map by
two of the city's computer whizzes. -
6:56 - 6:59It's community statistics in action.
-
7:09 - 7:13Crimespotting is
a map of crime reports from the
San Francisco Police Department -
7:13 - 7:16showing dots on maps
for citizens to be able to see -
7:16 - 7:19patterns of crime around their
neighbourhoods in San Francisco. -
7:19 - 7:25The map is not just about individual
crimes but about broader patterns
that show you where crime is -
7:25 - 7:28clustered around the city, which
areas have high crime, -
7:28 - 7:30and which areas have
relatively low crime. -
7:37 - 7:41We're here at the top of
Jones Street on Nob Hill... -
7:43 - 7:45..quite a nice neighbourhood.
-
7:45 - 7:50What the crime maps show us
is the relationship between -
7:50 - 7:51topography and crime.
-
7:51 - 7:55Basically the higher up the hill,
the less crime there is. -
7:56 - 7:59You cross over the border
-
7:59 - 8:00into the flats...
-
8:03 - 8:09Essentially as soon as you get
into the lower lying areas of Jones
Street the crime just skyrockets. -
8:20 - 8:24We're here in
the uptown Tenderloin district. -
8:26 - 8:30It's one of the oldest and densest
neighbourhoods in San Francisco. -
8:30 - 8:32This is where you go to buy drugs.
-
8:32 - 8:34Right around here.
-
8:37 - 8:42We see lots of aggravated assaults,
lots of auto thefts. -
8:42 - 8:49Basically a huge part of the crime
that happens in the city happens
in this five or six block radius. -
8:56 - 8:59If you've been hearing police sirens
in your neighbourhood, -
8:59 - 9:02you can use the map to find out why.
-
9:02 - 9:06If you're out at night in
an unfamiliar part of town, -
9:06 - 9:09you can check the map
for streets to avoid. -
9:09 - 9:12If a neighbour gets burgled,
you can see - -
9:12 - 9:17is it a one-off or has there been
a spike in local crime? -
9:17 - 9:19If you commute through a
neighbourhood and you're worried -
9:19 - 9:23about its safety, the fact that we
have the ability to turn off all -
9:23 - 9:25the night-time
and middle-of-the-day crimes -
9:25 - 9:28and show you just the things that are
happening during the commute, -
9:28 - 9:33it is a statistical operation.
But I think to people that are
interacting with the thing -
9:33 - 9:38it feels very much more like they're
just sort of browsing a website
or shopping on Amazon. -
9:38 - 9:44They're looking at data
and they don't realise
they're doing statistics. -
9:44 - 9:48What's most exciting for me
is that public statistics -
9:48 - 9:53is making citizens more powerful and
the authorities more accountable. -
10:02 - 10:05We have community meetings that
the police attend -
10:05 - 10:09and what citizens are
now doing are bringing printouts -
10:09 - 10:12of the maps that show where crimes
are taking place, -
10:12 - 10:16and they're demanding services
from the police department -
10:16 - 10:21and the police department is now
having to change how they police, -
10:21 - 10:23how they provide policing services,
-
10:23 - 10:27because the data is showing
what is working and what is not. -
10:29 - 10:32People in San Francisco
are also using public data -
10:32 - 10:36to map social inequalities
and see how to improve society. -
10:36 - 10:40And the possibilities are endless.
-
10:40 - 10:43I think our dream
government data analysis project -
10:43 - 10:46would really be focused on
live information, -
10:46 - 10:51on stuff that was being reported
and pushed out to the world over
the internet as it was happening. -
10:51 - 10:55You know, trash pickups,
traffic accidents, buses, -
10:55 - 10:58and I think through the kind of
stats-gathering power -
10:58 - 11:03of the internet
it's possible to really begin
to see the workings of the city -
11:03 - 11:05displayed as a unified interface.
-
11:07 - 11:10So that's where we are heading.
-
11:10 - 11:15Towards a world of free data
with all the statistical
insights that come from it, -
11:15 - 11:22accessible to everyone, empowering
us as citizens and letting us
hold our rulers to account. -
11:22 - 11:27It's a long way from
where statistics began. -
11:27 - 11:33Statistics
are essential to us to monitor
our governments and our societies. -
11:33 - 11:37But it was our rulers up
there who started -
11:37 - 11:41the collection of statistics in the
first place in order to monitor us! -
11:47 - 11:51In fact the word 'statistics'
comes from 'the state'. -
11:51 - 11:56Modern statistics
began two centuries ago. -
11:56 - 11:59Once it got going,
it spread and never stopped. -
11:59 - 12:02And guess who was first!
-
12:03 - 12:08The Chinese have Confucius,
the Italians have da Vinci, -
12:08 - 12:10and the British have Shakespeare.
-
12:10 - 12:12And we have the Tabellverket -
-
12:12 - 12:16the first ever systematic
collection of statistics! -
12:16 - 12:22Since the year 1749
we have collected data -
12:22 - 12:27on every birth, marriage and death,
and we are proud of it! -
12:29 - 12:32The Tabellverket recorded
information -
12:32 - 12:34from every parish in Sweden.
-
12:34 - 12:39It was a huge quantity of data and
it was the first time any government -
12:39 - 12:42could get an accurate
picture of its people. -
12:49 - 12:53Sweden had been the greatest
military power in Northern Europe, -
12:53 - 12:58but by 1749 our star
was really fading -
12:58 - 13:01and other countries
were growing stronger. -
13:01 - 13:04At least we were a large power,
-
13:04 - 13:10thought to have 20 million people,
enough to rival Britain and France. -
13:13 - 13:18But we were in for a nasty surprise.
-
13:18 - 13:21The first analysis
of the Tabellverket -
13:21 - 13:24revealed that Sweden
only had two million inhabitants. -
13:24 - 13:31Sweden was not just a power
in decline, it also had
a very small population. -
13:31 - 13:36The government was horrified
by this finding -
what if the enemy found out? -
13:38 - 13:45But the Tabellverket also showed
that many women died in childbirth
and many children died young. -
13:45 - 13:49So government took action
to improve the health of the people. -
13:49 - 13:52This was the beginning
of modern Sweden. -
13:54 - 13:59It took more than 50 years before
the Austrians, Belgians, Danes, -
13:59 - 14:02Dutch, French, Germans, Italians
-
14:02 - 14:09and, finally, the British,
caught up with Sweden
in collecting and using statistics. -
14:25 - 14:30It was called political arithmetic.
It was a lovely phrase
that was used for statistics. -
14:30 - 14:33Governments could have much more
control and understanding of -
14:33 - 14:37the society - how it was working,
how it was developing -
14:37 - 14:40and essentially
so they could control it better. -
14:43 - 14:48It wasn't just governments who
woke up to the power of statistics. -
14:48 - 14:55Right across Europe, 19th
century society went mad for facts. -
14:55 - 14:58And, despite its late start,
Britain, -
14:58 - 15:01with its Royal Statistical Society
in London, -
15:01 - 15:04was soon a statisticians' nirvana.
-
15:06 - 15:10I love looking at old copies of
the Royal Statistical Society journal -
15:10 - 15:12because it's full of such odd stuff.
-
15:12 - 15:15There's a wonderful paper
from the 1840s -
15:15 - 15:19which shows a map of England and
the rates of bastardy in each county. -
15:19 - 15:24So you can identify very quickly the
areas with high rates of bastardy. -
15:24 - 15:27Being in East Anglia it always
makes me slightly laugh that Norfolk -
15:27 - 15:31seems to top the "bastardy league"
in the 1840s. -
15:31 - 15:37One of the founders of
the Royal Statistical Society -
15:37 - 15:42was the great
Victorian mathematician
and inventor Charles Babbage. -
15:42 - 15:50In 1842 he read the latest
poem by an equally great Victorian,
Alfred Tennyson. -
15:50 - 15:53Vision of Sin contained the lines:
-
15:53 - 15:56"Fill the cup, and fill the can
-
15:56 - 15:58"Have a rouse before the morn
-
15:58 - 16:04"Every moment dies a man
Every moment one is born." -
16:04 - 16:07So keen a statistician was Babbage
that he could not contain himself. -
16:07 - 16:09He dashed off a letter to Tennyson
-
16:09 - 16:12explaining that because of
population growth, -
16:12 - 16:14the line should read,
-
16:14 - 16:19"Every moment dies a man
and one and a 16th is born." -
16:19 - 16:22I may add that
the exact figure is 1.067, -
16:22 - 16:27but something must be
conceded to the laws of metre. -
16:32 - 16:37In the 19th century, scholars all
over Europe did amazing work -
16:37 - 16:39in measuring their societies.
-
16:39 - 16:43They were hoovering up
data on almost everything. -
16:43 - 16:46But numbers alone
don't tell you anything. -
16:46 - 16:51You have to analyse them,
and that's what makes statistics. -
16:56 - 16:59When the first statisticians
began to get to grips with -
16:59 - 17:00analysing their data
-
17:00 - 17:06they seized upon the average, and
they took the average of everything. -
17:10 - 17:14What's so great
about an average is that -
17:14 - 17:19you can take a whole mass of data
and reduce it to a single number. -
17:22 - 17:26And though each of us is unique,
our collective lives produce -
17:26 - 17:30averages that can
characterise whole populations. -
17:41 - 17:45I looked in my local newspaper
one week and saw a pensioner -
17:45 - 17:49had accidentally put her foot on
the accelerator -
17:49 - 17:53and crushed her friend
against a wall. -
17:53 - 17:56Devastating, hideous,
horrible thing to happen. -
17:56 - 18:01And then there was a second one about
a young man who didn't have -
18:01 - 18:07a driving licence, was driving a car
under the influence of drugs
and alcohol -
18:07 - 18:10and he bashed into a pedestrian
and killed him. -
18:10 - 18:16What's remarkable, absolutely
remarkable, if you look at the number -
18:16 - 18:23of people who die each year
in traffic crashes,
it's nearly a constant. -
18:23 - 18:24What?
-
18:24 - 18:32All these individual events,
somehow when you sum them all up
there's the same number every year. -
18:32 - 18:35And every year, two and a half
times as many men -
18:35 - 18:39die in traffic crashes
as women, and it's a constant. -
18:39 - 18:44And every year the rate in Belgium
is double the rate in England. -
18:44 - 18:47There are these
remarkable regularities. -
18:47 - 18:55So that these individual
particular events sum up
into a social phenomenon. -
18:57 - 18:58Let's see what Sweden have done.
-
18:58 - 19:02We used to boast about fast social
progress, that's where we were.... -
19:02 - 19:05'In my lectures, to tell stories
about the changing world, -
19:05 - 19:08'I use the averages
from entire countries, -
19:08 - 19:12'whether the average of income,
child mortality, family size -
19:12 - 19:13'or carbon output.'
-
19:13 - 19:16OK, I give you Singapore.
The year I was born, -
19:16 - 19:21Singapore had twice the child
mortality of Sweden, the most
tropical country in the world, -
19:21 - 19:23a marshland on
the Equator, and here we go. -
19:23 - 19:25It took a little time for them
to get independent, -
19:25 - 19:27but then they started to grow
their economy, -
19:27 - 19:30and they made the social investment,
they got away malaria, -
19:30 - 19:33they got a magnificent health system
that beat both US and Sweden. -
19:33 - 19:38We never thought it would happen
that they would win over Sweden! -
19:38 - 19:41LAUGHTER AND APPLAUSE
-
19:41 - 19:46But useful as averages are,
they don't tell you the whole story. -
19:49 - 19:53On average, Swedish people have
slightly less than two legs. -
19:53 - 19:58This is because few people
only have one leg or no legs, -
19:58 - 20:00and no-one has three legs.
-
20:00 - 20:06So almost everybody in Sweden
has more than
the average number of legs. -
20:06 - 20:11The variation in data is just
as important as the average. -
20:17 - 20:19But how do you get
a handle on variation? -
20:19 - 20:23For this, you transform
numbers into shapes. -
20:23 - 20:26Let's look again at the number of
adult women in Sweden -
20:26 - 20:28for different heights.
-
20:28 - 20:32Plotting the data as a shape
shows how much their heights -
20:32 - 20:36vary from the average
and how wide that variation is. -
20:36 - 20:42The shape a set of data makes
is called its distribution. -
20:42 - 20:46This is the income distribution
of China, 1970. -
20:46 - 20:51This is the income distribution
of the United States, 1970. -
20:51 - 20:54Almost no overlap,
and what has happened? -
20:54 - 20:57China is growing,
it's not so equal any longer, -
20:57 - 21:01and it's appearing here
overlooking the United States. -
21:01 - 21:03Almost like a ghost, isn't it?
-
21:03 - 21:05It's pretty scary.
-
21:05 - 21:07Rrrr!
-
21:07 - 21:08LAUGHTER
-
21:17 - 21:21The statisticians
who first explored distribution -
21:21 - 21:26discovered one shape
that turned up again and again. -
21:26 - 21:28The Victorian scholar
Francis Galton -
21:28 - 21:32was so fascinated he built
a machine that could reproduce it, -
21:32 - 21:36and he found it fitted so many
different sets of measurements -
21:36 - 21:39that he named it
the normal distribution. -
21:39 - 21:46Whether it was people's arm spans,
lung capacities, -
21:46 - 21:47or even their exam results,
-
21:47 - 21:51the normal distribution shape
recurred time and time again. -
21:51 - 21:56Other statisticians soon found
many other regular shapes, -
21:56 - 22:01each produced by particular kinds
of natural or social processes. -
22:01 - 22:05And every statistician
has their favourite. -
22:05 - 22:09The Poisson distribution, the Poisson
shape is my favourite distribution. -
22:09 - 22:11I think it's an absolute cracker.
-
22:16 - 22:19The Poisson shape
describes how likely it is -
22:19 - 22:22that out-of-the-ordinary things
will happen. -
22:22 - 22:25Imagine a London bus stop where
we know that on average -
22:25 - 22:26we'll get three buses in an hour.
-
22:26 - 22:29We won't always get
three buses, of course. -
22:29 - 22:33Amazingly, the Poisson shape will
show us the probability -
22:33 - 22:37that in any given hour we will get
four, five, or six buses, -
22:37 - 22:39or no buses at all.
-
22:41 - 22:43The exact shape changes
with the average. -
22:43 - 22:47But whether it's how many people
will win the lottery jackpot -
22:47 - 22:48each week,
-
22:48 - 22:51or how many people will phone
a call centre each minute, -
22:51 - 22:54the Poisson shape
will give the probabilities. -
22:57 - 23:01The wonderful example where this was
applied to in the late 19th century -
23:01 - 23:04was to count each year the number of
Prussian officers, -
23:04 - 23:08cavalry officers, who were kicked
to death by their horses. -
23:08 - 23:10Now, some years there were none,
some years there were one, -
23:10 - 23:14some years there were two,
up to seven, I think,
one particularly bad year. -
23:14 - 23:17But with this distribution,
however many years there were -
23:17 - 23:20with nought, one, two, three,
four Prussian cavalry officers -
23:20 - 23:24kicked to death by their horses,
beautifully obeyed
the Poisson distribution. -
23:43 - 23:49So statisticians use shapes to
reveal the patterns in the data. -
23:49 - 23:51But we also use images of all kinds
-
23:51 - 23:54to communicate statistics
to a wider public. -
23:54 - 23:57Because if the story in the numbers
-
23:57 - 24:03is told by a beautiful and clever
image, then everyone understands. -
24:03 - 24:10Of the pioneers
of statistical graphics,
my favourite is Florence Nightingale. -
24:24 - 24:27There are not many people who realise
that she was known -
24:27 - 24:31as a passionate statistician
and not just the Lady of the Lamp. -
24:31 - 24:35She said that "to understand God's
thoughts, we must study statistics, -
24:35 - 24:37"for these are
the measure of His purpose." -
24:37 - 24:41Statistics was for her a religious
duty and moral imperative. -
24:42 - 24:45When Florence was nine years old
she started collecting data. -
24:45 - 24:48Her data was different
fruits and vegetables she found. -
24:48 - 24:50Put them into different tables.
-
24:50 - 24:53Trying to organise them
in some standard form. -
24:53 - 24:56And so we have one of Nightingale's
first statistical tables -
24:56 - 24:57at the age of nine.
-
25:04 - 25:11In the mid 1850s Florence
Nightingale went to the Crimea to
care for British casualties of war. -
25:11 - 25:14She was horrified by
what she discovered. -
25:14 - 25:20For all the soldiers being blown
to bits on the battlefield,
there were many, many more soldiers -
25:20 - 25:25dying from diseases they caught
in the army's filthy hospitals. -
25:25 - 25:29So Florence Nightingale
began counting the dead. -
25:29 - 25:35For two years she recorded
mortality data in meticulous detail. -
25:35 - 25:39When the war was over she persuaded
the government to set up -
25:39 - 25:41a Royal Commission of Inquiry,
-
25:41 - 25:45and gathered her data
in a devastating report. -
25:45 - 25:48What has cemented her place in
the statistical history books -
25:48 - 25:50are the graphics she used.
-
25:50 - 25:54And one in particular,
the polar area graph. -
25:54 - 25:59For each month of the war,
a huge blue wedge represented -
25:59 - 26:02the soldiers who had died
from preventable diseases. -
26:02 - 26:06The much smaller red wedges were
deaths from wounds, -
26:06 - 26:11and the black wedges were deaths
from accidents and other causes. -
26:11 - 26:17Nightingale's graphics were so clear
they were impossible to ignore. -
26:17 - 26:19The usual thing around
Florence Nightingale's time -
26:19 - 26:24was just to produce tables and
tables of figures - absolutely
really tedious stuff that, -
26:24 - 26:26unless you're an absolutely dedicated
statistician, -
26:26 - 26:29it's really quite difficult to spot
the patterns quite naturally. -
26:29 - 26:33But visualisations, they tell a
story, they tell a story immediately. -
26:33 - 26:38And the use of colour
and the use of shape can
really tell a powerful story. -
26:38 - 26:41And nowadays of course
we can make things move as well. -
26:41 - 26:44Florence Nightingale would have
loved to have played with... -
26:44 - 26:49She would have
produced wonderful animations,
I'm absolutely certain of it. -
26:51 - 26:55Today, 150 years on,
Nightingale's graphics -
26:55 - 26:58are rightly regarded as a classic.
-
26:58 - 27:01They led to a revolution
in nursing, health care -
27:01 - 27:06and hygiene in hospitals worldwide,
which saved innumerable lives. -
27:07 - 27:11And statistical graphics has
become an art form of its very own, -
27:11 - 27:16led by designers who are
passionate about visualising data. -
27:25 - 27:27This is the Billion Pound-O-Gram.
-
27:27 - 27:29This image arose out of frustration
-
27:29 - 27:32with the reporting of billion pound
amounts in the media. -
27:32 - 27:34£500 billion pounds for this war.
-
27:34 - 27:36£50 billion for this oil spill.
-
27:36 - 27:39It doesn't make sense -
the numbers are too enormous
to get your mind round. -
27:39 - 27:44So I scraped all this data
from various news sources
and created this diagram. -
27:44 - 27:49So the
squares here are scaled according
to the billion pound amounts. -
27:49 - 27:52When you see numbers visualised
like this -
27:52 - 27:54you start to have a different
relationship with them. -
27:54 - 27:57You can start to see the patterns,
and the scale of them. -
27:57 - 28:00Here in the corner,
this little square - £37 billion. -
28:00 - 28:03This was the predicted cost
of the Iraq war in 2003. -
28:03 - 28:06As you can see it's grown
exponentially over the last few years -
28:06 - 28:11and the total cost now is
around about £2,500 billion. -
28:11 - 28:13It's funny because when
you visualise statistics -
28:13 - 28:15you understand them,
and when you understand them -
28:15 - 28:18you can really start to put things
in perspective. -
28:24 - 28:28Visualisation is right at
the heart of my own work too. -
28:28 - 28:30I teach global health.
-
28:30 - 28:34And I know having the data
is not enough - -
28:34 - 28:39I have to show it in ways people
both enjoy and understand. -
28:39 - 28:43Now I'm going to try something
I've never done before. -
28:43 - 28:46Animating the data in real space,
-
28:46 - 28:50with a bit of technical
assistance from the crew. -
28:50 - 28:52So here we go.
-
28:52 - 28:54First, an axis for health.
-
28:54 - 28:59Life expectancy
from 25 years to 75 years. -
28:59 - 29:01And down here an axis for wealth.
-
29:01 - 29:07Income per person -
400, 4,000, 40,000. -
29:07 - 29:10So down here is poor and sick.
-
29:10 - 29:14And up here is rich and healthy.
-
29:14 - 29:18Now I'm going to show you the world
-
29:18 - 29:21200 years ago, in 1810.
-
29:21 - 29:23Here come all the countries.
-
29:23 - 29:26Europe, brown;
Asia, red; Middle East, green; -
29:26 - 29:29Africa south of the Sahara,
blue; and the Americas, yellow. -
29:29 - 29:34And the size of the country bubble
shows the size of the population. -
29:34 - 29:38In 1810, it was pretty crowded
down there, wasn't it? -
29:38 - 29:40All countries were sick and poor.
-
29:40 - 29:43Life expectancy
was below 40 in all countries. -
29:43 - 29:49And only UK and the Netherlands were
slightly better off. But not much. -
29:49 - 29:53And now I start the world.
-
29:53 - 29:57The industrial revolution makes
countries in Europe and elsewhere -
29:57 - 29:59move away from the rest.
-
29:59 - 30:02But the colonized countries
in Asia and Africa, -
30:02 - 30:04they are stuck down there.
-
30:04 - 30:08And eventually the Western countries
get healthier and healthier. -
30:08 - 30:13And now we slow down to show
the impact of the First World War -
30:13 - 30:16and the Spanish flu epidemic.
-
30:16 - 30:18What a catastrophe!
-
30:18 - 30:23And now I speed up through
the 1920s and the 1930s and, -
30:23 - 30:24in spite of the Great Depression,
-
30:24 - 30:28Western countries forge on towards
greater wealth and health. -
30:28 - 30:30Japan and some others try to follow.
-
30:30 - 30:33But most countries stay down here.
-
30:33 - 30:36And after the tragedies
of the Second World War, -
30:36 - 30:39we stop a bit to look
at the world in 1948. -
30:39 - 30:421948 was a great year.
-
30:42 - 30:43The war was over,
-
30:43 - 30:48Sweden topped the medal table at
the Winter Olympics and I was born. -
30:48 - 30:51But the differences between
the countries of the world -
30:51 - 30:53was wider than ever.
-
30:53 - 30:55United States was in the front.
-
30:55 - 30:57Japan was catching up.
-
30:57 - 30:58Brazil was way behind,
-
30:58 - 31:03Iran was getting a little richer
from oil but still had short lives. -
31:03 - 31:05And the Asian giants...
-
31:05 - 31:09China, India, Pakistan, Bangladesh,
and Indonesia, -
31:09 - 31:11they were still
poor and sick down here. -
31:11 - 31:14But look what was about to happen!
Here we go again. -
31:14 - 31:19In my lifetime, former colonies
gained independence and then finally -
31:19 - 31:23they started to get healthier
and healthier and healthier. -
31:23 - 31:26And in the 1970s, then countries
in Asia and Latin America -
31:26 - 31:29started to catch up
with the Western countries. -
31:29 - 31:31They became the emerging economies.
-
31:31 - 31:33Some in Africa follows,
-
31:33 - 31:36some Africans were stuck in civil
war, and others were hit by HIV. -
31:36 - 31:42And now we can see the world
in the most up-to-date statistics. -
31:43 - 31:45Most people today
live in the middle. -
31:45 - 31:48But there is huge difference
at the same time -
31:48 - 31:52between the best-off countries
and the worst-off countries. -
31:52 - 31:55And there are also huge
inequalities within countries. -
31:55 - 31:59These bubbles show country averages
but I can split them. -
31:59 - 32:02Take China. I can split it
into provinces. -
32:02 - 32:05There goes Shanghai...
-
32:05 - 32:08It has the same health
and wealth as Italy today. -
32:08 - 32:11And there
is the poor inland province Guizhou, -
32:11 - 32:13it is like Pakistan.
-
32:13 - 32:19And if I split it further, the rural
parts are like Ghana in Africa. -
32:20 - 32:23And yet, despite the enormous
disparities today, -
32:23 - 32:27we have seen 200 years
of remarkable progress! -
32:27 - 32:32That huge historical gap between
the west and the rest is now closing. -
32:32 - 32:36We have become an entirely
new, converging world. -
32:36 - 32:38And I see a clear trend
into the future. -
32:38 - 32:41With aid, trade, green
technology and peace, -
32:41 - 32:44it's fully possible
that everyone can make it -
32:44 - 32:46to the healthy, wealthy corner.
-
32:48 - 32:51Well, what you've just seen
in the last few minutes -
32:51 - 32:57is a story of 200 countries
shown over 200 years and beyond. -
32:57 - 33:01It involved plotting
120,000 numbers. -
33:01 - 33:03Pretty neat, huh?
-
33:08 - 33:13So, with statistics, we can begin
to see things as they really are. -
33:13 - 33:18From tables of data to averages,
distributions and visualisations, -
33:18 - 33:23statistics gives us a
clear description of the world. -
33:23 - 33:28But, with statistics, we can
not only discover WHAT is happening -
33:28 - 33:31but also explore WHY,
-
33:31 - 33:34by using the powerful analytical
method - correlation. -
33:35 - 33:38Just looking at one thing at a
time doesn't tell you very much. -
33:38 - 33:41You've got to look at the
relationships between things, -
33:41 - 33:43how they change,
how they vary together. -
33:43 - 33:45That's what correlation is about.
-
33:45 - 33:48That's how you start trying
to understand the processes -
33:48 - 33:51that are really going on
in the world and society. -
33:52 - 33:57Most of us today would recognise
that crime correlates to poverty, -
33:57 - 34:00that infection correlates
to poor sanitation, -
34:00 - 34:03and that knowledge of statistics
correlates -
34:03 - 34:05to being great at dancing!
-
34:07 - 34:10Correlations can be very tricky.
-
34:10 - 34:13I got a joke about
silly correlations. -
34:13 - 34:16There was this American who
was afraid of heart attack. -
34:16 - 34:20He found out that
the Japanese ate very little fat -
34:20 - 34:22and almost didn't drink wine,
-
34:22 - 34:26but they had much less
heart attacks than the Americans. -
34:26 - 34:29But, on the other hand,
he also found out that the French -
34:29 - 34:35eat as much fat as the Americans
and they drink much more wine but
they also have less heart attacks. -
34:35 - 34:41So he concluded that what kills you
is speaking English. -
34:41 - 34:44# Smoke, smoke,
smoke that cigarette -
34:44 - 34:48# Puff, puff, puff and if you
smoke yourself to death... # -
34:48 - 34:52The time, the pace,
the cigarette. Weights Tilt. -
34:52 - 34:56The best example of a really
ground-breaking correlation -
34:56 - 35:02is the link that was established
in the 1950s between
smoking and lung cancer. -
35:02 - 35:07Not long after the Second World War,
a British doctor, Richard Doll, -
35:07 - 35:11investigated lung cancer patients
in 20 London hospitals. -
35:11 - 35:15And he became certain
that the only thing they had
in common was smoking. -
35:15 - 35:18So certain,
that he stopped smoking himself. -
35:18 - 35:22But other people weren't so sure.
-
35:22 - 35:25A lot of the discussion
of the early data, -
35:25 - 35:29linking smoking to lung cancer, said,
"It's not the smoking, surely, -
35:29 - 35:33"that thing we've done all our lives,
that can't be bad for you. -
35:33 - 35:35"Maybe it's genes.
-
35:35 - 35:39"Maybe people who are genetically
predisposed to get lung cancer -
35:39 - 35:44"are also genetically
predisposed to smoke." -
35:44 - 35:47"Maybe it's not the smoking,
maybe it's air pollution - -
35:47 - 35:53"that smokers are somehow
more exposed to air pollution
than non-smokers. -
35:53 - 35:56"Maybe it's not smoking,
maybe it's poverty." -
35:56 - 36:01So now we've got three alternative
explanations, apart from chance. -
36:02 - 36:07To verify his correlation
did imply cause and effect. -
36:07 - 36:11Richard Doll created the biggest
statistical study of smoking yet. -
36:11 - 36:15He began tracking the lives
of 40,000 British doctors, -
36:15 - 36:17some of whom smoked
and some of whom didn't, -
36:17 - 36:19and gathered enough data
-
36:19 - 36:22to correlate the amount
the doctors smoked -
36:22 - 36:25with their likelihood
of getting cancer. -
36:25 - 36:30Eventually, he not only
showed a correlation between
smoking and lung cancer, -
36:30 - 36:36but also a correlation
between stopping smoking
and reducing the risk. -
36:36 - 36:38This was science at its best.
-
36:40 - 36:44What correlations do not replace
is human thought. -
36:44 - 36:47You've got to think
about what it means. -
36:47 - 36:50What a good scientist does,
if he comes with a correlation, -
36:50 - 36:56is try as hard as she or he
possibly can to disprove it, -
36:56 - 37:00to break it down, to get rid of it,
to try and refute it. -
37:00 - 37:05And if it withstands
all those efforts at demolishing it -
37:05 - 37:11and it is still standing up then,
cautiously, you say, "We really
might have something here." -
37:27 - 37:33However brilliant the scientist,
data is still the oxygen of science. -
37:33 - 37:39The good news is that the more we
have, the more correlations we'll
find, the more theories we'll test, -
37:39 - 37:42and the more discoveries
we're likely to make. -
37:46 - 37:53And history shows how our total sum
of information grows in huge leaps
as we develop new technologies. -
37:53 - 38:00The invention of the
printing press kicked off the first
data and information explosion. -
38:00 - 38:06If you piled up all the books that
had been printed by the year 1700, -
38:06 - 38:11they would make 60 stacks
each as high as Mount Everest. -
38:13 - 38:15Then, starting in the 19th century,
-
38:15 - 38:20there came a second information
revolution with the telegraph, -
38:20 - 38:24gramophone and camera.
And later radio and TV. -
38:24 - 38:28The total amount
of information exploded. -
38:28 - 38:35And by the 1950s
the information available to us all
had multiplied 6,000 times. -
38:35 - 38:41Then, thanks to the computer and
later the internet, we went digital. -
38:41 - 38:47And the amount of data we have now
is unimaginably vast. -
38:50 - 38:55A single letter printed in a book
is equivalent to a byte of data. -
38:55 - 38:59A printed page
equals a kilobyte or two. -
39:02 - 39:06Five megabytes is enough for
the complete works of Shakespeare. -
39:08 - 39:1210 gigabytes - that's a DVD movie.
-
39:17 - 39:23Two terabytes
is the tens of millions of photos
added to Facebook every day. -
39:25 - 39:32Ten petabytes is the data recorded
every second by the world's
largest particle accelerator. -
39:32 - 39:36So much
only a tiny fraction is kept. -
39:36 - 39:43Six exabytes is what you'd have
if you sequenced the genomes
of every single person on Earth. -
39:49 - 39:51But really, that's nothing.
-
39:51 - 39:55In 2009, the internet
added up to 500 exabytes. -
39:55 - 40:02In 2010, in just one year, that will
double to more than one zettabyte! -
40:06 - 40:14Back in the real world, if we
turned all this data into print
it would make 90 stacks of books, -
40:14 - 40:19each reaching from here
all the way to the sun! -
40:19 - 40:24The data deluge is staggering,
but, with today's computers -
40:24 - 40:28and statistics,
I'm confident we can handle it. -
40:28 - 40:31When it comes to all the data
on the internet, -
40:31 - 40:34the powerhouse
of statistical analysis -
40:34 - 40:38is the Silicon Valley giant Google.
-
40:44 - 40:51The average person over their
lifetime is exposed to about 100
million words of conversation. -
40:51 - 40:55And so if you multiple that by the
six billion people on the planet, -
40:55 - 40:58that amount of words is about
equal to the number of words -
40:58 - 41:01that Google has available
at any one instant in time. -
41:03 - 41:09Google's computers hoover up
and file away every document,
web page, and image they can find. -
41:09 - 41:15They then hunt for patterns and
correlations in all this data, -
41:15 - 41:18doing statistics on a massive scale.
-
41:18 - 41:26And, for me, Google has one project
that's particularly exciting -
statistical language translation. -
41:26 - 41:31We wanted to provide access
to all the web's information,
no matter what language you spoke. -
41:31 - 41:34There's just so much information
on the internet, -
41:34 - 41:38you couldn't hope to translate it all
by hand into every possible language. -
41:38 - 41:42We figured we'd have to be able
to do machine translation. -
41:44 - 41:47In the past, programmers
tried to teach their computers -
41:47 - 41:53to see each language as a set of
grammatical rules - much like the
way languages are taught at school. -
41:53 - 41:59But this didn't work because no set
of rules could capture a language -
41:59 - 42:01in all its subtlety and ambiguity.
-
42:01 - 42:06"Having eaten our lunch
the coach departed." -
42:06 - 42:08Well, that's obviously incorrect.
-
42:08 - 42:12Written like that it would imply
that the coach has eaten the lunch. -
42:12 - 42:15It would be far better to say...
-
42:15 - 42:20"having eaten our lunch
we departed in the coach." -
42:20 - 42:26Those rules are helpful and they are
useful most of time, but they don't
turn out to be true all the time. -
42:26 - 42:30And the insight of using statistical
machine translation is saying, -
42:30 - 42:35"If you've got to have all these
exceptions anyways, maybe you can get
by without having any of the rules. -
42:35 - 42:39"Maybe you can treat everything
as an exception." And that's
essentially what we've done. -
42:49 - 42:53What the computer is doing when
he's learning how to translate -
42:53 - 42:55is to learn correlations
between words -
42:55 - 42:57and correlations between phrases.
-
42:57 - 43:01So we feed the system very large
amounts of data -
43:01 - 43:05and then the system is seeing that
a certain word or a certain phrase -
43:05 - 43:08correlates very often
to the other language. -
43:10 - 43:16Google's website currently
offers translation between
any of 57 different languages. -
43:16 - 43:23It does this purely statistically,
having correlated a huge collection
of multilingual texts. -
43:23 - 43:26The people that built the system
don't need to know Chinese -
43:26 - 43:30in order to build the
Chinese-to-English system,
or they don't need to know Arabic. -
43:30 - 43:33But the expertise that's needed is
basically knowledge of statistics, -
43:33 - 43:36knowledge of computer science,
knowledge of infrastructure -
43:36 - 43:41to build those very large
computational systems
that we are building for doing that. -
43:43 - 43:48I hooked up with Google
from my office in Stockholm
to try the translator for myself. -
43:48 - 43:52'I will type...
some Swedish sentences.' -
43:52 - 43:53OK.
-
43:53 - 43:55Sveriges...
-
43:55 - 43:59..guldring i orat.
-
44:01 - 44:07OK. So it says, "Sweden's finance
minister has a ponytail
and a gold ring in your ear." -
44:07 - 44:12I guess it probably means
in his ear. 'That's exactly
correct, it's amazing! -
44:12 - 44:15'He comes from the Conservative
party, that's the kind
of Sweden we have today. -
44:15 - 44:19'I will type one more sentence.'
-
44:19 - 44:22'I sitt samkonade...'
-
44:22 - 44:26partnerskap...
-
44:26 - 44:28nya biskop.
-
44:28 - 44:35"In his same-sex partnership
has Stockholm's new bishop
and his partners a three-year son." -
44:35 - 44:38It's almost perfect,
there's one important thing - -
44:38 - 44:42it's HER,
it's a lesbian partnership. -
44:42 - 44:47OK, so those kinds of words his
and her are one of the challenges -
44:47 - 44:49in translation
to get really those right. -
44:49 - 44:52Especially when it comes
to bishops one can excuse it! -
44:52 - 44:54'Right, right.'
-
44:54 - 44:59I guess more often than not
it would probably be a "his".
'I will write one more sentence.' -
44:59 - 45:02Nar Sverige deltar
I olympiader ar malet -
45:02 - 45:04'inte att vinna
utan att sla Norge.' -
45:06 - 45:12OK. "When Sweden is taking part
in Olympic goal is not
to win but to beat Norway." -
45:12 - 45:14'Yes! This is what it is!
-
45:14 - 45:18'But they are very good
in Winter Olympics, so we
can't make it, but we are trying.' -
45:18 - 45:20Ah, very good, very good.
-
45:20 - 45:25'This is absolutely amazing, you
know, and I was especially impressed -
45:25 - 45:31'that it picks up words like
"same-sex partnership"
which are very new to the language." -
45:31 - 45:37'The translator is good, but
if they succeed with what's next,
that'll be remarkable.' -
45:37 - 45:38One of the exciting possibilities
-
45:38 - 45:43is combining the machine
translation technology with
the speech recognition technology. -
45:43 - 45:45Now, both of these
are statistical in nature. -
45:45 - 45:51The machine translation relies
on the statistics of mapping
from one language to another, -
45:51 - 45:58and similarly speech recognition
relies on the statistics of mapping
from a sound form to the words. -
45:58 - 46:00When we put them together,
-
46:00 - 46:03now we have the capability
of having instant conversation -
46:03 - 46:07between two people
that don't speak a common language. -
46:07 - 46:09I can talk to you in my language,
-
46:09 - 46:12you hear me in your language
and you can answer back. -
46:12 - 46:15And in real time we can
make that translation, -
46:15 - 46:19we can bring two people together
and allow them to speak. -
46:31 - 46:39The internet is just one
of many technologies created
to gather massive amounts of data. -
46:39 - 46:44Scientists studying
our earth and our environment -
46:44 - 46:47now use an incredible range
of instruments -
46:47 - 46:51to measure the processes
of our planet. -
46:53 - 47:00All around us are sensors
continuously measuring temperature,
water flow, and ocean currents. -
47:00 - 47:07And high in orbit are satellites
busy imaging cloud formations,
forest growth and snow cover. -
47:07 - 47:11Scientists speak
of "instrumenting the earth". -
47:13 - 47:20And pointing up to the skies
above are powerful new telescopes
mapping the universe. -
47:30 - 47:35What's happening in astronomy
is typical of how profoundly -
47:35 - 47:40this new torrent of data
is transforming science. -
47:40 - 47:45Astronomers are now addressing many
enduring mysteries of the cosmos -
47:45 - 47:50by applying statistical methods
to all this new data. -
48:00 - 48:03The galaxy is a very big place and
it's got billions of stars in it, -
48:03 - 48:09and so to put together a coherent
picture of the whole galaxy requires
having an enormous amount of data. -
48:09 - 48:14And before you could do
a large sky survey with
sensitive, digital detectors -
48:14 - 48:17that meant that you could map many,
many stars all at once, -
48:17 - 48:21it was very difficult to build up
enough data on enough of the galaxy. -
48:25 - 48:29In the past, large surveys
of the night sky had to be done -
48:29 - 48:32by exposing thousands
of large photographic plates. -
48:32 - 48:37But these surveys could take
25 years or more to complete. -
48:39 - 48:45Then, in the 1990s, came digital
astronomy and a huge increase -
48:45 - 48:50in both the amount
and the accessibility of data. -
48:50 - 48:56The Sloan Sky Survey
is the world's biggest yet,
using a massive digital sensor -
48:56 - 49:01mounted on the back
of a custom-built telescope
in New Mexico. -
49:01 - 49:05It's scanned the sky night
after night for eight years, -
49:05 - 49:10building up a composite picture
in unprecedented resolution. -
49:10 - 49:15The Sloan is some of the best,
deepest survey data
that we have in astronomy. -
49:15 - 49:19Both on our own galaxy and
on galaxies further away from ours. -
49:24 - 49:27All the Sloan data
is on the internet, -
49:27 - 49:34and with it astronomers
have identified millions of hitherto
unknown stars and galaxies. -
49:34 - 49:37They also comb the database
for statistical patterns -
49:37 - 49:43which will prove, disprove,
or even suggest new theories. -
49:43 - 49:49So we have this idea that galaxies
grow, they become large galaxies like
the one we live in, the milky way, -
49:49 - 49:56not all at once, or not smoothly,
but by continuously incorporating, -
49:56 - 49:59basically cannibalising,
smaller galaxies. -
49:59 - 50:04They dissolve them
and they become part
of the bigger galaxy as it grows. -
50:06 - 50:13It's a startling idea,
and, in the Sloan data,
is the evidence to support it. -
50:13 - 50:16Groups of stars that came
from cannibalised galaxies -
50:16 - 50:21stand out in the Sloan data
as statistically different
from other stars -
50:21 - 50:24because they move
at a different velocity. -
50:24 - 50:29Each big spike
on one of these distribution graphs -
50:29 - 50:35means Professor Rockosi has found
a group of stars all travelling
in a different way to the rest. -
50:35 - 50:38They are the telltale
patterns she's looking for. -
50:40 - 50:45The evidence is accumulating
that, in fact, this really is
how galaxies grow, -
50:45 - 50:47or an important way
in which how galaxies grow. -
50:47 - 50:53And so this is an important part
of understanding how galaxies form,
not only ours but every galaxy. -
50:56 - 51:00The more data there is,
the more discoveries can be made. -
51:00 - 51:03And the technology
is getting better all the time. -
51:03 - 51:08The next big survey telescope
starts its work in 2015. -
51:08 - 51:11It will leave Sloan in the dust!
-
51:11 - 51:16Sloan has taken eight years to cover
one quarter of the night sky. -
51:18 - 51:26The new telescope will scan
the entire sky, in even greater
resolution, every three days! -
51:34 - 51:41The vast amounts of data
we have today allows researchers
in all sorts of fields -
51:41 - 51:46to test their theories
on a previously unimaginable scale. -
51:46 - 51:54But more than this,
it may even change
the fundamental way science is done. -
51:54 - 51:59With the power of today's computers
applied to all this data, -
51:59 - 52:04the machines might even be able
to guide the researchers. -
52:15 - 52:18We're at a potentially
profoundly important -
52:18 - 52:23and potentially one of the most
significant points in science, -
52:23 - 52:25and certainly one of
the most exciting, -
52:25 - 52:32where the potential to transform
not just how scientists do science
but even what science is possible. -
52:32 - 52:35And what will power
that transformation -
52:35 - 52:38of both how science is done
and even what science is possible -
52:38 - 52:40is going to be computation.
-
52:42 - 52:49Many of the dynamics of the natural
world, like the interplay between
the rainforests and the atmosphere, -
52:49 - 52:54are so complex that we don't
as yet really understand them. -
52:54 - 52:59But now computers are generating
literally tens of thousands
of different simulations -
52:59 - 53:03of how these
biological systems might work. -
53:03 - 53:08It's like creating thousands
of hypothetical parallel worlds. -
53:08 - 53:11Each and every one
of these simulations -
53:11 - 53:18is analysed with statistics
to see if any are a good match
for what is observed in nature. -
53:18 - 53:22The computers can now
automatically generate, -
53:22 - 53:26test and discard hypotheses
with scarcely a human in sight. -
53:28 - 53:35This new application of statistics
will become absolutely vital
for the future of science. -
53:35 - 53:39It's creating a new paradigm,
if you like, -
53:39 - 53:43in science, in the way
in which we can do science, -
53:43 - 53:45which is increasingly...
-
53:45 - 53:51Which one might characterise as...
data-centric or data driven -
53:51 - 53:55rather than being hypothesis-driven
or experimentally-driven. -
53:55 - 53:58So, it's exciting times
in terms of the science, -
53:58 - 54:02in terms of the computation
and in terms of the statistics. -
54:09 - 54:15Now, if all that sounds a bit
abstract and theoretical to you,
how about one final frontier? -
54:15 - 54:19Could statistics even make
sense of your feelings? -
54:21 - 54:26In California - where else? -
one computer scientist -
54:26 - 54:33is harvesting the internet to try
to divine the patterns of our
innermost thoughts and emotions. -
54:45 - 54:46This is the madness movement.
-
54:46 - 54:51The madness movement represents
a skyscraper view of the world. -
54:51 - 54:55Each of these brightly coloured dots
is an individual feeling -
54:55 - 54:59expressed by someone out there
in a blog or a tweet. -
54:59 - 55:04And when you click on the dot
it explodes to reveal the
underlying feeling of that person. -
55:04 - 55:07This is what people say
they're feeling today. -
55:08 - 55:10Better...safe...
-
55:10 - 55:12crappy...
-
55:12 - 55:15well...
-
55:15 - 55:18pretty...special...
-
55:18 - 55:21sorry...alone...
-
55:26 - 55:29So, every minute, We Feel Fine
crawls the world's blogs, -
55:29 - 55:34takes all the sentences
that start with the words
"I feel" or "I am feeling", -
55:34 - 55:36and puts them in a database.
-
55:36 - 55:40We collect all the feelings
and we count the most common. -
55:40 - 55:43They are better...bad...
-
55:43 - 55:46good...right...
-
55:46 - 55:49guilty...sick...
-
55:49 - 55:52the same...like shit...
-
55:52 - 55:55sorry...well...
-
55:55 - 55:56and so on.
-
55:58 - 56:02And we can take a look at any
one feeling and analyse it. -
56:02 - 56:05Right now a lot of people
are feeling happy. -
56:05 - 56:11We can take a look at all the
people who are happy and break it
down by age, gender or location. -
56:11 - 56:17Since bloggers have public profiles
we have that information and
so we can ask questions like, -
56:17 - 56:21"Are women happier than men?"
or, "Is England happier
than the United States?" -
56:30 - 56:33We find that, as people get older,
they get happier. -
56:33 - 56:41And, moreover, we find that
for younger people they associate
happiness more with excitement, -
56:41 - 56:47and, as people get older,
they associate happiness
more with peacefulness. -
56:51 - 56:58And we also find that women feel
loved more often than men,
but also more guilty. -
56:58 - 57:02While men feel good more often
than women, but also more alone. -
57:07 - 57:12As people lead more and
more of their lives online,
they leave behind digital traces, -
57:12 - 57:20and with these digital traces
we can begin to statistically analyse
what it means to be human. -
57:51 - 57:54So where does all of this leave us?
-
57:54 - 58:00We generate unimaginable
quantities of data
about everything you can think of. -
58:00 - 58:03We analyse it to reveal
the patterns. -
58:03 - 58:10And now not only experts
but all of us can understand
the stories in the numbers. -
58:18 - 58:21Instead of being
led astray by prejudice, -
58:21 - 58:28with statistics at our fingertips,
our eyes can be open
for a fact-based view of the world. -
58:28 - 58:34So, more than ever before, we can
become authors of our own destiny. -
58:34 - 58:37And that's pretty
exciting isn't it?! -
58:38 - 58:44# 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,
12, 13, 14, 15, 16, 17, 18, 19, 20 -
58:44 - 58:51# 1, 22, 3, 24, 25, 26, 27, 28, 9,
30, 31, 32, 3, 34, 35, 36, 7 -
58:51 - 58:54# 38, 39, 40, 41, 42, 3,
44, 45, 46, 47 -
58:54 - 58:59LYRICS DEGENERATE INTO GIBBERISH
-
59:09 - 59:13GIBBERISH DEGENERATES INTO NOISE
-
59:13 - 59:14# 100. #
- Title:
- The Joy of Stats
- Description:
-
Documentary which takes viewers on a rollercoaster ride through the wonderful world of statistics to explore the remarkable power they have to change our understanding of the world, presented by superstar boffin Professor Hans Rosling, whose eye-opening, mind-expanding and funny online lectures have made him an international internet legend.
- Video Language:
- English
- Duration:
- 59:13
ettorerizza edited English, British subtitles for The Joy of Stats | ||
Adam Biernacki edited English, British subtitles for The Joy of Stats | ||
Adam Biernacki added a translation |