The Joy of Stats
-
Not SyncedThe world we live in is awashed with data
-
Not Syncedthat comes pouring in from everywhere around us.
-
Not SyncedOn it own, this data is just noise and confusion.
-
Not SyncedTo make sense of data, to find the meaning in it,
-
Not Syncedwe need a powerful branch of science: statistics.
-
Not SyncedBelieve me, there's nothing boring about statistics
-
Not Syncedespecially not today, when we can make the data sing.
-
Not SyncedWith statistics we can really make sense of the world.
-
Not SyncedAre statistics, the data diluge as it's been called,
-
Not Syncedleading us to a greater understanding
-
Not Syncedof the life on Earth and the world beyond?
-
Not SyncedThanks to incredible power of today's computers
-
Not Syncedit may fundamentally transform the process of scientific discovery.
-
Not SyncedI kid you not, statistics is now the sexiest subject around.
-
Not SyncedDid you know that there's is one million boats in Sweden?
-
Not SyncedThat's one boat per nine people.
-
Not SyncedIt's the highest number of boats per person in Europe.
-
Not SyncedBeing statistician, you don't like telling your profession at dinner parties,
-
Not Syncedbut really, statisticians shouldn't be shy
-
Not Syncedbecause they always want to understand what's going on.
-
Not SyncedStastistics gives us a persperctive of the world we live in
-
Not Syncedthat we can't get in any other way.
-
Not SyncedStatistics tells us whether the things we think and believe are actually true.
-
Not SyncedStatistics are far more useful than we usually like to admit.
-
Not SyncedIn the last recession, there was this famous call into Talk Radio Station.
-
Not SyncedThe man complained: "in times like this, when unemployment rates are up to 13%,
-
Not Syncedand income has fallen by 5%, and suicide rates are climbing,
-
Not SyncedI get so angry that the government is wasting money on things like correctional statistics."
-
Not SyncedI'm not oficially a statistician, strictly speaking my field is global health.
-
Not SyncedBut I got really obsessed with stats, when I realised how many people in Sweden
-
Not Synceddon't know anything about the rest of the world.
-
Not SyncedI started in our Medical University in Karolinksa Institute,
-
Not Syncedan ungraduate course called Global Health.
-
Not SyncedThese students coming to us have actually the highest grades you can get in theSwedish college system.
-
Not SyncedSo I thought maybe they know everything I'm going to teach them.
-
Not SyncedSo I did a pre-test when they came.
-
Not SyncedOne of the questions, from which I learnt a lot, was:
-
Not SyncedWhich country has the highest child mortality of these five pairs?
-
Not SyncedI won't put you at test here, but it's Turkey which is higher there,
-
Not SyncedPoland, Russia, Pakistan and South Africa.
-
Not SyncedAnd these were the results of the Swedish students.
-
Not Synced1.8 answers right out of 5 possible,
-
Not Syncedthat means that there was a place for a professor in International Health
-
Not Syncedand for my course.
-
Not SyncedBut one late night when I was compiling my report
-
Not SyncedI really realise my discovery.
-
Not SyncedI have shown that Swedish top students know
-
Not Syncedstatistically significantly less about the world than the chimpanzees.
-
Not SyncedBeacuse the chimpanzee would score half right.
-
Not SyncedIf I gave them two bananas with Sri Lanka and Turkey
-
Not Syncedthey would be right half of the cases.
-
Not SyncedBut the students are not there.
-
Not SyncedI did also an unethical study of the professors of the Karolinska Institute
-
Not Syncedthat hands out the Nobel Prize in Medicine, and they aren't on par with the chimpanzee.
-
Not SyncedToday, there's more information accesible than ever before,
-
Not Syncedand I work with my team at the Gapminder Foundation
-
Not Syncedusing new tools that help everyone make sense of the changing world.
-
Not SyncedWe draw on the masses of data that are now free available
-
Not Syncedfrom international institutions like the UN and the World Bank.
-
Not SyncedIt's become my mission to share my insights from this data
-
Not Syncedwith anyone who listen, and to reveal how statistics
is nothing to be frightened of. -
Not SyncedI'm going to provide you a view
of the global health situation across mankind, -
Not Syncedand I'm going to do that in a hopefully enjoyable way.
So relax. -
Not SyncedWe did this software which displays it like this,
-
Not Syncedevery bubble here is a country, this is China, this is India.
-
Not SyncedThe size of the bubble is the population.
-
Not SyncedAnd I'm going to stage a race here
-
Not Syncedbetween this sort of yellow Ford here, and the red Toyota down there,
-
Not Syncedand the brownish Volvo.
-
Not SyncedThe Toyota has a very bad start down here,
-
Not Syncedand United States' Ford is going off road there,
-
Not Syncedand the Volvo is doing quite fine, this is the war,
-
Not Syncedthey Toyota got off crack, now Toyota is coming on the healthier side of Sweden.
-
Not SyncedThat's the point when I sold the Volvo and bought the Toyota.
-
Not SyncedThis is the Great Leap Forward when China fell down,
-
Not Syncedit was central planning by Mao Tse Tung,
-
Not SyncedChina recovered and said "never more stupid central planning", but they went up here.
-
Not SyncedNo, there was one more inequity, look there! United States!
-
Not SyncedOh, they broke my frame!
-
Not SyncedWashington D.C. is so rich over there, but it's not as healthy as Kerala, India.
-
Not SyncedIt's quite interesting, isn't it?
-
Not SyncedWelcome to the USA, world leaders in big cars
-
Not Syncedand free data.
-
Not SyncedThere are many here who share my vision
-
Not Syncedof making public data accesible and useful for everyone.
-
Not SyncedThe city of San Francisco is in the lead, opening up it's data on everything.
-
Not SyncedEven the Police Dept. is releasing all it's crime reports.
-
Not SyncedThis official crime data has been turned into a wonderful inteactive map
-
Not Syncedby two of the cities computer whizzes.
-
Not SyncedIt's community statistics in action.
-
Not SyncedCrimespotting is a map of crime reports
-
Not Syncedfrom the San Francisco Police Dept.
-
Not Syncedshowing dots on maps for citizens to be able to see patterns of crime
-
Not Syncedin their neighbourhoods in San Francisco.
-
Not SyncedThe map is not just about individual crimes
-
Not Syncedbut about broader patterns that show
you where crime is clustered around the city, -
Not Syncedwhich have high crime,
which areas have relatively low crime. -
Not SyncedWe're here at top of Jones Street, on uphill,
quite a nice neighbourhood -
Not Syncedwhat the crime maps show us is the relationship between typography and crime.
-
Not SyncedThe higher up the hill, the less crime there is.
-
Not SyncedWe crossed over the border into the flats.
-
Not SyncedEssentially, as soon as you get into the kind of lower line areas of Jones street,
-
Not Syncedthe crime just skyrockets.
-
Not SyncedSo we're in the uptown Tenderloin District,
-
Not Syncedit's one of the oldest and most dangerous
neighbourhoods in San Francisco. -
Not SyncedThis is where you go to buy drugs,
right around here. -
Not SyncedYou see lots of aggreviated assault,
lots of thefts. -
Not SyncedBasically, the huge part of the crime of the city
happens right in these four or six block areas. -
Not SyncedIf you've been hearing police sirens
in your neighbourhood, -
Not Syncedyou can use the map to find out why.
-
Not SyncedIf you are out at night in an unfamiliar part of town
-
Not Syncedyou can check the map for streets to avoid.
-
Not SyncedIf a neighbour gets burglared, you can see,
-
Not Syncedis it the one off or has there been a spike in local crime?
-
Not SyncedIf you commute through a neighbourhood and you're worried about its safety
-
Not Syncedthe fact that we have the ability to turn off all the night time and middle-of-the-day crimes
-
Not Syncedand show you just the things that are happening during your commute,
-
Not Syncedis a statistical operation but I think to the people
that are interacting with the thing -
Not Syncedit feels very much more like
they just are sort of browsing a website -
Not Syncedor shopping on Amazon.
They're looking at data, -
Not Syncedand they don't realise that they're doing statistics.
-
Not SyncedWhat's most exciting for me is that public statistics
-
Not Syncedis making citizens more powerful
and the authorities more accountable. -
Not SyncedWe have community meetings that the police attend
-
Not Syncedand what citizens are now doing,
they're bringing printouts of the maps -
Not Syncedto show where crimes are taking place,
-
Not Syncedand they're demanding services
from the police department, -
Not Syncedwhich is now having to change how they please,
-
Not Syncedhow they provide policing services,
-
Not Syncedbecause the data is showing
what is working and what is not. -
Not SyncedPeople in San Francisco are also using public data
-
Not Syncedto map social inequalities,
and see how to improve society -
Not Syncedand the possibilities are endless.
-
Not SyncedOur dream would be that the government announced that
-
Not Syncedthis data project would really focus on live information
-
Not Syncedon stuff that was being reported
and pushed out into the world as it was happening. -
Not SyncedTrash pickup, traffic accidents, buses,
-
Not Syncedand through the kind of the stats gathering power on the internet
-
Not Syncedit's posible to really see the workings of the city
-
Not Synceddisplayed as a unified interface.
-
Not SyncedThat's where we are heading,
-
Not Syncedtowards a world of free data
with all the statistical insights that come from it -
Not Syncedaccesible to everyone, empowering us as citizens
-
Not Syncedand letting hold our rulers to account.
-
Not SyncedIt's a long way from where statistics began.
-
Not SyncedStatistics are essential to monitor our government in our societies.
-
Not SyncedBut, it was our rulers out there who started the collection of statistics
-
Not Syncedin first place in order to monitor us.
-
Not SyncedIn fact the word statistics comes from state.
-
Not SyncedModern statistics began two centuries ago.
-
Not SyncedOnce it got going it spread and never stopped.
-
Not SyncedAnd guess who was first.
-
Not SyncedThe Chinese have Confucious,
the Italians have Da Vinci, -
Not Syncedand the British have Shakespeare, and we have the Tabellverket
-
Not Syncedthe first ever systematic collection of statistics.
-
Not SyncedSince the year 1749 we have collected data on every birth, marriage and death
-
Not Syncedand we are proud of it.
-
Not SyncedThe Tabellverket recorded information from every parish in Sweden.
-
Not SyncedIt was a huge quantity of data
and it was the first time any goverment -
Not Syncedcould get any accurate picture of its people.
-
Not SyncedSweden had been the greatest military power in Northern Europe
-
Not Syncedbut by 1749 our star was really fading and other countries were growing stronger.
-
Not SyncedAt least though, we were a large power, thought to have 20 million people
-
Not Syncedenough to rival Britain and France.
-
Not SyncedBut we were in for a nasty surprise.
-
Not SyncedThe first analysis of Tabellverket revealed that Sweden only had 2 million inhabitants.
-
Not SyncedSweden was not only a power in decline, it also had a very small popoulation.
-
Not SyncedThe government was horrified by this finding.
-
Not SyncedWhat if the enemy found out?
-
Not SyncedBut the Tabellverket also showed that many women die in childbirth.
-
Not SyncedAnd many children died young, and government took action to improve the health of the people.
-
Not SyncedThat was the beginning of modern Sweden.
-
Not SyncedIt took more than 50 years before the Austrians, Belgiums, Danes, Dutch,
-
Not SyncedGermans, Italians and finally the British caught up with Sweden
-
Not Syncedin collecting and using statistics.
-
Not SyncedIt was called political arithmethic,
and it was a lovely phrase as use for statistics. -
Not SyncedGovernments could have much more control
and understanding of the society -
Not Syncedhow it's working, how it's developing,
-
Not Syncedand essentially, so they could control it better.
-
Not SyncedIt wasn't just governments
who woke up to the power of statistics. -
Not SyncedRight across Europe, 19th century society went mad for facts.
-
Not SyncedAnd despite its late start, Britain with its Royal Statistical Society in London
-
Not Syncedwas soon a statisticians' nirvana.
-
Not SyncedI love looking at old copies of the Royal Statistical Society,
-
Not Syncedbecause is full of this stuff.
-
Not SyncedThere's a wonderful paper from the 1840s
-
Not Syncedwhich shows a map of England
and the rates of bastardy of each county -
Not Synced
-
Not Syncedso you can identify very quickly the areas
with high areas of bastardy. -
Not SyncedBeing in East Anglia makes me slightly laugh
-
Not Syncedthat Norfolk was on top of the bastardy league in the 1840s.
-
Not SyncedOne of the founders of the Royal Statistical Society
-
Not Syncedwas the great victorian mathematician and inventor Charles Babbage.
-
Not SyncedIn 1842 he read the latest poem
by a equally great victorian -
Not SyncedAlfred Tennyson.
-
Not Synced"Vision of Sin" contained the lines:
-
Not Synced"Fill the cup and fill the can,
Have a rouse before the morn. -
Not SyncedEvery moment dies a man,
Every moment one is born." -
Not SyncedSo keen statistician was Babbage
that he could not contain himself. -
Not SyncedHe dashed a letter to Tennyson
explaining that because of population growth -
Not Syncedthe line should read:
-
Not Synced"Every moment dies a man,
And 11/16 is born." -
Not Synced"I may add that the exact figure is 1.167
-
Not Syncedbut something must be conceded
to the laws of metre." -
Not SyncedIn the 19th century scholars all over Europe
-
Not Synceddid an amazing work in measuring the societies.
-
Not SyncedThey hovered up data in almost everything
-
Not Syncedbut numbers alone don't tell you anything
-
Not Syncedyou have to analyse them, and that's what makes statistics.
-
Not SyncedWhen the first statisticians began
to get to grips with analysing their data -
Not Syncedthey seized upon the average,
and they took the average of everything. -
Not SyncedWhat's so great about an average
-
Not Syncedis that you can take a whole mass of data and reduce it to a single number.
-
Not SyncedThough each of us is unique,
our collective lives produce averages -
Not Syncedthat characterise whole populations.
-
Not SyncedI look to my local newspaper one week
-
Not Syncedand saw that a pensioner had accidently
put a foot on the accelerator -
Not Syncedand crashed her friend against the wall.
-
Not SyncedDevastating, hideous, horrible thing to happen.
-
Not SyncedAnd there was a second one about a young man
who didn't have a driving licence -
Not Syncedwho was driving a car under the influence
of drugs and alcohol -
Not Syncedand crashed into a pedestrian and killed him.
-
Not SyncedWhat is remarkable,
absolutely remarkable, -
Not Syncedif you look at the number of people
who die each year -
Not Syncedin traffic accidents,
it's nearly a constant. -
Not SyncedWhat?
-
Not SyncedAll these individual events,
somehow when you sum them all up -
Not Syncedit's the same number every year,
-
Not Syncedand every year two and a half times
as many men die -
Not Syncedin traffic accidents as women,
and it's a constant. -
Not SyncedAn every year the rate in Belgium is double
-
Not Syncedthe rate in England,
there are these remarkable regularities -
Not Syncedso that these individual particular events
sum up into a social phenomenon. -
Not Synced(Lecture) Let's see what Sweden has done
-
Not Syncedwe used to boast of fast social progress.
-
Not Synced(Narration) In my lectures,
to tell stories about the changing world -
Not SyncedI use averages for entire countries,
whether the average for income, -
Not Syncedchild mortality, family size or carbon output.
-
Not Synced(Lecture) OK, I give you Singapore,
the year I was born. -
Not SyncedSingapore had twice the child mortality of Sweden.
-
Not SyncedThe most tropical country in the world.
A marshland on the Equator. -
Not SyncedAnd here we go. It took a little time for them to get independence
-
Not Syncedbut they started to grow their economy,
and they made the social investments, -
Not Syncedthey got away malaria,
they got a magnificient health system -
Not Syncedthat beats both UkKs and Sweden's.
-
Not SyncedWe thought it would never happened
but they would win over Sweden! -
Not SyncedBut useful as averages are
they don't tell you the whole story. -
Not SyncedOn average, Swedish people have slightly
less than two legs. -
Not SyncedThat is because a few people have one leg
or no legs, and no one has three legs -
Not Syncedso almost everybody in Sweden
has more than the average number of legs. -
Not SyncedThe variation in data is just
as important as the average. -
Not SyncedBut how do you get the handle on variation?
-
Not SyncedFor this you transform numbers into shapes.
-
Not SyncedLet's llok again at the number of adult women
in Sweden for different heights. -
Not SyncedPlotting the data as a shape shows us
how much their heights vary from the average -
Not Syncedand how wide that variation is.
-
Not SyncedThe shape a set of data makes
is called its distribution. -
Not Synced(Lecture) This is the income distribution
of China 1970 -
Not SyncedThis is the income distribution
of the United States 1970. -
Not SyncedAlmost no overlap. And what has happened?
-
Not SyncedChina is growing. It's not so equal any longer.
-
Not SyncedAnd it's appearing here,
overlooking the United States -
Not Syncedalmost like a ghost, isn't it? It's scary!
-
Not SyncedThat statistician who first explored distribution
-
Not Synceddiscovered one shape that turned up
again and again -
Not Syncedthe victorian scholar Francis Goldtone
was so fascinated -
Not Syncedhe built a machine that could reproduce it
-
Not Syncedand he found it fitted so many different
sets of measurements -
Not Syncedthat he named it the Normal Distribution.
-
Not SyncedWhether it was people's arm spans, land capacity or even their exam results
-
Not Syncedthe Normal Distribution shape recurred
time and time again. -
Not SyncedAnd the statisticians soon found
many other regular shapes -
Not Syncedeach produced by a certain kind of natural or social processes.
-
Not SyncedAnd every statistician has their favourite.
-
Not SyncedThe Poisson distribution, I think it's my favourite,
it's absolute crack. -
Not SyncedThe Poisson shape, describes how likely it is
that out-of-the-ordinary things will happen. -
Not SyncedImagine a London bus stop that we know
that on average will get three buses an hour. -
Not SyncedWe won't always get three buses of course.
-
Not SyncedAmazingly the Poisson shape will show us
the probability that in any given hour -
Not Syncedwill get 4, 5 or 6 buses or no buses at all.
-
Not SyncedThe exact shape changes with the average
-
Not Syncedbut whether it is how many people will
win the lottery jackpot each week -
Not Syncedor how many people will phone
a call centre each minute -
Not Syncedthe Poisson shape will give the probabilities.
-
Not SyncedThe wonderful example where this does apply
is in the late 19th century -
Not Syncedwas to count each year the number
of Prussian officers -
Not Syncedcavalry officers that had be kicked
to death by their horses -
Not SyncedSome year there were none, some years one,
some years two,... up to seven. -
Not SyncedOne particularly bad year.
-
Not SyncedBut with this distribution, how many years they go, one, two three, four,
-
Not SyncedPrussian cavalry officers kicked to death
by their horses -
Not Syncedbeautifully obbey the Poisson distribution.
-
Not SyncedSo statisticians use shapes so we wield the patterns in the data
-
Not Syncedbut we also use images of all kinds to communicate statistics to a wider public
-
Not Syncedbecause if the story in the numbers is told by a beautiful and clever image
-
Not Syncedthen everyone understands.
-
Not SyncedOf the pioneers of statiscal graphics, my favourite is Florence Nightingale.
-
Not SyncedThere are not many people who realise that actually she was known as a passionate statistician
-
Not Syncedand not just the Lady of the Lamp.
-
Not SyncedShe said that to understand God's thoughts we must study statistics
-
Not Syncedfor these are the measure of His purpose.
-
Not SyncedStatistics must reserve a religious studio moral imperative.
-
Not SyncedWhen Florence was nine years old,
she started collecting data. -
Not SyncedHer data was different fruits and vegetables she found.
-
Not SyncedPut them into different tables,
trying to organise them in some standard form, -
Not Syncedso we have one of the Nightgale's first
statistical tables at the age of nine. -
Not SyncedIn the mid-1850s, Florence Nightingale went to Crimea
-
Not Syncedto care for British casualties at war.
-
Not SyncedShe was horrified by what she discovered.
-
Not SyncedFor all the soldiers being blown to bits on the battlefield
-
Not Syncedthere were many many more soldiers
dying from diseases -
Not Syncedcaught in the army's filthy hospitals.
-
Not SyncedSo Florence Nightingale bagan counting the dead.
-
Not SyncedFor two years she recorded mortality data
in meticulous detail. -
Not SyncedWhen the war was over,
she persuaded the government -
Not Syncedto set up a Royal Comission of Enquiry.
-
Not SyncedAnd gathered her data in a devastating report.
-
Not SyncedWhat has amended her place in the statistically
history books is the graphics she used. -
Not SyncedAnd one in particular, the Polar Area Graph.
-
Not SyncedFor each month of the war,
a huge blue wedge represented the soldiers -
Not Syncedwho had died of preventable diseases.
-
Not SyncedThe much smaller red wedges
were deaths from wounds, -
Not Syncedand the black wedges deaths
from accidents and other causes. -
Not SyncedNightingale graphics were so clear,
they were impossible to ignore. -
Not SyncedThe usual thing around Florence Nightingale's time
-
Not Syncedwas just to produce tables and tables of figures.
Absolutely tedious stuff. -
Not SyncedUnless you are a dedicated statistician,
it's quite difficult to spot the patterns naturally. -
Not SyncedBut visualisations tell a story.
They tell a story immediately. -
Not SyncedThe use of colour, the use of shape,
can really tell a powerful story. -
Not SyncedAnd these days, we can make things move as well.
-
Not SyncedFlorence Nightingale would've loved to play with it,
-
Not Syncedshe would've produced wonderful animations,
I'm absolutely certain about it. -
Not SyncedToday, a hundred and fifty years on,
-
Not SyncedNightingale's graphics are rightly
regarded as a classic. -
Not SyncedThey led to a revolution in nursing and health care,
in hygiene in hospitals worldwide. -
Not SyncedWe've saved innumerable lives.
-
Not SyncedStatistical graphics has become
an art of its very own. -
Not SyncedLed by designers who are passionate
about visualising data. -
Not SyncedThis is the Billion Pound O Gram.
-
Not SyncedThis image arouse out of the frustration
with the reporting -
Not Syncedof billion-pounds amounts in the media.
-
Not Synced500 trillion pounds for this war,
50 million pounds for this hospital, -
Not Syncedthis does not make sense,
these figures are too enormous to get your mind around. -
Not SyncedSo I squailed to this data from various news sources
and created this diagram -
Not Syncedso the squares here are scaled
according the the billion-pound amounts. -
Not SyncedWhen you see numbers visualised like this,
-
Not Syncedyou start to have a different
kind of relationship with them. -
Not SyncedYou can see patterns, see the scale of them.
-
Not SyncedHere, this little square, 37 billion,
this was the predicted cost of the Iraq war in 2003. -
Not SyncedAs you can see it has grown exponentially
over the last few years -
Not Syncedto the total cost of about 2,500 billion.
-
Not SyncedIt's funny because when you visualise statistics
like this, you undestand them. -
Not SyncedAnd when you understand them,
you can put things into perspective. -
Not SyncedVisualisation is right at the heart of my own work too.
-
Not SyncedI teach Global Health.
-
Not SyncedI know that having the data is not enough,
-
Not SyncedI have to show it in ways people
both enjoy and undestand. -
Not SyncedNow I'm going to try something
I've never done before. -
Not SyncedAnimating the data in real space.
-
Not SyncedWith a bit of technical assistance
from the crew. -
Not SyncedSo here we go!
-
Not SyncedFirst an axis for health,
life expectancy from 25 years to 75 years. -
Not SyncedDown here an axis for wealth,
income per person, $400, $4,000 and $40,000. -
Not SyncedSo down here is poor and sick.
And up here is rich and healthy. -
Not SyncedNow I'm going to show you the world
200 years ago, in 1810. -
Not SyncedHere come all the countries:
Europe brown, Asia red, -
Not SyncedMiddle East green, Africa South-of-Sahara blue,
and America is yellow. -
Not SyncedAnd the size of the country bubble
shows the size of the population. -
Not SyncedAnd in 1810 it was pretty crowded down there, isn't it?
-
Not SyncedAll countries were sick and poor,
life expectancy would be below 40 in all countries. -
Not SyncedOnly the UK and the Netherlands
were slightly better off, but not much. -
Not SyncedAnd now, I'll start the world!
-
Not SyncedThe Industrial Revolution makes countries in Europe and elsewhere move away from the rest.
-
Not SyncedBut the colonised countries in Asia and Africa
are stuck down there. -
Not SyncedEventually the Western countries
get healthier and healthier. -
Not SyncedNow we slow down to see the impact
of the First World War and the Spanish Flu Epidemy. -
Not SyncedWhat a catastrophe!
-
Not SyncedNow I'll speed up through the 1920s and 1930s
-
Not Syncedand spite of the Great Depression, Western countries fueled on towards greater wealth and health.
-
Not SyncedJapan and some others try to follow
but most countries stay down here. -
Not SyncedAfter the tragedies of the Second World War
-
Not Syncedwe stop a bit to look at the world in 1948.
-
Not Synced1948 was a great year, the war was over,
Sweden topped the medal table at the Winter Olympics, -
Not Syncedand I was born, but the differences between
the countries of the world was wider than ever. -
Not SyncedUnited States was in the front,
Japan was catching up, Brasil was way behind, -
Not SyncedIran was getting a little richer from oil,
but still had short lives. -
Not SyncedThe Asian giants, China, India,
Pakistan, Bangladesh and Indonesia, -
Not Syncedthey were still poor and sit down here.
-
Not SyncedBut look what is about to happen. In my lifetime,
former colonies gained independence -
Not Syncedand finally they started to get healthier,
and healthier, and healthier. -
Not SyncedAnd in the 1970s,
countries in Asia and Latin America -
Not Syncedstarted to catch up with the Western countries.
-
Not SyncedThey became the emerging economies.
-
Not SyncedSome in Africa follow, some in Africa
are stuck in civil wars, and others are hit by HIV. -
Not SyncedAnd now we can see the world today,
in the most up-to-date statistics. -
Not SyncedMost people today live in the middle,
-
Not Syncedbut here are huge differences at the same time
-
Not Syncedbetween the best of countries
and the worst of countries -
Not Syncedand there are also huge inequalities within countries.
-
Not SyncedThese bubbles show country averages,
but I can split them. -
Not SyncedTake China, I can split it into provinces.
-
Not SyncedThere goes Shanghai,
it has the same health and wealth as Italy today. -
Not SyncedAnd then there's the poor inland province of Guizhou. It's like Pakistan.
-
Not SyncedAnd if I split it further,
the rural parts are like Ghana in Africa. -
Not SyncedAnd yet, despite the enormous disparities today,
we have seen 200 years of remarkable progress. -
Not SyncedThat huge historical gap between
the West and the rest is now closing. -
Not SyncedWe have become an entirely new converging world.
-
Not SyncedAnd I see a clear trend into the future,
with aid, trade, green technology and peace. -
Not SyncedIt's fully possible that everyone
can make it to the healthy-wealthy corner. -
Not SyncedWhat you've just seen in the last few minutes
is a story of 200 countries -
Not Syncedshown over 200 years and beyond.
It involved plotting 120,000 numbers. -
Not SyncedPretty neat, eh?
-
Not SyncedWith statistics we can start to see things
as they really are. -
Not SyncedFrom tables of data, to averages,
distributions and visualisations, -
Not Syncedstatistics gives us a clear description of the world.
-
Not SyncedBut with statistics we can not only
discover what is happening -
Not Syncedbut also explore why, by using
the powerful analytical method of correlation. -
Not SyncedJust looking at one thing at a time
doesn't tell you very much. -
Not SyncedYou have to look at the relationships between things.
-
Not SyncedHow they change. How they vary together.
That's what correlation is about. -
Not SyncedThat's how we start to understand
the processes that are really going on -
Not Syncedin the world and in socierty.
-
Not SyncedMost of us would recognise today that crime
correlates to poverty, -
Not Syncedthat infection correlates to poor sanitasion,
-
Not Syncedand that knowledge of statistics correlates
to being great at dancing. -
Not SyncedCorrelations can be very tricky.
-
Not SyncedI've got a joke about silly correlations.
-
Not SyncedThis was this American
who was afraid of heart attack. -
Not SyncedHe found out that the Japanese ate very little fat,
and almost didn't drink wine, -
Not Syncedand have much less heart attacks than the American.
-
Not SyncedBut on the other hand, he found out that the French
eat as much fat as the Americans -
Not Syncedand they drink much more wine, but they also have less heart attacks.
-
Not Syncedso he concluded that what kills you
is speaking English. -
Not SyncedThe best example of
a really ground-breaking correlation -
Not Syncedwas the link that was established in the 1950s
between smoking and lung cancer. -
Not SyncedNot long after the Second World War,
a British doctor, Richard Doll, -
Not Syncedinvestigated lung cancer patients
in twenty London hospitals, -
Not Syncedand he became certain that
the only thing they had in common was smoking -
Not Syncedso certain that he stopped smoking himself.
-
Not SyncedBut other people weren't so sure.
-
Not SyncedLots of the discussion of early data
linking smoking and lung cancer -
Not Syncedit can't be smoking, surely, that thing
we've done all our lives, that can't be bad for you. -
Not SyncedMaybe it's genes, maybe people
who are genetically predisposed to get lung cancer -
Not Syncedare also genetically predisposed to smoke.
-
Not SyncedMaybe it's not the smoking,
maybe it's air pollution, -
Not Syncedthat smokers and somehow more exposed to air pollution than non-smokers.
-
Not SyncedMaybe it's not smoking, maybe it's poverty.
-
Not SyncedSo now we have three possible explanations
apart from chance. -
Not SyncedTo verify his correlation did imply cause and effect
-
Not SyncedRichard Doll created
the biggest statistical study of smoking yet -
Not SyncedHe began tracking the lives of 40,000 British doctors
-
Not Syncedsome of whom smoked, some of whom didn't.
-
Not SyncedAnd gathered enough data to correlate
the amount of doctors who smoked -
Not Syncedwith their likelihood of getting cancer.
-
Not SyncedEventually, he did not only show a correlation
between smoking and lung cancer -
Not Syncedbut also a correlation between stopping smoking
and reducing the risk. -
Not SyncedThis was science at its best.
-
Not SyncedWhat correlations do not replace
is human thought. -
Not SyncedWe could think about what it means.
-
Not SyncedWhat a good scientist does
if he comes up with a correlation -
Not Syncedis try as hard as he or she possibly can
to disprove it -
Not Syncedto break it down, to get rid of it,
to try to refute it, -
Not Syncedand if it withstands all those efforts
at demolishing it, and it still standing out, -
Not Syncedthen we might really have something here.
-
Not SyncedHowever brilliants the scientists,
data is still the oxygen of science. -
Not SyncedThe good news is that the more we have,
the more correlations we'll find, -
Not Syncedthe more theories we'll test,
and the more discoveries we are likely to make. -
Not SyncedAnd history shows how our total sum of information
grows in huge leaps -
Not Syncedas we develop new technologies.
-
Not SyncedThe invention of the printing press kicked off
the first data and information explosion -
Not SyncedIf you piled up all the books that have been printed
by the year 1700 -
Not Syncedthey would make sixty stacks,
each as high as Mount Everest. -
Not SyncedThen, starting in the 19th century,
there came a second information revolution. -
Not SyncedWith the telegraph, gramophone, camera,
and later radio and TV. -
Not SyncedThe total amount of information exploded.
-
Not SyncedAnd by the 1950s the information available to us all
had multiplied six thousend times. -
Not SyncedThen, thanks to the computer,
and later the Internet, we went digital, -
Not Syncedand the amount of data we have now,
is unimaginably vast. -
Not SyncedA single letter printed in a book
is the equivalent to a byte of data. -
Not SyncedA single page
equals a kilobyte or two. -
Not SyncedFive megabytes is enough
for the complete works of Shakespeare. -
Not Synced10 gigabytes, that's a DVD movie.
-
Not Synced2 terabytes is the tens of millions of photos
added to Facebook everyday. -
Not Synced10 petabytes is the data recorded every second
by the world's largest particle accelerator, -
Not Syncedso much only a tiny fraction is kept.
-
Not Synced6 exabytes is what you'd have if you sequenced
the genomes of every single person on Earth. -
Not SyncedBut really, that's nothing.
In 2009, the Internet added up to 600 exabytes, -
Not Syncedand in 2010, in just one year, that will double to more than one zettabyte.
-
Not SyncedBut in the real world,
if we turned all this data into print -
Not Syncedit would make ninety stacks of books,
each reaching from here all the way to the Sun. -
Not SyncedThe data deluge is staggering.
But with today's computers and statistics, -
Not SyncedI'm confident we can handle it.
-
Not SyncedWhen it comes to all the data on the Internet,
-
Not Syncedthe powerhouse of statistical analysis
is the Sillicon Valley giant Google. -
Not SyncedThe average person over their lifetime
-
Not Syncedis exposed to about a hundred million words
of conversation. -
Not SyncedSo if you multiply that
by the six billion people on the planet -
Not Syncedthat amount of words is equal to the amount of words
-
Not Syncedthat Google has available at any one instant of time.
-
Not SyncedGoogle's computers hoover up and file away
-
Not Syncedevery document, web page
and image they can find. -
Not SyncedThen they hunt for patterns and correlations
in all this data -
Not Synceddoing statistics on a massive scale.
-
Not SyncedAnd for me, Google has one project
that is particularly exciting: -
Not Syncedstatistical language translation.
-
Not SyncedIf you do want to provide access
to all the web's information -
Not Syncedno matter what language is spoken.
-
Not SyncedThere's so much information on the Internet,
you can not hope to tranlate it all by hand -
Not Syncedinto every possible language, we figured
we have to be able to do machine translation. -
Not SyncedIn the past, programmers tried to teach their computers to see each language as a set of grammatical rules.
-
Not SyncedMuch like languages are taught at school.
-
Not SyncedBut this didn't work, because no set of rules
could capture language in all its subtlety and ambiguity, -
Not SyncedHaving eaten out lunch,
the coach departed. -
Not SyncedThat's obviously incorrect. Written like that,
it would imply that the coach has eaten the lunch. -
Not SyncedIt would be far better to say: Having eaten our lunch,
we departed in the coach. -
Not SyncedThose rules are helpful,
they are useful most of the time, -
Not Syncedbut they don't turn out to be true
all the time. -
Not SyncedAnd the insight of using
statistical machine translation -
Not Syncedis saying: if we have all these exceptions anyways, maybe you can get by without having any rules,
-
Not Syncedmaybe we can treat everything as an exception,
and that's essentially what we've done. -
Not SyncedWhat the computer is doing
when it's learning how to translate -
Not Syncedis to learn correlations between words
and between phrases -
Not Syncedso we feed the system
very large amounts of data -
Not Syncedand the the system sees if a certain word or phrase
correlates very often to the other language. -
Not SyncedGoogle's website currently offers translation between any of 57 different languages.
-
Not SyncedIt does this purely statistically,having correlated
the huge collection of multilingual texts. -
Not SyncedThe people who built he system
don't need to know Chinese -
Not Syncedin order to build the Chinese system.
They dont need to know Arabic. -
Not SyncedThe expertise that is needed is basically knowledge of statistics, of computer science,
-
Not Syncedof infrastructure,
-
Not Syncedto build these very large computer systems we are building for doing that.
-
Not SyncedI hooked up with Google from my office in Stockholm,
to try the translator by myself. -
Not SyncedI will type some Swedish sentences.
-
Not Synced(Types in Swedish)
-
Not Synced(Reads on the screen) Sweden's finance minister
has a ponytail and a gold ring in your ear. -
Not SyncedIt's almost exactly correct, it's amazing.
-
Not SyncedHe comes from the conservative party,
that's the kind of Sweden we have today. -
Not SyncedI will type one more sentence.
-
Not SyncedIn his same-sex parnertships has Stockholm's
new bishop and his partners a three-year son. -
Not SyncedIt's almost perfect,
there's one important thing, it's "her". -
Not SyncedIt's a lesbian partnership.
-
Not SyncedOK, those kinds of words like "her"
are one of the challenges in translation, -
Not Syncedto get those right.
-
Not SyncedWhen it comes to bishops,
one can excuse it. -
Not SyncedRight, I think that more often than not
it would be probably a "his". -
Not SyncedI will write one more sentence.
(Reads aloud in Swedish) -
Not SyncedWhen Sweden is taking part in Olympic gold,
is not to win but to beat Norway. -
Not SyncedBut they are very good in Winter Olympics,
so we can't make it, but we are trying. -
Not SyncedVery good, very good.
-
Not SyncedThis is absolutely amazing,
-
Not Syncedand I'm impressed that it picked up
words like "same-sex partnerships" -
Not Syncedwhich are very due to the language.
-
Not SyncedThe translator is good, but if it succeeds,
what will be next, that'll be remarkable. -
Not SyncedOne of the exciting possibilities is combining
the machine translation technology -
Not Synced
with the speech recognition technology. -
Not SyncedBoth of these are statistically neutre.
-
Not SyncedThe machine translation relies on the statistics
of mapping from one language to another, -
Not Syncedand similarly speech recognition relies on the statistics
of mapping from a sound form to the words. -
Not SyncedWhen we put them together,
now we have the capability -
Not Syncedof having instant conversations between two people who don't speak a common language.
-
Not SyncedI can talk to you in my language,
you hear me in your language, -
Not Syncedand you can answer back in real time,
we can make that translation, -
Not Syncedwe can bring people together
and allow them to speak. -
Not SyncedThe Internet is just one of many technologies
created to gather massives amount of data. -
Not SyncedScientists studying our Earth
and our environment -
Not Syncednow use an incredible range of instruments
to measure the processes of our planet. -
Not SyncedAll around us our sensors are continously measuring
temperature, water flow and ocean currents. -
Not SyncedHigh in orbit our satellite is busy imaging cloud formations, forest growth and snow cover.
-
Not SyncedScientists speak of instrumenting the Earth.
-
Not SyncedAnd pointing up to the skies above,
-
Not Syncedour powerful new telescopes are mapping the Universe.
-
Not SyncedWhat's happening in astronomy,
is tipically how profoundly this torrent of data -
Not Syncedis transforming science.
-
Not SyncedAstronomers are now addressing
many enduring misteries of the cosmos -
Not Syncedby applying statistical methods
to all this new data. -
Not SyncedThe galaxy is a very big place
and it has billions of starts in it -
Not Syncedso to put toghether a coherent picture
of the whole galaxy requires -
Not Syncedhaving enourmous amounts of data,
and before you can do a large sky survey -
Not Syncedwith sensitive digital detectors,
that you can map many stars at once, -
Not Syncedit's very difficult to gather enough data
of enough of the galaxy. -
Not SyncedIn the past, large surveys of the night sky
had to be done -
Not Syncedby exposing thousands
of large photographic plates, -
Not Syncedbut these surveys could take 25 years
or more to complete. -
Not SyncedThen, in the 1990s, came digital astronomy,
-
Not Syncedand a huge increase in both the amount
and the accesibility of data. -
Not SyncedThe Sloan Sky Survey is the world's biggest yet
-
Not Syncedusing a massive digital sensor mounted
-
Not Syncedon the back of a custom built telescope in New Mexico.
-
Not SyncedIt's scanned the sky night after night for eight years
-
Not Syncedbuilding up a composite picture
in unprecedented resolution. -
Not SyncedThe Sloan's is some of the best deepest survey data
-
Not Syncedwe have in astronomy,
-
Not Syncedboth in our galaxy and galaxies away from ours.
-
Not SyncedAll the Sloan data is on the Internet
-
Not Syncedand with it astronomers have identified
-
Not Syncedmillions of hidden unknown stars and galaxies.
-
Not SyncedThey also comb the database for statistical patterns
-
Not Syncedwhich will prove, disprove or suggest new theories.
-
Not SyncedSo we have this idea that galaxies grow
-
Not Syncedthey become large galaxies
-
Not Syncedlike the one we live in, the Milky Way.
-
Not SyncedNot all at once, not smoothly
-
Not Syncedbut by continously incorporating
-
Not Syncedcannibalising smaller galaxies
-
Not Syncedthey dissolve them and become
part of the bigger galaxy -
Not SyncedIt's a startling idea
-
Not Syncedand in the Sloan data there's the evidence to support it.
-
Not SyncedGroups of starts that came from cannibalised galaxies
-
Not Syncedstand out in the Sloan data statistically
different from other stars. -
Not Syncedbecause they move at a different velocity.
-
Not SyncedEach big spike of one of these distribution graphs
-
Not Syncedmeans professor Rockossi has found a group of stars
-
Not Syncedall travelling in a different way to the rest.
-
Not SyncedThey are the telltale patterns she's looking for.
-
Not SyncedThe evidence is accumulating that in fact
-
Not Syncedthis really is how galaxies grow
-
Not Syncedor an important way of how galaxies grow
-
Not Syncedthis is important to understand how galaxies form
-
Not Syncednot only ours but every galaxy.
-
Not SyncedThe more data there is
the more discoveries can be made -
Not Syncedand the technology is getting better all the time.
-
Not SyncedThe next big survey telescope starts its work in 2015.
-
Not SyncedIt will leave Sloan in the dust.
-
Not SyncedSloan has taken 8 eight years
to cover one quarter of the nightsky. -
Not SyncedThe new telescope will scan the entire sky
in even greater resolution -
Not Syncedevery three days.
-
Not SyncedThe vast amounts of data we have today
-
Not Syncedallows researchers in all sorts of fields
-
Not Syncedto test their theories in a previously unimaginable scale
-
Not Syncedbut it may even change the fundamental way
science is done. -
Not SyncedWith the power of todays' computers
applied to all this data -
Not Syncedthe machines might be able to guide the researchers.
-
Not SyncedThere is a profoundly important,
one of the most significant points in science -
Not Syncedcertainly one of the most exciting
-
Not Syncedthe potential to transform not only
how scientists do science -
Not Syncedbut what science is possibly.
-
Not SyncedWhat will power that transformation
of how science is done -
Not Syncedis going to be computation.
-
Not SyncedMany of the dynamics of the natual world
-
Not Syncedlike the interplay
between the rainforest and the atmosphere -
Not Syncedare so complex, that we don't yet
really understand. -
Not SyncedBut now computers are generating
tens of thousands of simulations -
Not Syncedof how these biological systems might work.
-
Not SyncedIs like creating thousands of
hypothetical parellel worlds. -
Not SyncedEach of these simulations is analysed with statistics
-
Not Syncedto see if any are a good match
of what is observed in each. -
Not SyncedThe computers can now automatically generate,
-
Not Syncedtest and discard hypothesis
with scarcely human insight. -
Not SyncedThis new application statistics will become
-
Not Syncedabsolutely vital for the future of science.
-
Not SyncedIt's creating a new paradigm in the way we do science
-
Not Syncedwhich is characterised as data-centric or data-driven
-
Not Syncedrather than hypothesis- or experiment-driven.
-
Not SyncedIt's an exciting time in terms of science,
computation and statistics. -
Not SyncedIf all this sounds a bit abstract to you
-
Not Syncedhow about one final frontier?
-
Not SyncedCould statistics make sense of your feelings?
-
Not SyncedIn California, (where else!), one computer scientist
-
Not Syncedis harvesting the Internet to try to define the patterns
-
Not Syncedof our innermost thoughts and emotions.
-
Not SyncedThis is the Madness Movement
-
Not Syncedit represents a skyscrapper's view of the world.
-
Not SyncedEach brightly coloured dot is an individual feeling
-
Not Syncedexpressed by someone out there in a blog or a tweet
-
Not Syncedand when you click on the dot
-
Not Syncedit explodes to reveal
the underlying feeling of that person. -
Not SyncedThis is what people say they're feeling today:
-
Not Syncedbetter
-
Not Syncedsafe
-
Not Syncedcrappy
-
Not Syncedwell
-
Not Syncedpretty
-
Not Syncedspecial
-
Not Syncedsorry
-
Not Syncedalone
-
Not SyncedEvery minute WeFeelFine crosses the world's blogs
-
Not Syncedtakes all the sentences that start
with the words "I feel" or "I'm feeling" -
Not Syncedand push them into a database.
-
Not SyncedWe collect all the feelings
and we count the most common -
Not Syncedbetter
-
Not Syncedbad
-
Not Syncedgood
-
Not Syncedright
-
Not Syncedguilty
-
Not Syncedsick
-
Not Syncedthe same
-
Not Syncedlike shit
-
Not Syncedsorry
-
Not Syncedwell
-
Not SyncedWe can take a look at any one feeling and analyse it.
-
Not SyncedRight now a lot of people are feeling happy.
-
Not SyncedWe can take a look at these people,
and break them down by age, gender or location. -
Not SyncedSince bloggers have public profiles,
we have that information -
Not Syncedand we can ask questions like,
"Are women happier than men?" -
Not Syncedor "Is England happier
than the United States?" -
Not SyncedWe find that as people get older, they get happier.
-
Not SyncedFor younger people,
happiness associates with excitement -
Not Syncedwhereas older people associate happiness
more with peacefulness. -
Not SyncedWe also find than women feel loved
more often than men, -
Not Syncedbut also more guilty.
-
Not SyncedWhile men feel good more often than women,
but also more alone. -
Not SyncedAs people live more and more of their lives online
-
Not Syncedthey leave behind digital traces
-
Not Syncedwith which we can statistically analyse
-
Not Syncedwhat it means to be human.
-
Not SyncedWhere does all this leave us?
-
Not SyncedWe generate unimaginable quantities of data
-
Not SyncedAbout everything you can think of
-
Not Syncedand we analyse it to reveal the patterns.
-
Not SyncedNow not only experts but all of us can understand
-
Not Syncedthe stories in the numbers.
-
Not SyncedInstead of being led astray by prejudice
-
Not Syncedwith statistics at our fingertips, our eyes can be open
-
Not Syncedfor a facts-based view of the world.
-
Not SyncedMore than ever before we can become
authors of our own destiny. -
Not SyncedAnd that's pretty exciting isn't it?
-
Not Synced(Music)
- Title:
- The Joy of Stats
- Description:
-
Documentary which takes viewers on a rollercoaster ride through the wonderful world of statistics to explore the remarkable power they have to change our understanding of the world, presented by superstar boffin Professor Hans Rosling, whose eye-opening, mind-expanding and funny online lectures have made him an international internet legend.
- Video Language:
- English
- Duration:
- 59:13
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats | ||
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats | ||
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats | ||
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats | ||
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats | ||
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats | ||
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats | ||
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats |