< Return to Video

The Joy of Stats

  • Not Synced
    The world we live in is awashed with data
  • Not Synced
    that comes pouring in from everywhere around us.
  • Not Synced
    On it own, this data is just noise and confusion.
  • Not Synced
    To make sense of data, to find the meaning in it,
  • Not Synced
    we need a powerful branch of science: statistics.
  • Not Synced
    Believe me, there's nothing boring about statistics
  • Not Synced
    especially not today, when we can make the data sing.
  • Not Synced
    With statistics we can really make sense of the world.
  • Not Synced
    Are statistics, the data diluge as it's been called,
  • Not Synced
    leading us to a greater understanding
  • Not Synced
    of the life on Earth and the world beyond?
  • Not Synced
    Thanks to incredible power of today's computers
  • Not Synced
    it may fundamentally transform the process of scientific discovery.
  • Not Synced
    I kid you not, statistics is now the sexiest subject around.
  • Not Synced
    Did you know that there's is one million boats in Sweden?
  • Not Synced
    That's one boat per nine people.
  • Not Synced
    It's the highest number of boats per person in Europe.
  • Not Synced
    Being statistician, you don't like telling your profession at dinner parties,
  • Not Synced
    but really, statisticians shouldn't be shy
  • Not Synced
    because they always want to understand what's going on.
  • Not Synced
    Stastistics gives us a persperctive of the world we live in
  • Not Synced
    that we can't get in any other way.
  • Not Synced
    Statistics tells us whether the things we think and believe are actually true.
  • Not Synced
    Statistics are far more useful than we usually like to admit.
  • Not Synced
    In the last recession, there was this famous call into Talk Radio Station.
  • Not Synced
    The man complained: "in times like this, when unemployment rates are up to 13%,
  • Not Synced
    and income has fallen by 5%, and suicide rates are climbing,
  • Not Synced
    I get so angry that the government is wasting money on things like correctional statistics."
  • Not Synced
    I'm not oficially a statistician, strictly speaking my field is global health.
  • Not Synced
    But I got really obsessed with stats, when I realised how many people in Sweden
  • Not Synced
    don't know anything about the rest of the world.
  • Not Synced
    I started in our Medical University in Karolinksa Institute,
  • Not Synced
    an ungraduate course called Global Health.
  • Not Synced
    These students coming to us have actually the highest grades you can get in theSwedish college system.
  • Not Synced
    So I thought maybe they know everything I'm going to teach them.
  • Not Synced
    So I did a pre-test when they came.
  • Not Synced
    One of the questions, from which I learnt a lot, was:
  • Not Synced
    Which country has the highest child mortality of these five pairs?
  • Not Synced
    I won't put you at test here, but it's Turkey which is higher there,
  • Not Synced
    Poland, Russia, Pakistan and South Africa.
  • Not Synced
    And these were the results of the Swedish students.
  • Not Synced
    1.8 answers right out of 5 possible,
  • Not Synced
    that means that there was a place for a professor in International Health
  • Not Synced
    and for my course.
  • Not Synced
    But one late night when I was compiling my report
  • Not Synced
    I really realise my discovery.
  • Not Synced
    I have shown that Swedish top students know
  • Not Synced
    statistically significantly less about the world than the chimpanzees.
  • Not Synced
    Beacuse the chimpanzee would score half right.
  • Not Synced
    If I gave them two bananas with Sri Lanka and Turkey
  • Not Synced
    they would be right half of the cases.
  • Not Synced
    But the students are not there.
  • Not Synced
    I did also an unethical study of the professors of the Karolinska Institute
  • Not Synced
    that hands out the Nobel Prize in Medicine, and they aren't on par with the chimpanzee.
  • Not Synced
    Today, there's more information accesible than ever before,
  • Not Synced
    and I work with my team at the Gapminder Foundation
  • Not Synced
    using new tools that help everyone make sense of the changing world.
  • Not Synced
    We draw on the masses of data that are now free available
  • Not Synced
    from international institutions like the UN and the World Bank.
  • Not Synced
    It's become my mission to share my insights from this data
  • Not Synced
    with anyone who listen, and to reveal how statistics
    is nothing to be frightened of.
  • Not Synced
    I'm going to provide you a view
    of the global health situation across mankind,
  • Not Synced
    and I'm going to do that in a hopefully enjoyable way.
    So relax.
  • Not Synced
    We did this software which displays it like this,
  • Not Synced
    every bubble here is a country, this is China, this is India.
  • Not Synced
    The size of the bubble is the population.
  • Not Synced
    And I'm going to stage a race here
  • Not Synced
    between this sort of yellow Ford here, and the red Toyota down there,
  • Not Synced
    and the brownish Volvo.
  • Not Synced
    The Toyota has a very bad start down here,
  • Not Synced
    and United States' Ford is going off road there,
  • Not Synced
    and the Volvo is doing quite fine, this is the war,
  • Not Synced
    they Toyota got off crack, now Toyota is coming on the healthier side of Sweden.
  • Not Synced
    That's the point when I sold the Volvo and bought the Toyota.
  • Not Synced
    This is the Great Leap Forward when China fell down,
  • Not Synced
    it was central planning by Mao Tse Tung,
  • Not Synced
    China recovered and said "never more stupid central planning", but they went up here.
  • Not Synced
    No, there was one more inequity, look there! United States!
  • Not Synced
    Oh, they broke my frame!
  • Not Synced
    Washington D.C. is so rich over there, but it's not as healthy as Kerala, India.
  • Not Synced
    It's quite interesting, isn't it?
  • Not Synced
    Welcome to the USA, world leaders in big cars
  • Not Synced
    and free data.
  • Not Synced
    There are many here who share my vision
  • Not Synced
    of making public data accesible and useful for everyone.
  • Not Synced
    The city of San Francisco is in the lead, opening up it's data on everything.
  • Not Synced
    Even the Police Dept. is releasing all it's crime reports.
  • Not Synced
    This official crime data has been turned into a wonderful inteactive map
  • Not Synced
    by two of the cities computer whizzes.
  • Not Synced
    It's community statistics in action.
  • Not Synced
    Crimespotting is a map of crime reports
  • Not Synced
    from the San Francisco Police Dept.
  • Not Synced
    showing dots on maps for citizens to be able to see patterns of crime
  • Not Synced
    in their neighbourhoods in San Francisco.
  • Not Synced
    The map is not just about individual crimes
  • Not Synced
    but about broader patterns that show
    you where crime is clustered around the city,
  • Not Synced
    which have high crime,
    which areas have relatively low crime.
  • Not Synced
    We're here at top of Jones Street, on uphill,
    quite a nice neighbourhood
  • Not Synced
    what the crime maps show us is the relationship between typography and crime.
  • Not Synced
    The higher up the hill, the less crime there is.
  • Not Synced
    We crossed over the border into the flats.
  • Not Synced
    Essentially, as soon as you get into the kind of lower line areas of Jones street,
  • Not Synced
    the crime just skyrockets.
  • Not Synced
    So we're in the uptown Tenderloin District,
  • Not Synced
    it's one of the oldest and most dangerous
    neighbourhoods in San Francisco.
  • Not Synced
    This is where you go to buy drugs,
    right around here.
  • Not Synced
    You see lots of aggreviated assault,
    lots of thefts.
  • Not Synced
    Basically, the huge part of the crime of the city
    happens right in these four or six block areas.
  • Not Synced
    If you've been hearing police sirens
    in your neighbourhood,
  • Not Synced
    you can use the map to find out why.
  • Not Synced
    If you are out at night in an unfamiliar part of town
  • Not Synced
    you can check the map for streets to avoid.
  • Not Synced
    If a neighbour gets burglared, you can see,
  • Not Synced
    is it the one off or has there been a spike in local crime?
  • Not Synced
    If you commute through a neighbourhood and you're worried about its safety
  • Not Synced
    the fact that we have the ability to turn off all the night time and middle-of-the-day crimes
  • Not Synced
    and show you just the things that are happening during your commute,
  • Not Synced
    is a statistical operation but I think to the people
    that are interacting with the thing
  • Not Synced
    it feels very much more like
    they just are sort of browsing a website
  • Not Synced
    or shopping on Amazon.
    They're looking at data,
  • Not Synced
    and they don't realise that they're doing statistics.
  • Not Synced
    What's most exciting for me is that public statistics
  • Not Synced
    is making citizens more powerful
    and the authorities more accountable.
  • Not Synced
    We have community meetings that the police attend
  • Not Synced
    and what citizens are now doing,
    they're bringing printouts of the maps
  • Not Synced
    to show where crimes are taking place,
  • Not Synced
    and they're demanding services
    from the police department,
  • Not Synced
    which is now having to change how they please,
  • Not Synced
    how they provide policing services,
  • Not Synced
    because the data is showing
    what is working and what is not.
  • Not Synced
    People in San Francisco are also using public data
  • Not Synced
    to map social inequalities,
    and see how to improve society
  • Not Synced
    and the possibilities are endless.
  • Not Synced
    Our dream would be that the government announced that
  • Not Synced
    this data project would really focus on live information
  • Not Synced
    on stuff that was being reported
    and pushed out into the world as it was happening.
  • Not Synced
    Trash pickup, traffic accidents, buses,
  • Not Synced
    and through the kind of the stats gathering power on the internet
  • Not Synced
    it's posible to really see the workings of the city
  • Not Synced
    displayed as a unified interface.
  • Not Synced
    That's where we are heading,
  • Not Synced
    towards a world of free data
    with all the statistical insights that come from it
  • Not Synced
    accesible to everyone, empowering us as citizens
  • Not Synced
    and letting hold our rulers to account.
  • Not Synced
    It's a long way from where statistics began.
  • Not Synced
    Statistics are essential to monitor our government in our societies.
  • Not Synced
    But, it was our rulers out there who started the collection of statistics
  • Not Synced
    in first place in order to monitor us.
  • Not Synced
    In fact the word statistics comes from state.
  • Not Synced
    Modern statistics began two centuries ago.
  • Not Synced
    Once it got going it spread and never stopped.
  • Not Synced
    And guess who was first.
  • Not Synced
    The Chinese have Confucious,
    the Italians have Da Vinci,
  • Not Synced
    and the British have Shakespeare, and we have the Tabellverket
  • Not Synced
    the first ever systematic collection of statistics.
  • Not Synced
    Since the year 1749 we have collected data on every birth, marriage and death
  • Not Synced
    and we are proud of it.
  • Not Synced
    The Tabellverket recorded information from every parish in Sweden.
  • Not Synced
    It was a huge quantity of data
    and it was the first time any goverment
  • Not Synced
    could get any accurate picture of its people.
  • Not Synced
    Sweden had been the greatest military power in Northern Europe
  • Not Synced
    but by 1749 our star was really fading and other countries were growing stronger.
  • Not Synced
    At least though, we were a large power, thought to have 20 million people
  • Not Synced
    enough to rival Britain and France.
  • Not Synced
    But we were in for a nasty surprise.
  • Not Synced
    The first analysis of Tabellverket revealed that Sweden only had 2 million inhabitants.
  • Not Synced
    Sweden was not only a power in decline, it also had a very small popoulation.
  • Not Synced
    The government was horrified by this finding.
  • Not Synced
    What if the enemy found out?
  • Not Synced
    But the Tabellverket also showed that many women die in childbirth.
  • Not Synced
    And many children died young, and government took action to improve the health of the people.
  • Not Synced
    That was the beginning of modern Sweden.
  • Not Synced
    It took more than 50 years before the Austrians, Belgiums, Danes, Dutch,
  • Not Synced
    Germans, Italians and finally the British caught up with Sweden
  • Not Synced
    in collecting and using statistics.
  • Not Synced
    It was called political arithmethic,
    and it was a lovely phrase as use for statistics.
  • Not Synced
    Governments could have much more control
    and understanding of the society
  • Not Synced
    how it's working, how it's developing,
  • Not Synced
    and essentially, so they could control it better.
  • Not Synced
    It wasn't just governments
    who woke up to the power of statistics.
  • Not Synced
    Right across Europe, 19th century society went mad for facts.
  • Not Synced
    And despite its late start, Britain with its Royal Statistical Society in London
  • Not Synced
    was soon a statisticians' nirvana.
  • Not Synced
    I love looking at old copies of the Royal Statistical Society,
  • Not Synced
    because is full of this stuff.
  • Not Synced
    There's a wonderful paper from the 1840s
  • Not Synced
    which shows a map of England
    and the rates of bastardy of each county
  • Not Synced
  • Not Synced
    so you can identify very quickly the areas
    with high areas of bastardy.
  • Not Synced
    Being in East Anglia makes me slightly laugh
  • Not Synced
    that Norfolk was on top of the bastardy league in the 1840s.
  • Not Synced
    One of the founders of the Royal Statistical Society
  • Not Synced
    was the great victorian mathematician and inventor Charles Babbage.
  • Not Synced
    In 1842 he read the latest poem
    by a equally great victorian
  • Not Synced
    Alfred Tennyson.
  • Not Synced
    "Vision of Sin" contained the lines:
  • Not Synced
    "Fill the cup and fill the can,
    Have a rouse before the morn.
  • Not Synced
    Every moment dies a man,
    Every moment one is born."
  • Not Synced
    So keen statistician was Babbage
    that he could not contain himself.
  • Not Synced
    He dashed a letter to Tennyson
    explaining that because of population growth
  • Not Synced
    the line should read:
  • Not Synced
    "Every moment dies a man,
    And 11/16 is born."
  • Not Synced
    "I may add that the exact figure is 1.167
  • Not Synced
    but something must be conceded
    to the laws of metre."
  • Not Synced
    In the 19th century scholars all over Europe
  • Not Synced
    did an amazing work in measuring the societies.
  • Not Synced
    They hovered up data in almost everything
  • Not Synced
    but numbers alone don't tell you anything
  • Not Synced
    you have to analyse them, and that's what makes statistics.
  • Not Synced
    When the first statisticians began
    to get to grips with analysing their data
  • Not Synced
    they seized upon the average,
    and they took the average of everything.
  • Not Synced
    What's so great about an average
  • Not Synced
    is that you can take a whole mass of data and reduce it to a single number.
  • Not Synced
    Though each of us is unique,
    our collective lives produce averages
  • Not Synced
    that characterise whole populations.
  • Not Synced
    I look to my local newspaper one week
  • Not Synced
    and saw that a pensioner had accidently
    put a foot on the accelerator
  • Not Synced
    and crashed her friend against the wall.
  • Not Synced
    Devastating, hideous, horrible thing to happen.
  • Not Synced
    And there was a second one about a young man
    who didn't have a driving licence
  • Not Synced
    who was driving a car under the influence
    of drugs and alcohol
  • Not Synced
    and crashed into a pedestrian and killed him.
  • Not Synced
    What is remarkable,
    absolutely remarkable,
  • Not Synced
    if you look at the number of people
    who die each year
  • Not Synced
    in traffic accidents,
    it's nearly a constant.
  • Not Synced
    What?
  • Not Synced
    All these individual events,
    somehow when you sum them all up
  • Not Synced
    it's the same number every year,
  • Not Synced
    and every year two and a half times
    as many men die
  • Not Synced
    in traffic accidents as women,
    and it's a constant.
  • Not Synced
    An every year the rate in Belgium is double
  • Not Synced
    the rate in England,
    there are these remarkable regularities
  • Not Synced
    so that these individual particular events
    sum up into a social phenomenon.
  • Not Synced
    (Lecture) Let's see what Sweden has done
  • Not Synced
    we used to boast of fast social progress.
  • Not Synced
    (Narration) In my lectures,
    to tell stories about the changing world
  • Not Synced
    I use averages for entire countries,
    whether the average for income,
  • Not Synced
    child mortality, family size or carbon output.
  • Not Synced
    (Lecture) OK, I give you Singapore,
    the year I was born.
  • Not Synced
    Singapore had twice the child mortality of Sweden.
  • Not Synced
    The most tropical country in the world.
    A marshland on the Equator.
  • Not Synced
    And here we go. It took a little time for them to get independence
  • Not Synced
    but they started to grow their economy,
    and they made the social investments,
  • Not Synced
    they got away malaria,
    they got a magnificient health system
  • Not Synced
    that beats both UkKs and Sweden's.
  • Not Synced
    We thought it would never happened
    but they would win over Sweden!
  • Not Synced
    But useful as averages are
    they don't tell you the whole story.
  • Not Synced
    On average, Swedish people have slightly
    less than two legs.
  • Not Synced
    That is because a few people have one leg
    or no legs, and no one has three legs
  • Not Synced
    so almost everybody in Sweden
    has more than the average number of legs.
  • Not Synced
    The variation in data is just
    as important as the average.
  • Not Synced
    But how do you get the handle on variation?
  • Not Synced
    For this you transform numbers into shapes.
  • Not Synced
    Let's llok again at the number of adult women
    in Sweden for different heights.
  • Not Synced
    Plotting the data as a shape shows us
    how much their heights vary from the average
  • Not Synced
    and how wide that variation is.
  • Not Synced
    The shape a set of data makes
    is called its distribution.
  • Not Synced
    (Lecture) This is the income distribution
    of China 1970
  • Not Synced
    This is the income distribution
    of the United States 1970.
  • Not Synced
    Almost no overlap. And what has happened?
  • Not Synced
    China is growing. It's not so equal any longer.
  • Not Synced
    And it's appearing here,
    overlooking the United States
  • Not Synced
    almost like a ghost, isn't it? It's scary!
  • Not Synced
    That statistician who first explored distribution
  • Not Synced
    discovered one shape that turned up
    again and again
  • Not Synced
    the victorian scholar Francis Goldtone
    was so fascinated
  • Not Synced
    he built a machine that could reproduce it
  • Not Synced
    and he found it fitted so many different
    sets of measurements
  • Not Synced
    that he named it the Normal Distribution.
  • Not Synced
    Whether it was people's arm spans, land capacity or even their exam results
  • Not Synced
    the Normal Distribution shape recurred
    time and time again.
  • Not Synced
    And the statisticians soon found
    many other regular shapes
  • Not Synced
    each produced by a certain kind of natural or social processes.
  • Not Synced
    And every statistician has their favourite.
  • Not Synced
    The Poisson distribution, I think it's my favourite,
    it's absolute crack.
  • Not Synced
    The Poisson shape, describes how likely it is
    that out-of-the-ordinary things will happen.
  • Not Synced
    Imagine a London bus stop that we know
    that on average will get three buses an hour.
  • Not Synced
    We won't always get three buses of course.
  • Not Synced
    Amazingly the Poisson shape will show us
    the probability that in any given hour
  • Not Synced
    will get 4, 5 or 6 buses or no buses at all.
  • Not Synced
    The exact shape changes with the average
  • Not Synced
    but whether it is how many people will
    win the lottery jackpot each week
  • Not Synced
    or how many people will phone
    a call centre each minute
  • Not Synced
    the Poisson shape will give the probabilities.
  • Not Synced
    The wonderful example where this does apply
    is in the late 19th century
  • Not Synced
    was to count each year the number
    of Prussian officers
  • Not Synced
    cavalry officers that had be kicked
    to death by their horses
  • Not Synced
    Some year there were none, some years one,
    some years two,... up to seven.
  • Not Synced
    One particularly bad year.
  • Not Synced
    But with this distribution, how many years they go, one, two three, four,
  • Not Synced
    Prussian cavalry officers kicked to death
    by their horses
  • Not Synced
    beautifully obbey the Poisson distribution.
  • Not Synced
    So statisticians use shapes so we wield the patterns in the data
  • Not Synced
    but we also use images of all kinds to communicate statistics to a wider public
  • Not Synced
    because if the story in the numbers is told by a beautiful and clever image
  • Not Synced
    then everyone understands.
  • Not Synced
    Of the pioneers of statiscal graphics, my favourite is Florence Nightingale.
  • Not Synced
    There are not many people who realise that actually she was known as a passionate statistician
  • Not Synced
    and not just the Lady of the Lamp.
  • Not Synced
    She said that to understand God's thoughts we must study statistics
  • Not Synced
    for these are the measure of His purpose.
  • Not Synced
    Statistics must reserve a religious studio moral imperative.
  • Not Synced
    When Florence was nine years old,
    she started collecting data.
  • Not Synced
    Her data was different fruits and vegetables she found.
  • Not Synced
    Put them into different tables,
    trying to organise them in some standard form,
  • Not Synced
    so we have one of the Nightgale's first
    statistical tables at the age of nine.
  • Not Synced
    In the mid-1850s, Florence Nightingale went to Crimea
  • Not Synced
    to care for British casualties at war.
  • Not Synced
    She was horrified by what she discovered.
  • Not Synced
    For all the soldiers being blown to bits on the battlefield
  • Not Synced
    there were many many more soldiers
    dying from diseases
  • Not Synced
    caught in the army's filthy hospitals.
  • Not Synced
    So Florence Nightingale bagan counting the dead.
  • Not Synced
    For two years she recorded mortality data
    in meticulous detail.
  • Not Synced
    When the war was over,
    she persuaded the government
  • Not Synced
    to set up a Royal Comission of Enquiry.
  • Not Synced
    And gathered her data in a devastating report.
  • Not Synced
    What has amended her place in the statistically
    history books is the graphics she used.
  • Not Synced
    And one in particular, the Polar Area Graph.
  • Not Synced
    For each month of the war,
    a huge blue wedge represented the soldiers
  • Not Synced
    who had died of preventable diseases.
  • Not Synced
    The much smaller red wedges
    were deaths from wounds,
  • Not Synced
    and the black wedges deaths
    from accidents and other causes.
  • Not Synced
    Nightingale graphics were so clear,
    they were impossible to ignore.
  • Not Synced
    The usual thing around Florence Nightingale's time
  • Not Synced
    was just to produce tables and tables of figures.
    Absolutely tedious stuff.
  • Not Synced
    Unless you are a dedicated statistician,
    it's quite difficult to spot the patterns naturally.
  • Not Synced
    But visualisations tell a story.
    They tell a story immediately.
  • Not Synced
    The use of colour, the use of shape,
    can really tell a powerful story.
  • Not Synced
    And these days, we can make things move as well.
  • Not Synced
    Florence Nightingale would've loved to play with it,
  • Not Synced
    she would've produced wonderful animations,
    I'm absolutely certain about it.
  • Not Synced
    Today, a hundred and fifty years on,
  • Not Synced
    Nightingale's graphics are rightly
    regarded as a classic.
  • Not Synced
    They led to a revolution in nursing and health care,
    in hygiene in hospitals worldwide.
  • Not Synced
    We've saved innumerable lives.
  • Not Synced
    Statistical graphics has become
    an art of its very own.
  • Not Synced
    Led by designers who are passionate
    about visualising data.
  • Not Synced
    This is the Billion Pound O Gram.
  • Not Synced
    This image arouse out of the frustration
    with the reporting
  • Not Synced
    of billion-pounds amounts in the media.
  • Not Synced
    500 trillion pounds for this war,
    50 million pounds for this hospital,
  • Not Synced
    this does not make sense,
    these figures are too enormous to get your mind around.
  • Not Synced
    So I squailed to this data from various news sources
    and created this diagram
  • Not Synced
    so the squares here are scaled
    according the the billion-pound amounts.
  • Not Synced
    When you see numbers visualised like this,
  • Not Synced
    you start to have a different
    kind of relationship with them.
  • Not Synced
    You can see patterns, see the scale of them.
  • Not Synced
    Here, this little square, 37 billion,
    this was the predicted cost of the Iraq war in 2003.
  • Not Synced
    As you can see it has grown exponentially
    over the last few years
  • Not Synced
    to the total cost of about 2,500 billion.
  • Not Synced
    It's funny because when you visualise statistics
    like this, you undestand them.
  • Not Synced
    And when you understand them,
    you can put things into perspective.
  • Not Synced
    Visualisation is right at the heart of my own work too.
  • Not Synced
    I teach Global Health.
  • Not Synced
    I know that having the data is not enough,
  • Not Synced
    I have to show it in ways people
    both enjoy and undestand.
  • Not Synced
    Now I'm going to try something
    I've never done before.
  • Not Synced
    Animating the data in real space.
  • Not Synced
    With a bit of technical assistance
    from the crew.
  • Not Synced
    So here we go!
  • Not Synced
    First an axis for health,
    life expectancy from 25 years to 75 years.
  • Not Synced
    Down here an axis for wealth,
    income per person, $400, $4,000 and $40,000.
  • Not Synced
    So down here is poor and sick.
    And up here is rich and healthy.
  • Not Synced
    Now I'm going to show you the world
    200 years ago, in 1810.
  • Not Synced
    Here come all the countries:
    Europe brown, Asia red,
  • Not Synced
    Middle East green, Africa South-of-Sahara blue,
    and America is yellow.
  • Not Synced
    And the size of the country bubble
    shows the size of the population.
  • Not Synced
    And in 1810 it was pretty crowded down there, isn't it?
  • Not Synced
    All countries were sick and poor,
    life expectancy would be below 40 in all countries.
  • Not Synced
    Only the UK and the Netherlands
    were slightly better off, but not much.
  • Not Synced
    And now, I'll start the world!
  • Not Synced
    The Industrial Revolution makes countries in Europe and elsewhere move away from the rest.
  • Not Synced
    But the colonised countries in Asia and Africa
    are stuck down there.
  • Not Synced
    Eventually the Western countries
    get healthier and healthier.
  • Not Synced
    Now we slow down to see the impact
    of the First World War and the Spanish Flu Epidemy.
  • Not Synced
    What a catastrophe!
  • Not Synced
    Now I'll speed up through the 1920s and 1930s
  • Not Synced
    and spite of the Great Depression, Western countries fueled on towards greater wealth and health.
  • Not Synced
    Japan and some others try to follow
    but most countries stay down here.
  • Not Synced
    After the tragedies of the Second World War
  • Not Synced
    we stop a bit to look at the world in 1948.
  • Not Synced
    1948 was a great year, the war was over,
    Sweden topped the medal table at the Winter Olympics,
  • Not Synced
    and I was born, but the differences between
    the countries of the world was wider than ever.
  • Not Synced
    United States was in the front,
    Japan was catching up, Brasil was way behind,
  • Not Synced
    Iran was getting a little richer from oil,
    but still had short lives.
  • Not Synced
    The Asian giants, China, India,
    Pakistan, Bangladesh and Indonesia,
  • Not Synced
    they were still poor and sit down here.
  • Not Synced
    But look what is about to happen. In my lifetime,
    former colonies gained independence
  • Not Synced
    and finally they started to get healthier,
    and healthier, and healthier.
  • Not Synced
    And in the 1970s,
    countries in Asia and Latin America
  • Not Synced
    started to catch up with the Western countries.
  • Not Synced
    They became the emerging economies.
  • Not Synced
    Some in Africa follow, some in Africa
    are stuck in civil wars, and others are hit by HIV.
  • Not Synced
    And now we can see the world today,
    in the most up-to-date statistics.
  • Not Synced
    Most people today live in the middle,
  • Not Synced
    but here are huge differences at the same time
  • Not Synced
    between the best of countries
    and the worst of countries
  • Not Synced
    and there are also huge inequalities within countries.
  • Not Synced
    These bubbles show country averages,
    but I can split them.
  • Not Synced
    Take China, I can split it into provinces.
  • Not Synced
    There goes Shanghai,
    it has the same health and wealth as Italy today.
  • Not Synced
    And then there's the poor inland province of Guizhou. It's like Pakistan.
  • Not Synced
    And if I split it further,
    the rural parts are like Ghana in Africa.
  • Not Synced
    And yet, despite the enormous disparities today,
    we have seen 200 years of remarkable progress.
  • Not Synced
    That huge historical gap between
    the West and the rest is now closing.
  • Not Synced
    We have become an entirely new converging world.
  • Not Synced
    And I see a clear trend into the future,
    with aid, trade, green technology and peace.
  • Not Synced
    It's fully possible that everyone
    can make it to the healthy-wealthy corner.
  • Not Synced
    What you've just seen in the last few minutes
    is a story of 200 countries
  • Not Synced
    shown over 200 years and beyond.
    It involved plotting 120,000 numbers.
  • Not Synced
    Pretty neat, eh?
  • Not Synced
    With statistics we can start to see things
    as they really are.
  • Not Synced
    From tables of data, to averages,
    distributions and visualisations,
  • Not Synced
    statistics gives us a clear description of the world.
  • Not Synced
    But with statistics we can not only
    discover what is happening
  • Not Synced
    but also explore why, by using
    the powerful analytical method of correlation.
  • Not Synced
    Just looking at one thing at a time
    doesn't tell you very much.
  • Not Synced
    You have to look at the relationships between things.
  • Not Synced
    How they change. How they vary together.
    That's what correlation is about.
  • Not Synced
    That's how we start to understand
    the processes that are really going on
  • Not Synced
    in the world and in socierty.
  • Not Synced
    Most of us would recognise today that crime
    correlates to poverty,
  • Not Synced
    that infection correlates to poor sanitasion,
  • Not Synced
    and that knowledge of statistics correlates
    to being great at dancing.
  • Not Synced
    Correlations can be very tricky.
  • Not Synced
    I've got a joke about silly correlations.
  • Not Synced
    This was this American
    who was afraid of heart attack.
  • Not Synced
    He found out that the Japanese ate very little fat,
    and almost didn't drink wine,
  • Not Synced
    and have much less heart attacks than the American.
  • Not Synced
    But on the other hand, he found out that the French
    eat as much fat as the Americans
  • Not Synced
    and they drink much more wine, but they also have less heart attacks.
  • Not Synced
    so he concluded that what kills you
    is speaking English.
  • Not Synced
    The best example of
    a really ground-breaking correlation
  • Not Synced
    was the link that was established in the 1950s
    between smoking and lung cancer.
  • Not Synced
    Not long after the Second World War,
    a British doctor, Richard Doll,
  • Not Synced
    investigated lung cancer patients
    in twenty London hospitals,
  • Not Synced
    and he became certain that
    the only thing they had in common was smoking
  • Not Synced
    so certain that he stopped smoking himself.
  • Not Synced
    But other people weren't so sure.
  • Not Synced
    Lots of the discussion of early data
    linking smoking and lung cancer
  • Not Synced
    it can't be smoking, surely, that thing
    we've done all our lives, that can't be bad for you.
  • Not Synced
    Maybe it's genes, maybe people
    who are genetically predisposed to get lung cancer
  • Not Synced
    are also genetically predisposed to smoke.
  • Not Synced
    Maybe it's not the smoking,
    maybe it's air pollution,
  • Not Synced
    that smokers and somehow more exposed to air pollution than non-smokers.
  • Not Synced
    Maybe it's not smoking, maybe it's poverty.
  • Not Synced
    So now we have three possible explanations
    apart from chance.
  • Not Synced
    To verify his correlation did imply cause and effect
  • Not Synced
    Richard Doll created
    the biggest statistical study of smoking yet
  • Not Synced
    He began tracking the lives of 40,000 British doctors
  • Not Synced
    some of whom smoked, some of whom didn't.
  • Not Synced
    And gathered enough data to correlate
    the amount of doctors who smoked
  • Not Synced
    with their likelihood of getting cancer.
  • Not Synced
    Eventually, he did not only show a correlation
    between smoking and lung cancer
  • Not Synced
    but also a correlation between stopping smoking
    and reducing the risk.
  • Not Synced
    This was science at its best.
  • Not Synced
    What correlations do not replace
    is human thought.
  • Not Synced
    We could think about what it means.
  • Not Synced
    What a good scientist does
    if he comes up with a correlation
  • Not Synced
    is try as hard as he or she possibly can
    to disprove it
  • Not Synced
    to break it down, to get rid of it,
    to try to refute it,
  • Not Synced
    and if it withstands all those efforts
    at demolishing it, and it still standing out,
  • Not Synced
    then we might really have something here.
  • Not Synced
    However brilliants the scientists,
    data is still the oxygen of science.
  • Not Synced
    The good news is that the more we have,
    the more correlations we'll find,
  • Not Synced
    the more theories we'll test,
    and the more discoveries we are likely to make.
  • Not Synced
    And history shows how our total sum of information
    grows in huge leaps
  • Not Synced
    as we develop new technologies.
  • Not Synced
    The invention of the printing press kicked off
    the first data and information explosion
  • Not Synced
    If you piled up all the books that have been printed
    by the year 1700
  • Not Synced
    they would make sixty stacks,
    each as high as Mount Everest.
  • Not Synced
    Then, starting in the 19th century,
    there came a second information revolution.
  • Not Synced
    With the telegraph, gramophone, camera,
    and later radio and TV.
  • Not Synced
    The total amount of information exploded.
  • Not Synced
    And by the 1950s the information available to us all
    had multiplied six thousend times.
  • Not Synced
    Then, thanks to the computer,
    and later the Internet, we went digital,
  • Not Synced
    and the amount of data we have now,
    is unimaginably vast.
  • Not Synced
    A single letter printed in a book
    is the equivalent to a byte of data.
  • Not Synced
    A single page
    equals a kilobyte or two.
  • Not Synced
    Five megabytes is enough
    for the complete works of Shakespeare.
  • Not Synced
    10 gigabytes, that's a DVD movie.
  • Not Synced
    2 terabytes is the tens of millions of photos
    added to Facebook everyday.
  • Not Synced
    10 petabytes is the data recorded every second
    by the world's largest particle accelerator,
  • Not Synced
    so much only a tiny fraction is kept.
  • Not Synced
    6 exabytes is what you'd have if you sequenced
    the genomes of every single person on Earth.
  • Not Synced
    But really, that's nothing.
    In 2009, the Internet added up to 600 exabytes,
  • Not Synced
    and in 2010, in just one year, that will double to more than one zettabyte.
  • Not Synced
    But in the real world,
    if we turned all this data into print
  • Not Synced
    it would make ninety stacks of books,
    each reaching from here all the way to the Sun.
  • Not Synced
    The data deluge is staggering.
    But with today's computers and statistics,
  • Not Synced
    I'm confident we can handle it.
  • Not Synced
    When it comes to all the data on the Internet,
  • Not Synced
    the powerhouse of statistical analysis
    is the Sillicon Valley giant Google.
  • Not Synced
    The average person over their lifetime
  • Not Synced
    is exposed to about a hundred million words
    of conversation.
  • Not Synced
    So if you multiply that
    by the six billion people on the planet
  • Not Synced
    that amount of words is equal to the amount of words
  • Not Synced
    that Google has available at any one instant of time.
  • Not Synced
    Google's computers hoover up and file away
  • Not Synced
    every document, web page
    and image they can find.
  • Not Synced
    Then they hunt for patterns and correlations
    in all this data
  • Not Synced
    doing statistics on a massive scale.
  • Not Synced
    And for me, Google has one project
    that is particularly exciting:
  • Not Synced
    statistical language translation.
  • Not Synced
    If you do want to provide access
    to all the web's information
  • Not Synced
    no matter what language is spoken.
  • Not Synced
    There's so much information on the Internet,
    you can not hope to tranlate it all by hand
  • Not Synced
    into every possible language, we figured
    we have to be able to do machine translation.
  • Not Synced
    In the past, programmers tried to teach their computers to see each language as a set of grammatical rules.
  • Not Synced
    Much like languages are taught at school.
  • Not Synced
    But this didn't work, because no set of rules
    could capture language in all its subtlety and ambiguity,
  • Not Synced
    Having eaten out lunch,
    the coach departed.
  • Not Synced
    That's obviously incorrect. Written like that,
    it would imply that the coach has eaten the lunch.
  • Not Synced
    It would be far better to say: Having eaten our lunch,
    we departed in the coach.
  • Not Synced
    Those rules are helpful,
    they are useful most of the time,
  • Not Synced
    but they don't turn out to be true
    all the time.
  • Not Synced
    And the insight of using
    statistical machine translation
  • Not Synced
    is saying: if we have all these exceptions anyways, maybe you can get by without having any rules,
  • Not Synced
    maybe we can treat everything as an exception,
    and that's essentially what we've done.
  • Not Synced
    What the computer is doing
    when it's learning how to translate
  • Not Synced
    is to learn correlations between words
    and between phrases
  • Not Synced
    so we feed the system
    very large amounts of data
  • Not Synced
    and the the system sees if a certain word or phrase
    correlates very often to the other language.
  • Not Synced
    Google's website currently offers translation between any of 57 different languages.
  • Not Synced
    It does this purely statistically,having correlated
    the huge collection of multilingual texts.
  • Not Synced
    The people who built he system
    don't need to know Chinese
  • Not Synced
    in order to build the Chinese system.
    They dont need to know Arabic.
  • Not Synced
    The expertise that is needed is basically knowledge of statistics, of computer science,
  • Not Synced
    of infrastructure,
  • Not Synced
    to build these very large computer systems we are building for doing that.
  • Not Synced
    I hooked up with Google from my office in Stockholm,
    to try the translator by myself.
  • Not Synced
    I will type some Swedish sentences.
  • Not Synced
    (Types in Swedish)
  • Not Synced
    (Reads on the screen) Sweden's finance minister
    has a ponytail and a gold ring in your ear.
  • Not Synced
    It's almost exactly correct, it's amazing.
  • Not Synced
    He comes from the conservative party,
    that's the kind of Sweden we have today.
  • Not Synced
    I will type one more sentence.
  • Not Synced
    In his same-sex parnertships has Stockholm's
    new bishop and his partners a three-year son.
  • Not Synced
    It's almost perfect,
    there's one important thing, it's "her".
  • Not Synced
    It's a lesbian partnership.
  • Not Synced
    OK, those kinds of words like "her"
    are one of the challenges in translation,
  • Not Synced
    to get those right.
  • Not Synced
    When it comes to bishops,
    one can excuse it.
  • Not Synced
    Right, I think that more often than not
    it would be probably a "his".
  • Not Synced
    I will write one more sentence.
    (Reads aloud in Swedish)
  • Not Synced
    When Sweden is taking part in Olympic gold,
    is not to win but to beat Norway.
  • Not Synced
    But they are very good in Winter Olympics,
    so we can't make it, but we are trying.
  • Not Synced
    Very good, very good.
  • Not Synced
    This is absolutely amazing,
  • Not Synced
    and I'm impressed that it picked up
    words like "same-sex partnerships"
  • Not Synced
    which are very due to the language.
  • Not Synced
    The translator is good, but if it succeeds,
    what will be next, that'll be remarkable.
  • Not Synced
    One of the exciting possibilities is combining
    the machine translation technology
  • Not Synced

    with the speech recognition technology.
  • Not Synced
    Both of these are statistically neutre.
  • Not Synced
    The machine translation relies on the statistics
    of mapping from one language to another,
  • Not Synced
    and similarly speech recognition relies on the statistics
    of mapping from a sound form to the words.
  • Not Synced
    When we put them together,
    now we have the capability
  • Not Synced
    of having instant conversations between two people who don't speak a common language.
  • Not Synced
    I can talk to you in my language,
    you hear me in your language,
  • Not Synced
    and you can answer back in real time,
    we can make that translation,
  • Not Synced
    we can bring people together
    and allow them to speak.
  • Not Synced
    The Internet is just one of many technologies
    created to gather massives amount of data.
  • Not Synced
    Scientists studying our Earth
    and our environment
  • Not Synced
    now use an incredible range of instruments
    to measure the processes of our planet.
  • Not Synced
    All around us our sensors are continously measuring
    temperature, water flow and ocean currents.
  • Not Synced
    High in orbit our satellite is busy imaging cloud formations, forest growth and snow cover.
  • Not Synced
    Scientists speak of instrumenting the Earth.
  • Not Synced
    And pointing up to the skies above,
  • Not Synced
    our powerful new telescopes are mapping the Universe.
  • Not Synced
    What's happening in astronomy,
    is tipically how profoundly this torrent of data
  • Not Synced
    is transforming science.
  • Not Synced
    Astronomers are now addressing
    many enduring misteries of the cosmos
  • Not Synced
    by applying statistical methods
    to all this new data.
  • Not Synced
    The galaxy is a very big place
    and it has billions of starts in it
  • Not Synced
    so to put toghether a coherent picture
    of the whole galaxy requires
  • Not Synced
    having enourmous amounts of data,
    and before you can do a large sky survey
  • Not Synced
    with sensitive digital detectors,
    that you can map many stars at once,
  • Not Synced
    it's very difficult to gather enough data
    of enough of the galaxy.
  • Not Synced
    In the past, large surveys of the night sky
    had to be done
  • Not Synced
    by exposing thousands
    of large photographic plates,
  • Not Synced
    but these surveys could take 25 years
    or more to complete.
  • Not Synced
    Then, in the 1990s, came digital astronomy,
  • Not Synced
    and a huge increase in both the amount
    and the accesibility of data.
  • Not Synced
    The Sloan Sky Survey is the world's biggest yet
  • Not Synced
    using a massive digital sensor mounted
  • Not Synced
    on the back of a custom built telescope in New Mexico.
  • Not Synced
    It's scanned the sky night after night for eight years
  • Not Synced
    building up a composite picture
    in unprecedented resolution.
  • Not Synced
    The Sloan's is some of the best deepest survey data
  • Not Synced
    we have in astronomy,
  • Not Synced
    both in our galaxy and galaxies away from ours.
  • Not Synced
    All the Sloan data is on the Internet
  • Not Synced
    and with it astronomers have identified
  • Not Synced
    millions of hidden unknown stars and galaxies.
  • Not Synced
    They also comb the database for statistical patterns
  • Not Synced
    which will prove, disprove or suggest new theories.
  • Not Synced
    So we have this idea that galaxies grow
  • Not Synced
    they become large galaxies
  • Not Synced
    like the one we live in, the Milky Way.
  • Not Synced
    Not all at once, not smoothly
  • Not Synced
    but by continously incorporating
  • Not Synced
    cannibalising smaller galaxies
  • Not Synced
    they dissolve them and become
    part of the bigger galaxy
  • Not Synced
    It's a startling idea
  • Not Synced
    and in the Sloan data there's the evidence to support it.
  • Not Synced
    Groups of starts that came from cannibalised galaxies
  • Not Synced
    stand out in the Sloan data statistically
    different from other stars.
  • Not Synced
    because they move at a different velocity.
  • Not Synced
    Each big spike of one of these distribution graphs
  • Not Synced
    means professor Rockossi has found a group of stars
  • Not Synced
    all travelling in a different way to the rest.
  • Not Synced
    They are the telltale patterns she's looking for.
  • Not Synced
    The evidence is accumulating that in fact
  • Not Synced
    this really is how galaxies grow
  • Not Synced
    or an important way of how galaxies grow
  • Not Synced
    this is important to understand how galaxies form
  • Not Synced
    not only ours but every galaxy.
  • Not Synced
    The more data there is
    the more discoveries can be made
  • Not Synced
    and the technology is getting better all the time.
  • Not Synced
    The next big survey telescope starts its work in 2015.
  • Not Synced
    It will leave Sloan in the dust.
  • Not Synced
    Sloan has taken 8 eight years
    to cover one quarter of the nightsky.
  • Not Synced
    The new telescope will scan the entire sky
    in even greater resolution
  • Not Synced
    every three days.
  • Not Synced
    The vast amounts of data we have today
  • Not Synced
    allows researchers in all sorts of fields
  • Not Synced
    to test their theories in a previously unimaginable scale
  • Not Synced
    but it may even change the fundamental way
    science is done.
  • Not Synced
    With the power of todays' computers
    applied to all this data
  • Not Synced
    the machines might be able to guide the researchers.
  • Not Synced
    There is a profoundly important,
    one of the most significant points in science
  • Not Synced
    certainly one of the most exciting
  • Not Synced
    the potential to transform not only
    how scientists do science
  • Not Synced
    but what science is possibly.
  • Not Synced
    What will power that transformation
    of how science is done
  • Not Synced
    is going to be computation.
  • Not Synced
    Many of the dynamics of the natual world
  • Not Synced
    like the interplay
    between the rainforest and the atmosphere
  • Not Synced
    are so complex, that we don't yet
    really understand.
  • Not Synced
    But now computers are generating
    tens of thousands of simulations
  • Not Synced
    of how these biological systems might work.
  • Not Synced
    Is like creating thousands of
    hypothetical parellel worlds.
  • Not Synced
    Each of these simulations is analysed with statistics
  • Not Synced
    to see if any are a good match
    of what is observed in each.
  • Not Synced
    The computers can now automatically generate,
  • Not Synced
    test and discard hypothesis
    with scarcely human insight.
  • Not Synced
    This new application statistics will become
  • Not Synced
    absolutely vital for the future of science.
  • Not Synced
    It's creating a new paradigm in the way we do science
  • Not Synced
    which is characterised as data-centric or data-driven
  • Not Synced
    rather than hypothesis- or experiment-driven.
  • Not Synced
    It's an exciting time in terms of science,
    computation and statistics.
  • Not Synced
    If all this sounds a bit abstract to you
  • Not Synced
    how about one final frontier?
  • Not Synced
    Could statistics make sense of your feelings?
  • Not Synced
    In California, (where else!), one computer scientist
  • Not Synced
    is harvesting the Internet to try to define the patterns
  • Not Synced
    of our innermost thoughts and emotions.
  • Not Synced
    This is the Madness Movement
  • Not Synced
    it represents a skyscrapper's view of the world.
  • Not Synced
    Each brightly coloured dot is an individual feeling
  • Not Synced
    expressed by someone out there in a blog or a tweet
  • Not Synced
    and when you click on the dot
  • Not Synced
    it explodes to reveal
    the underlying feeling of that person.
  • Not Synced
    This is what people say they're feeling today:
  • Not Synced
    better
  • Not Synced
    safe
  • Not Synced
    crappy
  • Not Synced
    well
  • Not Synced
    pretty
  • Not Synced
    special
  • Not Synced
    sorry
  • Not Synced
    alone
  • Not Synced
    Every minute WeFeelFine crosses the world's blogs
  • Not Synced
    takes all the sentences that start
    with the words "I feel" or "I'm feeling"
  • Not Synced
    and push them into a database.
  • Not Synced
    We collect all the feelings
    and we count the most common
  • Not Synced
    better
  • Not Synced
    bad
  • Not Synced
    good
  • Not Synced
    right
  • Not Synced
    guilty
  • Not Synced
    sick
  • Not Synced
    the same
  • Not Synced
    like shit
  • Not Synced
    sorry
  • Not Synced
    well
  • Not Synced
    We can take a look at any one feeling and analyse it.
  • Not Synced
    Right now a lot of people are feeling happy.
  • Not Synced
    We can take a look at these people,
    and break them down by age, gender or location.
  • Not Synced
    Since bloggers have public profiles,
    we have that information
  • Not Synced
    and we can ask questions like,
    "Are women happier than men?"
  • Not Synced
    or "Is England happier
    than the United States?"
  • Not Synced
    We find that as people get older, they get happier.
  • Not Synced
    For younger people,
    happiness associates with excitement
  • Not Synced
    whereas older people associate happiness
    more with peacefulness.
  • Not Synced
    We also find than women feel loved
    more often than men,
  • Not Synced
    but also more guilty.
  • Not Synced
    While men feel good more often than women,
    but also more alone.
  • Not Synced
    As people live more and more of their lives online
  • Not Synced
    they leave behind digital traces
  • Not Synced
    with which we can statistically analyse
  • Not Synced
    what it means to be human.
  • Not Synced
    Where does all this leave us?
  • Not Synced
    We generate unimaginable quantities of data
  • Not Synced
    About everything you can think of
  • Not Synced
    and we analyse it to reveal the patterns.
  • Not Synced
    Now not only experts but all of us can understand
  • Not Synced
    the stories in the numbers.
  • Not Synced
    Instead of being led astray by prejudice
  • Not Synced
    with statistics at our fingertips, our eyes can be open
  • Not Synced
    for a facts-based view of the world.
  • Not Synced
    More than ever before we can become
    authors of our own destiny.
  • Not Synced
    And that's pretty exciting isn't it?
  • Not Synced
    (Music)
Title:
The Joy of Stats
Description:

Documentary which takes viewers on a rollercoaster ride through the wonderful world of statistics to explore the remarkable power they have to change our understanding of the world, presented by superstar boffin Professor Hans Rosling, whose eye-opening, mind-expanding and funny online lectures have made him an international internet legend.

more » « less
Video Language:
English
Duration:
59:13
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats
Robert Pearce Rodríguez edited English subtitles for The Joy of Stats
Show all

English subtitles

Incomplete

Revisions

  • Revision 18 Edited (legacy editor)
    Robert Pearce Rodríguez