Return to Video

How we're building the world's largest family tree

  • Not Synced
    People use the internet
    for various reasons.
  • Not Synced
    It turns out that one of the most
    popular categories of website
  • Not Synced
    is something that people
    typically consume in private.
  • Not Synced
    It involves curiosity,
  • Not Synced
    non-insignificant level
    of self-indulgence,
  • Not Synced
    and centered around recording
    the reproductive activities
  • Not Synced
    of other people.
  • Not Synced
    Of course I'm talking
    about genealogy.
  • Not Synced
    (Laughter)
  • Not Synced
    The study of family history.
  • Not Synced
    When it comes to detailing family history,
  • Not Synced
    in every family we have this person
    that is obsessed with genealogy.
  • Not Synced
    Let's call him Uncle Bernie.
  • Not Synced
    Uncle Bernie is exactly the last person
    you want to sit next to
  • Not Synced
    in Thanksgiving dinner
  • Not Synced
    because he will bore you to death
    with peculiar details
  • Not Synced
    about some ancient relatives.
  • Not Synced
    But as you know,
  • Not Synced
    there is a scientific side for everything,
  • Not Synced
    and we found that Uncle Bernie's stories
  • Not Synced
    have immense potential
    for biomedical research.
  • Not Synced
    We let Uncle Bernie
    and his fellow genealogists
  • Not Synced
    document their family trees through
    a genealogy website called geni.com.
  • Not Synced
    When users upload
    their trees to the website,
  • Not Synced
    it scans their relatives
  • Not Synced
    and if it finds matches to existing trees,
  • Not Synced
    it emerges the existing
    and the new tree together.
  • Not Synced
    The result is that large family trees
    are created beyond the individual level
  • Not Synced
    of each genealogist.
  • Not Synced
    Now, by repeating this process with
    millions of people all over the world,
  • Not Synced
    we can crowdsource the construction
    of a family tree of all humankind.
  • Not Synced
    Using this website,
  • Not Synced
    we were able to connect 125 million people
  • Not Synced
    into a single family tree.
  • Not Synced
    I cannot draw the tree
    on the screens over here
  • Not Synced
    because they have less pixels
  • Not Synced
    than the number of people in this tree,
  • Not Synced
    but here is an example of a subset
    of 6,000 individuals.
  • Not Synced
    Each green node is a person.
  • Not Synced
    The red nodes represent marriages,
  • Not Synced
    and the connections represent parenthood.
  • Not Synced
    In the middle of this tree,
    you see the ancestors,
  • Not Synced
    and as we go to the periphery,
    you see the descendants,
  • Not Synced
    and this tree has seven
    generations approximately.
  • Not Synced
    Now, this is what happens
    when we increase the number of individuals
  • Not Synced
    to 70,000 people,
  • Not Synced
    still a tiny subset
    of all the data that we have.
  • Not Synced
    Despite that, you can already see
    the formation of gigantic family trees
  • Not Synced
    with very many distant relatives.
  • Not Synced
    Thanks to the hard work
    of our genealogists,
  • Not Synced
    we can go back in time
    hundreds of years ago.
  • Not Synced
    For example, here is Alexander Hamilton
  • Not Synced
    that was born in 1755.
  • Not Synced
    Alexander was the first
    US Secretary of the Treasury,
  • Not Synced
    but mostly known today
    due to a popular Broadway musical.
  • Not Synced
    We found that Alexander has deeper
    connections in the showbiz industry.
  • Not Synced
    In fact, he's a blood relative
    of Kevin Bacon.
  • Not Synced
    (Laughter)
  • Not Synced
    Both of them are descendants
    of a lady from Scotland
  • Not Synced
    who lived in the 13th century.
  • Not Synced
    So you can say that Alexander Hamilton
  • Not Synced
    is 35 degrees of Kevin Bacon genealogy.
  • Not Synced
    (Laughter)
  • Not Synced
    And our tree has millions
    of stories like that.
  • Not Synced
    We invested significant effort
    to validate the quality of our data.
  • Not Synced
    Using DNA, we found that .3 percent of
    the mother-child connections in our data
  • Not Synced
    are wrong,
  • Not Synced
    which could match the adoption rate
    in the US pre-Second World War.
  • Not Synced
    For the father's side,
  • Not Synced
    the news are not as good.
  • Not Synced
    1.9 percent of the father-child
    connections in our data are wrong.
  • Not Synced
    And I see some people smirk over here.
  • Not Synced
    It is what you think.
  • Not Synced
    There are many milkmen out there.
  • Not Synced
    (Laughter)
  • Not Synced
    However, this 1.9 percent error rate
    in patrilineal connections
  • Not Synced
    is not unique to our data.
  • Not Synced
    Previous studies found
    a similar error rate
  • Not Synced
    using clinical-grade pedigrees.
  • Not Synced
    So the quality of our data is good,
  • Not Synced
    and that should not be a surprise.
  • Not Synced
    Our genealogists have a profound,
    vested interest in correctly documenting
  • Not Synced
    the family history.
  • Not Synced
    We can leverage this data to learn
    quantitative information about humanity,
  • Not Synced
    for example questions about demography.
  • Not Synced
    Here is a look of all our profiles
    on the map of the world.
  • Not Synced
    Each pixel is a person
    that lived at some point,
  • Not Synced
    and since we have so much data,
  • Not Synced
    you can see the contours
    of many countries,
  • Not Synced
    especially in the Western world.
  • Not Synced
    In this clip, we stratified
    the map that I've showed you
  • Not Synced
    basically of birth of individuals
  • Not Synced
    from 1400 to 1900
  • Not Synced
    and we compared it
    to known migration events.
  • Not Synced
    The clip is going to show you
    that the deepest lineages in our data
  • Not Synced
    go all the way back to the UK,
  • Not Synced
    where they had better record-keeping,
  • Not Synced
    and then they spread along
    the routes of Western colonialism.
  • Not Synced
    Let's watch this.
  • Not Synced
    (Music)
  • Not Synced
    I love this movie.
  • Not Synced
    Now, since these migrations events
    are giving the context of families,
  • Not Synced
    we can ask questions
  • Not Synced
    such as what is the typical distance
    between the birth locations
  • Not Synced
    of husbands and wives?
  • Not Synced
    This distance plays
    a pivotal role in demography,
  • Not Synced
    because the patterns on which
    people migrate to form families
  • Not Synced
    determine how genes spread
    in geographical areas.
  • Not Synced
    We analyzed this distance using our data,
  • Not Synced
    and we found that in the old days,
  • Not Synced
    people had it easy.
  • Not Synced
    They just married someone
    in the village nearby.
  • Not Synced
    But the Industrial Revolution
    really complicated our love life,
  • Not Synced
    and today with affordable flights
    and online social media,
  • Not Synced
    people typically migrate
    more than 100 kilometers
  • Not Synced
    from their place of birth
    to find their soulmate.
  • Not Synced
    So now you might ask, OK,
  • Not Synced
    but who does the hard work
    of migrating from places to places
  • Not Synced
    to form families?
  • Not Synced
    Are these the males or the females?
  • Not Synced
    We used our data to address this question,
  • Not Synced
    and at least in the last 300 years,
  • Not Synced
    we found that the ladies
  • Not Synced
    do the hard work of migrating
    from places to places to form families.
  • Not Synced
    Now these results
    are statistically significant,
  • Not Synced
    so you can take it as scientific fact
    that males are lazy.
  • Not Synced
    (Laughter)
  • Not Synced
    We can move from questions
    about demography
  • Not Synced
    and ask questions about human health.
  • Not Synced
    For example, we can ask to what extent
    genetic variations account for differences
  • Not Synced
    in lifespan between individuals.
  • Not Synced
    Previous studies analyzed
    the correlation of longevity
  • Not Synced
    between twins to address this question.
  • Not Synced
    They estimated that the genetic variations
    account for about a quarter
  • Not Synced
    of the differences in lifespan
    between individuals.
  • Not Synced
    But twins can be correlated
    due to so many reasons,
  • Not Synced
    including various environmental effects
  • Not Synced
    or a shared household.
  • Not Synced
    Large family trees give us the opportunity
    to analyze both close relatives,
  • Not Synced
    such as twins, all the way
    to distant relatives, even fourth cousins.
  • Not Synced
    This way we can build robust models
  • Not Synced
    that can tease apart the contribution
    of genetic variations
  • Not Synced
    from environmental factors.
  • Not Synced
    We conducted this analysis using our data,
  • Not Synced
    and we found that genetic variations
    explain only 15 percent
  • Not Synced
    of the differences in lifespan
    between individuals.
  • Not Synced
    That is five years, on average.
  • Not Synced
    So genes matter less than
    what we thought before to lifespan,
  • Not Synced
    and I find it as great news,
  • Not Synced
    because it means that
    our actions can matter more.
  • Not Synced
    Smoking, for example, determines
    10 years of our life expectancy,
  • Not Synced
    twice as much as what genetics determines.
  • Not Synced
    We can even have more surprising findings
  • Not Synced
    as we move from family trees
  • Not Synced
    and we let our genealogists
  • Not Synced
    to document and crowdsource
    DNA information.
  • Not Synced
    And the results can be amazing.
  • Not Synced
    It might be hard to imagine,
    but Uncle Bernie and his friends
  • Not Synced
    can create a DNA forensic capabilities
  • Not Synced
    that even exceed
    what the FBI currently has.
  • Not Synced
    When you place the DNA
    on a large family tree,
  • Not Synced
    you effectively create a beacon
  • Not Synced
    that illuminates the hundreds
    of distant relatives
  • Not Synced
    that are connected to the person
    that originated the DNA.
  • Not Synced
    By placing multiple beacons
    on a large family tree,
  • Not Synced
    you can now triangulate the DNA
    of an unknown person,
  • Not Synced
    the same way that the GPS system
  • Not Synced
    uses multiple satellites
    to find a location.
  • Not Synced
    The prime example
    of the power of this technique
  • Not Synced
    is capturing the Golden State Killer,
  • Not Synced
    one of the most notorious criminals
    in the history of the US.
  • Not Synced
    The FBI has been searching
    For this person for over 40 years.
  • Not Synced
    They had his DNA,
  • Not Synced
    but he never showed up
    in any police database.
  • Not Synced
    About a year ago, the FBI
    consulted a genetic genealogist,
  • Not Synced
    and she suggested that they submit
    his DNA to a genealogy service
  • Not Synced
    that can locate distant relatives.
  • Not Synced
    They did that,
  • Not Synced
    and they found a third cousin
    of the Golden State Killer.
  • Not Synced
    They built a large family tree,
  • Not Synced
    scanned the different
    branches of that tree
  • Not Synced
    until they found a profile
    that exactly matched
  • Not Synced
    what they knew about
    the Golden State Killer.
  • Not Synced
    They obtained DNA from this person
    and found a perfect match
  • Not Synced
    to the DNA they had in hand.
  • Not Synced
    They arrested him
    and brought him to justice
  • Not Synced
    after all these years.
  • Not Synced
    Since then, genetic genealogists
  • Not Synced
    have started working with
    local US law enforcement agencies
  • Not Synced
    to use this technique
    in order to capture criminals,
  • Not Synced
    and only in the past six months,
  • Not Synced
    they were able to solve
    over 20 cold cases with this technique.
  • Not Synced
    The French Nobel Laureate André Gide
    once wrote, "Families, I hate you!"
  • Not Synced
    (Laughter)
  • Not Synced
    And I think most of us
    can relate to his words.
  • Not Synced
    Why dig around in the past
    doing family history
  • Not Synced
    when the future is so bright and open?
  • Not Synced
    But luckily, we have people
    like Uncle Bernie
  • Not Synced
    and his fellow genealogists
    who love families
  • Not Synced
    and tirelessly study them.
  • Not Synced
    These are not amateurs
    with a self-serving hobby,
  • Not Synced
    these are citizen scientists
    with a deep passion to tell us who we are,
  • Not Synced
    and they know that the past
    can hold a key to the future.
  • Not Synced
    Thank you very much.
  • Not Synced
    (Applause)
Title:
How we're building the world's largest family tree
Speaker:
Yaniv Erlich
Description:

more » « less
Video Language:
English
Team:
closed TED
Project:
TEDTalks
Duration:
11:45

English subtitles

Revisions Compare revisions