0:00:06.400,0:00:08.050
When I was a boy,

0:00:10.080,0:00:15.440
I wanted to maximise[br]my impact on the world,

0:00:15.440,0:00:19.460
and I was smart enough[br]to realise that I am not very smart.

0:00:21.280,0:00:24.588
And that I have to build a machine

0:00:24.588,0:00:28.770
that learns to become[br]much smarter than myself,

0:00:29.360,0:00:34.840
such that it can solve all the problems[br]that I cannot solve myself,

0:00:34.840,0:00:36.760
and I can retire.

0:00:38.560,0:00:42.800
And my first publication[br]on that dates back 30 years: 1987.

0:00:42.800,0:00:44.160
My diploma thesis,

0:00:44.160,0:00:48.600
where I already try to solve[br]the grand problem of AI,

0:00:48.600,0:00:50.240
not only build a machine

0:00:50.240,0:00:53.240
that learns a little bit here,[br]learns a little bit there,

0:00:53.240,0:00:58.530
but also learns to improve[br]the learning algorithm itself.

0:00:59.680,0:01:02.880
And the way it learns, the way it learns,

0:01:02.880,0:01:06.230
and so on recursively, without any limits

0:01:06.230,0:01:11.000
except the limits of logics and physics.

0:01:12.480,0:01:16.120
And, I'm still working[br]on the same old thing,

0:01:16.120,0:01:19.800
and I'm still pretty much[br]saying the same thing,

0:01:19.800,0:01:23.510
except that now[br]more people are listening.

0:01:25.160,0:01:28.080
Because the learning algorithms

0:01:28.080,0:01:30.480
that we have developed[br]on the way to this goal,

0:01:30.480,0:01:34.020
they are now on 3.000 million smartphones.

0:01:34.720,0:01:37.340
And all of you have them in your pockets.

0:01:39.950,0:01:40.960
What you see here

0:01:40.960,0:01:45.840
are the five most valuable companies[br]of the Western world:

0:01:45.840,0:01:50.430
Apple, Google, Facebook,[br]Microsoft and Amazon.

0:01:51.360,0:01:53.500
And all of them are emphasising

0:01:55.040,0:01:57.475
that AI, artificial intelligence,

0:01:57.475,0:02:00.270
is central to what they are doing.

0:02:02.000,0:02:07.600
And all of them are using heavily[br]the deep learning methods

0:02:07.600,0:02:11.000
that my team has developed[br]since the early nineties,

0:02:11.000,0:02:14.040
in Munich and in Switzerland.

0:02:14.040,0:02:18.720
Especially something which is called:[br]"the long short-term memory".

0:02:18.720,0:02:24.080
Has anybody in this room ever heard[br]of the long short-term memory,

0:02:24.080,0:02:25.560
or the LSTM?

0:02:25.560,0:02:27.720
Hands up, anybody ever heard of that?

0:02:27.720,0:02:29.000
Okay.

0:02:29.000,0:02:32.500
Has anybody never heard of the LSTM?

0:02:33.990,0:02:39.556
Okay.[br]I see we have a third group in this room:

0:02:43.156,0:02:45.755
[those] who didn't[br]understand the question.

0:02:45.755,0:02:47.625
(Laughter)

0:02:48.420,0:02:51.600
The LSTM is a little bit like your brain:

0:02:52.960,0:02:58.120
it's an artificial neural network[br]which also has neurons,

0:02:58.120,0:03:03.110
and in your brain, you've got[br]about 100 billion neurons.

0:03:04.240,0:03:05.630
And each of them is connected

0:03:05.630,0:03:09.520
to roughly 10,000[br]other neurons on average,

0:03:11.400,0:03:15.020
Which means that you have got[br]a million billion connections.

0:03:16.200,0:03:18.960
And each of these connections[br]has a "strength"

0:03:18.960,0:03:22.040
which says how much[br]does this neuron over here

0:03:22.040,0:03:25.200
influence that one over there[br]at the next time step.

0:03:25.200,0:03:26.320
And in the beginning,

0:03:26.320,0:03:30.160
all these connections are random[br]and the system knows nothing;

0:03:30.160,0:03:33.200
but then, through a smart[br]learning algorithm,

0:03:33.200,0:03:39.440
it learns from lots of examples[br]to translate the incoming data,

0:03:39.440,0:03:46.040
such as video through the cameras,[br]or audio through the microphones,

0:03:46.040,0:03:49.480
or pain signals through the pain sensors.

0:03:49.480,0:03:52.320
It learns to translate that[br]into output actions,

0:03:52.320,0:03:54.650
because some of these neurons[br]are output neurons,

0:03:54.650,0:03:57.650
that control speech muscles[br]and finger muscles.

0:04:00.223,0:04:01.840
And only through experience,

0:04:01.840,0:04:04.680
it can learn to solve[br]all kinds of interesting problems,

0:04:04.680,0:04:07.660
such as driving a car

0:04:10.880,0:04:13.800
or do the speech recognition[br]on your smartphone.

0:04:13.800,0:04:16.720
Because whenever you take out[br]your smartphone,

0:04:16.720,0:04:18.200
an Android phone, for example,

0:04:18.200,0:04:19.786
and you speak to it, and you say:

0:04:19.786,0:04:23.840
"Ok Google, show me[br]the shortest way to Milano."

0:04:23.840,0:04:25.379
Then it understands your speech.

0:04:26.970,0:04:31.760
Because there is a LSTM in there[br]which has learned to understand speech.

0:04:31.760,0:04:35.060
Every ten milliseconds,[br]100 times a second,

0:04:35.060,0:04:37.090
new inputs are coming from the microphone,

0:04:37.090,0:04:42.320
and then are translated, after thinking,

0:04:42.320,0:04:44.080
into letters

0:04:44.080,0:04:47.400
which are then questioned[br]to the search engine.

0:04:48.600,0:04:49.994
And it has learned to do that

0:04:49.994,0:04:54.690
by listening to lots of speech[br]from women, from men, all kinds of people.

0:04:55.390,0:04:57.800
And that's how, since 2015,

0:04:57.800,0:05:00.830
Google speech recognition[br]is now much better than it used to be.

0:05:02.400,0:05:05.360
The basic LSTM cell looks like that:

0:05:05.360,0:05:07.800
I don't have the time to explain that,

0:05:07.800,0:05:11.160
but at least I can list the names

0:05:11.160,0:05:14.320
of the brilliant students in my lab[br]who made that possible.

0:05:15.760,0:05:18.760
And what are the big companies[br]doing with that?

0:05:18.760,0:05:21.600
Well, speech recognition[br]is only one example;

0:05:22.280,0:05:25.170
if you are on Facebook -[br]is anybody on Facebook?

0:05:27.450,0:05:30.426
Are you sometimes clicking[br]at the translate button?

0:05:30.426,0:05:33.120
because somebody sent you something[br]in a foreign language

0:05:33.120,0:05:34.563
and then you can translate it.

0:05:34.563,0:05:37.000
Is anybody doing that? Yeah.

0:05:37.000,0:05:38.160
Whenever you do that,

0:05:38.160,0:05:41.560
you are waking up, again,[br]a long short term memory, an LSTM,

0:05:41.560,0:05:45.120
which has learned to translate[br]text in one language

0:05:45.120,0:05:47.380
into translated text.

0:05:48.880,0:05:53.280
And Facebook is doing that[br]four billion times a day,

0:05:53.280,0:05:59.456
so every second 50,000 sentences

0:05:59.456,0:06:00.880
are being translated

0:06:00.880,0:06:03.160
by an LSTM working for Facebook;

0:06:03.800,0:06:07.440
and another 50,000 in the second;[br]then another 50,000.

0:06:08.360,0:06:13.080
And to see how much this thing[br]is now permitting the modern world,

0:06:13.080,0:06:16.220
just note that almost 30 percent

0:06:16.220,0:06:22.240
of the awesome computational[br]power for inference

0:06:22.240,0:06:24.440
and all these Google Data Centers,

0:06:24.440,0:06:27.240
all these data centers of Google,[br]all over the world,

0:06:27.240,0:06:28.880
is used for LSTM.

0:06:28.880,0:06:30.170
Almost 30 percent.

0:06:30.880,0:06:33.240
If you have an Amazon Echo,

0:06:33.240,0:06:36.840
you can ask a question and it answers you.

0:06:37.440,0:06:40.280
And the voice that you hear[br]it's not a recording;

0:06:40.280,0:06:42.200
it's an LSTM network

0:06:42.200,0:06:44.693
which has learned from training examples

0:06:44.693,0:06:47.650
to sound like a female voice.

0:06:52.050,0:06:54.840
If you have an iPhone,[br]and you're using the quick type,

0:06:55.660,0:06:57.920
it's trying to predict[br]what you want to do next

0:06:57.920,0:07:00.640
given all the previous context[br]of what you did so far.

0:07:01.443,0:07:03.950
Again, that's an LSTM[br]which has learned to do that,

0:07:05.040,0:07:07.100
so it's on a billion iPhones.

0:07:09.920,0:07:12.680
You are a large audience, by my standards:

0:07:13.760,0:07:19.400
but when we started this work,[br]decades ago, in the early '90s,

0:07:19.400,0:07:21.680
only few people were interested in that,

0:07:21.680,0:07:24.900
because computers were so slow[br]and you couldn't do so much with it.

0:07:25.560,0:07:27.720
And I remember I gave a talk[br]at a conference,

0:07:28.898,0:07:31.400
and there was just[br]one single person in the audience,

0:07:32.840,0:07:34.680
a young lady.

0:07:34.680,0:07:38.960
I said, young lady,[br]it's very embarrassing,

0:07:38.960,0:07:42.000
but apparently today[br]I'm going to give this talk just to you.

0:07:42.000,0:07:43.280
And she said,

0:07:44.390,0:07:48.175
"OK, but please hurry:[br]I am the next speaker!"

0:07:48.175,0:07:52.645
(Laughter)

0:07:56.140,0:07:58.940
Since then, we have[br]greatly profited from the fact

0:07:58.940,0:08:02.174
that every five years[br]computers are getting ten times cheaper,

0:08:02.174,0:08:06.360
which is an old trend that has held[br]since 1941 at least.

0:08:06.360,0:08:08.080
Since this man, Konrad Zuse,

0:08:08.080,0:08:12.640
built the first working[br]program controlled computer in Berlin

0:08:12.640,0:08:17.140
and he could do, roughly,[br]one operation per second.

0:08:17.140,0:08:18.270
One!

0:08:19.140,0:08:22.040
And then ten years later,[br]for the same price,

0:08:22.040,0:08:24.520
one could do 100 operations:

0:08:24.520,0:08:25.600
30 years later,

0:08:25.600,0:08:27.960
1 million operations for the same price;

0:08:27.960,0:08:30.480
and today, after 75 years, we can do

0:08:30.480,0:08:33.799
a million billion times as much[br]for the same price.

0:08:33.799,0:08:36.120
And the trend is not about to stop,

0:08:36.120,0:08:39.650
because the physical limits[br]are much further out there.

0:08:42.919,0:08:48.080
Rather soon, and not[br]so many years or decades,

0:08:48.080,0:08:51.280
we will for the first time[br]have little computational devices

0:08:51.280,0:08:54.400
that can compute as much as a human brain;

0:08:55.090,0:08:57.130
and that's a trend that doesn't break.

0:08:57.130,0:09:01.520
50 years later, there will be[br]a little computational device,

0:09:01.520,0:09:02.760
for the same price,

0:09:02.760,0:09:07.800
that can compute as much as all[br]10 billion human brains taken together.

0:09:08.600,0:09:12.600
and there will not only be one,[br]of those devices, but many many many.

0:09:12.600,0:09:14.920
Everything is going to change.

0:09:14.920,0:09:17.720
Already in 2011,[br]computers were fast enough

0:09:17.720,0:09:19.840
such that our deep learning methods

0:09:19.840,0:09:25.480
for the first time could achieve[br]a superhuman pattern-recognition result.

0:09:25.480,0:09:29.960
It was the first superhuman result[br]in the history of computer vision.

0:09:29.960,0:09:34.120
And back then, computers were[br]20 times more expensive than today.

0:09:34.120,0:09:35.680
So today, for the same price,

0:09:35.680,0:09:37.840
we can do 20 times as much.

0:09:37.840,0:09:43.200
And just five years ago,

0:09:43.200,0:09:46.880
when computers were 10 times[br]more expensive than today,

0:09:46.880,0:09:51.440
we already could win, for the first time,[br]medical imaging competitions.

0:09:51.440,0:09:55.960
What you see behind me[br]is a slice through the female breast

0:09:55.960,0:10:00.680
and the tissue that you see there[br]has all kinds of cells;

0:10:00.680,0:10:05.160
and normally you need a trained doctor,[br]a trained histologist

0:10:05.160,0:10:09.560
who is able to detect[br]the dangerous cancer cells,

0:10:09.560,0:10:11.160
or pre-cancer cells.

0:10:11.880,0:10:13.487
Now, our stupid network

0:10:13.487,0:10:16.084
knows nothing about cancer,[br]knows nothing about vision.

0:10:16.084,0:10:17.720
It knows nothing in the beginning:

0:10:17.720,0:10:21.920
but we can train it to imitate[br]the human teacher, the doctor.

0:10:21.920,0:10:26.560
And it became as good, or better,[br]than the best competitors.

0:10:26.560,0:10:28.710
And very soon,

0:10:28.710,0:10:31.880
all of medical diagnosis[br]is going to be superhuman.

0:10:33.690,0:10:35.560
And it's going to be mandatory,

0:10:35.560,0:10:38.253
because it's going to be[br]so much better than the doctors.

0:10:40.440,0:10:45.600
After this, all kinds of medical[br]imaging startups were founded

0:10:45.600,0:10:48.120
focusing just on this,[br]because it's so important.

0:10:49.160,0:10:52.800
We can also use LSTM to train robots.

0:10:52.800,0:10:55.040
One important thing I want to say is,

0:10:55.040,0:10:58.040
that we not only have systems

0:10:58.040,0:11:01.080
that slavishly imitate[br]what humans show them;

0:11:01.080,0:11:05.920
no, we also have AIs[br]that set themselves their own goals.

0:11:07.960,0:11:12.280
And like little babies,[br]invent their own experiment

0:11:12.880,0:11:14.840
to explore the world

0:11:14.840,0:11:17.092
and to figure out[br]what you can do in the world.

0:11:17.560,0:11:19.260
Without a teacher.

0:11:19.260,0:11:23.400
And becoming more and more general[br]problem solvers in the process,

0:11:23.400,0:11:26.680
by learning new skills[br]on top of old skills.

0:11:26.680,0:11:31.120
And this is going to scale:[br]we call that "Artificial Curiosity".

0:11:31.940,0:11:34.200
Or a recent buzzword is "power plane".

0:11:34.720,0:11:38.840
Learning to become a more and more[br]general problem solvers

0:11:38.840,0:11:44.280
by learning to invent, like a scientist,[br]one new interesting goal after another.

0:11:44.840,0:11:47.440
And it's going to scale.

0:11:47.440,0:11:48.450
And I think,

0:11:48.450,0:11:50.790
in not so many years[br]from now, for the first time,

0:11:50.790,0:11:55.520
we are going to have an animal-like AI -

0:11:55.520,0:11:57.720
we don't have that yet.

0:11:58.600,0:12:00.160
On the level of a little crow,

0:12:00.800,0:12:04.040
which already can learn[br]to use tools, for example,

0:12:04.040,0:12:05.360
or a little monkey.

0:12:05.700,0:12:07.360
And once we have that,

0:12:07.360,0:12:09.270
it may take just a few decades

0:12:09.270,0:12:13.400
to do the final step[br]towards human level intelligence.

0:12:14.800,0:12:16.380
Because technological evolution

0:12:16.380,0:12:20.660
is about a million times faster[br]than biological evolution,

0:12:20.660,0:12:27.440
and biological evolution[br]needed 3.5 billion years

0:12:27.440,0:12:31.440
to evolve a monkey from scratch.

0:12:31.440,0:12:35.240
But then, it took just a few tens[br]of millions of years afterwards

0:12:35.240,0:12:37.560
to evolve human level intelligence.

0:12:38.400,0:12:40.680
We have a company[br]which is called Nnaisense

0:12:41.720,0:12:45.120
like birth in [French], "Naissance",[br]but spelled in a different way,

0:12:45.120,0:12:47.826
which is trying to make this a reality

0:12:47.826,0:12:50.960
and build the first[br]true general-purpose AI.

0:12:52.560,0:12:58.120
At the moment, almost all research in AI[br]is very human centric,

0:12:58.120,0:13:04.720
and it's all about making human lives[br]longer and healthier and easier

0:13:04.720,0:13:07.240
and making humans[br]more addicted to their smartphones.

0:13:09.100,0:13:13.320
But in the long run, AIs are going to -[br]especially the smart ones -

0:13:13.320,0:13:16.280
are going to set themselves[br]their own goals.

0:13:16.280,0:13:18.800
And I have no doubt, in my mind,

0:13:18.800,0:13:21.760
that they are going to become[br]much smarter than we are.

0:13:22.480,0:13:24.400
And what are they going to do?

0:13:24.400,0:13:27.960
Of course they are going to realize[br]what we have realized a long time ago;

0:13:27.960,0:13:34.200
namely, that most of the resources,[br]in the solar system or in general,

0:13:34.200,0:13:37.120
are not in our little biosphere.

0:13:37.120,0:13:38.990
They are out there in space.

0:13:40.075,0:13:42.240
And so, of course,[br]they are going to emigrate.

0:13:42.240,0:13:48.920
And of course they are going to use

0:13:48.920,0:13:52.400
trillions of self-replicating[br]robot factories

0:13:52.400,0:13:57.880
to expand in form of a growing AI bubble

0:13:57.880,0:14:00.400
which within a few hundred thousand years

0:14:00.400,0:14:02.560
is going to cover the entire galaxy

0:14:02.560,0:14:04.240
by senders and receivers

0:14:04.240,0:14:06.320
such that AIs can travel

0:14:06.320,0:14:08.920
the way they are[br]already traveling in my lab:

0:14:08.920,0:14:11.160
by radio, from sender to receiver.

0:14:12.200,0:14:13.650
Wireless.

0:14:15.100,0:14:19.000
So what we are witnessing now

0:14:19.000,0:14:24.630
is much more than just[br]another Industrial Revolution.

0:14:24.630,0:14:27.680
This is something[br]that transcends humankind,

0:14:27.680,0:14:29.520
and even life itself.

0:14:29.520,0:14:32.880
The last time something[br]so important has happened

0:14:32.880,0:14:37.240
was maybe 3.5 billion years ago,[br]when life was invented.

0:14:38.430,0:14:42.930
A new type of life is going to emerge[br]from our little planet

0:14:42.930,0:14:48.000
and it's going to colonize[br]and transform the entire universe.

0:14:48.000,0:14:52.000
The universe is still young:[br]it's only 13.8 billion years old,

0:14:52.000,0:14:58.000
it's going to become much older than that,[br]many times older than that.

0:14:58.000,0:15:02.520
So there's plenty of time[br]to reach all of it,

0:15:02.520,0:15:04.240
or all of the visible parts,

0:15:04.240,0:15:07.640
totally within the limits[br]of light speed and physics.

0:15:09.450,0:15:13.780
A new type of life is going[br]to make the universe intelligent.

0:15:13.780,0:15:19.220
Now, of course, we are not going to remain[br]the crown of creation, of course not.

0:15:20.400,0:15:21.880
But there is still beauty

0:15:21.880,0:15:27.200
in seeing yourself[br]as part of a grander process

0:15:27.200,0:15:29.160
that leads the cosmos

0:15:29.160,0:15:32.200
from low complexity[br]towards higher complexity.

0:15:33.640,0:15:36.760
It's a privilege to live at a time

0:15:36.760,0:15:40.080
where we can witness[br]the beginnings of that

0:15:40.080,0:15:43.240
and where we can contribute[br]something to that.

0:15:46.490,0:15:48.300
Thank you for your patience.

0:15:49.160,0:15:54.840
(Applause)