1
00:00:00,000 --> 00:00:09,044
preroll music
2
00:00:09,044 --> 00:00:14,049
Herald: Our next talk is going to be about AI and
it's going to be about proper AI.
3
00:00:14,049 --> 00:00:17,730
It's not going to be about
deep learning or buzz word bingo.
4
00:00:17,730 --> 00:00:22,590
It's going to be about actual psychology.
It's going to be about computational metapsychology.
5
00:00:22,590 --> 00:00:25,750
And now please welcome Joscha!
6
00:00:25,750 --> 00:00:33,050
applause
7
00:00:33,050 --> 00:00:35,620
Joscha: Thank you.
8
00:00:35,620 --> 00:00:37,710
I'm interested in understanding
how the mind works,
9
00:00:37,710 --> 00:00:42,640
and I believe that the most foolproof perspective
at looking ... of looking at minds is to understand
10
00:00:42,640 --> 00:00:46,600
that they are systems that if you saw patterns
at them you find meaning.
11
00:00:46,600 --> 00:00:51,700
And you find meaning in those in very particular
ways and this is what makes us who we are.
12
00:00:51,700 --> 00:00:55,239
So they way to study and understand who we
are in my understanding is
13
00:00:55,239 --> 00:01:01,149
to build models of information processing
that constitutes our minds.
14
00:01:01,149 --> 00:01:05,640
Last year about the same time, I've answered
the four big questions of philosophy:
15
00:01:05,640 --> 00:01:08,510
"Whats the nature of reality?", "What can
be known?", "Who are we?",
16
00:01:08,510 --> 00:01:14,650
"What should we do?"
So now, how can I top this?
17
00:01:14,650 --> 00:01:18,720
applause
18
00:01:18,720 --> 00:01:22,849
I'm going to give you the drama
that divided a planet.
19
00:01:22,849 --> 00:01:26,470
Some of a very, very big events,
that happened in the course of last year,
20
00:01:26,470 --> 00:01:30,080
so I couldn't tell you about it before.
21
00:01:30,080 --> 00:01:38,489
What color is the dress
laughsapplause
22
00:01:38,489 --> 00:01:44,720
I mean ahmm... If you have.. do not have any
mental defects you can clearly see it's white
23
00:01:44,720 --> 00:01:46,550
and gold. Right?
24
00:01:46,550 --> 00:01:48,720
[voices from audience]
25
00:01:48,720 --> 00:01:53,009
Turns out, ehmm.. most people seem to have
mental defects and say it is blue and black.
26
00:01:53,009 --> 00:01:57,500
I have no idea why. Well Ok, I have an idea,
why that is the case.
27
00:01:57,500 --> 00:02:01,170
Ehmm, I guess that you got too, it has to
do with color renormalization
28
00:02:01,170 --> 00:02:04,720
and color renormalization happens differently
apparently in different people.
29
00:02:04,720 --> 00:02:09,000
So we have different wireing to renormalize
the white balance.
30
00:02:09,000 --> 00:02:12,650
And it seems to work in real world
situations in pretty much the same way,
31
00:02:12,650 --> 00:02:18,000
but not necessarily for photographs.
Which have only very small fringe around them,
32
00:02:18,000 --> 00:02:20,600
which gives you hint about the lighting situation.
33
00:02:20,600 --> 00:02:27,000
And that's why you get this huge divergencies,
which is amazing!
34
00:02:27,000 --> 00:02:29,660
So what we see that our minds can not know
35
00:02:29,660 --> 00:02:33,250
objective truths in any way. Outside of mathematics.
36
00:02:33,250 --> 00:02:36,340
They can generate meaning though.
37
00:02:36,340 --> 00:02:38,760
How does this work?
38
00:02:38,760 --> 00:02:42,010
I did robotic soccer for a while,
and there you have the situation,
39
00:02:42,010 --> 00:02:45,150
that you have a bunch of robots, that are
situated on a playing field.
40
00:02:45,150 --> 00:02:48,480
And they have a model of what goes on
in the playing field.
41
00:02:48,480 --> 00:02:52,050
Physics generates data for their sensors.
They read the bits of the sensors.
42
00:02:52,050 --> 00:02:55,900
And then they use them to.. erghmm update
the world model.
43
00:02:55,900 --> 00:02:59,020
And sometimes we didn't want
to take the whole playing field along,
44
00:02:59,020 --> 00:03:03,380
and the physical robots, because they are
expensive and heavy and so on.
45
00:03:03,380 --> 00:03:06,480
Instead if you just want to improve the learning
and the game play of the robots
46
00:03:06,480 --> 00:03:07,800
you can use the simulations.
47
00:03:07,800 --> 00:03:11,200
So we've wrote a computer simulation of the
playing field and the physics, and so on,
48
00:03:11,200 --> 00:03:15,210
that generates pretty some the same data,
and put the robot mind into the simulator
49
00:03:15,210 --> 00:03:17,040
robot body, and it works just as well.
50
00:03:17,040 --> 00:03:20,590
That is, if you the robot, because you can
not know the difference if you are the robot.
51
00:03:20,590 --> 00:03:24,460
You can not know what's out there. The only
thing that you get to see is what is the structure
52
00:03:24,460 --> 00:03:27,530
of the data at you system bit interface.
53
00:03:27,530 --> 00:03:30,090
And then you can derive model from this.
54
00:03:30,090 --> 00:03:32,960
And this is pretty much the situation
that we are in.
55
00:03:32,960 --> 00:03:38,180
That is, we are minds that are somehow computational,
56
00:03:38,180 --> 00:03:40,700
they are able to find regularity in patterns,
57
00:03:40,700 --> 00:03:44,530
and they are... we.. seem to have access to
something that is full of regularity,
58
00:03:44,530 --> 00:03:46,630
so we can make sense out of it.
59
00:03:46,630 --> 00:03:48,930
[ghulp, ghulp]
60
00:03:48,930 --> 00:03:52,800
Now, if you discover that you are in the same
situation as these robots,
61
00:03:52,800 --> 00:03:56,180
basically you discover that you are some kind
of apparently biological robot,
62
00:03:56,180 --> 00:03:58,530
that doesn't have direct access
to the world of concepts.
63
00:03:58,530 --> 00:04:02,140
That has never actually seen matter
and energy and other people.
64
00:04:02,140 --> 00:04:04,890
All it got to see was little bits of information,
65
00:04:04,890 --> 00:04:06,270
that were transmitted through the nerves,
66
00:04:06,270 --> 00:04:07,870
and the brain had to make sense of them,
67
00:04:07,870 --> 00:04:10,470
by counting them in elaborate ways.
68
00:04:10,470 --> 00:04:12,720
What's the best model of the world
that you can have with this?
69
00:04:12,720 --> 00:04:16,530
What will the state of affairs,
what's the system that you are in?
70
00:04:16,530 --> 00:04:20,920
And what are the best algorithms that you
should be using, to fix your world model.
71
00:04:20,920 --> 00:04:23,310
And this question is pretty old.
72
00:04:23,310 --> 00:04:27,750
And I think that has been answered for the
first time by Ray Solomonoff in the 1960.
73
00:04:27,750 --> 00:04:30,840
He has discovered an algorithm,
that you can apply when you discover
74
00:04:30,840 --> 00:04:33,540
that you are an robot,
and all you have is data.
75
00:04:33,540 --> 00:04:34,870
What is the world like?
76
00:04:34,870 --> 00:04:40,990
And this algorithm is basically
a combination of induction and Occam's razor.
77
00:04:40,990 --> 00:04:45,710
And we can mathematically prove that we can
not do better than Solomonoff induction.
78
00:04:45,710 --> 00:04:51,380
Unfortunately, Solomonoff induction
is not quite computable.
79
00:04:51,380 --> 00:04:54,450
But everything that we are going to do is
some... is going to be some approximation
80
00:04:54,450 --> 00:04:55,820
of Salomonoff induction.
81
00:04:55,820 --> 00:04:59,400
So our concepts can not really refer
to the facts in the world out there.
82
00:04:59,400 --> 00:05:02,380
We do not get the truth by referring
to stuff out there, in the world.
83
00:05:02,380 --> 00:05:07,960
We get meaning by suitably encoding
the patterns at our systemic interface.
84
00:05:07,960 --> 00:05:12,270
And AI has recently made a huge progress in
encoding data at perceptual interfaces.
85
00:05:12,270 --> 00:05:15,900
Deep learning is about using a stacked hierarchy
of feature detectors.
86
00:05:15,900 --> 00:05:21,280
That is, we use pattern detectors and we build
them into a networks that are arranged in
87
00:05:21,280 --> 00:05:23,030
hundreds of layers.
88
00:05:23,030 --> 00:05:26,500
And then we adjust the links
between these layers.
89
00:05:26,500 --> 00:05:29,380
Usually some kind of... using
some kind of gradient descent.
90
00:05:29,380 --> 00:05:33,220
And we can use this to classify
for instance images and parts of speech.
91
00:05:33,220 --> 00:05:37,950
So, we get to features that are more and more
complex, they started as very, very simple patterns.
92
00:05:37,950 --> 00:05:41,290
And then get more and more complex,
until we get to object categories.
93
00:05:41,290 --> 00:05:44,199
And now this systems are able
in image recognition task,
94
00:05:44,199 --> 00:05:47,480
to approach performance that is very similar
to human performance.
95
00:05:47,480 --> 00:05:52,040
Also what is nice is that it seems to be somewhat
similar to what the brain seems to be doing
96
00:05:52,040 --> 00:05:53,740
in visual processing.
97
00:05:53,740 --> 00:05:57,570
And if you take the activation in different
levels of these networks and you
98
00:05:57,570 --> 00:06:01,430
erghm... improve the... that... erghmm...
enhance this activation a little bit, what
99
00:06:01,430 --> 00:06:03,500
you get is stuff that look very psychedelic.
100
00:06:03,500 --> 00:06:09,620
Which may be similar to what happens, if you
put certain illegal substances into people,
101
00:06:09,620 --> 00:06:13,650
and enhance the activity on certain layers
of their visual processing.
102
00:06:13,650 --> 00:06:21,540
[BROKEN AUDIO]If you want to classify the
differences what we do if we want quantify
103
00:06:21,540 --> 00:06:33,030
this you filter out all the invariences in
the data.
104
00:06:33,030 --> 00:06:36,360
The pose that she has, the lighting,
the dress that she is on.. has on,
105
00:06:36,360 --> 00:06:38,020
her facial expression and so on.
106
00:06:38,020 --> 00:06:42,900
And then we go to only to this things that
is left after we've removed all the nuance data.
107
00:06:42,900 --> 00:06:47,410
But what if we... erghmm
want to get to something else,
108
00:06:47,410 --> 00:06:49,850
for instance if we want to understand poses.
109
00:06:49,850 --> 00:06:53,240
Could be for instance that we have several
dancers and we want to understand what they
110
00:06:53,240 --> 00:06:54,400
have in common.
111
00:06:54,400 --> 00:06:58,330
So our best bet is not just to have a single
classification based filtering,
112
00:06:58,330 --> 00:07:01,199
but instead what we want to have is to take
the low level input
113
00:07:01,199 --> 00:07:05,180
and get a whole universe of features,
that is interrelated.
114
00:07:05,180 --> 00:07:07,220
So we have different levels of interrelations.
115
00:07:07,220 --> 00:07:08,960
At the lowest levels we have percepts.
116
00:07:08,960 --> 00:07:11,580
On the slightly higher level we have simulations.
117
00:07:11,580 --> 00:07:16,920
And on even higher level we have concept landscape.
118
00:07:16,920 --> 00:07:19,300
How does this representation
by simulation work?
119
00:07:19,300 --> 00:07:22,229
Now imagine you want to understand sound.
120
00:07:22,229 --> 00:07:23,669
[Ghulp]
121
00:07:23,669 --> 00:07:26,710
If you are a brain and you want to understand
sound you need to model it.
122
00:07:26,710 --> 00:07:31,070
Unfortunatly we can not really model sound
with neurons, because sound goes up to 20kHz,
123
00:07:31,070 --> 00:07:36,660
or if you are old like me maybe to 12 kHz.
20 kHz is what babies could do.
124
00:07:36,660 --> 00:07:41,240
And... neurons do not want to do 20 kHz.
That's way too fast for them.
125
00:07:41,240 --> 00:07:43,250
They like something like 20 Hz.
126
00:07:43,250 --> 00:07:45,590
So what do you do? You need
to make a Fourier transform.
127
00:07:45,590 --> 00:07:49,650
The Fourier transform measures the amount
of energy at different frequencies.
128
00:07:49,650 --> 00:07:52,500
And because you can not do it with neurons,
you need to do it in hardware.
129
00:07:52,500 --> 00:07:54,180
And turns out this is exactly
what we are doing.
130
00:07:54,180 --> 00:07:59,860
We have this cochlea which is this snail like
thing in our ears,
131
00:07:59,860 --> 00:08:06,669
and what it does, it transforms energy of
sound in different frequency intervals into
132
00:08:06,669 --> 00:08:08,009
energy measurments.
133
00:08:08,009 --> 00:08:10,479
And then gives you something
like what you see here.
134
00:08:10,479 --> 00:08:12,550
And this is something that the brain can model,
135
00:08:12,550 --> 00:08:16,210
so we can get a neurosimulator that tries
to recreate this patterns.
136
00:08:16,210 --> 00:08:21,370
And we can predict the next input from the
cochlea that then understand the sound.
137
00:08:21,370 --> 00:08:23,410
Of course if you want to understand music,
138
00:08:23,410 --> 00:08:25,160
we have to go beyond understanding sound.
139
00:08:25,160 --> 00:08:29,340
We have to understand the transformations
that sound can have if you play it at different pitch.
140
00:08:29,340 --> 00:08:33,599
We have to arrange the sound in the sequence
that give you rhythms and so on.
141
00:08:33,599 --> 00:08:35,889
And then we want to identify
some kind of musical grammar
142
00:08:35,889 --> 00:08:38,799
that we can use to again control the sequencer.
143
00:08:38,799 --> 00:08:42,529
So we have stucked structures.
That simulate the world.
144
00:08:42,529 --> 00:08:44,319
And once you've learned this model of music,
145
00:08:44,319 --> 00:08:47,309
once you've learned the musical grammar,
the sequencer and the sounds.
146
00:08:47,309 --> 00:08:51,779
You can get to the structure
of the individual piece of music.
147
00:08:51,779 --> 00:08:54,399
So, if you want to model the world of music.
148
00:08:54,399 --> 00:08:58,279
You need to have the lowest level of percepts
then we have the higher level of mental simulations.
149
00:08:58,279 --> 00:09:01,910
And... which give the sequences of the music
and the grammars of music.
150
00:09:01,910 --> 00:09:05,149
And beyond this you have the conceptual landscape
that you can use
151
00:09:05,149 --> 00:09:08,249
to describe different styles of music.
152
00:09:08,249 --> 00:09:12,130
And if you go up in the hierarchy,
you get to more and more abstract models.
153
00:09:12,130 --> 00:09:13,860
More and more conceptual models.
154
00:09:13,860 --> 00:09:16,449
And more and more analytic models.
155
00:09:16,449 --> 00:09:18,160
And this are causal models at some point.
156
00:09:18,160 --> 00:09:20,999
This causal models can be weakly deterministic,
157
00:09:20,999 --> 00:09:22,980
basically associative models, which tell you
158
00:09:22,980 --> 00:09:27,339
if this state happens, it's quite probable
that this one comes afterwords.
159
00:09:27,339 --> 00:09:29,389
Or you can get to a strongly determined model.
160
00:09:29,389 --> 00:09:32,730
Strongly determined model is one which tells
you, if you are in this state
161
00:09:32,730 --> 00:09:33,879
and this condition is met,
162
00:09:33,879 --> 00:09:35,589
You are are going to go exactly in this state.
163
00:09:35,589 --> 00:09:40,110
If this condition is not met, or a different
condition is met, you are going to this state.
164
00:09:40,110 --> 00:09:41,449
And this is what we call an alghorithm.
165
00:09:41,449 --> 00:09:46,769
it's.. now we are on the domain of computation.
166
00:09:46,769 --> 00:09:48,730
Computation is slightly different from mathematics.
167
00:09:48,730 --> 00:09:51,179
It's important to understand this.
168
00:09:51,179 --> 00:09:54,699
For a long time people have thought that the
universe is written in mathematics.
169
00:09:54,699 --> 00:09:58,399
Or that.. minds are mathematical,
or anything is mathematical.
170
00:09:58,399 --> 00:10:00,439
In fact nothing is mathematical.
171
00:10:00,439 --> 00:10:04,529
Mathematics is just the domain
of formal languages. It doesn't exist.
172
00:10:04,529 --> 00:10:07,300
Mathematics starts with a void.
173
00:10:07,300 --> 00:10:11,939
You throw in a few axioms, and if you've chosen
a nice axioms, then you get infinite complexity.
174
00:10:11,939 --> 00:10:13,679
Most of which is not computable.
175
00:10:13,679 --> 00:10:16,270
In mathematics you can express arbitrary statements,
176
00:10:16,270 --> 00:10:18,269
because it's all about formal languages.
177
00:10:18,269 --> 00:10:20,369
Many of this statements will not make sense.
178
00:10:20,369 --> 00:10:22,469
Many of these statements will make sense
in some way,
179
00:10:22,469 --> 00:10:24,429
but you can not test whether they make sense,
180
00:10:24,429 --> 00:10:26,740
because they're not computable.
181
00:10:26,740 --> 00:10:29,929
Computation is different.
Computation can exist.
182
00:10:29,929 --> 00:10:32,459
It's starts with an initial state.
183
00:10:32,459 --> 00:10:34,739
And then you have a transition function.
You do the work.
184
00:10:34,739 --> 00:10:38,449
You apply the transition function,
and you get into the next state.
185
00:10:38,449 --> 00:10:41,249
Computation is always finite.
186
00:10:41,249 --> 00:10:43,689
Mathematics is the kingdom of specification.
187
00:10:43,689 --> 00:10:47,290
And computation is the kingdom of implementation.
188
00:10:47,290 --> 00:10:50,629
It's very important to understand this difference.
189
00:10:50,629 --> 00:10:55,329
All our access to mathematics of course is
because we do computation.
190
00:10:55,329 --> 00:10:57,459
We can understand mathematics,
191
00:10:57,459 --> 00:10:59,939
because our brain can compute
some parts of mathematics.
192
00:10:59,939 --> 00:11:04,439
Very, very little of it, and to
very constrained complexity.
193
00:11:04,439 --> 00:11:06,860
But enough, so we can map
some of the infinite complexity
194
00:11:06,860 --> 00:11:10,410
and noncomputability of mathematics
into computational patterns,
195
00:11:10,410 --> 00:11:12,279
that we can explore.
196
00:11:12,279 --> 00:11:14,410
So computation is about doing the work,
197
00:11:14,410 --> 00:11:16,939
it's about executing the transition function.
198
00:11:19,730 --> 00:11:22,899
Now we've seen that mental representation
is about concepts,
199
00:11:22,899 --> 00:11:25,670
mental simulations, conceptual representations
200
00:11:25,670 --> 00:11:29,110
and this conceptual representations
give us concept spaces.
201
00:11:29,110 --> 00:11:30,970
And the nice thing
about this concept spaces is
202
00:11:30,970 --> 00:11:33,399
that they give us an interface
to our mental representations,
203
00:11:33,399 --> 00:11:36,290
We can use to address and manipulate them.
204
00:11:36,290 --> 00:11:39,119
And we can share them in cultures.
205
00:11:39,119 --> 00:11:40,899
And this concepts are compositional.
206
00:11:40,899 --> 00:11:43,639
We can put them together, to create new concepts.
207
00:11:43,639 --> 00:11:48,230
And they can be described using
higher dimensional vector spaces.
208
00:11:48,230 --> 00:11:50,319
They don't do simulation
and prediction and so on,
209
00:11:50,319 --> 00:11:53,119
but we can capture regularity
in our concept wisdom.
210
00:11:53,119 --> 00:11:55,220
With this vector space
you can do amazing things.
211
00:11:55,220 --> 00:11:57,589
For instance, if you take the vector from
"King" to "Queen"
212
00:11:57,589 --> 00:12:01,009
is pretty much the same vector
as to.. between "Man" and "Woman"
213
00:12:01,009 --> 00:12:04,110
And because of this properties, because it's
really a high dimentional manifold
214
00:12:04,110 --> 00:12:07,569
this concepts faces, we can do interesting
things, like machine translation
215
00:12:07,569 --> 00:12:09,470
without understanding what it means.
216
00:12:09,470 --> 00:12:13,929
That is without doing any proper mental representation,
that predicts the world.
217
00:12:13,929 --> 00:12:16,989
So this is a type of meta representation,
that is somewhat incomplete,
218
00:12:16,989 --> 00:12:21,199
but it captures the landscape that we share
in a culture.
219
00:12:21,199 --> 00:12:25,089
And then there is another type of meta representation,
that is linguistic protocols.
220
00:12:25,089 --> 00:12:27,699
Which is basically a formal grammar and vocabulary.
221
00:12:27,699 --> 00:12:29,619
And we need this linguistic protocols
222
00:12:29,619 --> 00:12:32,869
to transfer mental representations
between people.
223
00:12:32,869 --> 00:12:36,019
And we do this by basically
scanning our mental representation,
224
00:12:36,019 --> 00:12:38,660
disassembling them in some way
or disambiguating them.
225
00:12:38,660 --> 00:12:43,040
And then we use it as discrete string of symbols
to get it to somebody else,
226
00:12:43,040 --> 00:12:46,429
and he trains an assembler,
that reverses this process,
227
00:12:46,429 --> 00:12:51,389
and build something that is pretty similar
to what we intended to convey.
228
00:12:51,389 --> 00:12:53,569
And if you look at the progression of AI models,
229
00:12:53,569 --> 00:12:55,600
it pretty much went the opposite direction.
230
00:12:55,600 --> 00:13:00,279
So AI started with linguistic protocols, which
were expressed in formal grammars.
231
00:13:00,279 --> 00:13:05,209
And then it got to concepts spaces, and now
it's about to address percepts.
232
00:13:05,209 --> 00:13:09,689
And at some point in near future it's going
to get better at mental simulations.
233
00:13:09,689 --> 00:13:11,730
And at some point after that we get to
234
00:13:11,730 --> 00:13:14,769
attention directed and
motivationally connected systems,
235
00:13:14,769 --> 00:13:16,600
that make sense of the world.
236
00:13:16,600 --> 00:13:20,290
that are in some sense able to address meaning.
237
00:13:20,290 --> 00:13:23,489
This is the hardware that we have can do.
238
00:13:23,489 --> 00:13:25,629
What kind of hardware do we have?
239
00:13:25,629 --> 00:13:28,480
That's a very interesting question.
240
00:13:28,480 --> 00:13:32,230
It could start out with a question:
How difficult is it to define a brain?
241
00:13:32,230 --> 00:13:35,439
We know that the brain must be
somewhere hidden in the genome.
242
00:13:35,439 --> 00:13:38,290
The genome fits on a CD ROM.
It's not that complicated.
243
00:13:38,290 --> 00:13:40,399
It's easier than Microsoft Windows. laughter
244
00:13:40,399 --> 00:13:45,549
And we also know, that about 2%
of the genome is coding for proteins.
245
00:13:45,549 --> 00:13:48,429
And maybe about 10% of the genome
has some kind of stuff
246
00:13:48,429 --> 00:13:51,239
that tells you when to switch protein.
247
00:13:51,239 --> 00:13:52,829
And the remainder is mostly garbage.
248
00:13:52,829 --> 00:13:57,170
It's old viruses that are left over and has
never been properly deleted and so on.
249
00:13:57,170 --> 00:14:01,420
Because there are no real
code revisions in the genome.
250
00:14:01,420 --> 00:14:08,119
So how much of this 10%
that is 75 MB code for the brain.
251
00:14:08,119 --> 00:14:09,469
We don't really know.
252
00:14:09,469 --> 00:14:13,399
What we do know is we share
almost all of this with mice.
253
00:14:13,399 --> 00:14:15,769
Genetically speaking human
is a pretty big mouse.
254
00:14:15,769 --> 00:14:21,049
With a few bits changed, so.. to fix some
of the genetic expressions
255
00:14:21,049 --> 00:14:25,879
And that is most of the stuff there is going
to code for cells and metabolism
256
00:14:25,879 --> 00:14:27,999
and how your body looks like and so on.
257
00:14:27,999 --> 00:14:33,679
But if you look at erghmm... how much is expressed
in the brain and only in the brain,
258
00:14:33,679 --> 00:14:35,170
in terms of proteins and so on.
259
00:14:35,170 --> 00:14:45,639
We find it's about... well of the 2% it's
about 5%. That is only the 5% of the 2% that
260
00:14:45,639 --> 00:14:46,799
is only in the brain.
261
00:14:46,799 --> 00:14:50,199
And another 5% of the 2% is predominantly
in the brain.
262
00:14:50,199 --> 00:14:52,069
That is more in the brain than anywhere else.
263
00:14:52,069 --> 00:14:54,249
Which gives you some kind of thing
like a lower bound.
264
00:14:54,249 --> 00:14:59,379
Which means to encode a brain genetically
base on the hardware that we are using.
265
00:14:59,379 --> 00:15:03,539
We need something like
at least 500 kB of code.
266
00:15:03,539 --> 00:15:06,670
Actually ehmm.. this... we very conservative
lower bound.
267
00:15:06,670 --> 00:15:08,720
It's going to be a little more I guess.
268
00:15:08,720 --> 00:15:11,449
But it sounds surprisingly little, right?
269
00:15:11,449 --> 00:15:13,709
But in terms of scientific theories
this is a lot.
270
00:15:13,709 --> 00:15:16,519
I mean the universe,
according to the core theory
271
00:15:16,519 --> 00:15:19,420
of the quantum mechanics and so on
is like so much of code.
272
00:15:19,420 --> 00:15:20,569
It's like half a page of code.
273
00:15:20,569 --> 00:15:23,100
That's it. That's all you need
to generate the universe.
274
00:15:23,100 --> 00:15:25,489
And if you want to understand evolution
it's like a paragraph.
275
00:15:25,489 --> 00:15:29,609
It's couple lines you need to understand
evolutionary process.
276
00:15:29,609 --> 00:15:32,199
And there is a lots, lots of details, that's
you get afterwards.
277
00:15:32,199 --> 00:15:34,220
Because this process itself doesn't define
278
00:15:34,220 --> 00:15:37,259
how the animals are going to look like,
and in similar way is..
279
00:15:37,259 --> 00:15:41,269
the code of the universe doesn't tell you
what this planet is going to look like.
280
00:15:41,269 --> 00:15:43,279
And what you guys are going to look like.
281
00:15:43,279 --> 00:15:45,949
It's just defining the rulebook.
282
00:15:45,949 --> 00:15:49,209
And in the same sense genome defines the rulebook,
283
00:15:49,209 --> 00:15:51,569
by which our brain is build.
284
00:15:51,569 --> 00:15:56,399
erghmmm,.. The brain boots itself
into developer process,
285
00:15:56,399 --> 00:15:58,119
and this booting takes some time.
286
00:15:58,119 --> 00:16:01,069
So subliminal learning in which
initial connections are forged
287
00:16:01,069 --> 00:16:04,910
And basic models are build of the world,
so we can operate in it.
288
00:16:04,910 --> 00:16:06,999
And how long does this booting take?
289
00:16:06,999 --> 00:16:09,669
I thing it's about 80 mega seconds.
290
00:16:09,669 --> 00:16:14,319
That's the time that a child is awake until
it's 2.5 years old.
291
00:16:14,319 --> 00:16:16,449
By this age you understand Star Wars.
292
00:16:16,449 --> 00:16:20,029
And I think that everything after
understanding Star Wars is cosmetics.
293
00:16:20,029 --> 00:16:26,799
laughterapplause
294
00:16:26,799 --> 00:16:32,820
You are going to be online, if you get to
arrive old age for about 1.5 giga seconds.
295
00:16:32,820 --> 00:16:37,929
And in this time I think you are going to
get not to watch more than 5 milion concepts.
296
00:16:37,929 --> 00:16:41,600
Why? I don't know real...
If you look at this child.
297
00:16:41,600 --> 00:16:45,480
If a child would be able to form a concept
let say every 5 minutes,
298
00:16:45,480 --> 00:16:48,529
then by the time it's about 4 years old,
it's going to have
299
00:16:48,529 --> 00:16:51,549
something like 250 thousands concepts.
300
00:16:51,549 --> 00:16:54,119
And... so... a quarter million.
301
00:16:54,119 --> 00:16:56,809
And if we extrapolate this into our lifetime,
302
00:16:56,809 --> 00:16:59,799
at some point it slows down,
because we have enough concepts,
303
00:16:59,799 --> 00:17:01,230
to describe the world.
304
00:17:01,230 --> 00:17:04,410
Maybe it's something... It's I think it's
less that 5 million.
305
00:17:04,410 --> 00:17:07,140
How much storage capacity does the brain has?
306
00:17:07,140 --> 00:17:12,319
I think that the... the estimates
are pretty divergent,
307
00:17:12,319 --> 00:17:14,930
The lower bound is something like a 100 GB,
308
00:17:14,930 --> 00:17:18,569
And the upper bound
is something like 2.5 PB.
309
00:17:18,569 --> 00:17:21,890
There is even...
even some higher outliers this..
310
00:17:21,890 --> 00:17:25,630
If you for instance think that we need all
those synaptic vesicle to store information,
311
00:17:25,630 --> 00:17:27,530
maybe even more fits into this.
312
00:17:27,530 --> 00:17:31,740
But the 2.5 PB is usually based
on what you need
313
00:17:31,740 --> 00:17:34,760
to code the information
that is in all the neurons.
314
00:17:34,760 --> 00:17:36,770
But maybe the neurons
do not really matter so much,
315
00:17:36,770 --> 00:17:39,930
because if the neuron dies it's not like the
word is changing dramatically.
316
00:17:39,930 --> 00:17:44,270
The brain is very resilient
against individual neurons failing.
317
00:17:44,270 --> 00:17:48,930
So the 100 GB capacity is much more
what you actually store in the neurons.
318
00:17:48,930 --> 00:17:51,380
If you look at all the redundancy
that you need.
319
00:17:51,380 --> 00:17:54,230
And I think this is much closer to the actual
Ballpark figure.
320
00:17:54,230 --> 00:17:58,130
Also if you want to store 5 hundred...
5 million concepts,
321
00:17:58,130 --> 00:18:02,330
and maybe 10 times or 100 times the number
of percepts, on top of this,
322
00:18:02,330 --> 00:18:05,490
this is roughly the Ballpark figure
that you are going to need.
323
00:18:05,490 --> 00:18:07,110
So our brain
324
00:18:07,110 --> 00:18:08,320
is a prediction machine.
325
00:18:08,320 --> 00:18:11,490
It... What it does is it reduces the entropy
of the environment,
326
00:18:11,490 --> 00:18:14,610
to solve whatever problems you are encountering,
327
00:18:14,610 --> 00:18:17,790
if you don't have a... feedback loop, to fix
them.
328
00:18:17,790 --> 00:18:20,240
So normally if something happens, we have
some kind of feedback loop,
329
00:18:20,240 --> 00:18:23,440
that regulates our temperature or that makes
problems go away.
330
00:18:23,440 --> 00:18:26,050
And only when this is not working
we employ recognition.
331
00:18:26,050 --> 00:18:29,250
And then we start this arbitrary
computational processes,
332
00:18:29,250 --> 00:18:31,830
that is facilitated by the neural cortex.
333
00:18:31,830 --> 00:18:34,940
And this.. arhmm.. neural cortex has really
do arbitrary programs.
334
00:18:34,940 --> 00:18:37,870
But it can do so
with only with very limited complexity,
335
00:18:37,870 --> 00:18:42,070
because really you just saw,
it's not that complex.
336
00:18:42,070 --> 00:18:43,900
The modeling of the world is very slow.
337
00:18:43,900 --> 00:18:46,570
And it's something
that we see in our eye models.
338
00:18:46,570 --> 00:18:48,150
To learn the basic structure of the world
339
00:18:48,150 --> 00:18:49,330
takes a very long time.
340
00:18:49,330 --> 00:18:52,650
To learn basically that we are moving in 3D
and objects are moving,
341
00:18:52,650 --> 00:18:54,030
and what they look like.
342
00:18:54,030 --> 00:18:55,130
Once we have this basic model,
343
00:18:55,130 --> 00:18:59,300
we can get to very, very quick
understanding within this model.
344
00:18:59,300 --> 00:19:02,110
Basically encoding based
on the structure of the world,
345
00:19:02,110 --> 00:19:03,610
that we've learned.
346
00:19:03,610 --> 00:19:07,100
And this is some kind of
data compression, that we are doing.
347
00:19:07,100 --> 00:19:09,740
We use this model, this grammar of the world,
348
00:19:09,740 --> 00:19:12,150
this simulation structures that we've learned,
349
00:19:12,150 --> 00:19:15,190
to encode the world very, very efficently.
350
00:19:15,190 --> 00:19:17,740
How much data compression do we get?
351
00:19:17,740 --> 00:19:19,860
Well... if you look at the retina.
352
00:19:19,860 --> 00:19:24,610
The retina get's data
in the order of about 10Gb/s.
353
00:19:24,610 --> 00:19:27,500
And the retina already compresses these data,
354
00:19:27,500 --> 00:19:31,120
and puts them into optic nerve
at the rate of about 1Mb/s
355
00:19:31,120 --> 00:19:34,030
This is what you get fed into visual cortex.
356
00:19:34,030 --> 00:19:36,370
And the visual cortex
does some additional compression,
357
00:19:36,370 --> 00:19:42,110
and by the time it gets to layer four of the
first layer of vision, to V1.
358
00:19:42,110 --> 00:19:46,880
We are down to something like 1Kb/s.
359
00:19:46,880 --> 00:19:50,720
So if we extrapolate this, and you get live
to the age of 80 years,
360
00:19:50,720 --> 00:19:54,140
and you are awake for 2/3 of your lifetime.
361
00:19:54,140 --> 00:19:56,930
That is you have your eyes open for 2/3 of
your lifetime.
362
00:19:56,930 --> 00:19:59,040
The stuff that you get into your brain,
363
00:19:59,040 --> 00:20:03,700
via your visual perception
is going to be only 2TB.
364
00:20:03,700 --> 00:20:05,370
Only 2TB of visual data.
365
00:20:05,370 --> 00:20:06,680
Throughout all your lifetime.
366
00:20:06,680 --> 00:20:09,430
That's all you are going to get ever to see.
367
00:20:09,430 --> 00:20:11,160
Isn't this depressing?
368
00:20:11,160 --> 00:20:12,790
laughter
369
00:20:12,790 --> 00:20:16,540
So I would really like to eghmm..
to tell you,
370
00:20:16,540 --> 00:20:22,750
choose wisely what you
are going to look at. laughter
371
00:20:22,750 --> 00:20:26,940
Ok. Let's look at this problem of neural compositionality.
372
00:20:26,940 --> 00:20:29,250
Our brains has this amazing thing
that they can put
373
00:20:29,250 --> 00:20:31,510
meta representation together very, very quickly.
374
00:20:31,510 --> 00:20:33,150
For instance you read a page of code,
375
00:20:33,150 --> 00:20:35,190
you compile it in you mind
into some kind of program
376
00:20:35,190 --> 00:20:37,700
it tells you what this page is going to do.
377
00:20:37,700 --> 00:20:39,110
Isn't that amazing?
378
00:20:39,110 --> 00:20:40,810
And then you can forget about this,
379
00:20:40,810 --> 00:20:43,910
disassemble it all, and use the
building blocks for something else.
380
00:20:43,910 --> 00:20:45,230
It's like legos.
381
00:20:45,230 --> 00:20:48,000
How you can do this with neurons?
382
00:20:48,000 --> 00:20:50,160
Legos can do this, because they have
a well defined interface.
383
00:20:50,160 --> 00:20:52,180
They have all this slots, you know,
that fit together
384
00:20:52,180 --> 00:20:53,600
in well defined ways.
385
00:20:53,600 --> 00:20:54,530
How can neurons do this?
386
00:20:54,530 --> 00:20:57,280
Well, neurons can maybe learn
the interface of other neurons.
387
00:20:57,280 --> 00:20:59,780
But that's difficult, because every neuron
looks slightly different,
388
00:20:59,780 --> 00:21:04,830
after all this... some kind of biologically
grown natural stuff.
389
00:21:04,830 --> 00:21:06,610
laughter
390
00:21:06,610 --> 00:21:10,620
So what you want to do is,
you want to encapsulate this erhmm...
391
00:21:10,620 --> 00:21:13,020
diversity of the neurons to make the predictable.
392
00:21:13,020 --> 00:21:14,820
To give them well defined interface.
393
00:21:14,820 --> 00:21:16,410
And I think that nature solution to this
394
00:21:16,410 --> 00:21:19,770
is cortical columns.
395
00:21:19,770 --> 00:21:24,250
Cortical column is a circuit of
between 100 and 400 neurons.
396
00:21:24,250 --> 00:21:26,860
And this circuit has some kind of neural network,
397
00:21:26,860 --> 00:21:28,650
that can learn stuff.
398
00:21:28,650 --> 00:21:31,070
And after it has learned particular function,
399
00:21:31,070 --> 00:21:35,320
and in between, it's able to link up these
other cortical columns.
400
00:21:35,320 --> 00:21:37,120
And we have about 100 million of those.
401
00:21:37,120 --> 00:21:39,770
Depending on how many neurons
you assume is in there,
402
00:21:39,770 --> 00:21:41,490
it's... erghmm we guess it's something,
403
00:21:41,490 --> 00:21:46,500
at least 20 million and maybe
something like a 100 million.
404
00:21:46,500 --> 00:21:48,330
And this cortical columns, what they can do,
405
00:21:48,330 --> 00:21:50,280
is they can link up like lego bricks,
406
00:21:50,280 --> 00:21:54,130
and then perform,
by transmitting information between them,
407
00:21:54,130 --> 00:21:55,990
pretty much arbitrary computations.
408
00:21:55,990 --> 00:21:57,540
What kind of computation?
409
00:21:57,540 --> 00:22:00,130
Well... Solomonoff induction.
410
00:22:00,130 --> 00:22:03,820
And... they have some short range links,
to their neighbors.
411
00:22:03,820 --> 00:22:05,690
Which comes almost for free, because erghmm..
412
00:22:05,690 --> 00:22:08,490
well, they are connected to them,
they are direct neighborhood.
413
00:22:08,490 --> 00:22:10,050
And they have some long range connectivity,
414
00:22:10,050 --> 00:22:13,000
so you can combine everything
in your cortex with everything.
415
00:22:13,000 --> 00:22:14,900
So you need some kind of global switchboard.
416
00:22:14,900 --> 00:22:17,630
Some grid like architecture
of long range connections.
417
00:22:17,630 --> 00:22:18,900
They are going to be more expensive,
418
00:22:18,900 --> 00:22:20,640
they are going to be slower,
419
00:22:20,640 --> 00:22:23,590
but they are going to be there.
420
00:22:23,590 --> 00:22:26,070
So how can we optimize
what these guys are doing?
421
00:22:26,070 --> 00:22:28,270
In some sense it's like an economy.
422
00:22:28,270 --> 00:22:31,460
It's not enduring based system,
as we often use in machine learning.
423
00:22:31,460 --> 00:22:32,780
It's really an economy. You have...
424
00:22:32,780 --> 00:22:35,560
The question is, you have a fixed number of
elements,
425
00:22:35,560 --> 00:22:37,970
how can you do the most valuable stuff with
them.
426
00:22:37,970 --> 00:22:41,030
Fixed resources, most valuable stuff, the
problem is economy.
427
00:22:41,030 --> 00:22:43,320
So you have an economy of information brokers.
428
00:22:43,320 --> 00:22:45,830
Every one of these guys,
this little cortical columns,
429
00:22:45,830 --> 00:22:48,150
is very simplistic information broker.
430
00:22:48,150 --> 00:22:50,950
And they trade rewards against neg entropy,
431
00:22:50,950 --> 00:22:54,140
Against reducing entropy in the...
in the world.
432
00:22:54,140 --> 00:22:55,790
And to do this, as we just saw
433
00:22:55,790 --> 00:22:58,890
that they need some kind of standardized interface.
434
00:22:58,890 --> 00:23:02,090
And internally, to use this interface
they are going to
435
00:23:02,090 --> 00:23:03,880
have some kind of state machine.
436
00:23:03,880 --> 00:23:05,660
And then they are going to pass messages
437
00:23:05,660 --> 00:23:07,400
between each other.
438
00:23:07,400 --> 00:23:08,630
And what are these messages?
439
00:23:08,630 --> 00:23:11,100
Well, it's going to be hard
to discover these messages,
440
00:23:11,100 --> 00:23:12,800
by looking at brains.
441
00:23:12,800 --> 00:23:14,800
Because it's very difficult to see in brains,
442
00:23:14,800 --> 00:23:15,450
what the are actually doing.
443
00:23:15,450 --> 00:23:17,250
you just see all these neurons.
444
00:23:17,250 --> 00:23:18,790
And if you would be waiting for neuroscience,
445
00:23:18,790 --> 00:23:20,970
to discover anything, we wouldn't even have
446
00:23:20,970 --> 00:23:22,590
gradient descent or anything else.
447
00:23:22,590 --> 00:23:23,720
We wouldn't have neuron learning.
448
00:23:23,720 --> 00:23:25,420
We wouldn't have all this advances in AI.
449
00:23:25,420 --> 00:23:28,230
Jürgen Schmidhuber said that the biggest,
450
00:23:28,230 --> 00:23:30,010
the last contribution of neuroscience to
451
00:23:30,010 --> 00:23:32,220
artificial intelligence
was about 50 years ago.
452
00:23:32,220 --> 00:23:34,280
That's depressing, and it might be
453
00:23:34,280 --> 00:23:37,870
overemphasizing the unimportance of neuroscience,
454
00:23:37,870 --> 00:23:39,490
because neuroscience is very important,
455
00:23:39,490 --> 00:23:41,090
once you know what are you looking for.
456
00:23:41,090 --> 00:23:42,510
You can actually often find this,
457
00:23:42,510 --> 00:23:44,320
and see whether you are on the right track.
458
00:23:44,320 --> 00:23:45,860
But it's very difficult to take neuroscience
459
00:23:45,860 --> 00:23:47,940
to understand how the brain is working.
460
00:23:47,940 --> 00:23:49,290
Because it's really like understanding
461
00:23:49,290 --> 00:23:53,230
flight by looking at birds through a microscope.
462
00:23:53,230 --> 00:23:55,150
So, what are these messages?
463
00:23:55,150 --> 00:23:57,850
You are going to need messages,
that tell these cortical columns
464
00:23:57,850 --> 00:24:00,160
to join themselves into a structure.
465
00:24:00,160 --> 00:24:01,990
And to unlink again once they're done.
466
00:24:01,990 --> 00:24:03,690
You need ways that they can request each other
467
00:24:03,690 --> 00:24:06,040
to perform computations for them.
468
00:24:06,040 --> 00:24:07,510
You need ways they can inhibit each other
469
00:24:07,510 --> 00:24:08,320
when they are linked up.
470
00:24:08,320 --> 00:24:10,990
So they don't do conflicting computations.
471
00:24:10,990 --> 00:24:12,940
Then they need to tell you whether the computation,
472
00:24:12,940 --> 00:24:14,110
the result of the computation
473
00:24:14,110 --> 00:24:16,730
that the are asked to do is probably false.
474
00:24:16,730 --> 00:24:19,340
Or whether it's probably true,
but you still need to wait for others,
475
00:24:19,340 --> 00:24:21,990
to tell you whether the details worked out.
476
00:24:21,990 --> 00:24:24,240
Or whether it's confirmed true that the concepts
477
00:24:24,240 --> 00:24:26,730
that they stand for is actually the case.
478
00:24:26,730 --> 00:24:28,150
And then you want to have learning,
479
00:24:28,150 --> 00:24:29,630
to tell you how well this worked.
480
00:24:29,630 --> 00:24:31,390
So you will have to announce a bounty,
481
00:24:31,390 --> 00:24:34,380
that tells them to link up
and kind of reward signal
482
00:24:34,380 --> 00:24:36,740
that makes do computation in the first place.
483
00:24:36,740 --> 00:24:38,680
And then you want to have
some kind of reward signal
484
00:24:38,680 --> 00:24:40,550
once you got the result as an organism.
485
00:24:40,550 --> 00:24:42,280
But you reach your goal if you made
486
00:24:42,280 --> 00:24:45,810
the disturbance go away
or what ever you consume the cake.
487
00:24:45,810 --> 00:24:47,710
And then you will have
some kind of reward signal
488
00:24:47,710 --> 00:24:49,250
that's you give everybody.
489
00:24:49,250 --> 00:24:50,650
That was involved in this.
490
00:24:50,650 --> 00:24:52,720
And this reward signal facilitates learning,
491
00:24:52,720 --> 00:24:55,230
so the.. difference between the announce reward
492
00:24:55,230 --> 00:24:57,530
and consumption reward is the learning signal
493
00:24:57,530 --> 00:24:58,740
for these guys.
494
00:24:58,740 --> 00:25:00,210
So they can learn how to play together,
495
00:25:00,210 --> 00:25:02,700
and how to do the Solomonoff induction.
496
00:25:02,700 --> 00:25:04,660
Now, I've told you that Solomonoff induction
497
00:25:04,660 --> 00:25:05,280
is not computable.
498
00:25:05,280 --> 00:25:07,630
And it's mostly because of two things,
499
00:25:07,630 --> 00:25:09,280
First of all it's needs infinite resources
500
00:25:09,280 --> 00:25:11,200
to compare all the possible models.
501
00:25:11,200 --> 00:25:13,530
And the other one is that we do not know
502
00:25:13,530 --> 00:25:15,440
the priori probability for our Bayesian model.
503
00:25:15,440 --> 00:25:19,280
If we do not know
how likely unknown stuff is in the world.
504
00:25:19,280 --> 00:25:22,520
So what we do instead is,
we set some kind of hyperparameter,
505
00:25:22,520 --> 00:25:25,050
Some kind of default
priori probability for concepts,
506
00:25:25,050 --> 00:25:28,110
that are encoded by cortical columns.
507
00:25:28,110 --> 00:25:30,580
And if we set these parameters very low,
508
00:25:30,580 --> 00:25:32,140
then we are going to end up with inferences
509
00:25:32,140 --> 00:25:35,250
that are quite probable.
510
00:25:35,250 --> 00:25:36,480
For unknown things.
511
00:25:36,480 --> 00:25:37,690
And then we can test for those.
512
00:25:37,690 --> 00:25:41,350
If we set this parameter higher, we are going
to be very, very creative.
513
00:25:41,350 --> 00:25:43,670
But we end up with many many theories,
514
00:25:43,670 --> 00:25:45,140
that are difficult to test.
515
00:25:45,140 --> 00:25:48,470
Because maybe there are
too many theories to test.
516
00:25:48,470 --> 00:25:50,650
Basically every of these cortical columns
will now tell you,
517
00:25:50,650 --> 00:25:52,240
when you ask them if they are true:
518
00:25:52,240 --> 00:25:54,960
"Yes I'm probably true,
but i still need to ask others,
519
00:25:54,960 --> 00:25:56,980
to work on the details"
520
00:25:56,980 --> 00:25:58,670
So these others are going to be get active,
521
00:25:58,670 --> 00:26:00,640
and they are being asked by the asking element:
522
00:26:00,640 --> 00:26:01,730
"Are you going to be true?",
523
00:26:01,730 --> 00:26:04,380
and they say "Yeah, probably yes,
I just have to work on the details"
524
00:26:04,380 --> 00:26:05,930
and they are going to ask even more.
525
00:26:05,930 --> 00:26:07,980
So your brain is going to light up like a
christmas tree,
526
00:26:07,980 --> 00:26:10,240
and do all these amazing computations,
527
00:26:10,240 --> 00:26:12,450
and you see connections everywhere,
most of them are wrong.
528
00:26:12,450 --> 00:26:16,310
You are basically in psychotic state
if your hyperparameter is too high.
529
00:26:16,310 --> 00:26:20,790
You're brain invents more theories
that it can disproof.
530
00:26:20,790 --> 00:26:24,550
Would it actually sometimes be good
to be in this state?
531
00:26:24,550 --> 00:26:27,850
You bet. So i think every night our brain
goes in this state.
532
00:26:27,850 --> 00:26:31,720
We turn up this hyperparameter.
We dream. We get all kinds
533
00:26:31,720 --> 00:26:34,100
weird connections, and we get to see connections,
534
00:26:34,100 --> 00:26:36,140
that otherwise we couldn't be seeing.
535
00:26:36,140 --> 00:26:38,080
Even though... because they are highly improbable.
536
00:26:38,080 --> 00:26:42,750
But sometimes they hold, and we see... "Oh
my God, DNA is organized in double helix".
537
00:26:42,750 --> 00:26:44,640
And this is what we remember in the morning.
538
00:26:44,640 --> 00:26:46,870
All the other stuff is deleted.
539
00:26:46,870 --> 00:26:48,440
So we usually don't form long term memories
540
00:26:48,440 --> 00:26:51,480
in dreams, if everything goes well.
541
00:26:51,480 --> 00:26:56,670
If you accidentally trip this up.. your modulators,
542
00:26:56,670 --> 00:26:59,100
for instance by consuming illegal substances,
543
00:26:59,100 --> 00:27:01,690
or because you just gone randomly psychotic
544
00:27:01,690 --> 00:27:04,600
you was basically entering
a dreaming state I guess.
545
00:27:04,600 --> 00:27:06,990
You get to a state
when the brain starts inventing more
546
00:27:06,990 --> 00:27:10,860
concepts that it can disproof.
547
00:27:10,860 --> 00:27:13,600
So you want to have a state
where this is well balanced.
548
00:27:13,600 --> 00:27:16,180
And the difference between
highly creative people,
549
00:27:16,180 --> 00:27:20,070
and very religious people is probably
a different setting of this hyperparameter.
550
00:27:20,070 --> 00:27:21,890
So I suspect that people that people
that are genius,
551
00:27:21,890 --> 00:27:23,880
like people like Einstein and so on,
552
00:27:23,880 --> 00:27:26,600
do not simply have better neurons than others.
553
00:27:26,600 --> 00:27:29,130
What they mostly have is a slightly hyperparameter,
554
00:27:29,130 --> 00:27:33,860
that is very finely tuned, so they can get
better balance than other people
555
00:27:33,860 --> 00:27:43,850
in finding theories that might be true,
but can still be disprooven.
556
00:27:43,850 --> 00:27:49,480
So inventiveness could be
a hyperparameter in the brain.
557
00:27:49,480 --> 00:27:54,169
If you want to measure
the quality of belief that we have
558
00:27:54,169 --> 00:27:56,370
we are going to have to have
some kind of some cost function
559
00:27:56,370 --> 00:27:58,710
which is based on motivational system.
560
00:27:58,710 --> 00:28:02,400
And to identify if belief
is good or not we can abstract criteria,
561
00:28:02,400 --> 00:28:06,440
for instance how well does it predict the
wourld, or how about does it reduce uncertainty
562
00:28:06,440 --> 00:28:07,590
in the world,
563
00:28:07,590 --> 00:28:10,020
or is it consistency and sparse.
564
00:28:10,020 --> 00:28:14,080
And then of course utility, how about does
it help me to satisfy my needs.
565
00:28:14,080 --> 00:28:18,920
And the motivational system is going
to evaluate all this things by giving a signal.
566
00:28:18,920 --> 00:28:24,200
And the first signal.. kind of signal
is the possible rewards if we are able to compute
567
00:28:24,200 --> 00:28:25,020
the task.
568
00:28:25,020 --> 00:28:27,430
And this is probably done by dopamine.
569
00:28:27,430 --> 00:28:30,350
So we have a very small area in the brain,
substantia nigra,
570
00:28:30,350 --> 00:28:33,610
and the ventral tegmental area,
and they produce dopamine.
571
00:28:33,610 --> 00:28:38,180
And this get fed into lateral frontal cortext
and the frontal lobe,
572
00:28:38,180 --> 00:28:41,920
which control attention,
and tell you what things to do.
573
00:28:41,920 --> 00:28:46,020
And if we have successfully done
what you wanted to do,
574
00:28:46,020 --> 00:28:49,300
we consume the rewards.
575
00:28:49,300 --> 00:28:51,940
And we do this with another signal
which is serotonine.
576
00:28:51,940 --> 00:28:53,480
It's also announce to motivational system,
577
00:28:53,480 --> 00:28:55,870
to this very small are the Raphe nuclei.
578
00:28:55,870 --> 00:28:58,690
And it feeds into all the areas of the brain
where learning is necessary.
579
00:28:58,690 --> 00:29:02,160
A connection is strengthen
once you get to result.
580
00:29:02,160 --> 00:29:07,559
These two substances are emitted
by the motivational system.
581
00:29:07,559 --> 00:29:09,710
The motivational system is a bunch of needs,
582
00:29:09,710 --> 00:29:11,510
essentially you regulate it below the cortext.
583
00:29:11,510 --> 00:29:14,490
They are not part of your mental representations.
584
00:29:14,490 --> 00:29:16,930
They are part of something
that is more primary than this.
585
00:29:16,930 --> 00:29:19,360
This is what makes us go,
this is what makes us human.
586
00:29:19,360 --> 00:29:22,290
This is not our rationality, this is what we want.
587
00:29:22,290 --> 00:29:27,000
And the needs are physiological,
they are social, they are cognitive.
588
00:29:27,000 --> 00:29:28,960
And you pretty much born with them.
589
00:29:28,960 --> 00:29:30,470
They can not be totally adaptive,
590
00:29:30,470 --> 00:29:33,340
because if we were adaptive,
we wouldn't be doing anything.
591
00:29:33,340 --> 00:29:35,390
The needs are resistive.
592
00:29:35,390 --> 00:29:38,290
They are pushing us against the world.
593
00:29:38,290 --> 00:29:40,170
If you wouldn't have all this needs,
594
00:29:40,170 --> 00:29:41,740
If you wouldn't have this motivational system,
595
00:29:41,740 --> 00:29:43,630
you would just be doing what best for you.
596
00:29:43,630 --> 00:29:45,150
Which means collapse on the ground,
597
00:29:45,150 --> 00:29:49,010
be a vegetable, rod, give into gravity.
598
00:29:49,010 --> 00:29:50,270
Instead you do all this unpleasant things,
599
00:29:50,270 --> 00:29:52,690
to get up in the morning,
you eat, you have sex,
600
00:29:52,690 --> 00:29:54,120
you do all this crazy things.
601
00:29:54,120 --> 00:29:58,809
And it's only because the
motivational system forces you to.
602
00:29:58,809 --> 00:30:00,850
The motivational system
takes this bunch of matter,
603
00:30:00,850 --> 00:30:02,890
and makes us to do all these strange things,
604
00:30:02,890 --> 00:30:05,940
just so genomes get replicated and so on.
605
00:30:05,940 --> 00:30:10,470
And... so to do this, we are going to build
resistance against the world.
606
00:30:10,470 --> 00:30:13,360
And the motivational system
is in a sense forcing us,
607
00:30:13,360 --> 00:30:15,470
to do all this things by giving us needs,
608
00:30:15,470 --> 00:30:18,330
and the need have some kind
of target value and current value.
609
00:30:18,330 --> 00:30:21,850
If we have a differential
between the target value and current value,
610
00:30:21,850 --> 00:30:24,590
we perceive some urgency
to do something about the need.
611
00:30:24,590 --> 00:30:26,680
And when the target value
approaches the current value
612
00:30:26,680 --> 00:30:28,660
we get the pleasure, which is a learning signal.
613
00:30:28,660 --> 00:30:30,540
If it gets away from it
we get a displeasure signal,
614
00:30:30,540 --> 00:30:31,870
which is also a learning signal.
615
00:30:31,870 --> 00:30:35,370
And we can use this to structure
our understanding of the world.
616
00:30:35,370 --> 00:30:36,870
To understand what goals are and so on.
617
00:30:36,870 --> 00:30:40,020
Goals are learned. Needs are not.
618
00:30:40,020 --> 00:30:42,780
To learn we need success
and failure in the world.
619
00:30:42,780 --> 00:30:45,940
But to do things we need anticipated reward.
620
00:30:45,940 --> 00:30:48,120
So it's dopamine that's makes brain go round.
621
00:30:48,120 --> 00:30:50,560
Dopamine makes you do things.
622
00:30:50,560 --> 00:30:52,750
But in order to do this in the right way,
623
00:30:52,750 --> 00:30:54,610
you have to make sure,
that the cells can not
624
00:30:54,610 --> 00:30:55,880
produce dopamine themselves.
625
00:30:55,880 --> 00:30:59,100
If they do this they can start
to drive others to work for them.
626
00:30:59,100 --> 00:31:01,870
You are going to get something like
bureaucracy in your neural cortext,
627
00:31:01,870 --> 00:31:05,650
where different bosses try
to set up others to they own bidding
628
00:31:05,650 --> 00:31:07,910
and pitch against other groups in nerual cortext.
629
00:31:07,910 --> 00:31:09,730
It's going to be horrible.
630
00:31:09,730 --> 00:31:12,210
So you want to have some kind of central authority,
631
00:31:12,210 --> 00:31:16,290
that make sure that the cells
do not produce dopamine themselves.
632
00:31:16,290 --> 00:31:19,679
It's only been produce in
very small area and then given out,
633
00:31:19,679 --> 00:31:21,059
and pass through the system.
634
00:31:21,059 --> 00:31:23,350
And after you're done with it's going to be gone,
635
00:31:23,350 --> 00:31:26,070
so there is no hoarding of the dopamine.
636
00:31:26,070 --> 00:31:29,770
And in our society the role of dopamine
is played by money.
637
00:31:29,770 --> 00:31:32,150
Money is not reward in itself.
638
00:31:32,150 --> 00:31:35,570
It's in some sense way
that you can trade against the reward.
639
00:31:35,570 --> 00:31:36,850
You can not eat money.
640
00:31:36,850 --> 00:31:40,500
You can take it later and take
a arbitrary reward for it.
641
00:31:40,500 --> 00:31:45,400
And in some sense money is the dopamine
that makes organizations
642
00:31:45,400 --> 00:31:48,410
and society, companies
and many individuals do things.
643
00:31:48,410 --> 00:31:50,500
They do stuff because of money.
644
00:31:50,500 --> 00:31:53,309
But money if you compare to dopamine
is pretty broken,
645
00:31:53,309 --> 00:31:54,850
because you can hoard it.
646
00:31:54,850 --> 00:31:57,400
So you are going to have this
cortical columns in the real world,
647
00:31:57,400 --> 00:31:59,670
which are individual people
or individual corporations.
648
00:31:59,670 --> 00:32:03,250
They are hoarding the dopamine,
they sit on this very big pile of dopamine.
649
00:32:03,250 --> 00:32:07,890
They are starving the rest
of the society of the dopamine.
650
00:32:07,890 --> 00:32:10,630
They don't give it away,
and they can make it do it's bidding.
651
00:32:10,630 --> 00:32:13,970
So for instance they can pitch
substantial part of society
652
00:32:13,970 --> 00:32:16,130
against understanding of global warming.
653
00:32:16,130 --> 00:32:20,110
because they profit of global warming
or of technology that leads to global warming,
654
00:32:20,110 --> 00:32:22,850
which is very bad for all of us. applause
655
00:32:22,850 --> 00:32:28,850
So our society is a nervous system
that lies to itself.
656
00:32:28,850 --> 00:32:30,429
How can we overcome this?
657
00:32:30,429 --> 00:32:32,480
Actually, we don't know.
658
00:32:32,480 --> 00:32:34,639
To do this we would need
to have some kind of centrialized,
659
00:32:34,639 --> 00:32:36,660
top-down reward motivational system.
660
00:32:36,660 --> 00:32:39,010
We have this for instance in the military,
661
00:32:39,010 --> 00:32:42,520
you have this system of
military rewards that you get.
662
00:32:42,520 --> 00:32:44,950
And this are completely
controlled from the top.
663
00:32:44,950 --> 00:32:47,260
Also within working organizations
you have this.
664
00:32:47,260 --> 00:32:49,600
In corporations you have centralized rewards,
665
00:32:49,600 --> 00:32:51,850
it's not like rewards flow bottom-up,
666
00:32:51,850 --> 00:32:55,120
they always flown top-down.
667
00:32:55,120 --> 00:32:57,850
And there was an attempt
to model society in such a way.
668
00:32:57,850 --> 00:33:03,380
That was in Chile in the early 1970,
the Allende government had the idea
669
00:33:03,380 --> 00:33:07,320
to redesign society or economy
in society using cybernetics.
670
00:33:07,320 --> 00:33:12,590
So Allende invited a bunch of cyberneticians
to redesign the Chilean economy.
671
00:33:12,590 --> 00:33:14,550
And this was meant to be the control room,
672
00:33:14,550 --> 00:33:17,460
where Allende and his chief economists
would be sitting,
673
00:33:17,460 --> 00:33:19,709
to look at what the economy is doing.
674
00:33:19,709 --> 00:33:23,880
We don't know how this would work out,
because we know how it ended.
675
00:33:23,880 --> 00:33:27,260
In 1973 there was this big putsch in Chile,
676
00:33:27,260 --> 00:33:30,290
and this experiment ended among other things.
677
00:33:30,290 --> 00:33:34,170
Maybe it would have worked, who knows?
Nobody tried it.
678
00:33:34,170 --> 00:33:38,370
So, there is something else
what is going on in people,
679
00:33:38,370 --> 00:33:40,030
beyond the motivational system.
680
00:33:40,030 --> 00:33:43,610
That is: we have social criteria, for learning.
681
00:33:43,610 --> 00:33:47,670
We also check if our ideas
are normativly acceptable.
682
00:33:47,670 --> 00:33:50,510
And this is actually a good thing,
because individual may shortcut
683
00:33:50,510 --> 00:33:52,590
the learning through communication.
684
00:33:52,590 --> 00:33:55,260
Other people have learned stuff
that we don't need to learn ourselves.
685
00:33:55,260 --> 00:33:59,800
We can build on this, so we can accelerate
learning by many order of magnitutde,
686
00:33:59,800 --> 00:34:00,970
which makes culture possible.
687
00:34:00,970 --> 00:34:04,190
And which makes many anything possible,
because if you were on your own
688
00:34:04,190 --> 00:34:06,860
you would not be going to find out
very much in your lifetime.
689
00:34:08,520 --> 00:34:11,270
You know how they say?
Everything that you do,
690
00:34:11,270 --> 00:34:14,250
you do by standing on the shoulders of giants.
691
00:34:14,250 --> 00:34:17,779
Or on a big pile of dwarfs
it works either way.
692
00:34:17,779 --> 00:34:27,089
laughterapplause
693
00:34:27,089 --> 00:34:30,379
Social learning usually outperforms
individual learning. You can test this.
694
00:34:30,379 --> 00:34:33,949
But in the case of conflict
between different social truths,
695
00:34:33,949 --> 00:34:36,659
you need some way to decide who to believe.
696
00:34:36,659 --> 00:34:39,498
So you have some kind of reputation
estimate for different authority,
697
00:34:39,498 --> 00:34:42,399
and you use this to check whom you believe.
698
00:34:42,399 --> 00:34:45,748
And the problem of course is this
in existing society, in real society,
699
00:34:45,748 --> 00:34:48,389
this reputation system is going
to reflect power structure,
700
00:34:48,389 --> 00:34:51,699
which may distort your belief systematically.
701
00:34:51,699 --> 00:34:54,759
Social learning therefore leads groups
to synchronize their opinions.
702
00:34:54,759 --> 00:34:57,220
And the opinions become ...get another role.
703
00:34:57,220 --> 00:35:02,180
They become important part
of signalling which group you belong to.
704
00:35:02,180 --> 00:35:06,630
So opinions start to signal
group loyalty in societies.
705
00:35:06,630 --> 00:35:11,170
And people in this, and that's the actual world,
they should optimize not for getting the best possible
706
00:35:11,170 --> 00:35:12,619
opinions in terms of truth.
707
00:35:12,619 --> 00:35:17,289
They should guess... they should optimize
for doing... having the best possible opinion,
708
00:35:17,289 --> 00:35:19,799
with respect to agreement with their peers.
709
00:35:19,799 --> 00:35:22,029
If you have the same opinion
as your peers, you can signal them
710
00:35:22,029 --> 00:35:24,299
that you are the part of their ingroup,
they are going to like you.
711
00:35:24,299 --> 00:35:28,160
If you don't do this, chances are
they are not going to like you.
712
00:35:28,160 --> 00:35:34,049
There is rarely any benefit in life to be
in disagreement with your boss. Right?
713
00:35:34,049 --> 00:35:39,230
So, if you evolve an opinion forming system
in these curcumstances,
714
00:35:39,230 --> 00:35:41,220
you should be ending up
with an opinion forming system,
715
00:35:41,220 --> 00:35:42,980
that leaves you with the most usefull opinion,
716
00:35:42,980 --> 00:35:45,400
which is the opinion in your environment.
717
00:35:45,400 --> 00:35:48,400
And it turns out, most people are able
to do this effortlessly.
718
00:35:48,400 --> 00:35:50,969
laughter
719
00:35:50,969 --> 00:35:55,529
They have an instinct, that makes them adapt
the dominant opinion in their social environment.
720
00:35:55,529 --> 00:35:56,599
It's amazing, right?
721
00:35:56,599 --> 00:36:01,040
And if you are nerd like me,
you don't get this.
722
00:36:01,040 --> 00:36:08,999
laugingapplause
723
00:36:08,999 --> 00:36:12,999
So in the world out there,
explanations piggyback on you group allegiance.
724
00:36:12,999 --> 00:36:15,900
For instance you will find that there is a
substantial group of people that believes
725
00:36:15,900 --> 00:36:18,380
the minimum wage is good
for the economy and for you
726
00:36:18,380 --> 00:36:20,549
and another one believes that its bad.
727
00:36:20,549 --> 00:36:23,470
And its pretty much aligned
with political parties.
728
00:36:23,470 --> 00:36:25,970
Its not aligned with different
understandings of economy,
729
00:36:25,970 --> 00:36:30,740
because nobody understands
how the economy works.
730
00:36:30,740 --> 00:36:36,330
And if you are a nerd you try to understand
the world in terms of what is true and false.
731
00:36:36,330 --> 00:36:40,680
You try to prove everything by putting it
in some kind of true and false level
732
00:36:40,680 --> 00:36:43,589
and if you are not a nerd
you try to get to right and wrong
733
00:36:43,589 --> 00:36:45,609
you try to understand
whether you are in alignment
734
00:36:45,609 --> 00:36:49,559
with what's objectively right
in your society, right?
735
00:36:49,559 --> 00:36:55,680
So I guess that nerds are people that have
a defect in there opinion forming system.
736
00:36:55,680 --> 00:36:57,069
laughing
737
00:36:57,069 --> 00:37:00,609
And usually that's maladaptive
and under normal circumstances
738
00:37:00,609 --> 00:37:03,099
nerds would mostly be filtered
from the world,
739
00:37:03,099 --> 00:37:06,529
because they don't reproduce so well,
because people don't like them so much.
740
00:37:06,529 --> 00:37:07,960
laughing
741
00:37:07,960 --> 00:37:11,119
And then something very strange happened.
The computer revolution came along and
742
00:37:11,119 --> 00:37:14,170
suddenly if you argue with the computer
it doesn't help you if you have the
743
00:37:14,170 --> 00:37:17,849
normatively correct opinion you need to
be able to understand things in terms of
744
00:37:17,849 --> 00:37:26,029
true and false, right? applause
745
00:37:26,029 --> 00:37:29,779
So now we have this strange situation that
the weird people that have this offensive,
746
00:37:29,779 --> 00:37:33,410
strange opinions and that really don't
mix well with the real normal people
747
00:37:33,410 --> 00:37:38,119
get all this high paying jobs
and we don't understand how is that happening.
748
00:37:38,119 --> 00:37:42,599
And it's because suddenly
our maladapting is a benefit.
749
00:37:42,599 --> 00:37:47,300
But out there there is this world of the
social norms and it's made of paperwalls.
750
00:37:47,300 --> 00:37:50,349
There are all this things that are true
and false in a society that make
751
00:37:50,349 --> 00:37:51,549
people behave.
752
00:37:51,549 --> 00:37:57,390
It's like this japanese wall, there.
They made palaces out of paper basically.
753
00:37:57,390 --> 00:38:00,339
And these are walls by convention.
754
00:38:00,339 --> 00:38:04,009
They exist because people agree
that this is a wall.
755
00:38:04,009 --> 00:38:06,630
And if you are a hypnotist
like Donald Trump
756
00:38:06,630 --> 00:38:11,109
you can see that these are paper walls
and you can shift them.
757
00:38:11,109 --> 00:38:14,079
And if you are a nerd like me
you can not see these paperwalls.
758
00:38:14,079 --> 00:38:20,230
If you pay closely attention you see that
people move and then suddenly middair
759
00:38:20,230 --> 00:38:22,869
they make a turn. Why would they do this?
760
00:38:22,869 --> 00:38:24,360
There must be something
that they see there
761
00:38:24,360 --> 00:38:26,549
and this is basically a normative agreement.
762
00:38:26,549 --> 00:38:29,690
And you can infer what this is
and then you can manipulate it and understand it.
763
00:38:29,690 --> 00:38:32,640
Of course you can't fix this, you can
debug yourself in this regard,
764
00:38:32,640 --> 00:38:34,690
but it's something that is hard
to see for nerds.
765
00:38:34,690 --> 00:38:38,109
So in some sense they have a superpower:
they can think straight in the presence
766
00:38:38,109 --> 00:38:39,079
of others.
767
00:38:39,079 --> 00:38:42,590
But often they end up in their living room
and people are upset.
768
00:38:42,590 --> 00:38:45,810
laughter
769
00:38:45,810 --> 00:38:49,789
Learning in a complex domain can not
guarantee that you find the global maximum.
770
00:38:49,789 --> 00:38:53,970
We know that we can not find truth
because we can not recognize whether we live
771
00:38:53,970 --> 00:38:57,059
on a plain field or on a
simulated plain field.
772
00:38:57,059 --> 00:39:00,579
But what we can do is, we can try to
approach a global maximum.
773
00:39:00,579 --> 00:39:02,339
But we don't know if that
is the global maximum.
774
00:39:02,339 --> 00:39:05,509
We will always move along
some kind of belief gradient.
775
00:39:05,509 --> 00:39:09,110
We will take certain elements of
our belief and then give them up
776
00:39:09,110 --> 00:39:12,650
for new elements of a belief based on
thinking, that this new element
777
00:39:12,650 --> 00:39:15,049
of belief is better than the one
we give up.
778
00:39:15,049 --> 00:39:17,079
So we always move along
some kind of gradient.
779
00:39:17,079 --> 00:39:19,789
and the truth does not matter,
the gradient matters.
780
00:39:19,789 --> 00:39:23,650
If you think about teaching for a moment,
when I started teaching I often thought:
781
00:39:23,650 --> 00:39:27,489
Okay, I understand the truth of the
subject, the students don't, so I have to
782
00:39:27,489 --> 00:39:30,069
give this to them
and at some point I realized:
783
00:39:30,069 --> 00:39:33,450
Oh, I changed my mind so many times
in the past and I'm probably not going to
784
00:39:33,450 --> 00:39:35,769
stop changing it in the future.
785
00:39:35,769 --> 00:39:38,710
I'm always moving along a gradient
and I keep moving along a gradient.
786
00:39:38,710 --> 00:39:43,099
So I'm not moving to truth,
I'm moving forward.
787
00:39:43,099 --> 00:39:45,230
And when we teach our kids
we should probably not think about
788
00:39:45,230 --> 00:39:46,390
how to give them truth.
789
00:39:46,390 --> 00:39:51,039
We should think about how to put them onto
an interesting gradient, that makes them
790
00:39:51,039 --> 00:39:55,079
explore the world,
world of possible beliefs.
791
00:39:55,079 --> 00:40:03,150
applause
792
00:40:03,150 --> 00:40:05,359
And this possible beliefs
lead us into local minima.
793
00:40:05,359 --> 00:40:08,150
This is inevitable. This are like valleys
and sometimes this valleys are
794
00:40:08,150 --> 00:40:11,210
neighbouring and we don't understand
what the people in the neighbouring
795
00:40:11,210 --> 00:40:15,700
valley are doing unless we are willing to
retrace the steps they have been taken.
796
00:40:15,700 --> 00:40:19,569
And if you want to get from one valley
into the next, we will have to have some kind
797
00:40:19,569 --> 00:40:21,789
of energy that moves us over the hill.
798
00:40:21,789 --> 00:40:27,739
We have to have a trajectory were every
step works by finding reason to give up
799
00:40:27,739 --> 00:40:30,380
bit of our current belief and adopt a
new belief, because it's somehow
800
00:40:30,380 --> 00:40:34,739
more useful, more relevant,
more consistent and so on.
801
00:40:34,739 --> 00:40:38,349
Now the problem is that this is not
monotonous we can not guarantee that
802
00:40:38,349 --> 00:40:40,499
we're always climbing,
because the problem is, that
803
00:40:40,499 --> 00:40:44,599
the beliefs themselfs can change
our evaluation of the belief.
804
00:40:44,599 --> 00:40:50,390
It could be for instance that you start
believing in a religion and this religion
805
00:40:50,390 --> 00:40:54,299
could tell you: If you give up the belief
in the religion, you're going to face
806
00:40:54,299 --> 00:40:56,500
eternal damnation in hell.
807
00:40:56,500 --> 00:40:59,489
As long as you believe in the religion,
it's going to be very expensive for you
808
00:40:59,489 --> 00:41:02,430
to give up the religion, right?
If you truly belief in it.
809
00:41:02,430 --> 00:41:05,109
You're now caught
in some kind of attractor.
810
00:41:05,109 --> 00:41:08,680
Before you believe the religion it is not
very dangerous but once you've gotten
811
00:41:08,680 --> 00:41:13,019
into the attractor it's very,
very hard to get out.
812
00:41:13,019 --> 00:41:16,309
So these belief attractors
are actually quite dangerous.
813
00:41:16,309 --> 00:41:19,920
You can get not only to chaotic behaviour,
where you can not guarantee that your
814
00:41:19,920 --> 00:41:23,470
current belief is better than the last one
but you can also get into beliefs that are
815
00:41:23,470 --> 00:41:26,849
almost impossible to change.
816
00:41:26,849 --> 00:41:33,739
And that makes it possible to program
people to work in societies.
817
00:41:33,739 --> 00:41:37,529
Social domains are structured by values.
Basically a preference is what makes you
818
00:41:37,529 --> 00:41:40,769
do things, because you anticipate
pleasure or displeasure,
819
00:41:40,769 --> 00:41:45,339
and values make you do things
even if you don't anticipate any pleasure.
820
00:41:45,339 --> 00:41:49,809
These are virtual rewards.
They make us do things, because we believe
821
00:41:49,809 --> 00:41:51,799
that is stuff
that is more important then us.
822
00:41:51,799 --> 00:41:55,109
This is what values are about.
823
00:41:55,109 --> 00:42:00,690
And these values are the source
of what we would call true meaning, deeper meaning.
824
00:42:00,690 --> 00:42:05,220
There is something that is more important
than us, something that we can serve.
825
00:42:05,220 --> 00:42:08,769
This is what we usually perceive as
meaningful life, it is one which
826
00:42:08,769 --> 00:42:12,759
is in the serves of values that are more
important than I myself,
827
00:42:12,759 --> 00:42:15,749
because after all I'm not that important.
I'm just this machine that runs around
828
00:42:15,749 --> 00:42:20,789
and tries to optimize its pleasure and
pain, which is kinda boring.
829
00:42:20,789 --> 00:42:26,329
So my PI has puzzled me, my principle
investigator in the Havard department,
830
00:42:26,329 --> 00:42:29,349
where I have my desk, Martin Nowak.
831
00:42:29,349 --> 00:42:33,970
He said, that meaning can not exist without
god; you are either religious,
832
00:42:33,970 --> 00:42:36,950
or you are a nihilist.
833
00:42:36,950 --> 00:42:42,789
And this guy is the head of the
department for evolutionary dynamics.
834
00:42:42,789 --> 00:42:45,769
Also he is a catholic.. chuckling
835
00:42:45,769 --> 00:42:49,729
So this really puzzled me and I tried
to understand what he meant by this.
836
00:42:49,729 --> 00:42:53,200
Typically if you are a good atheist
like me,
837
00:42:53,200 --> 00:42:57,920
you tend to attack gods that are
structured like this, religious gods,
838
00:42:57,920 --> 00:43:02,940
that are institutional, they are personal,
they are some kind of person.
839
00:43:02,940 --> 00:43:08,239
They do care about you, they prescribe
norms, for instance don't mastrubate
840
00:43:08,239 --> 00:43:10,060
it's bad for you.
841
00:43:10,060 --> 00:43:14,759
Many of this norms are very much aligned
with societal institutions, for instance
842
00:43:14,759 --> 00:43:20,799
don't questions the authorities,
god wants them to be ruling above you
843
00:43:20,799 --> 00:43:23,839
and be monogamous and so on and so on.
844
00:43:23,839 --> 00:43:28,979
So they prescribe norms that do not make
a lot of sense in terms of beings that
845
00:43:28,979 --> 00:43:31,200
creates world every now and then,
846
00:43:31,200 --> 00:43:34,619
but they make sense in terms of
what you should be doing to be a
847
00:43:34,619 --> 00:43:36,730
functioning member of society.
848
00:43:36,730 --> 00:43:40,799
And this god also does things like it
creates world, they like to manifest as
849
00:43:40,799 --> 00:43:43,660
burning shrubbery and so on. There are
many books that describe stories that
850
00:43:43,660 --> 00:43:45,700
these gods have allegedly done.
851
00:43:45,700 --> 00:43:48,819
And it's very hard to test for all these
features which makes this gods very
852
00:43:48,819 --> 00:43:54,280
improbable for us. And makes Atheist
very dissatisfied with these gods.
853
00:43:54,280 --> 00:43:56,569
But then there is a different kind of god.
854
00:43:56,569 --> 00:43:58,599
This is what we call the spiritual god.
855
00:43:58,599 --> 00:44:02,410
This spiritual god is independent of
institutions, it still does care about you.
856
00:44:02,410 --> 00:44:06,489
It's probably conscious. It might not be a
person. There are not that many stories,
857
00:44:06,489 --> 00:44:10,579
that you can consistently tell about it,
but you might be able to connect to it
858
00:44:10,579 --> 00:44:15,259
spiritually.
859
00:44:15,259 --> 00:44:19,470
Then there is a god that is even less
expensive. That is god as a transcendental
860
00:44:19,470 --> 00:44:23,489
principle and this god is simply the reason
why there is something rather then
861
00:44:23,489 --> 00:44:28,150
nothing. This god is the question the
universe is the answer to, this is the
862
00:44:28,150 --> 00:44:29,600
thing that gives meaning.
863
00:44:29,600 --> 00:44:31,489
Everything else about it is unknowable.
864
00:44:31,489 --> 00:44:34,190
This is the god of Thomas of Aquinus.
865
00:44:34,190 --> 00:44:38,089
The God that Thomas of Aquinus discovered
is not the god of Abraham this is not the
866
00:44:38,089 --> 00:44:39,180
religious god.
867
00:44:39,180 --> 00:44:43,559
It's a god that is basically a principle
that us ... the universe into existence.
868
00:44:43,559 --> 00:44:47,140
It's the one that gives
the universe it's purpose.
869
00:44:47,140 --> 00:44:50,200
And because every other property
is unknowable about this,
870
00:44:50,200 --> 00:44:52,010
this god is not that expensive.
871
00:44:52,010 --> 00:44:55,960
Unfortunately it doesn't really work.
I mean Thomas of Aquinus tried to prove
872
00:44:55,960 --> 00:45:00,049
god. He tried to prove an necessary god,
a god that has to be existing and
873
00:45:00,049 --> 00:45:02,779
I think we can only prove a possible god.
874
00:45:02,779 --> 00:45:05,339
So if you try to prove a necessary god,
this god can not exist.
875
00:45:05,339 --> 00:45:11,650
Which means your god prove is going to
fail. You can only prove possible gods.
876
00:45:11,650 --> 00:45:13,259
And then there is an even more improper god.
877
00:45:13,259 --> 00:45:15,890
And that's the god of Aristotle and he said:
878
00:45:15,890 --> 00:45:20,069
"If there is change in the universe,
something in going to have to change it."
879
00:45:20,069 --> 00:45:23,640
There must be something that moves it
along from one state to the next.
880
00:45:23,640 --> 00:45:26,289
So I would say that is the primary
computational transition function
881
00:45:26,289 --> 00:45:35,079
of the universe.
laughingapplause
882
00:45:35,079 --> 00:45:38,439
And Aristotle discovered it.
It's amazing isn't it?
883
00:45:38,439 --> 00:45:41,509
We have to have this because we
can not be conscious in a single state.
884
00:45:41,509 --> 00:45:43,279
We need to move between states
to be conscious.
885
00:45:43,279 --> 00:45:45,979
We need to be processes.
886
00:45:45,979 --> 00:45:50,859
So we can take our gods and sort them by
their metaphysical cost.
887
00:45:50,859 --> 00:45:53,290
The 1st degree god would be the first mover.
888
00:45:53,290 --> 00:45:56,069
The 2nd degree god is the god of purpose and meaning.
889
00:45:56,069 --> 00:45:59,089
3rd degree god is the spiritual god.
And the 4th degree god is this bound to
890
00:45:59,089 --> 00:46:01,229
religious institutions, right?
891
00:46:01,229 --> 00:46:03,720
So if you take this statement
from Martin Nowak,
892
00:46:03,720 --> 00:46:07,759
"You can not have meaning without god!"
I would say: yes! You need at least
893
00:46:07,759 --> 00:46:14,990
a 2nd degree god to have meaning.
So objective meaning can only exist
894
00:46:14,990 --> 00:46:19,119
with a 2nd degree god. chuckling
895
00:46:19,119 --> 00:46:22,269
And subjective meaning can exist as a
function in a cognitive system of course.
896
00:46:22,269 --> 00:46:24,180
We don't need objective meaning.
897
00:46:24,180 --> 00:46:27,410
So we can subjectively feel that there is
something more important to us
898
00:46:27,410 --> 00:46:30,509
and this makes us work in society and
makes us perceive that we have values
899
00:46:30,509 --> 00:46:34,329
and so on, but we don't need to believe
that there is something outside of the
900
00:46:34,329 --> 00:46:36,869
universe to have this.
901
00:46:36,869 --> 00:46:40,650
So the 4th degree god is the one
that is bound to religious institutions,
902
00:46:40,650 --> 00:46:45,400
it requires a belief attractor and it
enables complex norm prescriptions.
903
00:46:45,400 --> 00:46:48,430
It my theory is right then it should be
much harder for nerds to believe in
904
00:46:48,430 --> 00:46:52,039
a 4th degree god then for normal people.
905
00:46:52,039 --> 00:46:56,489
And what this god does it allows you to
have state building mind viruses.
906
00:46:56,489 --> 00:47:00,269
Basically religion is a mind virus. And
the amazing thing about these mind viruses
907
00:47:00,269 --> 00:47:02,489
is that they structure behaviour
in large groups.
908
00:47:02,489 --> 00:47:06,130
We have evolved to live in small groups
of a few 100 individuals, maybe somthing
909
00:47:06,130 --> 00:47:07,249
like a 150.
910
00:47:07,249 --> 00:47:10,059
This is roughly the level
to which reputation works.
911
00:47:10,059 --> 00:47:15,369
We can keep track of about 150 people and
after this it gets much much worse.
912
00:47:15,369 --> 00:47:18,290
So in this system where you have
reputation people feel responsible
913
00:47:18,290 --> 00:47:21,349
for each other and they can
keep track of their doings
914
00:47:21,349 --> 00:47:23,049
and society kind of sort of works.
915
00:47:23,049 --> 00:47:27,789
If you want to go beyond this, you have
to right a software that controls people.
916
00:47:27,789 --> 00:47:32,420
And religions were the first software,
that did this on a very large scale.
917
00:47:32,420 --> 00:47:35,319
And in order to keep stable they had to be
designed like operating systems
918
00:47:35,319 --> 00:47:36,039
in some sense.
919
00:47:36,039 --> 00:47:39,930
They give people different roles
like insects in a hive.
920
00:47:39,930 --> 00:47:44,529
And they have even as part of this roles is
to update this religion but it has to be
921
00:47:44,529 --> 00:47:48,380
done very carefully and centrally
because otherwise the religion will split apart
922
00:47:48,380 --> 00:47:51,719
and fall together into new religions
or be overcome by new ones.
923
00:47:51,719 --> 00:47:54,259
So there is some kind of
evolutionary dynamics that goes on
924
00:47:54,259 --> 00:47:55,930
with respect to religion.
925
00:47:55,930 --> 00:47:58,519
And if you look the religions,
there is actually a veritable evolution
926
00:47:58,519 --> 00:47:59,739
of religions.
927
00:47:59,739 --> 00:48:04,789
So we have this Israelic tradition and
the Mesoputanic mythology that gave rise
928
00:48:04,789 --> 00:48:13,019
to Judaism. applause
929
00:48:13,019 --> 00:48:16,299
It's kind of cool, right? laughing
930
00:48:16,299 --> 00:48:36,289
Also history totally repeats itself.
roaring laughterapplause
931
00:48:36,289 --> 00:48:41,889
Yeah, it totally blew my mind when
I discovered this. laughter
932
00:48:41,889 --> 00:48:45,039
Of course the real tree of programming
languages is slightly more complicated,
933
00:48:45,039 --> 00:48:48,599
And the real tree of religion is slightly
more complicated.
934
00:48:48,599 --> 00:48:51,229
But still its neat.
935
00:48:51,229 --> 00:48:54,289
So if you want to immunize yourself
against mind viruses,
936
00:48:54,289 --> 00:48:58,570
first of all you want to check yourself
whether you are infected.
937
00:48:58,570 --> 00:49:02,809
You should check: Can I let go of my
current beliefs without feeling that
938
00:49:02,809 --> 00:49:07,670
meaning departures me and I feel very
terrible, when I let go of my beliefs.
939
00:49:07,670 --> 00:49:11,279
Also you should check: All the other
people around there that don't
940
00:49:11,279 --> 00:49:17,019
share my belief, are they either stupid,
or crazy, or evil?
941
00:49:17,019 --> 00:49:19,890
If you think this chances are you are
infected by some kind of mind virus,
942
00:49:19,890 --> 00:49:23,710
because they are just part
of the out group.
943
00:49:23,710 --> 00:49:28,059
And does your god have properties that
you know but you did not observe.
944
00:49:28,059 --> 00:49:32,490
So basically you have a god
of 2nd or 3rd degree or higher.
945
00:49:32,490 --> 00:49:34,589
In this case you also probably got a mind virus.
946
00:49:34,589 --> 00:49:37,259
There is nothing wrong
with having a mind virus,
947
00:49:37,259 --> 00:49:39,920
but if you want to immunize yourself
against this people have invented
948
00:49:39,920 --> 00:49:44,059
rationalism and enlightenment,
basically to act as immunization against
949
00:49:44,059 --> 00:49:50,660
mind viruses.
loud applause
950
00:49:50,660 --> 00:49:53,869
And in some sense its what the mind does
by itself because, if you want to
951
00:49:53,869 --> 00:49:56,949
understand how you go wrong,
you need to have a mechanism
952
00:49:56,949 --> 00:49:58,839
that discovers who you are.
953
00:49:58,839 --> 00:50:03,109
Some kind of auto debugging mechanism,
that makes the mind aware of itself.
954
00:50:03,109 --> 00:50:04,779
And this is actually the self.
955
00:50:04,779 --> 00:50:08,339
So according to Robert Kegan:
"The development of ourself is a process,
956
00:50:08,339 --> 00:50:13,400
in which we learn who we are by making
thing explicit", by making processes that
957
00:50:13,400 --> 00:50:17,249
are automatic visible to us and by
conceptualize them so we no longer
958
00:50:17,249 --> 00:50:18,859
identify with them.
959
00:50:18,859 --> 00:50:22,019
And it starts out with understanding
that there is only pleasure and pain.
960
00:50:22,019 --> 00:50:25,180
If you are a baby, you have only
pleasure and pain you identify with this.
961
00:50:25,180 --> 00:50:27,869
And then you turn into a toddler and the
toddler understands that they are not
962
00:50:27,869 --> 00:50:31,059
their pleasure and pain
but they are their impulses.
963
00:50:31,059 --> 00:50:34,259
And in the next level if you grow beyond
the toddler age you actually know that
964
00:50:34,259 --> 00:50:38,880
you have goals and that your needs and
impulses are there to serve goals, but its
965
00:50:38,880 --> 00:50:40,210
very difficult to let go of the goals,
966
00:50:40,210 --> 00:50:42,789
if you are a very young child.
967
00:50:42,789 --> 00:50:46,329
And at some point you realize: Oh, the
goals don't really matter, because
968
00:50:46,329 --> 00:50:49,509
sometimes you can not reach them, but
we have preferences, we have thing that we
969
00:50:49,509 --> 00:50:52,950
want to happen and thing that we do not
want to happen. And then at some point
970
00:50:52,950 --> 00:50:55,869
we realize that other people have
preferences, too.
971
00:50:55,869 --> 00:50:58,979
And then we start to model the world
as a system where different people have
972
00:50:58,979 --> 00:51:01,940
different preferences and we have
to navigate this landscape.
973
00:51:01,940 --> 00:51:06,420
And then we realize that this preferences
also relate to values and we start
974
00:51:06,420 --> 00:51:09,700
to identify with this values as members of
society.
975
00:51:09,700 --> 00:51:13,469
And this is basically the stage if you
are an adult being, that you get into.
976
00:51:13,469 --> 00:51:16,910
And you can get to a stage beyond that,
especially if you have people this, which
977
00:51:16,910 --> 00:51:20,059
have already done this. And this means
that you understand that people have
978
00:51:20,059 --> 00:51:23,660
different values and what they do
naturally flows out of them.
979
00:51:23,660 --> 00:51:26,849
And this values are not necessarily worse
than yours they are just different.
980
00:51:26,849 --> 00:51:29,450
And you learn that you can hold different
sets of values in your mind at
981
00:51:29,450 --> 00:51:33,019
the same time, isn't that amazing?
and understand other people, even if
982
00:51:33,019 --> 00:51:36,660
they are not part of your group.
If you get that, this is really good.
983
00:51:36,660 --> 00:51:39,269
But I don't think it stops there.
984
00:51:39,269 --> 00:51:43,019
You can also learn that the stuff that
you perceive is kind of incidental,
985
00:51:43,019 --> 00:51:45,339
that you can turn it of and you can
manipulate it.
986
00:51:45,339 --> 00:51:49,940
And at some point you also can realize
that yourself is only incidental that you
987
00:51:49,940 --> 00:51:52,559
can manipulate it or turn it of.
And that your basically some kind of
988
00:51:52,559 --> 00:51:57,420
consciousness that happens to run a brain
of some kind of person, that navigates
989
00:51:57,420 --> 00:52:04,279
the world in terms to get rewards or avoid
displeasure and serve values and so on,
990
00:52:04,279 --> 00:52:05,130
but it doesn't really matter.
991
00:52:05,130 --> 00:52:08,119
There is just this consciousness which
understands the world.
992
00:52:08,119 --> 00:52:11,009
And this is the stage that we typically
call enlightenment.
993
00:52:11,009 --> 00:52:14,549
In this stage you realize that you are not
your brain, but you are a story that
994
00:52:14,549 --> 00:52:25,640
your brain tells itself.
applause
995
00:52:25,640 --> 00:52:29,630
So becoming self aware is a process of
reverse engineering your mind.
996
00:52:29,630 --> 00:52:32,890
Its a different set of stages in which
to realize what goes on.
997
00:52:32,890 --> 00:52:33,799
So isn't that amazing.
998
00:52:33,799 --> 00:52:38,930
AI is a way to get to more self awareness?
999
00:52:38,930 --> 00:52:41,319
I think that is a good point to stop here.
1000
00:52:41,319 --> 00:52:44,499
The first talk that I gave in this series
was 2 years ago. It was about
1001
00:52:44,499 --> 00:52:45,979
how to build a mind.
1002
00:52:45,979 --> 00:52:49,670
Last year I talked about how to get from
basic computation to consciousness.
1003
00:52:49,670 --> 00:52:53,709
And this year we have talked about
finding meaning using AI.
1004
00:52:53,709 --> 00:52:57,470
I wonder where it goes next.
laughter
1005
00:52:57,470 --> 00:53:22,769
applause
1006
00:53:22,769 --> 00:53:26,489
Herald: Thank you for this amazing talk!
We now have some minutes for Q&A.
1007
00:53:26,489 --> 00:53:31,190
So please line up at the microphones as
always. If you are unable to stand up
1008
00:53:31,190 --> 00:53:36,430
for some reason please very very visibly
rise your hand, we should be able to dispatch
1009
00:53:36,430 --> 00:53:40,099
an audio angle to your location
so you can have a question too.
1010
00:53:40,099 --> 00:53:44,030
And also if you are locationally
disabled, you are not actually in the room
1011
00:53:44,030 --> 00:53:49,069
if you are on the stream, you can use IRC
or twitter to also ask questions.
1012
00:53:49,069 --> 00:53:50,989
We also have a person for that.
1013
00:53:50,989 --> 00:53:53,779
We will start at microphone number 2.
1014
00:53:53,779 --> 00:53:59,940
Q: Wow that's me. Just a guess! What
would you guess, when can you discuss
1015
00:53:59,940 --> 00:54:04,559
your talk with a machine,
in how many years?
1016
00:54:04,559 --> 00:54:07,400
Joscha: I don't know! As a software
engineer I know if I don't have the
1017
00:54:07,400 --> 00:54:12,619
specification all bets are off, until I
have the implementation. laughter
1018
00:54:12,619 --> 00:54:14,509
So it can be of any order of magnitude.
1019
00:54:14,509 --> 00:54:18,249
I have a gut feeling but I also know as a
software engineer that my gut feeling is
1020
00:54:18,249 --> 00:54:23,450
usually wrong, laughter
until I have the specification.
1021
00:54:23,450 --> 00:54:28,200
So the question is if there are silver
bullets? Right now there are some things
1022
00:54:28,200 --> 00:54:30,569
that are not solved yet and it could be
that they are easier to solve
1023
00:54:30,569 --> 00:54:33,469
than we think, but it could be that
they're harder to solve than we think.
1024
00:54:33,469 --> 00:54:36,710
Before I stumbled on this cortical
self organization thing,
1025
00:54:36,710 --> 00:54:40,719
I thought it's going to be something like
maybe 60, 80 years and now I think it's
1026
00:54:40,719 --> 00:54:47,289
way less, but again this is a very
subjective perspective. I don't know.
1027
00:54:47,289 --> 00:54:49,240
Herald: Number 1, please!
1028
00:54:49,240 --> 00:54:55,589
Q: Yes, I wanted to ask a little bit about
metacognition. It seems that you kind of
1029
00:54:55,589 --> 00:55:01,329
end your story saying that it's still
reflecting on input that you get and
1030
00:55:01,329 --> 00:55:04,900
kind of working with your social norms
and this and that, but Colberg
1031
00:55:04,900 --> 00:55:11,839
for instance talks about what he calls a
postconventional universal morality
1032
00:55:11,839 --> 00:55:17,420
for instance, which is thinking about
moral laws without context, basically
1033
00:55:17,420 --> 00:55:23,069
stating that there is something beyond the
relative norm that we have to each other,
1034
00:55:23,069 --> 00:55:29,579
which would only be possible if you can do
kind of, you know, meta cognition,
1035
00:55:29,579 --> 00:55:32,599
thinking about your own thinking
and then modifying that thinking.
1036
00:55:32,599 --> 00:55:37,229
So kind of feeding back your own ideas
into your own mind and coming up with
1037
00:55:37,229 --> 00:55:43,779
stuff that actually can't get ...
well processing external inputs.
1038
00:55:43,779 --> 00:55:48,469
Joscha: Mhm! I think it's very tricky.
This project of defining morality without
1039
00:55:48,469 --> 00:55:53,119
societies exists longer than Kant of
course. And Kant tried to give this
1040
00:55:53,119 --> 00:55:56,869
internal rules and others tried to.
I find this very difficult.
1041
00:55:56,869 --> 00:56:01,069
From my perspective we are just moving
bits of rocks. And this bits of rocks they
1042
00:56:01,069 --> 00:56:07,589
are on some kind of dust mode in a galaxy
out of trillions of galaxies and how can
1043
00:56:07,589 --> 00:56:08,609
there be meaning?
1044
00:56:08,609 --> 00:56:11,180
It's very hard for me to say:
1045
00:56:11,180 --> 00:56:13,969
One chimpanzee species is better than
another chimpanzee species or
1046
00:56:13,969 --> 00:56:16,559
a particular monkey
is better than another monkey.
1047
00:56:16,559 --> 00:56:18,539
This only happens
within a certain framework
1048
00:56:18,539 --> 00:56:20,160
and we have to set this framework.
1049
00:56:20,160 --> 00:56:23,700
And I don't think that we can define this
framework outside of a context of
1050
00:56:23,700 --> 00:56:26,420
social norms, that we have to agree on.
1051
00:56:26,420 --> 00:56:29,650
So objectively I'm not sure
if we can get to ethics.
1052
00:56:29,650 --> 00:56:33,769
I only think that is possible based on
some kind of framework that people
1053
00:56:33,769 --> 00:56:38,339
have to agree on implicitly or explicitly.
1054
00:56:38,339 --> 00:56:40,630
Herald: Microphone number 4, please.
1055
00:56:40,630 --> 00:56:46,559
Q: Hi, thank you, it was a fascinating talk.
I have 2 thought that went through my mind.
1056
00:56:46,559 --> 00:56:51,589
And the first one is that it's so
convincing the models that you present,
1057
00:56:51,589 --> 00:56:56,709
but it's kind of like you present
another metaphor of understanding the
1058
00:56:56,709 --> 00:57:01,670
brain which is still something that we try
to grasp on different levels of science
1059
00:57:01,670 --> 00:57:07,469
basically. And the 2nd one is that your
definition of the nerd who walks
1060
00:57:07,469 --> 00:57:10,950
and doesn't see the walls is kind of
definition... or reminds me
1061
00:57:10,950 --> 00:57:15,229
Richard Rortys definition of the ironist
which is a person who knows that their
1062
00:57:15,229 --> 00:57:20,799
vocabulary is finite and that other people
have also a finite vocabulary and
1063
00:57:20,799 --> 00:57:24,599
then that obviously opens up the whole question
of meaning making which has been
1064
00:57:24,599 --> 00:57:28,979
discussed in so many
other disciplines and fields.
1065
00:57:28,979 --> 00:57:32,930
And I thought about Darridas
deconstruction of ideas and thoughts and
1066
00:57:32,930 --> 00:57:36,300
Butler and then down the rabbit hole to
Nietzsche and I was just wondering,
1067
00:57:36,300 --> 00:57:39,009
if you could maybe
map out other connections
1068
00:57:39,009 --> 00:57:44,430
where basically not AI helping us to
understand the mind, but where
1069
00:57:44,430 --> 00:57:49,819
already existing huge, huge fields of
science, like cognitive process
1070
00:57:49,819 --> 00:57:53,359
coming from the other end could help us
to understand AI.
1071
00:57:53,359 --> 00:57:59,680
Joscha: Thank you, the tradition that you
mentioned Rorty and Butler and so on
1072
00:57:59,680 --> 00:58:02,989
are part of a completely different belief
attractor in my current perspective.
1073
00:58:02,989 --> 00:58:06,209
That is they are mostly
social constructionists.
1074
00:58:06,209 --> 00:58:10,880
They believe that reality at least in the
domains of the mind and sociality
1075
00:58:10,880 --> 00:58:15,359
are social constructs they are part
of social agreement.
1076
00:58:15,359 --> 00:58:17,190
Personally I don't think that
this is the case.
1077
00:58:17,190 --> 00:58:19,630
I think that patterns that we refer to
1078
00:58:19,630 --> 00:58:23,890
are mostly independent of your mind.
The norms are part of social constructs,
1079
00:58:23,890 --> 00:58:28,099
but for instance our motivational
preferences that make us adapt or
1080
00:58:28,099 --> 00:58:32,719
reject norms, are something that builds up
resistance to the environment.
1081
00:58:32,719 --> 00:58:35,660
So they are probably not part
of social agreement.
1082
00:58:35,660 --> 00:58:41,569
And the only thing I can invite you to is
try to retrace both of the different
1083
00:58:41,569 --> 00:58:45,640
belief attractors, try to retrace the
different paths on the landscape.
1084
00:58:45,640 --> 00:58:48,529
All this thing that I tell you, all of
this is of course very speculative.
1085
00:58:48,529 --> 00:58:52,390
These are that seem to be logical
to me at this point in my life.
1086
00:58:52,390 --> 00:58:55,400
And I try to give you the arguments
why I think that is plausible, but don't
1087
00:58:55,400 --> 00:58:59,109
believe in them, question them, challenge
them, see if they work for you!
1088
00:58:59,109 --> 00:59:00,559
I'm not giving you any truth.
1089
00:59:00,559 --> 00:59:05,720
I'm just going to give you suitable encodings
according to my current perspective.
1090
00:59:05,720 --> 00:59:11,739
Q:Thank you!
applause
1091
00:59:11,739 --> 00:59:15,099
Herald: The internet, please!
1092
00:59:19,179 --> 00:59:26,029
Signal angel: So, someone is asking
if in this belief space you're talking about
1093
00:59:26,029 --> 00:59:30,109
how is it possible
to get out of local minima?
1094
00:59:30,109 --> 00:59:33,959
And very related question as well:
1095
00:59:33,959 --> 00:59:38,530
Should we teach some momentum method
to our children,
1096
00:59:38,530 --> 00:59:41,599
so we don't get stuck in a local minima.
1097
00:59:41,599 --> 00:59:44,829
Joscha: I believe at some level it's not
possible to get out of a local minima.
1098
00:59:44,829 --> 00:59:50,329
In an absolute sense, because you only get
to get into some kind of meta minimum,
1099
00:59:50,329 --> 00:59:56,769
but what you can do is to retrace the
path that you took whenever you discover
1100
00:59:56,769 --> 00:59:59,989
that somebody else has a fundamentally
different set of beliefs.
1101
00:59:59,989 --> 01:00:02,769
And if you realize that this person is
basically a smart person that is not
1102
01:00:02,769 --> 01:00:07,359
completely insane but has reasons to
believe in their beliefs and they seem to
1103
01:00:07,359 --> 01:00:10,579
be internally consistent it's usually
worth to retrace what they
1104
01:00:10,579 --> 01:00:12,180
have been thinking and why.
1105
01:00:12,180 --> 01:00:15,930
And this means you have to understand
where their starting point was and
1106
01:00:15,930 --> 01:00:18,279
how they moved from their current point
to their starting point.
1107
01:00:18,279 --> 01:00:22,219
You might not be able to do this
accurately and the important thing is
1108
01:00:22,219 --> 01:00:25,369
also afterwards you discover a second
valley, you haven't discovered
1109
01:00:25,369 --> 01:00:27,059
the landscape inbetween.
1110
01:00:27,059 --> 01:00:30,839
But the only way that we can get an idea
of the lay of the land is that we try to
1111
01:00:30,839 --> 01:00:33,200
retrace as many paths as possible.
1112
01:00:33,200 --> 01:00:36,339
And if we try to teach our children, what
I think what we should be doing is:
1113
01:00:36,339 --> 01:00:38,650
To tell them how to explore
this world on there own.
1114
01:00:38,650 --> 01:00:43,900
It's not that we tell them this is the
valley, basically it's given, it's
1115
01:00:43,900 --> 01:00:47,599
the truth, but instead we have to tell
them: This is the path that we took.
1116
01:00:47,599 --> 01:00:51,239
And these are the things that we saw
inbetween and it is important to be not
1117
01:00:51,239 --> 01:00:54,390
completely naive when we go into this
landscape, but we also have to understand
1118
01:00:54,390 --> 01:00:58,170
that it's always an exploration that
never stops and that might change
1119
01:00:58,170 --> 01:01:01,140
everything that you believe now
at a later point.
1120
01:01:01,140 --> 01:01:05,700
So for me it's about teaching my own
children how to be explorers,
1121
01:01:05,700 --> 01:01:10,950
how to understand that knowledge is always
changing and it's always a moving frontier.
1122
01:01:10,950 --> 01:01:17,230
applause
1123
01:01:17,230 --> 01:01:22,259
Herald: We are unfortunately out of time.
So, please once again thank Joscha!
1124
01:01:22,259 --> 01:01:24,069
applause
Joscha: Thank you!
1125
01:01:24,069 --> 01:01:28,239
applause
1126
01:01:28,239 --> 01:01:32,719
postroll music
1127
01:01:32,719 --> 01:01:40,000
subtitles created by c3subtitles.de
Join, and help us!