1 00:00:16,256 --> 00:00:19,993 Intelligence, what is it? 2 00:00:19,993 --> 00:00:24,610 If we take a look back at the history of how intelligence is being viewed, 3 00:00:24,610 --> 00:00:31,187 one seminal example has been Edsger Dijkstra's famous quote 4 00:00:31,187 --> 00:00:34,898 that the question of whether a machine can think 5 00:00:34,898 --> 00:00:37,779 is about as interesting as the question of 6 00:00:37,779 --> 00:00:40,942 whether a submarine can swim. 7 00:00:41,412 --> 00:00:46,832 Now, Edsger Dijkstra, when he wrote this, intended it as a criticism 8 00:00:46,832 --> 00:00:51,519 of early pioneers of computer science like Alan Turing. 9 00:00:52,679 --> 00:00:55,532 However, if you take a look back 10 00:00:55,532 --> 00:00:59,406 and think about what have been the most empowering innovations 11 00:00:59,406 --> 00:01:03,007 that enabled us to build artificial machines that swim 12 00:01:03,507 --> 00:01:06,278 and artificial machines that [fly], 13 00:01:06,388 --> 00:01:10,700 you find that it was only through understanding the underlying 14 00:01:10,810 --> 00:01:15,916 physical mechanisms of swimming and flight that we were able 15 00:01:15,916 --> 00:01:18,442 to build these machines. 16 00:01:18,442 --> 00:01:22,019 And so, several years ago, I undertook a program 17 00:01:22,019 --> 00:01:26,487 to try to understand the fundamental physical mechanisms 18 00:01:26,487 --> 00:01:29,065 underlying intelligence. 19 00:01:30,275 --> 00:01:32,338 Let's take a step back. 20 00:01:32,364 --> 00:01:35,613 Let's first begin with a thought experiment. 21 00:01:35,613 --> 00:01:38,019 Pretend that you're an alien race 22 00:01:38,019 --> 00:01:42,854 that doesn't know anything about Earth biology or Earth neuroscience 23 00:01:42,854 --> 00:01:46,565 or Earth intelligence, but you have amazing telescopes 24 00:01:46,655 --> 00:01:51,021 and you're able to watch the Earth and you have amazingly long lives 25 00:01:51,021 --> 00:01:55,943 so you're able to watch the Earth over millions, even billions of years. 26 00:01:55,943 --> 00:02:00,361 And you observe a really strange effect, 27 00:02:00,361 --> 00:02:03,623 you observe that over the course of the millennia, 28 00:02:03,649 --> 00:02:09,798 Earth is continually bombarded with asteroids up until a point 29 00:02:09,798 --> 00:02:13,201 and that at some point, corresponding roughly 30 00:02:13,221 --> 00:02:18,980 to our year 2000 AD, asteroids that are on a collision course with the Earth, 31 00:02:19,241 --> 00:02:23,293 that otherwise would have collided, mysteriously get deflected 32 00:02:23,893 --> 00:02:26,643 or detonate before they can hit the Earth. 33 00:02:26,863 --> 00:02:30,364 Now, of course, as Earthlings, we know the reason would be 34 00:02:30,364 --> 00:02:35,088 that we're trying to save ourselves, we're trying to prevent an impact. 35 00:02:35,088 --> 00:02:38,087 But if you're an alien race that doesn't know any of this, 36 00:02:38,087 --> 00:02:40,664 that doesn't have any concept of Earth intelligence, 37 00:02:40,664 --> 00:02:42,543 you'd be forced to put together 38 00:02:42,543 --> 00:02:46,945 a physical theory that explains how, up until a certain point in time, 39 00:02:47,925 --> 00:02:51,508 asteroids thad would demolish the surface of the planet, 40 00:02:52,258 --> 00:02:55,361 mysteriously stop doing that. 41 00:02:55,361 --> 00:02:59,610 So, I claim that this is the same question 42 00:02:59,610 --> 00:03:03,112 as understanding the physical nature of intelligence. 43 00:03:03,762 --> 00:03:08,863 So, in this program that I undertook years ago, I've looked at a variety 44 00:03:08,863 --> 00:03:13,766 of different threads in crossed science across a variety of disciplines, 45 00:03:13,766 --> 00:03:19,280 pointing, I think, towards a single underlying mechanism for intelligence. 46 00:03:19,910 --> 00:03:22,024 In cosmology, for example, 47 00:03:22,314 --> 00:03:24,968 there has been a variety of different threads of evidence 48 00:03:24,968 --> 00:03:29,695 that our universe appears to be finely tuned for the development 49 00:03:29,695 --> 00:03:33,360 of intelligence, and in particular, for the development 50 00:03:33,360 --> 00:03:38,941 of universal states that maximize the diversity of possible futures. 51 00:03:38,941 --> 00:03:44,063 In gameplay, for example in Go, everyone remembers in 1997 52 00:03:44,403 --> 00:03:48,120 when IBM's Deep Blue beat Gary Kasparov at chess. 53 00:03:48,480 --> 00:03:51,934 Fewer people are aware that in the past ten year or so, 54 00:03:51,934 --> 00:03:56,137 the game of Go, arguably a much more challenging game because it has 55 00:03:56,137 --> 00:04:00,804 a much higher branching factor, has also started to succumb to computer 56 00:04:00,804 --> 00:04:03,862 game players for the same reason. 57 00:04:03,862 --> 00:04:06,529 The best techniques, right now, for computers playing Go, 58 00:04:06,529 --> 00:04:11,651 are techniques that try to maximize future options during gameplay. 59 00:04:12,091 --> 00:04:15,693 Finally, in robotic motion planning, 60 00:04:15,693 --> 00:04:17,863 there has been a variety of recent techniques 61 00:04:17,863 --> 00:04:22,768 that have tried to take advantage of abilities of robots to maximize 62 00:04:23,018 --> 00:04:27,116 future freedom of action in order to accomplish complex tasks. 63 00:04:27,496 --> 00:04:31,340 And so, taking all of these different threads and putting them together, 64 00:04:31,730 --> 00:04:36,090 I asked, starting several years ago, is there an underlying mechanism 65 00:04:36,340 --> 00:04:40,249 for intelligence that we can factor out of all of these different threads? 66 00:04:40,509 --> 00:04:45,250 Is there, as it were, a single equation for intelligence? 67 00:04:46,990 --> 00:04:50,442 And the answer, I believe, is yes. 68 00:04:50,468 --> 00:04:57,469 What you're seeing is probably the closest equivalent to an E=mc2 for intelligence 69 00:04:57,469 --> 00:05:00,072 that I certainly have ever seen. 70 00:05:00,098 --> 00:05:02,276 So, what you're seeing here 71 00:05:02,371 --> 00:05:07,835 is a statement of correspondence that intelligence is a Force (F) 72 00:05:08,765 --> 00:05:13,390 that acts so as to maximize future freedom of action; 73 00:05:13,590 --> 00:05:17,324 It acts to maximize future freedom of action or keep options open 74 00:05:17,324 --> 00:05:19,654 with some strength (T), 75 00:05:19,904 --> 00:05:24,955 with the amount of the diversity of possible accessible futures (S), 76 00:05:24,985 --> 00:05:28,295 up to some future time horizon (Ƭ). 77 00:05:28,321 --> 00:05:30,613 In short, intelligence doesn't like 78 00:05:30,613 --> 00:05:34,498 to get trapped, intelligence tries to maximize future freedom of action 79 00:05:34,498 --> 00:05:39,526 and keep options open. And so, given this one equation 80 00:05:39,526 --> 00:05:42,445 it's natural to ask: So, what can you do with this? 81 00:05:42,445 --> 00:05:45,856 How predictive is it? Does it predict human-level intelligence? 82 00:05:45,856 --> 00:05:48,609 Does it predict artificial intelligence? 83 00:05:48,609 --> 00:05:53,726 So, I'm going to show you now a video that will, I think, demonstrate 84 00:05:54,006 --> 00:05:58,294 some of the amazing applications of just this single equation. 85 00:06:00,004 --> 00:06:03,357 (Video) Recent research in cosmology has suggested that universes 86 00:06:03,357 --> 00:06:07,531 that produce more disorder or "entropy" over their lifetimes should tend 87 00:06:07,531 --> 00:06:11,269 to have more favorable conditions for the existence of intelligent beings 88 00:06:11,589 --> 00:06:13,445 such as ourselves. 89 00:06:13,445 --> 00:06:15,763 But what if that tentative cosmological connection 90 00:06:15,763 --> 00:06:19,449 between entropy and intelligence hints at a deeper relationship? 91 00:06:19,449 --> 00:06:22,012 What if intelligent behavior doesn't just correlate 92 00:06:22,012 --> 00:06:26,226 with the production of long-term entropy, but actually emerges directly from it? 93 00:06:26,576 --> 00:06:30,114 To find out, we developed a software engine called ENTROPICA 94 00:06:30,114 --> 00:06:34,184 designed to maximize the production of long-term entropy of any system 95 00:06:34,184 --> 00:06:36,001 that it finds itself in. 96 00:06:36,001 --> 00:06:40,645 Amazingly, ENTROPICA was able to pass multiple animal intelligence tests, 97 00:06:40,645 --> 00:06:43,766 play human games and even earn money trading stocks; 98 00:06:43,766 --> 00:06:46,157 all without being instructed to do so. 99 00:06:46,157 --> 00:06:48,610 Here are some examples of ENTROPICA in action: 100 00:06:48,610 --> 00:06:52,164 just like a human standing upright without falling over, here we see 101 00:06:52,254 --> 00:06:56,225 ENTROPICA automatically balancing a pole using a cart. 102 00:06:56,225 --> 00:07:00,342 This behavior is remarkable, in part, because we never gave ENTROPICA a goal, 103 00:07:00,342 --> 00:07:03,754 it simply decided on its own to balance the pole. 104 00:07:03,754 --> 00:07:06,997 This balancing ability would have applications for humanoid robotics 105 00:07:06,997 --> 00:07:09,277 and human assistive technologies. 106 00:07:09,625 --> 00:07:12,679 Just as some animals can use objects in their environments 107 00:07:12,679 --> 00:07:15,056 as tools to reach into narrow spaces, 108 00:07:15,056 --> 00:07:18,967 here we see that ENTROPICA, again on its own initiative, 109 00:07:18,967 --> 00:07:22,192 was able to move a large disk, representing an animal, 110 00:07:22,192 --> 00:07:25,450 around so as to cause a small disk, representing a tool, 111 00:07:25,450 --> 00:07:28,346 to reach into a confined space holding a third disk 112 00:07:28,346 --> 00:07:31,953 and release the third disk from its initially fixed position. 113 00:07:31,953 --> 00:07:36,658 This tool usability would have application for smart manufacturing and agriculture. 114 00:07:37,338 --> 00:07:40,295 In addition, just as some other animals are able to cooperate 115 00:07:40,295 --> 00:07:44,043 by pulling opposite ends of a rope at the same time to release food, 116 00:07:44,043 --> 00:07:46,740 here we see that ENTROPICA is able to accomplish 117 00:07:46,740 --> 00:07:48,497 a model version of that task. 118 00:07:48,497 --> 00:07:52,136 This cooperative ability has interesting implications for economic planning 119 00:07:52,136 --> 00:07:55,450 and a variety of other fields. 120 00:07:55,450 --> 00:07:59,288 ENTROPICA is broadly applicable to a variety of domains. 121 00:07:59,288 --> 00:08:03,661 For example, here we see it successfully playing a game of pong against itself 122 00:08:04,371 --> 00:08:06,347 illustrating its potential for gaming. 123 00:08:08,103 --> 00:08:09,794 Here, we see ENTROPICA orchestrating 124 00:08:09,794 --> 00:08:13,289 new connections on a social network where friends are constantly 125 00:08:13,289 --> 00:08:17,341 falling out of touch and successfully keeping the network well connected. 126 00:08:17,671 --> 00:08:22,252 This same network orchestration ability also has applications in health care, 127 00:08:22,252 --> 00:08:25,404 energy and intelligence. 128 00:08:25,404 --> 00:08:28,816 Here we see ENTROPICA directing the paths of a fleet of ships 129 00:08:28,816 --> 00:08:33,260 successfully discovering and utilizing the Panama Canal to globally extend 130 00:08:33,260 --> 00:08:35,951 its reach from the Atlantic to the Pacific. 131 00:08:35,951 --> 00:08:39,253 By the same token, ENTROPICA is broadly applicable to problems 132 00:08:39,253 --> 00:08:43,496 in autonomous defense, logistics and transportation. 133 00:08:44,566 --> 00:08:49,370 Finally, here we see ENTROPICA spontaneously discovering and executing 134 00:08:49,370 --> 00:08:53,843 a buy low, sell high strategy on a simulated range traded stock 135 00:08:53,843 --> 00:08:57,369 successfully growing assets under management exponentially. 136 00:08:57,369 --> 00:09:00,513 This risk management ability would have broad applications 137 00:09:00,513 --> 00:09:02,911 in finance and insurance. 138 00:09:08,475 --> 00:09:12,067 AWG: So, what you've just seen is that a variety 139 00:09:12,114 --> 00:09:16,178 of signature human intelligent cognitive behavior 140 00:09:16,204 --> 00:09:18,895 such us tool use and walking upright 141 00:09:19,490 --> 00:09:24,005 and social cooperation, all follow from a single equation 142 00:09:24,255 --> 00:09:29,452 which drives a system to maximize its future freedom of action. 143 00:09:30,242 --> 00:09:33,237 Now, there's a profound irony here. 144 00:09:33,237 --> 00:09:37,663 Going back to the beginning of the usage of the term robot, 145 00:09:38,503 --> 00:09:41,499 the play RUR, 146 00:09:41,499 --> 00:09:46,515 there was always a concept that if we develop machine, intelligence, 147 00:09:47,345 --> 00:09:52,622 there will be a cybernetic revolt, that machines would rise up against us. 148 00:09:53,452 --> 00:09:58,731 One major consequence of this work is that maybe all of these decades 149 00:09:58,731 --> 00:10:02,872 we've had the whole concept of cybernetic revolt in reverse. 150 00:10:03,772 --> 00:10:06,918 It's not that machines first become intelligent 151 00:10:06,918 --> 00:10:11,283 and then megalomaniacal, and try to take over the world. 152 00:10:11,283 --> 00:10:15,621 It's quite the opposite: that the urge to take control 153 00:10:15,621 --> 00:10:19,701 of all possible futures is a more fundamental principle 154 00:10:20,071 --> 00:10:23,949 than that of intelligence; that general intelligence may, in fact, 155 00:10:23,949 --> 00:10:28,456 emerge directly from this sort of control grabbing, 156 00:10:28,456 --> 00:10:31,209 rather than vice versa. 157 00:10:32,589 --> 00:10:36,312 Another important consequence is goal seeking. 158 00:10:36,652 --> 00:10:42,443 I'm often asked how does the ability to seek goals follow from this framework 159 00:10:42,643 --> 00:10:43,747 and the answer is: 160 00:10:43,747 --> 00:10:48,203 the ability to seek goals, for example if you're playing the game of chess, 161 00:10:48,543 --> 00:10:53,252 to try to win that game of chess in order to accomplish worldly goods 162 00:10:53,252 --> 00:10:55,599 and accomplishments outside of that game, 163 00:10:55,809 --> 00:10:59,124 will follow directly from this in the following sense: 164 00:10:59,554 --> 00:11:03,855 Just like you would travel through a tunnel, a bottleneck, 165 00:11:03,855 --> 00:11:07,050 in your future path space in order to achieve many other 166 00:11:07,050 --> 00:11:11,178 diverse objectives later on or just like you would invest 167 00:11:11,178 --> 00:11:15,350 in a financial security reducing your short term liquidity 168 00:11:15,350 --> 00:11:17,825 in order to increase your wealth over the long term, 169 00:11:17,825 --> 00:11:21,613 goal seeking emerges directly from a long term drive 170 00:11:21,613 --> 00:11:25,571 to increase future freedom of action. 171 00:11:25,571 --> 00:11:29,881 Finally, the famous physicist Richard Feynman once wrote 172 00:11:30,361 --> 00:11:34,703 that if human civilization were destroyed and you could pass only a single concept 173 00:11:34,703 --> 00:11:38,164 on to our descendents to help them rebuild civilization, 174 00:11:38,524 --> 00:11:41,620 that concept should be that all matter around us 175 00:11:42,240 --> 00:11:45,506 is made out of tiny elements that attract each other 176 00:11:45,506 --> 00:11:48,101 when they're far apart, but repel each other 177 00:11:48,341 --> 00:11:50,096 when they're close together. 178 00:11:50,126 --> 00:11:53,152 My equivalent to that statement to pass on to descendents 179 00:11:53,472 --> 00:11:55,915 to help them build artificial intelligence, 180 00:11:55,915 --> 00:11:59,988 or to help them to understand human intelligence, is the following: 181 00:12:00,108 --> 00:12:03,541 Intelligence should be viewed as a physical process 182 00:12:03,541 --> 00:12:06,492 that tries to maximize future freedom of action 183 00:12:06,492 --> 00:12:09,624 and avoid constraints in its own future. 184 00:12:10,194 --> 00:12:11,452 Thank you very much. 185 00:12:11,478 --> 00:12:14,478 (Applause)