0:00:00.899,0:00:04.566
Intelligence -- what is it?

0:00:04.566,0:00:06.857
If we take a look back at the history

0:00:06.857,0:00:09.481
of how intelligence has been viewed,

0:00:09.481,0:00:13.099
one seminal example has been

0:00:13.099,0:00:16.576
Edsger Dijkstra's famous quote that

0:00:16.576,0:00:19.687
"the question of whether a machine can think

0:00:19.687,0:00:20.997
is about as interesting

0:00:20.997,0:00:23.968
as the question of whether a submarine

0:00:23.968,0:00:25.758
can swim."

0:00:25.758,0:00:29.602
Now, Edsger Dijkstra, when he wrote this,

0:00:29.602,0:00:31.656
intended it as a criticism

0:00:31.656,0:00:34.656
of the early pioneers of computer science,

0:00:34.656,0:00:36.403
like Alan Turing.

0:00:36.403,0:00:38.902
However, if you take a look back

0:00:38.902,0:00:40.867
and think about what have been

0:00:40.867,0:00:42.863
the most empowering innovations

0:00:42.863,0:00:44.742
that enabled us to build

0:00:44.742,0:00:46.976
artificial machines that swim

0:00:46.976,0:00:49.549
and artificial machines that [fly],

0:00:49.549,0:00:53.096
you find that it was only through understanding

0:00:53.096,0:00:55.704
the underlying physical mechanisms

0:00:55.704,0:00:58.483
of swimming and flight

0:00:58.483,0:01:01.655
that we were able to build these machines.

0:01:01.655,0:01:03.911
And so, several years ago,

0:01:03.911,0:01:07.160
I undertook a program to try to understand

0:01:07.160,0:01:09.794
the fundamental physical mechanisms

0:01:09.794,0:01:12.562
underlying intelligence.

0:01:12.562,0:01:14.422
Let's take a step back.

0:01:14.422,0:01:17.571
Let's first begin with a thought experiment.

0:01:17.571,0:01:20.425
Pretend that you're an alien race

0:01:20.425,0:01:23.466
that doesn't know anything about Earth biology

0:01:23.466,0:01:26.582
or Earth neuroscience or Earth intelligence,

0:01:26.582,0:01:28.774
but you have amazing telescopes

0:01:28.774,0:01:31.136
and you're able to watch the Earth,

0:01:31.136,0:01:33.468
and you have amazingly long lives,

0:01:33.468,0:01:34.967
so you're able to watch the Earth

0:01:34.967,0:01:38.409
over millions, even billions of years.

0:01:38.409,0:01:41.424
And you observe a really strange effect.

0:01:41.424,0:01:45.736
You observe that, over the course of the millennia,

0:01:45.736,0:01:50.021
Earth is continually bombarded with asteroids

0:01:50.021,0:01:52.108
up until a point,

0:01:52.108,0:01:53.639
and that at some point,

0:01:53.639,0:01:57.831
corresponding roughly to our year, 2000 AD,

0:01:57.831,0:01:59.547
asteroids that are on

0:01:59.547,0:02:01.478
a collision course with the Earth

0:02:01.478,0:02:03.453
that otherwise would have collided

0:02:03.453,0:02:05.868
mysteriously get deflected

0:02:05.868,0:02:08.940
or they detonate before they can hit the Earth.

0:02:08.940,0:02:11.023
Now of course, as earthlings,

0:02:11.023,0:02:12.567
we know the reason would be

0:02:12.567,0:02:14.323
that we're trying to save ourselves.

0:02:14.323,0:02:17.403
We're trying to prevent an impact.

0:02:17.403,0:02:19.114
But if you're an alien race

0:02:19.114,0:02:20.260
who doesn't know any of this,

0:02:20.260,0:02:22.774
doesn't have any concept of Earth intelligence,

0:02:22.774,0:02:24.502
you'd be forced to put together

0:02:24.502,0:02:27.420
a physical theory that explains how,

0:02:27.420,0:02:29.958
up until a certain point in time,

0:02:29.958,0:02:34.407
asteroids that would demolish the surface of a planet

0:02:34.407,0:02:37.638
mysteriously stop doing that.

0:02:37.638,0:02:41.842
And so I claim that this is the same question

0:02:41.842,0:02:45.840
as understanding the physical nature of intelligence.

0:02:45.840,0:02:49.722
So in this program that I[br]undertook several years ago,

0:02:49.722,0:02:52.487
I looked at a variety of different threads

0:02:52.487,0:02:55.649
across science, across a variety of disciplines,

0:02:55.649,0:02:57.541
that were pointing, I think,

0:02:57.541,0:03:00.089
towards a single, underlying mechanism

0:03:00.089,0:03:01.670
for intelligence.

0:03:01.670,0:03:04.216
In cosmology, for example,

0:03:04.216,0:03:06.963
there have been a variety of[br]different threads of evidence

0:03:06.963,0:03:10.370
that our universe appears to be finely tuned

0:03:10.370,0:03:12.523
for the development of intelligence,

0:03:12.523,0:03:14.912
and, in particular, for the development

0:03:14.912,0:03:16.798
of universal states

0:03:16.798,0:03:20.896
that maximize the diversity of possible futures.

0:03:20.896,0:03:23.240
In game play, for example, in Go --

0:03:23.240,0:03:26.265
everyone remembers in 1997

0:03:26.265,0:03:30.216
when IBM's Deep Blue beat [br]Garry Kasparov at chess --

0:03:30.216,0:03:31.739
fewer people are aware

0:03:31.739,0:03:33.757
that in the past 10 years or so,

0:03:33.757,0:03:34.955
the game of Go,

0:03:34.955,0:03:36.911
arguably a much more challenging game

0:03:36.911,0:03:39.336
because it has a much higher branching factor,

0:03:39.336,0:03:41.038
has also started to succumb

0:03:41.038,0:03:42.903
to computer game players

0:03:42.903,0:03:44.476
for the same reason:

0:03:44.476,0:03:47.276
the best techniques right now[br]for computers playing Go

0:03:47.276,0:03:50.972
are techniques that try to maximize future options

0:03:50.972,0:03:52.986
during game play.

0:03:52.986,0:03:56.567
Finally, in robotic motion planning,

0:03:56.567,0:03:58.749
there have been a variety of recent techniques

0:03:58.749,0:04:00.651
that have tried to take advantage

0:04:00.651,0:04:03.797
of abilities of robots to maximize

0:04:03.797,0:04:05.303
future freedom of action

0:04:05.303,0:04:08.400
in order to accomplish complex tasks.

0:04:08.400,0:04:10.755
And so, taking all of these different threads

0:04:10.755,0:04:12.377
and putting them together,

0:04:12.377,0:04:15.017
I asked, starting several years ago,

0:04:15.017,0:04:17.867
is there an underlying mechanism for intelligence

0:04:17.867,0:04:19.540
that we can factor out

0:04:19.540,0:04:21.314
of all of these different threads?

0:04:21.314,0:04:25.907
Is there a single equation for intelligence?

0:04:25.907,0:04:29.278
And the answer, I believe, is yes.[br]["F = T ∇ Sτ"]

0:04:29.278,0:04:31.191
What you're seeing is probably

0:04:31.191,0:04:34.485
the closest equivalent to an E = mc²

0:04:34.485,0:04:37.315
for intelligence that I've seen.

0:04:37.315,0:04:39.017
So what you're seeing here

0:04:39.017,0:04:41.686
is a statement of correspondence

0:04:41.686,0:04:46.121
that intelligence is a force, F,

0:04:46.121,0:04:50.771
that acts so as to maximize future freedom of action.

0:04:50.771,0:04:53.146
It acts to maximize future freedom of action,

0:04:53.146,0:04:54.774
or keep options open,

0:04:54.774,0:04:56.999
with some strength T,

0:04:56.999,0:05:01.776
with the diversity of possible accessible futures, S,

0:05:01.776,0:05:04.326
up to some future time horizon, tau.

0:05:04.326,0:05:07.535
In short, intelligence doesn't like to get trapped.

0:05:07.535,0:05:10.590
Intelligence tries to maximize[br]future freedom of action

0:05:10.590,0:05:13.263
and keep options open.

0:05:13.263,0:05:15.696
And so, given this one equation,

0:05:15.696,0:05:18.228
it's natural to ask, so what can you do with this?

0:05:18.228,0:05:19.579
How predictive is it?

0:05:19.579,0:05:21.714
Does it predict human-level intelligence?

0:05:21.714,0:05:24.532
Does it predict artificial intelligence?

0:05:24.532,0:05:26.574
So I'm going to show you now a video

0:05:26.574,0:05:29.994
that will, I think, demonstrate

0:05:29.994,0:05:32.282
some of the amazing applications

0:05:32.282,0:05:34.601
of just this single equation.

0:05:34.601,0:05:36.580
(Video) Narrator: Recent research in cosmology

0:05:36.580,0:05:38.627
has suggested that universes that produce

0:05:38.627,0:05:42.108
more disorder, or "entropy," over their lifetimes

0:05:42.108,0:05:44.586
should tend to have more favorable conditions

0:05:44.586,0:05:47.602
for the existence of intelligent[br]beings such as ourselves.

0:05:47.602,0:05:50.176
But what if that tentative cosmological connection

0:05:50.176,0:05:52.019
between entropy and intelligence

0:05:52.019,0:05:53.790
hints at a deeper relationship?

0:05:53.790,0:05:56.354
What if intelligent behavior doesn't just correlate

0:05:56.354,0:05:58.198
with the production of long-term entropy,

0:05:58.198,0:06:00.516
but actually emerges directly from it?

0:06:00.516,0:06:02.922
To find out, we developed a software engine

0:06:02.922,0:06:05.425
called Entropica, designed to maximize

0:06:05.425,0:06:07.193
the production of long-term entropy

0:06:07.193,0:06:09.769
of any system that it finds itself in.

0:06:09.769,0:06:11.924
Amazingly, Entropica was able to pass

0:06:11.924,0:06:15.380
multiple animal intelligence[br]tests, play human games,

0:06:15.380,0:06:17.526
and even earn money trading stocks,

0:06:17.526,0:06:19.637
all without being instructed to do so.

0:06:19.637,0:06:22.155
Here are some examples of Entropica in action.

0:06:22.155,0:06:25.360
Just like a human standing[br]upright without falling over,

0:06:25.360,0:06:26.590
here we see Entropica

0:06:26.590,0:06:29.475
automatically balancing a pole using a cart.

0:06:29.475,0:06:31.487
This behavior is remarkable in part

0:06:31.487,0:06:33.818
because we never gave Entropica a goal.

0:06:33.818,0:06:36.975
It simply decided on its own to balance the pole.

0:06:36.975,0:06:39.107
This balancing ability will have appliactions

0:06:39.107,0:06:40.504
for humanoid robotics

0:06:40.504,0:06:43.019
and human assistive technologies.

0:06:43.019,0:06:45.020
Just as some animals can use objects

0:06:45.020,0:06:46.462
in their environments as tools

0:06:46.462,0:06:48.449
to reach into narrow spaces,

0:06:48.449,0:06:50.331
here we see that Entropica,

0:06:50.331,0:06:52.169
again on its own initiative,

0:06:52.169,0:06:55.079
was able to move a large[br]disk representing an animal

0:06:55.079,0:06:57.424
around so as to cause a small disk,

0:06:57.424,0:07:00.195
representing a tool, to reach into a confined space

0:07:00.195,0:07:01.732
holding a third disk

0:07:01.732,0:07:04.704
and release the third disk[br]from its initially fixed position.

0:07:04.704,0:07:06.893
This tool use ability will have applications

0:07:06.893,0:07:09.252
for smart manufacturing and agriculture.

0:07:09.252,0:07:11.196
In addition, just as some other animals

0:07:11.196,0:07:13.892
are able to cooperate by pulling[br]opposite ends of a rope

0:07:13.892,0:07:15.945
at the same time to release food,

0:07:15.945,0:07:18.240
here we see that Entropica is able to accomplish

0:07:18.240,0:07:20.228
a model version of that task.

0:07:20.228,0:07:22.750
This cooperative ability has interesting implications

0:07:22.750,0:07:26.185
for economic planning and a variety of other fields.

0:07:26.185,0:07:28.256
Entropica is broadly applicable

0:07:28.256,0:07:30.199
to a variety of domains.

0:07:30.199,0:07:32.641
For example, here we see it successfully

0:07:32.641,0:07:35.200
playing a game of pong against itself,

0:07:35.200,0:07:37.543
illustrating its potential for gaming.

0:07:37.543,0:07:39.462
Here we see Entropica orchestrating

0:07:39.462,0:07:41.301
new connections on a social network

0:07:41.301,0:07:44.061
where friends are constantly falling out of touch

0:07:44.061,0:07:46.917
and successfully keeping[br]the network well connected.

0:07:46.917,0:07:49.215
This same network orchestration ability

0:07:49.215,0:07:51.543
also has applications in health care,

0:07:51.543,0:07:54.775
energy, and intelligence.

0:07:54.775,0:07:56.860
Here we see Entropica directing the paths

0:07:56.860,0:07:58.346
of a fleet of ships,

0:07:58.346,0:08:01.521
successfully discovering and[br]utilizing the Panama Canal

0:08:01.521,0:08:03.979
to globally extend its reach from the Atlantic

0:08:03.979,0:08:05.508
to the Pacific.

0:08:05.508,0:08:07.235
By the same token, Entropica

0:08:07.235,0:08:08.855
is broadly applicable to problems

0:08:08.855,0:08:14.157
in autonomous defense, logistics and transportation.

0:08:14.173,0:08:16.203
Finally, here we see Entropica

0:08:16.203,0:08:18.926
spontaneously discovering and executing

0:08:18.926,0:08:20.993
a buy-low, sell-high strategy

0:08:20.993,0:08:23.171
on a simulated range traded stock,

0:08:23.171,0:08:25.502
successfully growing assets under management

0:08:25.502,0:08:26.926
exponentially.

0:08:26.926,0:08:28.234
This risk management ability

0:08:28.234,0:08:30.721
will have broad applications in finance

0:08:30.721,0:08:34.049
and insurance.

0:08:34.049,0:08:36.140
Alex Wissner-Gross: So what you've just seen

0:08:36.140,0:08:40.532
is that a variety of signature human intelligent

0:08:40.532,0:08:42.289
cognitive behaviors

0:08:42.289,0:08:45.120
such as tool use and walking upright

0:08:45.120,0:08:47.149
and social cooperation

0:08:47.149,0:08:50.121
all follow from a single equation,

0:08:50.121,0:08:52.053
which drives a system

0:08:52.053,0:08:55.964
to maximize its future freedom of action.

0:08:55.964,0:08:58.971
Now, there's a profound irony here.

0:08:58.971,0:09:00.995
Going back to the beginning

0:09:00.995,0:09:04.268
of the usage of the term robot,

0:09:04.268,0:09:07.171
the play "RUR,"

0:09:07.171,0:09:09.406
there was always a concept

0:09:09.406,0:09:12.632
that if we developed machine intelligence,

0:09:12.632,0:09:15.659
there would be a cybernetic revolt.

0:09:15.659,0:09:19.210
The machines would rise up against us.

0:09:19.210,0:09:21.529
One major consequence of this work

0:09:21.529,0:09:24.298
is that maybe all of these decades,

0:09:24.298,0:09:27.274
we've had the whole concept of cybernetic revolt

0:09:27.274,0:09:29.285
in reverse.

0:09:29.285,0:09:32.564
It's not that machines first become intelligent

0:09:32.564,0:09:34.579
and then megalomaniacal

0:09:34.579,0:09:36.803
and try to take over the world.

0:09:36.803,0:09:38.237
It's quite the opposite,

0:09:38.237,0:09:41.143
that the urge to take control

0:09:41.143,0:09:43.404
of all possible futures

0:09:43.404,0:09:45.522
is a more fundamental principle

0:09:45.522,0:09:46.885
than that of intelligence,

0:09:46.885,0:09:50.585
that general intelligence may in fact emerge

0:09:50.585,0:09:54.144
directly from this sort of control-grabbing,

0:09:54.144,0:09:58.329
rather than vice versa.

0:09:58.329,0:10:02.098
Another important consequence is goal seeking.

0:10:02.098,0:10:06.458
I'm often asked, how does the ability to seek goals

0:10:06.458,0:10:08.078
follow from this sort of framework?

0:10:08.078,0:10:11.106
And the answer is, the ability to seek goals

0:10:11.106,0:10:12.988
will follow directly from this

0:10:12.988,0:10:14.822
in the following sense:

0:10:14.822,0:10:17.687
just like you would travel through a tunnel,

0:10:17.687,0:10:20.192
a bottleneck in your future path space,

0:10:20.192,0:10:22.063
in order to achieve many other

0:10:22.063,0:10:24.084
diverse objectives later on,

0:10:24.084,0:10:26.456
or just like you would invest

0:10:26.456,0:10:28.243
in a financial security,

0:10:28.243,0:10:30.480
reducing your short-term liquidity

0:10:30.480,0:10:32.880
in order to increase your wealth over the long term,

0:10:32.880,0:10:35.217
goal seeking emerges directly

0:10:35.217,0:10:36.946
from a long-term drive

0:10:36.946,0:10:40.983
to increase future freedom of action.

0:10:40.983,0:10:44.511
Finally, Richard Feynman, famous physicist,

0:10:44.511,0:10:48.183
once wrote that if human civilization were destroyed

0:10:48.183,0:10:50.076
and you could pass only a single concept

0:10:50.076,0:10:51.447
on to our descendants

0:10:51.447,0:10:53.754
to help them rebuild civilization,

0:10:53.754,0:10:55.440
that concept should be

0:10:55.440,0:10:57.292
that all matter around us

0:10:57.292,0:10:59.615
is made out of tiny elements

0:10:59.615,0:11:02.123
that attract each other when they're far apart

0:11:02.123,0:11:05.453
but repel each other when they're close together.

0:11:05.453,0:11:07.234
My equivalent of that statement

0:11:07.234,0:11:08.502
to pass on to descendants

0:11:08.502,0:11:11.214
to help them build artificial intelligences

0:11:11.214,0:11:14.163
or to help them understand human intelligence,

0:11:14.163,0:11:15.430
is the following:

0:11:15.430,0:11:17.483
Intelligence should be viewed

0:11:17.483,0:11:18.896
as a physical process

0:11:18.896,0:11:21.861
that tries to maximize future freedom of action

0:11:21.861,0:11:25.477
and avoid constraints in its own future.

0:11:25.477,0:11:26.835
Thank you very much.

0:11:26.835,0:11:30.835
(Applause)