Return to Video

How to read the genome and build a human being

  • 0:01 - 0:03
    For the next 16 minutes,
    I'm going to take you on a journey
  • 0:03 - 0:06
    that is probably
    the biggest dream of humanity:
  • 0:07 - 0:09
    to understand the code of life.
  • 0:09 - 0:12
    So for me, everything started
    many, many years ago
  • 0:12 - 0:15
    when I met the first 3D printer.
  • 0:15 - 0:16
    The concept was fascinating.
  • 0:16 - 0:18
    A 3D printer needs three elements:
  • 0:18 - 0:22
    a bit of information, some
    raw material, some energy,
  • 0:22 - 0:26
    and it can produce any object
    that was not there before.
  • 0:27 - 0:29
    I was doing physics,
    I was coming back home
  • 0:29 - 0:32
    and I realized that I actually
    always knew a 3D printer.
  • 0:32 - 0:33
    And everyone does.
  • 0:34 - 0:35
    It was my mom.
  • 0:35 - 0:36
    (Laughter)
  • 0:36 - 0:38
    My mom takes three elements:
  • 0:38 - 0:42
    a bit of information, which is between
    my father and my mom in this case,
  • 0:42 - 0:46
    raw elements and energy
    in the same media, that is food,
  • 0:46 - 0:49
    and after several months, produces me.
  • 0:49 - 0:51
    And I was not existent before.
  • 0:51 - 0:54
    So apart from the shock of my mom
    discovering that she was a 3D printer,
  • 0:54 - 0:59
    I immediately got mesmerized
    by that piece,
  • 0:59 - 1:01
    the first one, the information.
  • 1:01 - 1:03
    What amount of information does it take
  • 1:03 - 1:05
    to build and assemble a human?
  • 1:05 - 1:07
    Is it much? Is it little?
  • 1:07 - 1:09
    How many thumb drives can you fill?
  • 1:09 - 1:12
    Well, I was studying physics
    at the beginning
  • 1:12 - 1:17
    and I took this approximation of a human
    as a gigantic Lego piece.
  • 1:17 - 1:21
    So, imagine that the building
    blocks are little atoms
  • 1:21 - 1:26
    and there is a hydrogen here,
    a carbon here, a nitrogen here.
  • 1:26 - 1:27
    So in the first approximation,
  • 1:27 - 1:32
    if I can list the number of atoms
    that compose a human being,
  • 1:32 - 1:33
    I can build it.
  • 1:33 - 1:35
    Now, you can run some numbers
  • 1:35 - 1:38
    and that happens to be
    quite an astonishing number.
  • 1:38 - 1:41
    So the number of atoms,
  • 1:41 - 1:46
    the file that I will save in my thumb
    drive to assemble a little baby,
  • 1:46 - 1:51
    will actually fill an entire Titanic
    of thumb drives --
  • 1:51 - 1:53
    multiplied 2,000 times.
  • 1:54 - 1:57
    This is the miracle of life.
  • 1:57 - 2:00
    Every time you see from now on
    a pregnant lady,
  • 2:00 - 2:03
    she's assembling the biggest
    amount of information
  • 2:03 - 2:04
    that you will ever encounter.
  • 2:04 - 2:07
    Forget big data, forget
    anything you heard of.
  • 2:07 - 2:10
    This is the biggest amount
    of information that exists.
  • 2:10 - 2:14
    (Applause)
  • 2:14 - 2:19
    But nature, fortunately, is much smarter
    than a young physicist,
  • 2:19 - 2:22
    and in four billion years, managed
    to pack this information
  • 2:22 - 2:25
    in a small crystal we call DNA.
  • 2:26 - 2:30
    We met it for the first time in 1950
    when Rosalind Franklin,
  • 2:30 - 2:31
    an amazing scientist, a woman,
  • 2:32 - 2:33
    took a picture of it.
  • 2:33 - 2:38
    But it took us more than 40 years
    to finally poke inside a human cell,
  • 2:38 - 2:40
    take out this crystal,
  • 2:40 - 2:43
    unroll it, and read it for the first time.
  • 2:44 - 2:47
    The code comes out to be
    a fairly simple alphabet,
  • 2:47 - 2:51
    four letters: A, T, C and G.
  • 2:51 - 2:54
    And to build a human,
    you need three billion of them.
  • 2:55 - 2:56
    Three billion.
  • 2:56 - 2:58
    How many are three billion?
  • 2:58 - 3:01
    It doesn't really make
    any sense as a number, right?
  • 3:01 - 3:05
    So I was thinking how
    I could explain myself better
  • 3:05 - 3:08
    about how big and enormous this code is.
  • 3:08 - 3:11
    But there is -- I mean,
    I'm going to have some help,
  • 3:11 - 3:14
    and the best person to help me
    introduce the code
  • 3:14 - 3:18
    is actually the first man
    to sequence it, Dr. Craig Venter.
  • 3:18 - 3:21
    So welcome onstage, Dr. Craig Venter.
  • 3:21 - 3:28
    (Applause)
  • 3:28 - 3:30
    Not the man in the flesh,
  • 3:31 - 3:34
    but for the first time in history,
  • 3:34 - 3:37
    this is the genome of a specific human,
  • 3:37 - 3:41
    printed page-by-page, letter-by-letter:
  • 3:41 - 3:45
    262,000 pages of information,
  • 3:45 - 3:49
    450 kilograms, shipped
    from the United States to Canada
  • 3:49 - 3:54
    thanks to Bruno Bowden,
    Lulu.com, a start-up, did everything.
  • 3:54 - 3:56
    It was an amazing feat.
  • 3:56 - 4:00
    But this is the visual perception
    of what is the code of life.
  • 4:00 - 4:03
    And now, for the first time,
    I can do something fun.
  • 4:03 - 4:05
    I can actually poke inside it and read.
  • 4:05 - 4:10
    So let me take an interesting
    book ... like this one.
  • 4:13 - 4:16
    I have an annotation;
    it's a fairly big book.
  • 4:16 - 4:19
    So just to let you see
    what is the code of life.
  • 4:21 - 4:24
    Thousands and thousands and thousands
  • 4:24 - 4:27
    and millions of letters.
  • 4:27 - 4:29
    And they apparently make sense.
  • 4:29 - 4:31
    Let's get to a specific part.
  • 4:32 - 4:33
    Let me read it to you:
  • 4:33 - 4:34
    (Laughter)
  • 4:34 - 4:38
    "AAG, AAT, ATA."
  • 4:39 - 4:41
    To you it sounds like mute letters,
  • 4:41 - 4:45
    but this sequence gives
    the color of the eyes to Craig.
  • 4:46 - 4:48
    I'll show you another part of the book.
  • 4:48 - 4:50
    This is actually a little
    more complicated.
  • 4:51 - 4:54
    Chromosome 14, book 132:
  • 4:54 - 4:56
    (Laughter)
  • 4:56 - 4:57
    As you might expect.
  • 4:57 - 5:01
    (Laughter)
  • 5:03 - 5:07
    "ATT, CTT, GATT."
  • 5:08 - 5:10
    This human is lucky,
  • 5:10 - 5:15
    because if you miss just
    two letters in this position --
  • 5:15 - 5:16
    two letters of our three billion --
  • 5:16 - 5:19
    he will be condemned
    to a terrible disease:
  • 5:19 - 5:20
    cystic fibrosis.
  • 5:20 - 5:23
    We have no cure for it,
    we don't know how to solve it,
  • 5:23 - 5:27
    and it's just two letters
    of difference from what we are.
  • 5:28 - 5:30
    A wonderful book, a mighty book,
  • 5:31 - 5:33
    a mighty book that helped me understand
  • 5:33 - 5:36
    and show you something quite remarkable.
  • 5:36 - 5:41
    Every one of you -- what makes
    me, me and you, you --
  • 5:41 - 5:44
    is just about five million of these,
  • 5:44 - 5:45
    half a book.
  • 5:46 - 5:48
    For the rest,
  • 5:48 - 5:50
    we are all absolutely identical.
  • 5:51 - 5:55
    Five hundred pages
    is the miracle of life that you are.
  • 5:55 - 5:58
    The rest, we all share it.
  • 5:58 - 6:01
    So think about that again
    when we think that we are different.
  • 6:01 - 6:03
    This is the amount that we share.
  • 6:03 - 6:07
    So now that I have your attention,
  • 6:07 - 6:08
    the next question is:
  • 6:08 - 6:09
    How do I read it?
  • 6:09 - 6:11
    How do I make sense out of it?
  • 6:11 - 6:16
    Well, for however good you can be
    at assembling Swedish furniture,
  • 6:16 - 6:19
    this instruction manual
    is nothing you can crack in your life.
  • 6:19 - 6:21
    (Laughter)
  • 6:21 - 6:24
    And so, in 2014, two famous TEDsters,
  • 6:24 - 6:27
    Peter Diamandis and Craig Venter himself,
  • 6:27 - 6:29
    decided to assemble a new company.
  • 6:29 - 6:30
    Human Longevity was born,
  • 6:30 - 6:31
    with one mission:
  • 6:31 - 6:33
    trying everything we can try
  • 6:33 - 6:36
    and learning everything
    we can learn from these books,
  • 6:36 - 6:38
    with one target --
  • 6:39 - 6:42
    making real the dream
    of personalized medicine,
  • 6:42 - 6:45
    understanding what things
    should be done to have better health
  • 6:45 - 6:48
    and what are the secrets in these books.
  • 6:48 - 6:53
    An amazing team, 40 data scientists
    and many, many more people,
  • 6:53 - 6:54
    a pleasure to work with.
  • 6:54 - 6:56
    The concept is actually very simple.
  • 6:56 - 6:59
    We're going to use a technology
    called machine learning.
  • 6:59 - 7:04
    On one side, we have genomes --
    thousands of them.
  • 7:04 - 7:08
    On the other side, we collected
    the biggest database of human beings:
  • 7:08 - 7:12
    phenotypes, 3D scan, NMR --
    everything you can think of.
  • 7:12 - 7:15
    Inside there, on these two opposite sides,
  • 7:15 - 7:18
    there is the secret of translation.
  • 7:18 - 7:20
    And in the middle, we build a machine.
  • 7:21 - 7:23
    We build a machine
    and we train a machine --
  • 7:23 - 7:26
    well, not exactly one machine,
    many, many machines --
  • 7:26 - 7:31
    to try to understand and translate
    the genome in a phenotype.
  • 7:31 - 7:35
    What are those letters,
    and what do they do?
  • 7:35 - 7:37
    It's an approach that can
    be used for everything,
  • 7:37 - 7:40
    but using it in genomics
    is particularly complicated.
  • 7:41 - 7:44
    Little by little we grew and we wanted
    to build different challenges.
  • 7:44 - 7:47
    We started from the beginning,
    from common traits.
  • 7:47 - 7:49
    Common traits are comfortable
    because they are common,
  • 7:49 - 7:50
    everyone has them.
  • 7:50 - 7:53
    So we started to ask our questions:
  • 7:53 - 7:54
    Can we predict height?
  • 7:55 - 7:57
    Can we read the books
    and predict your height?
  • 7:57 - 7:58
    Well, we actually can,
  • 7:58 - 8:00
    with five centimeters of precision.
  • 8:00 - 8:03
    BMI is fairly connected to your lifestyle,
  • 8:03 - 8:07
    but we still can, we get in the ballpark,
    eight kilograms of precision.
  • 8:07 - 8:08
    Can we predict eye color?
  • 8:08 - 8:10
    Yeah, we can.
  • 8:10 - 8:11
    Eighty percent accuracy.
  • 8:11 - 8:13
    Can we predict skin color?
  • 8:13 - 8:16
    Yeah we can, 80 percent accuracy.
  • 8:16 - 8:17
    Can we predict age?
  • 8:18 - 8:22
    We can, because apparently,
    the code changes during your life.
  • 8:22 - 8:25
    It gets shorter, you lose pieces,
    it gets insertions.
  • 8:25 - 8:28
    We read the signals, and we make a model.
  • 8:28 - 8:30
    Now, an interesting challenge:
  • 8:30 - 8:32
    Can we predict a human face?
  • 8:33 - 8:34
    It's a little complicated,
  • 8:34 - 8:38
    because a human face is scattered
    among millions of these letters.
  • 8:38 - 8:40
    And a human face is not
    a very well-defined object.
  • 8:40 - 8:42
    So, we had to build an entire tier of it
  • 8:42 - 8:45
    to learn and teach
    a machine what a face is,
  • 8:45 - 8:47
    and embed and compress it.
  • 8:47 - 8:49
    And if you're comfortable
    with machine learning,
  • 8:49 - 8:52
    you understand what the challenge is here.
  • 8:52 - 8:58
    Now, after 15 years -- 15 years after
    we read the first sequence --
  • 8:58 - 9:01
    this October, we started
    to see some signals.
  • 9:01 - 9:04
    And it was a very emotional moment.
  • 9:04 - 9:07
    What you see here is a subject
    coming in our lab.
  • 9:08 - 9:10
    This is a face for us.
  • 9:10 - 9:13
    So we take the real face of a subject,
    we reduce the complexity,
  • 9:13 - 9:15
    because not everything is in your face --
  • 9:15 - 9:19
    lots of features and defects
    and asymmetries come from your life.
  • 9:19 - 9:22
    We symmetrize the face,
    and we run our algorithm.
  • 9:23 - 9:25
    The results that I show you right now,
  • 9:25 - 9:29
    this is the prediction we have
    from the blood.
  • 9:30 - 9:31
    (Applause)
  • 9:31 - 9:33
    Wait a second.
  • 9:33 - 9:37
    In these seconds, your eyes are watching,
    left and right, left and right,
  • 9:37 - 9:41
    and your brain wants
    those pictures to be identical.
  • 9:41 - 9:44
    So I ask you to do
    another exercise, to be honest.
  • 9:44 - 9:46
    Please search for the differences,
  • 9:46 - 9:47
    which are many.
  • 9:47 - 9:50
    The biggest amount of signal
    comes from gender,
  • 9:50 - 9:55
    then there is age, BMI,
    the ethnicity component of a human.
  • 9:55 - 9:59
    And scaling up over that signal
    is much more complicated.
  • 9:59 - 10:02
    But what you see here,
    even in the differences,
  • 10:02 - 10:06
    lets you understand
    that we are in the right ballpark,
  • 10:06 - 10:07
    that we are getting closer.
  • 10:07 - 10:10
    And it's already giving you some emotions.
  • 10:10 - 10:12
    This is another subject
    that comes in place,
  • 10:12 - 10:14
    and this is a prediction.
  • 10:14 - 10:18
    A little smaller face, we didn't get
    the complete cranial structure,
  • 10:18 - 10:21
    but still, it's in the ballpark.
  • 10:22 - 10:24
    This is a subject that comes in our lab,
  • 10:24 - 10:25
    and this is the prediction.
  • 10:26 - 10:31
    So these people have never been seen
    in the training of the machine.
  • 10:31 - 10:34
    These are the so-called "held-out" set.
  • 10:34 - 10:37
    But these are people that you will
    probably never believe.
  • 10:37 - 10:40
    We're publishing everything
    in a scientific publication,
  • 10:40 - 10:41
    you can read it.
  • 10:41 - 10:44
    But since we are onstage,
    Chris challenged me.
  • 10:44 - 10:47
    I probably exposed myself
    and tried to predict
  • 10:47 - 10:50
    someone that you might recognize.
  • 10:50 - 10:55
    So, in this vial of blood --
    and believe me, you have no idea
  • 10:55 - 10:58
    what we had to do to have
    this blood now, here --
  • 10:58 - 11:02
    in this vial of blood is the amount
    of biological information
  • 11:02 - 11:04
    that we need to do a full genome sequence.
  • 11:04 - 11:06
    We just need this amount.
  • 11:07 - 11:10
    We ran this sequence,
    and I'm going to do it with you.
  • 11:10 - 11:14
    And we start to layer up
    all the understanding we have.
  • 11:14 - 11:17
    In the vial of blood,
    we predicted he's a male.
  • 11:17 - 11:18
    And the subject is a male.
  • 11:19 - 11:21
    We predict that he's a meter and 76 cm.
  • 11:21 - 11:24
    The subject is a meter and 77 cm.
  • 11:24 - 11:28
    So, we predicted that he's 76;
    the subject is 82.
  • 11:29 - 11:31
    We predict his age, 38.
  • 11:31 - 11:33
    The subject is 35.
  • 11:34 - 11:36
    We predict his eye color.
  • 11:37 - 11:38
    Too dark.
  • 11:38 - 11:40
    We predict his skin color.
  • 11:40 - 11:41
    We are almost there.
  • 11:42 - 11:43
    That's his face.
  • 11:45 - 11:48
    Now, the reveal moment:
  • 11:48 - 11:50
    the subject is this person.
  • 11:50 - 11:52
    (Laughter)
  • 11:52 - 11:54
    And I did it intentionally.
  • 11:54 - 11:58
    I am a very particular
    and peculiar ethnicity.
  • 11:58 - 12:01
    Southern European, Italians --
    they never fit in models.
  • 12:01 - 12:06
    And it's particular -- that ethnicity
    is a complex corner case for our model.
  • 12:06 - 12:08
    But there is another point.
  • 12:08 - 12:11
    So, one of the things that we use
    a lot to recognize people
  • 12:11 - 12:13
    will never be written in the genome.
  • 12:13 - 12:15
    It's our free will, it's how I look.
  • 12:15 - 12:18
    Not my haircut in this case,
    but my beard cut.
  • 12:19 - 12:22
    So I'm going to show you, I'm going to,
    in this case, transfer it --
  • 12:22 - 12:25
    and this is nothing more
    than Photoshop, no modeling --
  • 12:25 - 12:27
    the beard on the subject.
  • 12:27 - 12:30
    And immediately, we get
    much, much better in the feeling.
  • 12:31 - 12:34
    So, why do we do this?
  • 12:36 - 12:41
    We certainly don't do it
    for predicting height
  • 12:41 - 12:43
    or taking a beautiful picture
    out of your blood.
  • 12:44 - 12:48
    We do it because the same technology
    and the same approach,
  • 12:48 - 12:51
    the machine learning of this code,
  • 12:51 - 12:54
    is helping us to understand how we work,
  • 12:54 - 12:56
    how your body works,
  • 12:56 - 12:57
    how your body ages,
  • 12:57 - 13:00
    how disease generates in your body,
  • 13:00 - 13:03
    how your cancer grows and develops,
  • 13:03 - 13:05
    how drugs work
  • 13:05 - 13:07
    and if they work on your body.
  • 13:08 - 13:09
    This is a huge challenge.
  • 13:10 - 13:12
    This is a challenge that we share
  • 13:12 - 13:14
    with thousands of other
    researchers around the world.
  • 13:14 - 13:16
    It's called personalized medicine.
  • 13:17 - 13:21
    It's the ability to move
    from a statistical approach
  • 13:21 - 13:23
    where you're a dot in the ocean,
  • 13:23 - 13:24
    to a personalized approach,
  • 13:25 - 13:27
    where we read all these books
  • 13:27 - 13:30
    and we get an understanding
    of exactly how you are.
  • 13:30 - 13:34
    But it is a particularly
    complicated challenge,
  • 13:34 - 13:38
    because of all these books, as of today,
  • 13:38 - 13:40
    we just know probably two percent:
  • 13:41 - 13:45
    four books of more than 175.
  • 13:46 - 13:49
    And this is not the topic of my talk,
  • 13:50 - 13:53
    because we will learn more.
  • 13:53 - 13:56
    There are the best minds
    in the world on this topic.
  • 13:57 - 13:59
    The prediction will get better,
  • 13:59 - 14:01
    the model will get more precise.
  • 14:01 - 14:03
    And the more we learn,
  • 14:03 - 14:08
    the more we will
    be confronted with decisions
  • 14:08 - 14:11
    that we never had to face before
  • 14:11 - 14:12
    about life,
  • 14:12 - 14:14
    about death,
  • 14:14 - 14:16
    about parenting.
  • 14:21 - 14:25
    So, we are touching the very
    inner detail on how life works.
  • 14:26 - 14:29
    And it's a revolution
    that cannot be confined
  • 14:29 - 14:32
    in the domain of science or technology.
  • 14:33 - 14:35
    This must be a global conversation.
  • 14:36 - 14:41
    We must start to think of the future
    we're building as a humanity.
  • 14:41 - 14:45
    We need to interact with creatives,
    with artists, with philosophers,
  • 14:45 - 14:47
    with politicians.
  • 14:47 - 14:48
    Everyone is involved,
  • 14:48 - 14:51
    because it's the future of our species.
  • 14:51 - 14:55
    Without fear, but with the understanding
  • 14:55 - 14:59
    that the decisions
    that we make in the next year
  • 14:59 - 15:03
    will change the course of history forever.
  • 15:04 - 15:05
    Thank you.
  • 15:05 - 15:09
    (Applause)
Title:
How to read the genome and build a human being
Speaker:
Riccardo Sabatini
Description:

more » « less
Video Language:
English
Team:
closed TED
Project:
TEDTalks
Duration:
15:28

English subtitles

Revisions Compare revisions