Return to Video

How to read the genome and build a human being

  • 0:01 - 0:03
    For the next 16 minutes,
    I'm going to take you on a journey
  • 0:03 - 0:06
    that is probably
    the biggest dream of humanity:
  • 0:07 - 0:09
    to understand the code of life.
  • 0:09 - 0:12
    So for me, everything started
    many, many years ago
  • 0:12 - 0:15
    when I met the first 3D printer.
  • 0:15 - 0:16
    The concept was fascinating.
  • 0:16 - 0:18
    A 3D printer needs three elements:
  • 0:18 - 0:22
    a bit of information, some
    raw material, some energy,
  • 0:22 - 0:26
    and it can produce any object
    that was not there before.
  • 0:27 - 0:29
    I was doing physics,
    I was coming back home
  • 0:29 - 0:32
    and I realized that I actually
    always knew a 3D printer.
  • 0:32 - 0:33
    And everyone does.
  • 0:34 - 0:35
    It was my mom.
  • 0:35 - 0:36
    (Laughter)
  • 0:36 - 0:38
    My mom takes three elements:
  • 0:38 - 0:42
    a bit of information, which is between
    my father and my mom in this case,
  • 0:42 - 0:46
    raw elements and energy
    in the same media, that is food,
  • 0:46 - 0:49
    and after several months, produces me.
  • 0:49 - 0:51
    And I was not existent before.
  • 0:51 - 0:54
    So apart from the shock of my mom
    discovering that she was a 3D printer,
  • 0:54 - 0:59
    I immediately got mesmerized
    by that piece,
  • 0:59 - 1:01
    the first one, the information.
  • 1:01 - 1:03
    What amount of information does it take
  • 1:03 - 1:05
    to build and assemble a human?
  • 1:05 - 1:07
    Is it much? Is it little?
  • 1:07 - 1:09
    How many thumb drives can you fill?
  • 1:09 - 1:12
    Well, I was studying physics
    at the beginning
  • 1:12 - 1:17
    and I took this approximation of a human
    as a gigantic Lego piece.
  • 1:17 - 1:21
    So, imagine that the building
    blocks are little atoms
  • 1:21 - 1:26
    and there is a hydrogen here,
    a carbon here, a nitrogen here.
  • 1:26 - 1:27
    So in the first approximation,
  • 1:27 - 1:32
    if I can list the number of atoms
    that compose a human being,
  • 1:32 - 1:33
    I can build it.
  • 1:33 - 1:35
    Now, you can run some numbers
  • 1:35 - 1:38
    and that happens to be
    quite an astonishing number.
  • 1:38 - 1:41
    So the number of atoms,
  • 1:41 - 1:46
    the file that I will save in my thumb
    drive to assemble a little baby,
  • 1:46 - 1:51
    will actually fill an entire Titanic
    of thumb drives --
  • 1:51 - 1:53
    multiplied 2,000 times.
  • 1:54 - 1:57
    This is the miracle of life.
  • 1:57 - 2:00
    Every time you see from now on
    a pregnant lady,
  • 2:00 - 2:03
    she's assembling the biggest
    amount of information
  • 2:03 - 2:04
    that you will ever encounter.
  • 2:04 - 2:07
    Forget big data, forget
    anything you heard of.
  • 2:07 - 2:10
    This is the biggest amount
    of information that exists.
  • 2:10 - 2:14
    (Applause)
  • 2:14 - 2:19
    But nature, fortunately, is much smarter
    than a young physicist,
  • 2:19 - 2:22
    and in four billion years, managed
    to pack this information
  • 2:22 - 2:25
    in a small crystal we call DNA.
  • 2:26 - 2:30
    We met it for the first time in 1950
    when Rosalind Franklin,
  • 2:30 - 2:31
    an amazing scientist, a woman,
  • 2:32 - 2:33
    took a picture of it.
  • 2:33 - 2:38
    But it took us more than 40 years
    to finally poke inside a human cell,
  • 2:38 - 2:40
    take out this crystal,
  • 2:40 - 2:43
    unroll it, and read it for the first time.
  • 2:44 - 2:47
    The code comes out to be
    a fairly simple alphabet,
  • 2:47 - 2:51
    four letters: A, T, C and G.
  • 2:51 - 2:54
    And to build a human,
    you need three billion of them.
  • 2:55 - 2:56
    Three billion.
  • 2:56 - 2:58
    How many are three billion?
  • 2:58 - 3:01
    It doesn't really make
    any sense as a number, right?
  • 3:01 - 3:05
    So I was thinking how
    I could explain myself better
  • 3:05 - 3:08
    about how big and enormous this code is.
  • 3:08 - 3:11
    But there is -- I mean,
    I'm going to have some help,
  • 3:11 - 3:14
    and the best person to help me
    introduce the code
  • 3:14 - 3:18
    is actually the first man
    to sequence it, Dr. Craig Venter.
  • 3:18 - 3:21
    So welcome onstage, Dr. Craig Venter.
  • 3:21 - 3:28
    (Applause)
  • 3:28 - 3:30
    Not the man in the flesh,
  • 3:31 - 3:34
    but for the first time in history,
  • 3:34 - 3:37
    this is the genome of a specific human,
  • 3:37 - 3:41
    printed page-by-page, letter-by-letter:
  • 3:41 - 3:45
    262,000 pages of information,
  • 3:45 - 3:49
    450 kilograms, shipped
    from the United States to Canada
  • 3:49 - 3:54
    thanks to Bruno Bowden,
    Lulu.com, a start-up, did everything.
  • 3:54 - 3:56
    It was an amazing feat.
  • 3:56 - 4:00
    But this is the visual perception
    of what is the code of life.
  • 4:00 - 4:03
    And now, for the first time,
    I can do something fun.
  • 4:03 - 4:05
    I can actually poke inside it and read.
  • 4:05 - 4:10
    So let me take an interesting
    book ... like this one.
  • 4:13 - 4:16
    I have an annotation;
    it's a fairly big book.
  • 4:16 - 4:19
    So just to let you see
    what is the code of life.
  • 4:21 - 4:24
    Thousands and thousands and thousands
  • 4:24 - 4:27
    and millions of letters.
  • 4:27 - 4:29
    And they apparently make sense.
  • 4:29 - 4:31
    Let's get to a specific part.
  • 4:32 - 4:33
    Let me read it to you:
  • 4:33 - 4:34
    (Laughter)
  • 4:34 - 4:38
    "AAG, AAT, ATA."
  • 4:39 - 4:41
    To you it sounds like mute letters,
  • 4:41 - 4:45
    but this sequence gives
    the color of the eyes to Craig.
  • 4:46 - 4:48
    I'll show you another part of the book.
  • 4:48 - 4:50
    This is actually a little
    more complicated.
  • 4:51 - 4:54
    Chromosome 14, book 132:
  • 4:54 - 4:56
    (Laughter)
  • 4:56 - 4:57
    As you might expect.
  • 4:57 - 5:01
    (Laughter)
  • 5:03 - 5:07
    "ATT, CTT, GATT."
  • 5:08 - 5:10
    This human is lucky,
  • 5:10 - 5:15
    because if you miss just
    two letters in this position --
  • 5:15 - 5:16
    two letters of our three billion --
  • 5:16 - 5:19
    he will be condemned
    to a terrible disease:
  • 5:19 - 5:20
    cystic fibrosis.
  • 5:20 - 5:23
    We have no cure for it,
    we don't know how to solve it,
  • 5:23 - 5:27
    and it's just two letters
    of difference from what we are.
  • 5:28 - 5:30
    A wonderful book, a mighty book,
  • 5:31 - 5:33
    a mighty book that helped me understand
  • 5:33 - 5:36
    and show you something quite remarkable.
  • 5:36 - 5:41
    Every one of you -- what makes
    me, me and you, you --
  • 5:41 - 5:44
    is just about five million of these,
  • 5:44 - 5:45
    half a book.
  • 5:46 - 5:48
    For the rest,
  • 5:48 - 5:50
    we are all absolutely identical.
  • 5:51 - 5:55
    Five hundred pages
    is the miracle of life that you are.
  • 5:55 - 5:58
    The rest, we all share it.
  • 5:58 - 6:01
    So think about that again
    when we think that we are different.
  • 6:01 - 6:03
    This is the amount that we share.
  • 6:03 - 6:07
    So now that I have your attention,
  • 6:07 - 6:08
    the next question is:
  • 6:08 - 6:09
    How do I read it?
  • 6:09 - 6:11
    How do I make sense out of it?
  • 6:11 - 6:16
    Well, for however good you can be
    at assembling Swedish furniture,
  • 6:16 - 6:19
    this instruction manual
    is nothing you can crack in your life.
  • 6:19 - 6:21
    (Laughter)
  • 6:21 - 6:24
    And so, in 2014, two famous TEDsters,
  • 6:24 - 6:27
    Peter Diamandis and Craig Venter himself,
  • 6:27 - 6:29
    decided to assemble a new company.
  • 6:29 - 6:30
    Human Longevity was born,
  • 6:30 - 6:31
    with one mission:
  • 6:31 - 6:33
    trying everything we can try
  • 6:33 - 6:36
    and learning everything
    we can learn from these books,
  • 6:36 - 6:38
    with one target --
  • 6:39 - 6:42
    making real the dream
    of personalized medicine,
  • 6:42 - 6:45
    understanding what things
    should be done to have better health
  • 6:45 - 6:48
    and what are the secrets in these books.
  • 6:48 - 6:53
    An amazing team, 40 data scientists
    and many, many more people,
  • 6:53 - 6:54
    a pleasure to work with.
  • 6:54 - 6:56
    The concept is actually very simple.
  • 6:56 - 6:59
    We're going to use a technology
    called machine learning.
  • 6:59 - 7:04
    On one side, we have genomes --
    thousands of them.
  • 7:04 - 7:08
    On the other side, we collected
    the biggest database of human beings:
  • 7:08 - 7:12
    phenotypes, 3D scan, NMR --
    everything you can think of.
  • 7:12 - 7:15
    Inside there, on these two opposite sides,
  • 7:15 - 7:18
    there is the secret of translation.
  • 7:18 - 7:20
    And in the middle, we build a machine.
  • 7:21 - 7:23
    We build a machine
    and we train a machine --
  • 7:23 - 7:26
    well, not exactly one machine,
    many, many machines --
  • 7:26 - 7:31
    to try to understand and translate
    the genome in a phenotype.
  • 7:31 - 7:35
    What are those letters,
    and what do they do?
  • 7:35 - 7:37
    It's an approach that can
    be used for everything,
  • 7:37 - 7:40
    but using it in genomics
    is particularly complicated.
  • 7:41 - 7:44
    Little by little we grew and we wanted
    to build different challenges.
  • 7:44 - 7:47
    We started from the beginning,
    from common traits.
  • 7:47 - 7:49
    Common traits are comfortable
    because they are common,
  • 7:49 - 7:50
    everyone has them.
  • 7:50 - 7:53
    So we started to ask our questions:
  • 7:53 - 7:54
    Can we predict height?
  • 7:55 - 7:57
    Can we read the books
    and predict your height?
  • 7:57 - 7:58
    Well, we actually can,
  • 7:58 - 8:00
    with five centimeters of precision.
  • 8:00 - 8:03
    BMI is fairly connected to your lifestyle,
  • 8:03 - 8:07
    but we still can, we get in the ballpark,
    eight kilograms of precision.
  • 8:07 - 8:08
    Can we predict eye color?
  • 8:08 - 8:10
    Yeah, we can.
  • 8:10 - 8:11
    Eighty percent accuracy.
  • 8:11 - 8:13
    Can we predict skin color?
  • 8:13 - 8:16
    Yeah we can, 80 percent accuracy.
  • 8:16 - 8:17
    Can we predict age?
  • 8:18 - 8:22
    We can, because apparently,
    the code changes during your life.
  • 8:22 - 8:25
    It gets shorter, you lose pieces,
    it gets insertions.
  • 8:25 - 8:28
    We read the signals, and we make a model.
  • 8:28 - 8:30
    Now, an interesting challenge:
  • 8:30 - 8:32
    Can we predict a human face?
  • 8:33 - 8:34
    It's a little complicated,
  • 8:34 - 8:38
    because a human face is scattered
    among millions of these letters.
  • 8:38 - 8:40
    And a human face is not
    a very well-defined object.
  • 8:40 - 8:42
    So, we had to build an entire tier of it
  • 8:42 - 8:45
    to learn and teach
    a machine what a face is,
  • 8:45 - 8:47
    and embed and compress it.
  • 8:47 - 8:49
    And if you're comfortable
    with machine learning,
  • 8:49 - 8:52
    you understand what the challenge is here.
  • 8:52 - 8:58
    Now, after 15 years -- 15 years after
    we read the first sequence --
  • 8:58 - 9:01
    this October, we started
    to see some signals.
  • 9:01 - 9:04
    And it was a very emotional moment.
  • 9:04 - 9:07
    What you see here is a subject
    coming in our lab.
  • 9:08 - 9:10
    This is a face for us.
  • 9:10 - 9:13
    So we take the real face of a subject,
    we reduce the complexity,
  • 9:13 - 9:15
    because not everything is in your face --
  • 9:15 - 9:19
    lots of features and defects
    and asymmetries come from your life.
  • 9:19 - 9:22
    We symmetrize the face,
    and we run our algorithm.
  • 9:23 - 9:25
    The results that I show you right now,
  • 9:25 - 9:29
    this is the prediction we have
    from the blood.
  • 9:30 - 9:31
    (Applause)
  • 9:31 - 9:33
    Wait a second.
  • 9:33 - 9:37
    In these seconds, your eyes are watching,
    left and right, left and right,
  • 9:37 - 9:41
    and your brain wants
    those pictures to be identical.
  • 9:41 - 9:44
    So I ask you to do
    another exercise, to be honest.
  • 9:44 - 9:46
    Please search for the differences,
  • 9:46 - 9:47
    which are many.
  • 9:47 - 9:50
    The biggest amount of signal
    comes from gender,
  • 9:50 - 9:55
    then there is age, BMI,
    the ethnicity component of a human.
  • 9:55 - 9:59
    And scaling up over that signal
    is much more complicated.
  • 9:59 - 10:02
    But what you see here,
    even in the differences,
  • 10:02 - 10:06
    lets you understand
    that we are in the right ballpark,
  • 10:06 - 10:07
    that we are getting closer.
  • 10:07 - 10:10
    And it's already giving you some emotions.
  • 10:10 - 10:12
    This is another subject
    that comes in place,
  • 10:12 - 10:14
    and this is a prediction.
  • 10:14 - 10:18
    A little smaller face, we didn't get
    the complete cranial structure,
  • 10:18 - 10:21
    but still, it's in the ballpark.
  • 10:22 - 10:24
    This is a subject that comes in our lab,
  • 10:24 - 10:25
    and this is the prediction.
  • 10:26 - 10:31
    So these people have never been seen
    in the training of the machine.
  • 10:31 - 10:34
    These are the so-called "held-out" set.
  • 10:34 - 10:37
    But these are people that you will
    probably never believe.
  • 10:37 - 10:40
    We're publishing everything
    in a scientific publication,
  • 10:40 - 10:41
    you can read it.
  • 10:41 - 10:44
    But since we are onstage,
    Chris challenged me.
  • 10:44 - 10:47
    I probably exposed myself
    and tried to predict
  • 10:47 - 10:50
    someone that you might recognize.
  • 10:50 - 10:55
    So, in this vial of blood --
    and believe me, you have no idea
  • 10:55 - 10:58
    what we had to do to have
    this blood now, here --
  • 10:58 - 11:02
    in this vial of blood is the amount
    of biological information
  • 11:02 - 11:04
    that we need to do a full genome sequence.
  • 11:04 - 11:06
    We just need this amount.
  • 11:07 - 11:10
    We ran this sequence,
    and I'm going to do it with you.
  • 11:10 - 11:14
    And we start to layer up
    all the understanding we have.
  • 11:14 - 11:17
    In the vial of blood,
    we predicted he's a male.
  • 11:17 - 11:18
    And the subject is a male.
  • 11:19 - 11:21
    We predict that he's a meter and 76 cm.
  • 11:21 - 11:24
    The subject is a meter and 77 cm.
  • 11:24 - 11:28
    So, we predicted that he's 76;
    the subject is 82.
  • 11:29 - 11:31
    We predict his age, 38.
  • 11:31 - 11:33
    The subject is 35.
  • 11:34 - 11:36
    We predict his eye color.
  • 11:37 - 11:38
    Too dark.
  • 11:38 - 11:40
    We predict his skin color.
  • 11:40 - 11:41
    We are almost there.
  • 11:42 - 11:43
    That's his face.
  • 11:45 - 11:48
    Now, the reveal moment:
  • 11:48 - 11:50
    the subject is this person.
  • 11:50 - 11:52
    (Laughter)
  • 11:52 - 11:54
    And I did it intentionally.
  • 11:54 - 11:58
    I am a very particular
    and peculiar ethnicity.
  • 11:58 - 12:01
    Southern European, Italians --
    they never fit in models.
  • 12:01 - 12:06
    And it's particular -- that ethnicity
    is a complex corner case for our model.
  • 12:06 - 12:08
    But there is another point.
  • 12:08 - 12:11
    So, one of the things that we use
    a lot to recognize people
  • 12:11 - 12:13
    will never be written in the genome.
  • 12:13 - 12:15
    It's our free will, it's how I look.
  • 12:15 - 12:18
    Not my haircut in this case,
    but my beard cut.
  • 12:19 - 12:22
    So I'm going to show you, I'm going to,
    in this case, transfer it --
  • 12:22 - 12:25
    and this is nothing more
    than Photoshop, no modeling --
  • 12:25 - 12:27
    the beard on the subject.
  • 12:27 - 12:30
    And immediately, we get
    much, much better in the feeling.
  • 12:31 - 12:34
    So, why do we do this?
  • 12:36 - 12:41
    We certainly don't do it
    for predicting height
  • 12:41 - 12:43
    or taking a beautiful picture
    out of your blood.
  • 12:44 - 12:48
    We do it because the same technology
    and the same approach,
  • 12:48 - 12:51
    the machine learning of this code,
  • 12:51 - 12:54
    is helping us to understand how we work,
  • 12:54 - 12:56
    how your body works,
  • 12:56 - 12:57
    how your body ages,
  • 12:57 - 13:00
    how disease generates in your body,
  • 13:00 - 13:03
    how your cancer grows and develops,
  • 13:03 - 13:05
    how drugs work
  • 13:05 - 13:07
    and if they work on your body.
  • 13:08 - 13:09
    This is a huge challenge.
  • 13:10 - 13:12
    This is a challenge that we share
  • 13:12 - 13:14
    with thousands of other
    researchers around the world.
  • 13:14 - 13:16
    It's called personalized medicine.
  • 13:17 - 13:21
    It's the ability to move
    from a statistical approach
  • 13:21 - 13:23
    where you're a dot in the ocean,
  • 13:23 - 13:24
    to a personalized approach,
  • 13:25 - 13:27
    where we read all these books
  • 13:27 - 13:30
    and we get an understanding
    of exactly how you are.
  • 13:30 - 13:34
    But it is a particularly
    complicated challenge,
  • 13:34 - 13:38
    because of all these books, as of today,
  • 13:38 - 13:40
    we just know probably two percent:
  • 13:41 - 13:45
    four books of more than 175.
  • 13:46 - 13:49
    And this is not the topic of my talk,
  • 13:50 - 13:53
    because we will learn more.
  • 13:53 - 13:56
    There are the best minds
    in the world on this topic.
  • 13:57 - 13:59
    The prediction will get better,
  • 13:59 - 14:01
    the model will get more precise.
  • 14:01 - 14:03
    And the more we learn,
  • 14:03 - 14:08
    the more we will
    be confronted with decisions
  • 14:08 - 14:11
    that we never had to face before
  • 14:11 - 14:12
    about life,
  • 14:12 - 14:14
    about death,
  • 14:14 - 14:16
    about parenting.
  • 14:21 - 14:25
    So, we are touching the very
    inner detail on how life works.
  • 14:26 - 14:29
    And it's a revolution
    that cannot be confined
  • 14:29 - 14:32
    in the domain of science or technology.
  • 14:33 - 14:35
    This must be a global conversation.
  • 14:36 - 14:41
    We must start to think of the future
    we're building as a humanity.
  • 14:41 - 14:45
    We need to interact with creatives,
    with artists, with philosophers,
  • 14:45 - 14:47
    with politicians.
  • 14:47 - 14:48
    Everyone is involved,
  • 14:48 - 14:51
    because it's the future of our species.
  • 14:51 - 14:55
    Without fear, but with the understanding
  • 14:55 - 14:59
    that the decisions
    that we make in the next year
  • 14:59 - 15:03
    will change the course of history forever.
  • 15:04 - 15:05
    Thank you.
  • 15:05 - 15:15
    (Applause)
Title:
How to read the genome and build a human being
Speaker:
Riccardo Sabatini
Description:

Secrets, disease and beauty are all written in the human genome, the complete set of genetic instructions needed to build a human being. Now, as scientist and entrepreneur Riccardo Sabatini shows us, we have the power to read this complex code, predicting things like height, eye color, age and even facial structure — all from a vial of blood. And soon, Sabatini says, our new understanding of the genome will allow us to personalize treatments for diseases like cancer. We have the power to change life as we know it. How will we use it?

more » « less
Video Language:
English
Team:
closed TED
Project:
TEDTalks
Duration:
15:28

English subtitles

Revisions Compare revisions