Return to Video

How to read the genome and build a human being

  • 0:01 - 0:04
    The next 60 minutes,
    I'm going to take you on a journey
  • 0:04 - 0:07
    that is probably the biggest dream
    of humanity:
  • 0:07 - 0:09
    to understand the code of life.
  • 0:09 - 0:12
    So for me, everything started
    many, many years ago
  • 0:12 - 0:15
    when I met the first 3D printer.
  • 0:15 - 0:17
    The concept was fascinating,
  • 0:17 - 0:19
    a 3D printer needed three elements:
  • 0:19 - 0:23
    a bit of information, some raw material,
    some energy,
  • 0:23 - 0:27
    and it can produce any object
    that was not there before.
  • 0:27 - 0:29
    I was doing physics, I was coming back
    home
  • 0:29 - 0:34
    and I realized that I actually always knew
    a 3D printer and everyone does.
  • 0:34 - 0:35
    It was my mom.
  • 0:35 - 0:36
    (Laughter)
  • 0:36 - 0:38
    My mom takes three elements:
  • 0:38 - 0:42
    a bit of information, which is between
    my father and my mom in this case,
  • 0:42 - 0:47
    raw elements and energy
    in the same medium, that is food.
  • 0:47 - 0:49
    And after several months, produces me,
  • 0:49 - 0:51
    and I was not existent before.
  • 0:51 - 0:55
    So a part from the shock of my mom
    discovering that she's a 3D priner,
  • 0:55 - 1:01
    I immediately got mesmerized
    by that piece, the first one, the information.
  • 1:01 - 1:05
    What amount of information takes
    to build and asemble a human?
  • 1:05 - 1:07
    Is it much? Is it little?
  • 1:07 - 1:09
    How many thumb drives can you fill?
  • 1:09 - 1:12
    Well, I was studying physics
    in the beginning
  • 1:12 - 1:15
    and I took this approximation
    of a human
  • 1:15 - 1:18
    as a gigantic lego piece.
  • 1:18 - 1:21
    You can imagine that the building blocks
    are little atoms
  • 1:21 - 1:26
    and there is a hydrogen here, a carbon here,
    a nitrogen here.
  • 1:26 - 1:28
    So in the first approximation,
  • 1:28 - 1:32
    if I can list the number of atoms
    that compose a human being,
  • 1:32 - 1:33
    I can build it.
  • 1:33 - 1:36
    Now you can round some numbers
  • 1:36 - 1:39
    and that happens to be
    quite an astonishing number.
  • 1:39 - 1:44
    So the number of atoms,
    the file that I will save in my thumb drive
  • 1:44 - 1:46
    to assemble a little baby
  • 1:46 - 1:51
    will actually fill an entire Titanic
    of thumb drives.
  • 1:51 - 1:58
    Multiplied 2,000 times, this is
    the miracle of life.
  • 1:58 - 2:00
    Every time you see from now on
    a pregnant lady,
  • 2:00 - 2:03
    she's assembling the biggest
    amount of information
  • 2:03 - 2:05
    that you will ever encounter.
  • 2:05 - 2:07
    Forget big data, forget anything
    you heard of.
  • 2:07 - 2:11
    This is the biggest amount
    of information that exists.
  • 2:11 - 2:14
    (Applause)
  • 2:14 - 2:19
    But nature, fortunately, is much smarter
    than a young physicist
  • 2:19 - 2:23
    and in 4 billion years, managed
    to pack this information
  • 2:23 - 2:26
    in a small crystal we call DNA.
  • 2:26 - 2:29
    We met it for the first time in 1950
    when Rosalind Franklin,
  • 2:29 - 2:32
    an amazing scientist,
    a woman,
  • 2:32 - 2:33
    took a picture of it.
  • 2:33 - 2:38
    But it took us more than 40 years
    to finally poke inside a human cell,
  • 2:38 - 2:40
    take out this crystal,
  • 2:40 - 2:44
    unroll it, and read it for the first time.
  • 2:44 - 2:47
    While the code comes out to be
    a fairly simple alphabet,
  • 2:47 - 2:51
    four letters: A,T,C and G.
  • 2:51 - 2:55
    And to build a human,
    you need 3 billion of them.
  • 2:55 - 2:58
    3 billion, how many are 3 billion?
  • 2:58 - 3:01
    Doesn't really make
    much sense as a number?
  • 3:01 - 3:05
    So I was thinking how
    I could explain myself better
  • 3:05 - 3:08
    about how big and enormous
    is this code.
  • 3:08 - 3:11
    But there is...I'm going
    to have some help
  • 3:11 - 3:14
    and the best person who is
    going to help me introduce the code
  • 3:14 - 3:18
    is actually the first man
    to sequence it, Dr. Craig Venter.
  • 3:18 - 3:21
    So welcome on stage Dr. Craig Venter.
  • 3:21 - 3:28
    (Applause)
  • 3:28 - 3:32
    Not the man in his flesh,
  • 3:32 - 3:34
    but for the first time in history
  • 3:34 - 3:41
    this is the genome of a specific human
    printed page -by-page, letter-by-letter,
  • 3:41 - 3:45
    262,000 pages of information,
  • 3:45 - 3:50
    450 kilograms shipped from
    the United States to Canada
  • 3:50 - 3:55
    all the thanks to ?,
  • 3:55 - 3:56
    it was an amazing feat.
  • 3:56 - 4:01
    But this is the visual perception
    of what is the code of life,
  • 4:01 - 4:03
    and now, for the first time,
    I can do something funny.
  • 4:03 - 4:06
    I can actually poke inside it
    and read.
  • 4:06 - 4:13
    So let me take some interesting book
    like this one.
  • 4:13 - 4:16
    I have an annotation, it's a
    fairly big book.
  • 4:16 - 4:21
    So just to let you see
    what is the code of life.
  • 4:21 - 4:24
    Thousands and thousands
    and thousands
  • 4:24 - 4:27
    and millions of letters.
  • 4:27 - 4:29
    And they apparently
    make any sense.
  • 4:29 - 4:32
    Let's get to a specific part.
  • 4:32 - 4:34
    Let em read it to you:
  • 4:34 - 4:35
    (Laughter)
  • 4:35 - 4:39
    "AAGAATATA."
  • 4:39 - 4:41
    To you it sounds like mute letters,
  • 4:41 - 4:46
    but this sequence gives
    the color of the eyes to Craig.
  • 4:46 - 4:48
    I'll show you another part
    of the book.
  • 4:48 - 4:51
    It's actually a little more
    complicated.
  • 4:51 - 4:54
    Chromosome 14, book 132:
  • 4:54 - 4:56
    (Laughter)
  • 4:56 - 4:57
    As you might expect.
  • 4:57 - 5:03
    (Laughter)
  • 5:03 - 5:09
    "ATTCTTGATT."
  • 5:09 - 5:15
    This human is lucky, because if
    you miss just two letters in this position,
  • 5:15 - 5:17
    two letters out of 3 billion,
  • 5:17 - 5:19
    he will condemned to
    a terrible disease:
  • 5:19 - 5:20
    cystic fibrosis.
  • 5:20 - 5:24
    We have no cure for it, we don't
    know how to solve it,
  • 5:24 - 5:28
    and it's just two letter of difference
    from what we are.
  • 5:28 - 5:31
    A wonderful book, a mighty book,
  • 5:31 - 5:37
    a mighty book that helped me understand
    and show you something quite remarkable.
  • 5:37 - 5:41
    Every one of you, what makes me, me
    and you, you
  • 5:41 - 5:46
    is just bout 5 million of these,
    half a book.
  • 5:46 - 5:51
    For the rest, we all absolutely
    identical.
  • 5:51 - 5:55
    500 pages if the miracle
    of life that you are.
  • 5:55 - 5:58
    The rest, we all share it.
  • 5:58 - 6:01
    So think about that again
    when you think you are different.
  • 6:01 - 6:04
    This is the amount that we share.
  • 6:04 - 6:07
    So now that I have your attention,
  • 6:07 - 6:10
    the next question is,
    how do I read it?
  • 6:10 - 6:12
    How do I make sense of it?
  • 6:12 - 6:16
    Well, for how good you can be
    at assembling Swedish furnitures,
  • 6:16 - 6:20
    this instruction manual is nothing
    crack in your life.
  • 6:20 - 6:21
    (Laughter)
  • 6:21 - 6:27
    And so, in 2014, two famous TEDsters
    Peter Diamandis and Craig Venture himself
  • 6:27 - 6:30
    decided to assemble a new company
    and Human Longevity was born,
  • 6:30 - 6:31
    with one mission;
  • 6:31 - 6:36
    trying everything we can try and learning
    everything we can learn from these books
  • 6:36 - 6:42
    with one target; making real
    the dream of personalized medicine.
  • 6:42 - 6:46
    And understanding what things
    should be done to have better health
  • 6:46 - 6:49
    and what are the secrets in these books.
  • 6:49 - 6:53
    An amazing team, 14 other scientists
    and many, many more people,
  • 6:53 - 6:54
    a pleasure to work with,
  • 6:54 - 6:56
    the ceoncept is actually very simple.
  • 6:56 - 7:00
    We're going to use a technology
    called machine learning.
  • 7:00 - 7:04
    On one side, we have genomes,
    thousands of them.
  • 7:04 - 7:08
    On the other side, we collected
    the biggest database of human beings:
  • 7:08 - 7:13
    phenotypes, 3D scan, MRI, everything
    you can think of.
  • 7:13 - 7:18
    Inside there, on the two opposite sides,
    there is the secret the translation.
  • 7:18 - 7:21
    And in the middle, we build a machine.
  • 7:21 - 7:24
    We build a machine
    and we train a machine --
  • 7:24 - 7:27
    well, not exactly one machine,
    many, many machines --
  • 7:27 - 7:32
    to try to understand and translate
    the genome in a phenotype.
  • 7:32 - 7:35
    What are those letters and what
    do they do?
  • 7:35 - 7:38
    It's an approach that can be used
    for everything,
  • 7:38 - 7:41
    but using it in genomics is particularly
    complicated.
  • 7:41 - 7:41
    Little by little we grew and we wanted
    to build different challenges.
  • 7:41 - 7:47
    We start from the beginning
    from common traits.
  • 7:47 - 7:49
    Common traits are comfortable
    because they are common,
  • 7:49 - 7:51
    everyone has them.
  • 7:51 - 7:55
    So we started to ask our question,
    can we predict height?
  • 7:55 - 7:57
    Can we read the books and predict
    your height?
  • 7:57 - 8:00
    We actually can, with
    5 centimeters of precision.
  • 8:00 - 8:04
    BMI is fairly connected
    to your lifestyle,
  • 8:04 - 8:07
    but we still can, we get in the ballpark,
    kilograms of precision.
  • 8:07 - 8:09
    Can we predict the eye color?
  • 8:09 - 8:10
    Yeah, we can.
  • 8:10 - 8:12
    80 percent accuracy.
  • 8:12 - 8:14
    Can we predict the skin color?
  • 8:14 - 8:16
    Yeah we can, 80 percent accuracy.
  • 8:16 - 8:18
    Can we predict age?
  • 8:18 - 8:22
    We can, because apparently the code
    changes during your life.
  • 8:22 - 8:26
    It gets shorter, you loose pieces,
    it gets insertions.
  • 8:26 - 8:29
    We read the signals
    and we make a model.
  • 8:29 - 8:30
    Now, an interesting challenge.
  • 8:30 - 8:33
    Can we predict a human face?
  • 8:33 - 8:35
    It's a little complicated because
    a human face
  • 8:35 - 8:38
    is scattered among millions
    of these letters
  • 8:38 - 8:40
    and a human face is not
    a very well-defined object.
  • 8:40 - 8:42
    So, we had to build an entire tier of it
  • 8:42 - 8:45
    to learn and teach a machine
    what is a face
  • 8:45 - 8:47
    and embed and compress it,
  • 8:47 - 8:49
    and if you're comfortable with
    machine learning,
  • 8:49 - 8:52
    you understand what is the challenge.
  • 8:52 - 8:58
    Now, after 15 years, 15 years after
    we read the first sequence,
  • 8:58 - 9:01
    this October, we started to see
    some ?
  • 9:01 - 9:04
    and it was a very emotional moment.
  • 9:04 - 9:08
    What you see here is a subject
    coming in our lab.
  • 9:08 - 9:10
    This is a face for us.
  • 9:10 - 9:13
    So we take the real face of a subject,
    we reduce the complecity
  • 9:13 - 9:15
    because not everything is in your face,
  • 9:15 - 9:19
    lots of features and defects
    and asymmetries are coming from your life.
  • 9:19 - 9:23
    We symetrize the face and we run
    our algorithm.
  • 9:23 - 9:28
    The results I show you right now,
    this is the prediction we have
  • 9:28 - 9:30
    from the blot.
  • 9:30 - 9:31
    (Applause)
  • 9:31 - 9:33
    Wait a second.
  • 9:33 - 9:38
    In these seconds, your eyes are watching,
    left and right, left and right,
  • 9:38 - 9:41
    and your brain wants those pictures
    to be identical.
  • 9:41 - 9:44
    So I ask you to do another exercise
    to be honest.
  • 9:44 - 9:48
    please search for the differences,
    which are many.
  • 9:48 - 9:50
    The biggest amount of signal
    comes from gender,
  • 9:50 - 9:55
    then there is age, BMI,
    the ethnicity component of a human.
  • 9:55 - 9:59
    And scaling up after that signal
    is much more complicated.
  • 9:59 - 10:02
    What you see here, even in the
    differences,
  • 10:02 - 10:06
    let's you understand that we are
    in the right ballpark.
  • 10:06 - 10:10
    That we are getting closer
    and it's already giving you some emotions.
  • 10:10 - 10:15
    This is another subject that
    comes in our lab
  • 10:15 - 10:16
    and this is a prediction.
  • 10:16 - 10:19
    A little smaller face, we didn't get
    the complete cranial structure,
  • 10:19 - 10:22
    but still, it's in the ballpark.
  • 10:22 - 10:26
    This is a subject that comes in our lab,
    and this is the prediction.
  • 10:26 - 10:30
    So these people have never been seen
    in the training of the machine,
  • 10:30 - 10:34
    these are the "held out set" ?
  • 10:34 - 10:38
    But these are, as well, people that you'll
    probably never believe.
  • 10:38 - 10:40
    We are publishing everything
    in a scientific publication,
  • 10:40 - 10:41
    you can read it.
  • 10:41 - 10:44
    Since we are onstage,
    Chris challeneged me.
  • 10:44 - 10:47
    I probably exposed myself
    and tried to predict
  • 10:47 - 10:50
    someone that you might recognize.
  • 10:50 - 10:54
    In this vile of blood, and believe me,
  • 10:54 - 10:58
    you had no idea what we had to do
    to have blood now, here.
  • 10:58 - 11:02
    In this vile of blood, there is mountain
    of biological information
  • 11:02 - 11:04
    that we need to do
    a full genome sequence on.
  • 11:04 - 11:07
    We just need this amount.
  • 11:07 - 11:10
    We ran this sequence and I'm
    going to do it with you.
  • 11:10 - 11:14
    And we start to layer up
    all the understanding we have.
  • 11:14 - 11:19
    In the vile of blood, we predicted
    he's a male, the subject is a male.
  • 11:19 - 11:22
    We predict that he's a meter and 76,
  • 11:22 - 11:24
    the subject is a meter and 77.
  • 11:24 - 11:29
    We predict that he's 76,
    the subject is 82.
  • 11:29 - 11:32
    We predict his age, 38.
  • 11:32 - 11:34
    The subject is 35.
  • 11:34 - 11:37
    Predict his eye color?
  • 11:37 - 11:38
    Too dark.
  • 11:38 - 11:40
    Predict his skin color?
  • 11:40 - 11:42
    We are almost there.
  • 11:42 - 11:45
    That's his face.
  • 11:45 - 11:52
    Now, the reveal moment
    that the subject is this person.
  • 11:52 - 11:55
    And I did that intentionally.
  • 11:55 - 11:58
    I am a very particular
    and peculiar ethnicity.
  • 11:58 - 12:01
    Southern European Italians, they never
    fit in models.
  • 12:01 - 12:06
    And it's particular that ethnicity
    is a complex corner case for our model.
  • 12:06 - 12:08
    But there is another point.
  • 12:08 - 12:12
    So, one of the things that we
    use a lot to recognize poeple
  • 12:12 - 12:13
    will never be written in the genome,
    it's our free will.
  • 12:13 - 12:19
    Not my haircut in this case,
    but my beard cut.
  • 12:19 - 12:22
    So I'm going to show you, I"m going to,
    in this case, transfer it --
  • 12:22 - 12:27
    and this is nothing more than photoshop,
    no modeling -- the beard on the subject.
  • 12:27 - 12:31
    And immediately, we get
    much, much better and filling.
  • 12:31 - 12:36
    So, why do we do this?
  • 12:36 - 12:41
    We certainly don't do it
    for predicting the height
  • 12:41 - 12:45
    or taking a beautiful picture
    out of your blood.
  • 12:45 - 12:49
    We do it because the same technology
    and the same approach,
  • 12:49 - 12:54
    the machine learning of this code
    is helping us to understand how we work,
  • 12:54 - 12:56
    how your body works,
  • 12:56 - 12:58
    how your body ages,
  • 12:58 - 13:00
    how disease generates in your body,
  • 13:00 - 13:03
    how your cancer develops,
  • 13:03 - 13:08
    how drugs work and if they work
    on your body.
  • 13:08 - 13:10
    This is a huge challenge,
  • 13:10 - 13:13
    this is a challenge that we share with
    thousands of other researchers
  • 13:13 - 13:14
    around the world.
  • 13:14 - 13:17
    It's called personalized medicine.
  • 13:17 - 13:21
    It's the ability to move from
    a statistical appraoch
  • 13:21 - 13:23
    where you're a dot in the ocean,
  • 13:23 - 13:25
    to a personalized approach,
  • 13:25 - 13:27
    where we read all these books
  • 13:27 - 13:30
    and we get an understanding of exactly
    how you are.
  • 13:30 - 13:33
    But it is a particularly
    complicated challenge
  • 13:33 - 13:38
    because of all these books,
    as of today,
  • 13:38 - 13:41
    we probably know just 2 percent:
  • 13:41 - 13:46
    4 books out of 175.
  • 13:46 - 13:50
    And this is not the topic of my talk
  • 13:50 - 13:54
    because we will learn more.
  • 13:54 - 13:57
    There are the best minds in the world
    on this topic.
  • 13:57 - 13:59
    The prediction will better,
  • 13:59 - 14:01
    the model will get more precise.
  • 14:01 - 14:03
    And the more we will learn,
  • 14:03 - 14:08
    the more we will be confronted
    with decisions
  • 14:08 - 14:11
    that we never had to face before
  • 14:11 - 14:17
    about life, about death,
    about parenting.
  • 14:17 - 14:26
    So, this conversation....we are touching
    the very inner detail of how life works.
  • 14:26 - 14:30
    And ti's a revolution
    that cannot be confined
  • 14:30 - 14:33
    in the domain of science or technology.
  • 14:33 - 14:36
    This must be a global conversation.
  • 14:36 - 14:41
    We must start to think of the future
    we're building as a humanity.
  • 14:41 - 14:45
    We need to interact creatives,
    with artists, with philosophers,
  • 14:45 - 14:47
    with politicians,
  • 14:47 - 14:52
    everyone is involved because it's
    the future of our species.
  • 14:52 - 14:57
    Without fear, but with the understanding
    that the decisions
  • 14:57 - 15:03
    that we will take in the next year
    will change the course of history forever.
  • 15:03 - 15:05
    Thank you.
  • 15:05 - 15:15
    (Applause)
Title:
How to read the genome and build a human being
Speaker:
Riccardo Sabatini
Description:

more » « less
Video Language:
English
Team:
closed TED
Project:
TEDTalks
Duration:
15:28

English subtitles

Revisions Compare revisions