0:00:00.982,0:00:05.041 I'm going to tell you about the most[br]amazing machines in the world 0:00:05.065,0:00:06.833 and what we can now do with them. 0:00:07.396,0:00:08.559 Proteins, 0:00:08.583,0:00:10.833 some of which you see inside a cell here, 0:00:10.857,0:00:14.316 carry out essentially all the important[br]functions in our bodies. 0:00:14.972,0:00:16.851 Proteins digest your food, 0:00:16.875,0:00:18.581 contract your muscles, 0:00:18.605,0:00:20.182 fire your neurons 0:00:20.206,0:00:21.822 and power your immune system. 0:00:22.400,0:00:24.376 Everything that happens in biology -- 0:00:24.400,0:00:25.551 almost -- 0:00:25.575,0:00:26.987 happens because of proteins. 0:00:27.698,0:00:31.773 Proteins are linear chains[br]of building blocks called amino acids. 0:00:32.366,0:00:35.599 Nature uses an alphabet of 20 amino acids, 0:00:35.623,0:00:37.898 some of which have names[br]you may have heard of. 0:00:38.921,0:00:42.463 In this picture, for scale,[br]each bump is an atom. 0:00:43.351,0:00:47.995 Chemical forces between the amino acids[br]cause these long stringy molecules 0:00:48.019,0:00:51.480 to fold up into unique,[br]three-dimensional structures. 0:00:51.937,0:00:53.277 The folding process, 0:00:53.301,0:00:54.735 while it looks random, 0:00:54.759,0:00:56.722 is in fact very precise. 0:00:56.746,0:01:01.143 Each protein folds[br]to its characteristic shape each time, 0:01:01.167,0:01:04.555 and the folding process[br]takes just a fraction of a second. 0:01:06.029,0:01:07.873 And it's the shapes of proteins 0:01:07.897,0:01:11.867 which enable them to carry out[br]their remarkable biological functions. 0:01:12.520,0:01:13.671 For example, 0:01:13.695,0:01:17.203 hemoglobin has a shape[br]in the lungs perfectly suited 0:01:17.227,0:01:19.214 for binding a molecule of oxygen. 0:01:19.759,0:01:21.651 When hemoglobin moves to your muscle, 0:01:21.675,0:01:23.607 the shape changes slightly 0:01:23.631,0:01:25.822 and the oxygen comes out. 0:01:27.494,0:01:28.860 The shapes of proteins, 0:01:28.884,0:01:31.097 and hence their remarkable functions, 0:01:31.121,0:01:36.899 are completely specified by the sequence[br]of amino acids in the protein chain. 0:01:37.331,0:01:41.272 In this picture, each letter[br]on top is an amino acid. 0:01:42.860,0:01:44.697 Where do these sequences come from? 0:01:45.586,0:01:50.410 The genes in your genome[br]specify the amino acid sequences 0:01:50.434,0:01:51.832 of your proteins. 0:01:51.856,0:01:55.594 Each gene encodes the amino acid[br]sequence of a single protein. 0:01:57.515,0:02:01.317 The translation between[br]these amino acid sequences 0:02:01.341,0:02:03.799 and the structures[br]and functions of proteins 0:02:03.823,0:02:05.880 is known as the protein folding problem. 0:02:06.439,0:02:07.984 It's a very hard problem 0:02:08.008,0:02:11.188 because there's so many different[br]shapes a protein can adopt. 0:02:12.073,0:02:13.718 Because of this complexity, 0:02:13.742,0:02:16.679 humans have only been able[br]to harness the power of proteins 0:02:16.703,0:02:20.171 by making very small changes[br]to the amino acid sequences 0:02:20.195,0:02:22.286 of the proteins we've found in nature. 0:02:22.835,0:02:26.693 This is similar to the process[br]that our Stone Age ancestors used 0:02:26.717,0:02:30.076 to make tools and other implements[br]from the sticks and stones 0:02:30.100,0:02:32.103 that we found in the world around us. 0:02:33.226,0:02:38.250 But humans did not learn to fly[br]by modifying birds. 0:02:38.790,0:02:40.807 (Laughter) 0:02:40.831,0:02:47.141 Instead, scientists, inspired by birds,[br]uncovered the principles of aerodynamics. 0:02:47.165,0:02:51.560 Engineers then used those principles[br]to design custom flying machines. 0:02:52.195,0:02:53.440 In a similar way, 0:02:53.464,0:02:55.406 we've been working for a number of years 0:02:55.430,0:02:58.699 to uncover the fundamental[br]principles of protein folding 0:02:58.723,0:03:02.782 and encoding those principles[br]in the computer program called Rosetta. 0:03:03.742,0:03:06.255 We made a breakthrough in recent years. 0:03:07.029,0:03:11.488 We can now design completely new proteins[br]from scratch on the computer. 0:03:12.396,0:03:14.464 Once we've designed the new protein, 0:03:15.242,0:03:19.145 we encode its amino acid sequence[br]in a synthetic gene. 0:03:19.656,0:03:21.544 We have to make a synthetic gene 0:03:21.568,0:03:23.819 because since the protein[br]is completely new, 0:03:23.843,0:03:28.605 there's no gene in any organism on earth[br]which currently exists that encodes it. 0:03:29.697,0:03:33.884 Our advances in understanding[br]protein folding 0:03:33.908,0:03:35.630 and how to design proteins, 0:03:35.654,0:03:39.282 coupled with the decreasing cost[br]of gene synthesis 0:03:39.306,0:03:42.805 and the Moore's law increase[br]in computing power, 0:03:42.829,0:03:47.565 now enable us to design[br]tens of thousands of new proteins, 0:03:47.589,0:03:49.928 with new shapes and new functions, 0:03:49.952,0:03:51.465 on the computer, 0:03:51.489,0:03:55.404 and encode each one of those[br]in a synthetic gene. 0:03:56.248,0:03:57.916 Once we have those synthetic genes, 0:03:57.940,0:03:59.485 we put them into bacteria 0:03:59.509,0:04:02.814 to program them to make[br]these brand-new proteins. 0:04:03.197,0:04:05.270 We then extract the proteins 0:04:05.294,0:04:08.730 and determine whether they function[br]as we designed them to 0:04:08.754,0:04:10.165 and whether they're safe. 0:04:11.867,0:04:14.332 It's exciting to be able[br]to make new proteins, 0:04:14.356,0:04:16.852 because despite the diversity in nature, 0:04:16.876,0:04:22.968 evolution has only sampled a tiny fraction[br]of the total number of proteins possible. 0:04:23.572,0:04:27.067 I told you that nature uses[br]an alphabet of 20 amino acids, 0:04:27.091,0:04:31.540 and a typical protein is a chain[br]of about 100 amino acids, 0:04:31.564,0:04:37.116 so the total number of possibilities[br]is 20 times 20 times 20, 100 times, 0:04:37.140,0:04:40.957 which is a number on the order[br]of 10 to the 130th power, 0:04:40.981,0:04:44.793 which is enormously more[br]than the total number of proteins 0:04:44.817,0:04:47.233 which have existed[br]since life on earth began. 0:04:47.990,0:04:50.681 And it's this unimaginably large space 0:04:50.705,0:04:54.235 we can now explore[br]using computational protein design. 0:04:55.747,0:04:58.116 Now the proteins that exist on earth 0:04:58.140,0:05:02.133 evolved to solve the problems[br]faced by natural evolution. 0:05:02.705,0:05:05.058 For example, replicating the genome. 0:05:06.128,0:05:08.412 But we face new challenges today. 0:05:08.436,0:05:11.173 We live longer, so new[br]diseases are important. 0:05:11.197,0:05:13.412 We're heating up and polluting the planet, 0:05:13.436,0:05:16.994 so we face a whole host[br]of ecological challenges. 0:05:17.977,0:05:19.785 If we had a million years to wait, 0:05:19.809,0:05:23.017 new proteins might evolve[br]to solve those challenges. 0:05:23.787,0:05:25.846 But we don't have[br]millions of years to wait. 0:05:26.488,0:05:29.359 Instead, with computational[br]protein design, 0:05:29.383,0:05:33.822 we can design new proteins[br]to address these challenges today. 0:05:35.693,0:05:40.143 Our audacious idea is to bring[br]biology out of the Stone Age 0:05:40.167,0:05:43.142 through technological revolution[br]in protein design. 0:05:44.113,0:05:46.977 We've already shown[br]that we can design new proteins 0:05:47.001,0:05:48.684 with new shapes and functions. 0:05:49.174,0:05:53.482 For example, vaccines work[br]by stimulating your immune system 0:05:53.506,0:05:56.628 to make a strong response[br]against a pathogen. 0:05:57.698,0:05:59.249 To make better vaccines, 0:05:59.273,0:06:01.575 we've designed protein particles 0:06:01.599,0:06:05.186 to which we can fuse[br]proteins from pathogens, 0:06:05.210,0:06:09.544 like this blue protein here,[br]from the respiratory virus RSV. 0:06:10.131,0:06:11.861 To make vaccine candidates 0:06:11.885,0:06:15.548 that are literally bristling[br]with the viral protein, 0:06:15.572,0:06:18.142 we find that such vaccine candidates 0:06:18.166,0:06:21.468 produce a much stronger[br]immune response to the virus 0:06:21.492,0:06:24.195 than any previous vaccines[br]that have been tested. 0:06:24.648,0:06:28.498 This is important because RSV[br]is currently one of the leading causes 0:06:28.522,0:06:30.751 of infant mortality worldwide. 0:06:32.414,0:06:36.377 We've also designed new proteins[br]to break down gluten in your stomach 0:06:36.401,0:06:37.998 for celiac disease 0:06:38.022,0:06:42.398 and other proteins to stimulate[br]your immune system to fight cancer. 0:06:43.338,0:06:47.277 These advances are the beginning[br]of the protein design revolution. 0:06:48.850,0:06:52.040 We've been inspired by a previous[br]technological revolution: 0:06:52.064,0:06:53.409 the digital revolution, 0:06:53.433,0:06:58.558 which took place in large part[br]due to advances in one place, 0:06:58.582,0:06:59.854 Bell Laboratories. 0:07:00.337,0:07:03.631 Bell Labs was a place with an open,[br]collaborative environment, 0:07:03.655,0:07:06.838 and was able to attract top talent[br]from around the world. 0:07:07.418,0:07:10.860 And this led to a remarkable[br]string of innovations -- 0:07:10.884,0:07:15.075 the transistor, the laser,[br]satellite communication 0:07:15.099,0:07:16.825 and the foundations of the internet. 0:07:17.761,0:07:21.602 Our goal is to build[br]the Bell Laboratories of protein design. 0:07:22.076,0:07:25.591 We are seeking to attract[br]talented scientists from around the world 0:07:25.615,0:07:28.550 to accelerate the protein[br]design revolution, 0:07:28.574,0:07:32.662 and we'll be focusing[br]on five grand challenges. 0:07:34.136,0:07:39.733 First, by taking proteins from flu strains[br]from around the world 0:07:39.757,0:07:43.311 and putting them on top[br]of the designed protein particles 0:07:43.335,0:07:45.002 I showed you earlier, 0:07:45.026,0:07:48.416 we aim to make a universal flu vaccine, 0:07:48.440,0:07:52.391 one shot of which gives a lifetime[br]of protection against the flu. 0:07:53.356,0:07:54.968 The ability to design -- 0:07:54.992,0:08:00.216 (Applause) 0:08:00.240,0:08:03.308 The ability to design[br]new vaccines on the computer 0:08:03.332,0:08:08.640 is important both to protect[br]against natural flu epidemics 0:08:08.664,0:08:12.144 and, in addition, intentional[br]acts of bioterrorism. 0:08:13.272,0:08:16.562 Second, we're going far beyond[br]nature's limited alphabet 0:08:16.586,0:08:18.297 of just 20 amino acids 0:08:18.321,0:08:23.056 to design new therapeutic candidates[br]for conditions such as chronic pain, 0:08:23.080,0:08:25.711 using an alphabet[br]of thousands of amino acids. 0:08:26.602,0:08:30.415 Third, we're building[br]advanced delivery vehicles 0:08:30.439,0:08:34.603 to target existing medications[br]exactly where they need to go in the body. 0:08:35.226,0:08:37.875 For example, chemotherapy to a tumor 0:08:37.899,0:08:42.202 or gene therapies to the tissue[br]where gene repair needs to take place. 0:08:43.000,0:08:49.532 Fourth, we're designing smart therapeutics[br]that can do calculations within the body 0:08:49.556,0:08:51.770 and go far beyond current medicines, 0:08:51.794,0:08:54.058 which are really blunt instruments. 0:08:54.082,0:08:58.431 For example, to target a small[br]subset of immune cells 0:08:58.455,0:09:00.536 responsible for an autoimmune disorder, 0:09:00.560,0:09:04.018 and distinguish them from the vast[br]majority of healthy immune cells. 0:09:04.899,0:09:08.311 Finally, inspired by remarkable[br]biological materials 0:09:08.335,0:09:13.443 such as silk, abalone shell,[br]tooth and others, 0:09:13.467,0:09:16.351 we're designing new[br]protein-based materials 0:09:16.375,0:09:20.538 to address challenges in energy[br]and ecological issues. 0:09:21.558,0:09:24.403 To do all this,[br]we're growing our institute. 0:09:24.768,0:09:30.367 We seek to attract energetic,[br]talented and diverse scientists 0:09:30.391,0:09:33.471 from around the world,[br]at all career stages, 0:09:33.495,0:09:34.645 to join us. 0:09:35.304,0:09:38.607 You can also participate[br]in the protein design revolution 0:09:38.631,0:09:42.375 through our online[br]folding and design game, "Foldit." 0:09:43.214,0:09:47.065 And through our distributed[br]computing project, Rosetta@home, 0:09:47.089,0:09:50.820 which you can join from your laptop[br]or your Android smartphone. 0:09:52.547,0:09:56.514 Making the world a better place[br]through protein design is my life's work. 0:09:56.996,0:09:59.274 I'm so excited about[br]what we can do together. 0:09:59.583,0:10:01.053 I hope you'll join us, 0:10:01.077,0:10:02.235 and thank you. 0:10:02.259,0:10:06.714 (Applause and cheers)