0:00:01.014,0:00:04.185 (lift) 0:00:04.185,0:00:07.244 (lift 12 - Feb 24 2012 - Geneva) 0:00:07.244,0:00:10.044 (Rufus Pollock - Stories) 0:00:10.044,0:00:11.788 [Rufus Pollock] Just to say for those of you who don't know: 0:00:11.788,0:00:13.666 the Open Knowledge Foundation is a not-profit -- not for profit 0:00:13.666,0:00:15.611 founded in 2004 0:00:15.611,0:00:17.865 and which builds tools and communities 0:00:17.865,0:00:20.934 to create, use and share open information 0:00:20.934,0:00:24.585 and that's information that anyone can use, reuse and redistribute. 0:00:24.585,0:00:28.321 And as such, we've been working on open data for quite a long time 0:00:28.321,0:00:30.011 since we started in 2004. 0:00:30.011,0:00:34.817 And today, I want to start the story by going back in time 5000 years, 0:00:34.817,0:00:37.610 to ancient Mesopotamia. 0:00:37.610,0:00:41.393 There, between the Tigris and the Euphrates rivers, 0:00:42.069,0:00:44.390 flourished the Sumerian civilization. 0:00:44.390,0:00:47.298 And they were confronted by a problem. 0:00:47.298,0:00:50.269 They were confronted by the limitations of human memory 0:00:50.899,0:00:54.338 in the recording of taxes, food and other goods. 0:00:54.338,0:00:59.642 And those ancient civil servants and businessmen hit on a novel solution: 0:01:00.380,0:01:04.666 What they decided to do was they would start counting things with small clay chits, 0:01:04.666,0:01:09.234 which they would bake inside of a clay -- a little clay box 0:01:09.234,0:01:12.617 and then mark, on the outside of that box, what they were counting. 0:01:12.617,0:01:15.303 You know, was it grain, was it tax payments, whatever. 0:01:16.150,0:01:19.786 And so, born out of necessity for a state and a society, 0:01:20.632,0:01:25.773 came one of the great information technology revolutions of all time: writing. 0:01:25.773,0:01:28.172 The Sumerians invented writing via cuneiform. 0:01:28.910,0:01:34.039 And if we fast-forward from that a few thousand years, we come to the UK census. 0:01:34.039,0:01:37.577 Again, it's always interesting that states, governments are often at the forefront 0:01:37.577,0:01:42.681 of at least driving information technology and information systems innovations. 0:01:42.681,0:01:44.654 The UK census: again, the state, 0:01:44.654,0:01:46.565 this is during the Napoleon Wars, 0:01:46.565,0:01:48.601 desired to count the population more accurately: 0:01:48.601,0:01:51.995 and we have the first UK census in 1801. 0:01:51.995,0:01:56.189 And in the US, they also had censuses, in fact starting in 1790. 0:01:56.819,0:01:59.383 And one of the problems encountered in the 1880 census 0:01:59.383,0:02:01.592 was they tabulated the census by hand. 0:02:02.345,0:02:05.699 And by the 1880 census, it was taking seven years 0:02:05.699,0:02:06.822 to tabulate the census. 0:02:06.822,0:02:10.241 So after it got taken in 1880, it wasn't until 1887 0:02:10.241,0:02:12.892 they actually had any data they could use. 0:02:12.892,0:02:16.004 And they calculated that for the next census in 1890, 0:02:16.004,0:02:18.164 they wouldn't be finished by 1900. 0:02:18.164,0:02:21.936 They still wouldn't have the results of the census by the time they started the next one. 0:02:21.936,0:02:24.233 They had a crisis of information technology. 0:02:24.233,0:02:26.979 And what they went and did is they commissioned Herman Hollerith 0:02:26.979,0:02:29.747 to build the first automatic tabulator. 0:02:29.747,0:02:32.835 And for those of you who know your company history, of course, 0:02:32.835,0:02:34.513 Herman Hollerith's company went on 0:02:34.513,0:02:35.899 to be one of the founders, if you like, 0:02:35.899,0:02:38.808 one of the companies that came and created IBM. 0:02:38.808,0:02:42.258 And IBM, by the sixties, were building this 0:02:42.258,0:02:44.374 -- they replaced those hand -- 0:02:44.374,0:02:45.905 those kind of wooden, mechanical tabulators 0:02:45.905,0:02:48.524 with this stuff: digital tabulators, 0:02:48.524,0:02:50.375 the modern computer of this age. 0:02:50.375,0:02:52.610 And again, much of this -- I don't know if you guys know -- 0:02:52.610,0:02:53.705 IBM would have gone bankrupt 0:02:53.705,0:02:58.477 if it hadn't been for Franklin Roosevelt passing the Social Security Act in the States, 0:02:58.477,0:03:01.132 which necessitated a huge amount of new tabulation. 0:03:01.132,0:03:04.629 So, again, a lot of innovation in this space came out of government need 0:03:04.629,0:03:06.370 and also, of course, the nuclear program, 0:03:06.370,0:03:08.641 the other great needer of computational power. 0:03:09.317,0:03:11.899 And today, today, 0:03:12.623,0:03:15.485 we find ourselves again in the midst of a revolution. 0:03:16.438,0:03:19.331 It's a revolution driven by two needs: 0:03:19.331,0:03:22.027 ones that have been the same throughout history as I've just shown, 0:03:22.027,0:03:23.886 information complexity, which is the necessity, 0:03:24.456,0:03:27.575 and information technology, which is the opportunity. 0:03:28.544,0:03:32.702 And what we're doing in this case is a policy innovation, if you like. 0:03:32.702,0:03:36.468 We are innovating by opening up information. 0:03:37.052,0:03:39.436 So just take the obvious example, government, 0:03:39.436,0:03:41.097 as I said, often the innovator. 0:03:41.097,0:03:43.308 In the last -- 3 years ago, you go back 3 years, 0:03:43.308,0:03:45.829 there's almost no open government data initiatives 0:03:45.829,0:03:46.688 in the world. 0:03:46.688,0:03:48.442 Today there are dozens. 0:03:48.442,0:03:51.162 The UK, the US, Finland, Kenya, The Netherlands, 0:03:51.162,0:03:53.049 and there's new ones almost every week. 0:03:53.049,0:03:57.407 There's been a launch of an official kind of movement as a part of the UN 0:03:57.407,0:04:00.097 called the Open Government Partnership in which countries sign up, 0:04:00.097,0:04:02.433 and among other things, they open up their data. 0:04:03.002,0:04:05.325 And of course, it's been, in the UK and other countries, 0:04:05.325,0:04:06.562 Tim Berners-Lee has been involved. 0:04:06.562,0:04:09.106 I've helped advise the government around this in the UK. 0:04:09.106,0:04:11.221 But it's not just government, it's also companies. 0:04:11.651,0:04:13.982 Companies are opening up data. 0:04:13.982,0:04:15.690 Very interestingly, last year, 0:04:15.690,0:04:19.092 Nike started an open data initiative there 0:04:19.092,0:04:21.372 to open up supply chain and sustainability data, 0:04:21.372,0:04:23.931 for themselves and also for their suppliers, 0:04:23.931,0:04:26.800 which I think is a very interesting change. 0:04:26.800,0:04:28.004 And it's also communities. 0:04:28.004,0:04:29.715 Often, in fact, back there in the beginning, 0:04:29.715,0:04:31.927 this incredible map that you saw in an earlier slide, 0:04:31.927,0:04:35.002 is a OpenStreetMap activity, around the world. 0:04:35.002,0:04:38.073 People adding to this crowd-built map of the world. 0:04:38.073,0:04:41.074 And in the last 6 years, OpenStreetMap, 0:04:41.074,0:04:42.445 from a bottom-up community, 0:04:42.445,0:04:44.435 have built a complete, comprehensive, 0:04:44.435,0:04:47.918 map of the world, of fully open data. 0:04:48.872,0:04:50.898 So I've just gone on about Open Data, 0:04:50.898,0:04:52.766 and one thing I'm aware of, of this audience, 0:04:52.766,0:04:54.035 is you might not all know what it is. 0:04:54.035,0:04:59.152 So I'm going to take a brief moment, a brief moment, to say what it is. 0:04:59.152,0:05:01.493 What does it mean when I say 'open'? 0:05:01.493,0:05:05.557 And was it, you know, what's different from anything else? What's different from simply public data? 0:05:05.557,0:05:07.083 So there's actually a definition, 0:05:07.083,0:05:10.177 a definition we the Open Knowledge Foundation helped write, it's very simple. 0:05:10.177,0:05:13.671 In a nutshell, a piece of information, a piece of data, 0:05:13.671,0:05:18.384 is open if anyone is free to use, reuse, 0:05:18.384,0:05:20.797 and redistribute it, subject only at most 0:05:20.797,0:05:22.891 to a requirement to attribute and share alike. 0:05:23.214,0:05:25.784 And anyone means anyone! 0:05:25.784,0:05:28.055 It doesn't mean -- there can't be any commercial restrictions. 0:05:28.055,0:05:32.262 You can't say: hey, here's this data, but only people using it for non-commercial purposes. 0:05:32.262,0:05:34.849 Or only people working in education. 0:05:34.849,0:05:38.051 Or only people living in the developing world, or the developed world. 0:05:38.051,0:05:40.743 There can't be any restrictions like that. 0:05:41.343,0:05:43.189 And there's a reason for this, by the way, 0:05:43.209,0:05:48.615 and it isn't just because one's obsessed about if you like, trademarking an attractive term. 0:05:49.315,0:05:51.081 It's because it's about interoperability. 0:05:51.291,0:05:54.617 One of my experiences at this conference, which I remember from previous trips to Geneva, 0:05:54.627,0:05:56.974 is I've been unable to plug in my laptop! 0:05:56.974,0:06:02.048 Even though I have a French adaptor, in fact, these wonderful Swiss plugs here, are, you know, 0:06:02.048,0:06:03.582 these wonderful, small octagonal shape. 0:06:03.582,0:06:05.379 And even with my adaptor I can't plug in. 0:06:05.379,0:06:07.347 Right? And it's called interoperability. 0:06:07.347,0:06:10.929 When we travel around to different countries, our power adaptors don't actually fit in. 0:06:10.929,0:06:12.581 We have to buy something. 0:06:12.581,0:06:16.755 And the point about this definition, and the point about caring about Open Data, 0:06:16.755,0:06:18.317 is, it's about interoperability. 0:06:18.317,0:06:22.112 The dream of Open Data is interoperability. 0:06:22.112,0:06:26.058 Of seamlessly being able to share and interweave information. 0:06:27.898,0:06:31.704 And if every time I get information from two different people I have to consult a lawyer, 0:06:31.704,0:06:35.300 I have to work out whether I'm allowed to do it, whether I'm allowed to put these things together, 0:06:35.300,0:06:37.634 we lose that dream, that dream is shattered. 0:06:37.634,0:06:42.166 And the key point is, this definition, and those conditions, ensure interoperability. 0:06:42.166,0:06:45.744 If you comply with them, we know that any piece of info, of Open Data, 0:06:45.744,0:06:47.880 will work with any other piece of Open Data. 0:06:48.681,0:06:52.932 And also, it's worth saying for a quick moment, what kind of data, and to emphasize a point. 0:06:52.932,0:06:55.985 Just to foreclose those kinds of questions, otherwise I always get asked. 0:06:55.985,0:06:58.809 When we talk about opening up data, in general, 0:06:58.809,0:07:01.026 we're not talking about personal data. 0:07:01.026,0:07:04.161 We're not talking about opening up your private health records 0:07:04.161,0:07:08.302 or opening up your personal tax information. 0:07:08.302,0:07:11.267 We're talking about information that is non-personal in nature. 0:07:11.267,0:07:15.667 And for the government for example: transport, geodata, statistics, electoral, legal. 0:07:15.667,0:07:19.510 Stuff that the UK has, in fact, for example been opening up over the last few years. 0:07:19.510,0:07:23.381 This financial information, on government spending, this information on health outcomes, 0:07:23.381,0:07:28.625 on prescriptions, this information on educational outcomes, this information on the law. 0:07:28.625,0:07:30.765 This information -- statistical information. 0:07:30.785,0:07:32.691 That's the kind of thing that we're talking about. 0:07:34.186,0:07:37.393 Now, I want to say, it's in this story, we have this story of over time. 0:07:37.393,0:07:38.996 But why governments are doing it now? 0:07:39.596,0:07:40.598 And why Open Data? 0:07:41.268,0:07:43.930 So, okay, for thousands of years, governments innovate, 0:07:43.930,0:07:47.274 but why do they innovate at this particular moment and in this way? 0:07:47.274,0:07:51.976 So I want to start here with a quick story, a story of medicine gone wrong. 0:07:52.006,0:07:54.484 It is from a great book by a guy called Stephen Klaidman. 0:07:54.484,0:07:55.918 It's in fact one of the things 0:07:55.918,0:07:57.781 that made me think quite deeply about this: 0:07:57.781,0:07:59.852 why I was interested in Open Data. 0:08:01.172,0:08:02.917 In that picture there, you can see 0:08:02.917,0:08:05.726 what was the Redding Medical Centre in Northern California. 0:08:05.726,0:08:10.471 There, in 2002, in the Summer of 2002, John Corapi, 0:08:11.231,0:08:12.401 in typical American style, 0:08:12.401,0:08:15.374 an ex-accountant from Vegas turned Catholic priest, 0:08:15.374,0:08:17.243 [scattered laughter] 0:08:17.783,0:08:22.274 ...arrived at the Redding Medical Centre having been referred by his doctor for having chest pains. 0:08:22.784,0:08:28.419 He had a cardiogram by the local cardiologist and was told that he needed an immediate heart bypass, 0:08:28.419,0:08:31.484 that he was at serious risk, and that he should come back later that day, 0:08:31.484,0:08:34.514 or at the latest, tomorrow, to have open heart surgery. 0:08:35.764,0:08:37.985 Rather shocked, and dazed by this news, 0:08:37.985,0:08:41.225 he returned home to pack his bags in order to return to hospital. 0:08:41.225,0:08:45.102 He called up his best friend, who was still an accountant in Vegas, 0:08:46.032,0:08:52.568 whose partner was a hospital nurse, and who advised him that he should get a second opinion, 0:08:52.568,0:08:55.904 that, according to his partner, it was not, you know, 0:08:55.904,0:08:58.981 it was very unusual that you would need to have immediate open heart surgery, 0:08:58.981,0:09:00.235 and that he should get a second opinion. 0:09:00.975,0:09:04.507 Rather doubtful about this, because he was extremely worried, he did get on a plane. 0:09:04.507,0:09:07.919 He went to Vegas, he got seen by another specialist... 0:09:07.919,0:09:11.785 who, to his complete surprise, told him there was nothing wrong with his heart. 0:09:12.805,0:09:15.289 He saw another specialist, just to make sure. 0:09:15.289,0:09:18.563 They told him also, there was nothing wrong with his heart. 0:09:19.343,0:09:25.067 Relieved, and rather, you know, happy, he returned home and just wanted to really forget about it. 0:09:25.067,0:09:27.389 But his friend said: "No, what's going on here? Something's wrong". 0:09:27.389,0:09:32.613 And they went in to see the CEO of the Tenet Healthcare, the people running this hospital 0:09:32.613,0:09:35.654 (which, by the way, was a private hospital), and said: 0:09:35.654,0:09:38.614 "Look, something's wrong, what's going on, what are you going to do about this?" 0:09:38.614,0:09:40.256 And basically they were told: not very much. 0:09:40.256,0:09:44.581 You know, mistakes get made, it's bad luck, don't worry about it, 0:09:44.581,0:09:46.233 we'll look into it, but thank you very much. 0:09:46.763,0:09:51.631 They weren't convinced by this, and eventually they decided to contact the FBI. 0:09:51.631,0:09:53.826 The reason they contacted the FBI, by the way, 0:09:53.826,0:09:56.401 is it's a private healthcare provider in the United States, 0:09:56.401,0:10:00.476 they provide Medicare provision of healthcare to the Federal Government. 0:10:00.476,0:10:04.202 So, if the Federal Government is getting defrauded, the FBI can get involved. 0:10:04.982,0:10:06.850 The FBI started investigating. 0:10:08.281,0:10:12.081 Eventually it turned out, that hundreds, probably thousands of people 0:10:12.081,0:10:15.854 over a ten or longer year period, had been operated on unnecessarily. 0:10:16.704,0:10:19.561 Most of them had had serious procedures performed on them, 0:10:19.561,0:10:22.189 open heart surgery, some had died as a result. 0:10:22.189,0:10:24.325 Obviously it's quite a serious operation. 0:10:24.325,0:10:27.437 Some people had basically been condemned to a lifetime of pain. 0:10:27.437,0:10:31.437 One of the most traumatic examples was a 36-year-old, he had been cut open, 0:10:31.437,0:10:33.000 which is obviously what happens in open heart surgery, 0:10:33.000,0:10:35.369 and his chest had never knitted back together correctly. 0:10:35.999,0:10:38.125 Basically, he would be in pain for the rest of his life. 0:10:39.395,0:10:43.000 So, hundreds, thousands of people had been harmed. 0:10:43.610,0:10:45.968 One of the interesting things was that in this community 0:10:45.968,0:10:48.159 there was already some suspicion, there were anecdotes. 0:10:48.159,0:10:50.853 I mean, one of the ones I really liked from this book was the story that went: 0:10:50.853,0:10:56.021 'Don't get a flat tyre outside of Redding Medical Centre because you'll end up with a heart bypass.' 0:10:56.021,0:10:57.303 [scattered laughter] 0:10:57.303,0:11:00.258 You know, but the thing was, there was no data. 0:11:00.728,0:11:04.563 People were you know, a bit suspicious, but it was among doctors who knew, 0:11:04.563,0:11:06.867 you know, in the community, and who wants to doubt it. 0:11:06.867,0:11:12.171 And guess what? Also, Redding Medical Centre had one of the best mortality rates, 0:11:13.001,0:11:15.350 for cardiac procedures in the United States, 0:11:15.350,0:11:19.609 because if you operate on healthy people, you have a good mortality rate! 0:11:19.619,0:11:21.129 [scattered laughter] 0:11:21.129,0:11:23.390 So, the other thing, though, 0:11:23.390,0:11:25.452 and this is the point that comes to Open Data for me 0:11:25.452,0:11:28.722 the other red flag if you had been looking at the data, 0:11:28.741,0:11:31.927 was these two things: one is incredibly low mortality rate, 0:11:31.927,0:11:35.351 and (B) that it had almost the highest number of procedures 0:11:35.351,0:11:37.464 for the population that it covered in the United States, 0:11:38.144,0:11:39.634 which should be a red flag, right? 0:11:39.634,0:11:42.618 Because, one, it's just a massive outlier on that basis, and also, 0:11:42.618,0:11:45.815 the more people you should be operating on, the more you're doing marginal cases, 0:11:45.815,0:11:49.450 the higher should be your mortality rate unless something very odd is going on. 0:11:50.030,0:11:53.015 The thought was: what if people had been looking at this data? 0:11:53.015,0:11:56.045 What if we'd - if this data had been open and public, 0:11:56.045,0:11:59.517 and not maybe just for particular researchers to look at or the government? 0:11:59.927,0:12:04.129 And it kind of reminded me of a phrase that's very famous in Open Source software, which is: 0:12:04.129,0:12:05.965 "To many eyes, all bugs are shallow". 0:12:05.965,0:12:10.504 What's great about Open Source software is lots of people can look at it, lots of people can fix it. 0:12:10.504,0:12:14.730 And for me, what this was saying was: to many eyes, all anomalies are noticeable. 0:12:14.730,0:12:16.679 It's somewhat of an exaggeration, 0:12:16.679,0:12:18.908 but what happens if rather than ten or twenty people 0:12:18.908,0:12:22.077 who worked in monitoring Medicare provision in the US government, 0:12:22.077,0:12:23.877 we'd had thousands or millions of people? 0:12:23.877,0:12:26.919 If the local journalists or citizens, who had suspicions, 0:12:26.919,0:12:28.747 had been able to go and look at that data and say: 0:12:28.747,0:12:32.485 "Whoa! What's going on here? This isn't just anecdotes, there's some data". 0:12:34.205,0:12:40.225 And so, and it's not just then, about kind of spotting healthcare errors, or issues, or risks, 0:12:40.225,0:12:42.415 it's also about things like apps and services 0:12:42.415,0:12:43.857 that you can build with Open Data. 0:12:43.867,0:12:46.667 This is a great app built by mySociety in the UK, 0:12:46.667,0:12:47.640 called Mapumental. 0:12:47.640,0:12:48.974 And the question is, I don't know if people know, 0:12:48.974,0:12:50.646 London house prices are very expensive, 0:12:50.646,0:12:52.510 I don't know whether they rival Geneva's, 0:12:52.510,0:12:55.238 but they're, it's a pretty difficult thing. 0:12:55.238,0:12:57.978 And one of the questions was, if I have to work somewhere, 0:12:57.978,0:13:01.752 and I want to know where I can live, and afford, 0:13:01.752,0:13:05.757 and I can commute to work in a certain time, and it's not too ugly, 0:13:05.757,0:13:07.583 this is what this app does. 0:13:07.583,0:13:11.200 You can choose the price, you can say where you're going to work, 0:13:11.200,0:13:14.195 you can choose the commute time, and you can choose the scenicness. 0:13:14.195,0:13:17.167 And it will show you, on this map, where you can live. 0:13:18.427,0:13:20.796 Another example, more about transparency, 0:13:20.796,0:13:22.746 is a project we did called "Where Does My Money Go?". 0:13:23.976,0:13:25.406 It's an interactive version, 0:13:25.406,0:13:26.211 you can kind of draw it out, 0:13:26.211,0:13:29.114 so what it starts with, is one, is it tells you what your tax is, 0:13:29.114,0:13:30.821 something that most people often don't know, 0:13:30.821,0:13:33.668 and it will tell you how much you're paying each day 0:13:33.668,0:13:36.254 to a particular area of society. 0:13:36.254,0:13:37.328 And the dream for me, 0:13:37.328,0:13:39.127 a dream that we're on the way to realising, 0:13:39.127,0:13:42.817 is in this visualisation, you can drill down into areas. 0:13:42.817,0:13:45.092 And my dream is to keep drilling down. 0:13:45.472,0:13:47.633 So depending on what day we have, I want to go down, 0:13:47.633,0:13:49.628 right down through those bubbles, step by step, 0:13:49.628,0:13:52.403 until I see the money spent on street lights on my street, 0:13:52.403,0:13:55.270 on filling in potholes, on collecting my rubbish. 0:13:56.190,0:13:57.138 And for two reasons: 0:13:57.138,0:13:59.704 One, obviously there's a question, particularly in some countries, 0:13:59.704,0:14:01.016 of inefficiency or corruption, 0:14:01.436,0:14:05.176 but also, just because most of us don't feel very happy about paying tax. 0:14:06.066,0:14:08.157 It's not one of those things people welcome! 0:14:08.157,0:14:09.817 But it's something that we should. 0:14:09.817,0:14:11.960 Government does an awful lot for us, 0:14:11.960,0:14:14.287 and having a better sense of where it's going 0:14:14.287,0:14:17.120 could make us feel an awful lot better about paying that tax. 0:14:17.120,0:14:18.657 In the way that when we go to a restaurant, 0:14:18.657,0:14:21.397 we don't, when we get the bill, we don't necessarily feel bad. 0:14:21.397,0:14:24.334 We feel "Wow, I had a great meal. That was worth it." 0:14:25.274,0:14:26.366 But why Open? 0:14:26.366,0:14:29.079 I've given you examples, and you know, we see a lot of apps and services. 0:14:29.079,0:14:30.948 Why is Open relevant here? 0:14:31.598,0:14:36.350 This goes back to what I said about the information technology, the revolution. 0:14:36.350,0:14:37.813 So it's the challenge and the opportunity. 0:14:37.813,0:14:42.387 It's the challenge that we see today, is exploding informational complexity. 0:14:42.797,0:14:43.924 I mean, another great story: 0:14:43.924,0:14:47.728 in the 1820s, all bank clearing in the largest financial centre in the world 0:14:47.728,0:14:51.848 was done in a single room, where people -- one person from each bank gathered 0:14:51.848,0:14:56.402 and they'd go round the room pulling out gold, and swapping it around, between different banks. 0:14:56.402,0:14:58.434 And that's how they did bank clearing. 0:14:59.074,0:15:01.991 Today we have billions of transactions a minute. 0:15:01.991,0:15:07.945 And the way we as humans deal with complexity is by dividing and conquering it. 0:15:07.945,0:15:10.683 We split it up into manageable chunks that we deal with. 0:15:11.013,0:15:12.381 The other answer, 0:15:12.381,0:15:14.883 and this answer's particularly relevant about Open Data, 0:15:14.883,0:15:16.219 is information technology. 0:15:16.219,0:15:18.951 Today, a smartphone has as much computing power 0:15:18.951,0:15:22.260 as the system that ran the Apollo moon landings. 0:15:22.260,0:15:24.027 And an even better example is storage: 0:15:24.027,0:15:26.930 one terabyte of storage today is a hundred dollars. 0:15:26.930,0:15:30.297 In 1994, this would have cost 400,000 dollars. 0:15:30.297,0:15:33.977 I can have every financial transaction 0:15:33.977,0:15:38.376 the UK government, or the US government made last year, or even for the last decade, 0:15:38.376,0:15:39.665 on my laptop. 0:15:39.665,0:15:43.543 That was not possible for an average citizen a decade ago. 0:15:44.283,0:15:48.187 So it's mass participation, information access, processing, and production. 0:15:48.187,0:15:49.557 It's decentralisation. 0:15:49.557,0:15:51.728 And the claim here is that openness is key. 0:15:51.728,0:15:53.957 It's because it's about scaling. 0:15:54.547,0:15:57.399 What we are doing is weaving data together. 0:15:57.399,0:15:59.615 As I said, we deal with complexity by splitting it up. 0:15:59.615,0:16:02.928 We componentise, we split data up into blocks 0:16:02.928,0:16:04.670 that we recombine. 0:16:04.670,0:16:07.201 But if we are going to recombine information, 0:16:07.961,0:16:10.076 we need to put Humpty Dumpty back together again, 0:16:10.076,0:16:12.909 it won't work most of the time if it is closed. 0:16:13.449,0:16:17.039 We need Open Data to scale and to componentise. 0:16:17.679,0:16:21.518 And it's a point just to make here in this respect, that you might think: 0:16:21.518,0:16:23.351 "Well you know, you're talking about Open Data, 0:16:23.351,0:16:24.721 you know, this could be true of anything! 0:16:24.721,0:16:25.789 Why don't we have like, 0:16:25.789,0:16:28.232 Open Cars, and Open Shoes, and you know, 0:16:28.232,0:16:29.578 why don't we just share everything, man! 0:16:29.578,0:16:31.026 It would be so beautiful!". 0:16:31.706,0:16:33.393 Right? And the sad thing is, 0:16:33.393,0:16:39.070 is that that hasn't generally worked as a way of organising most production in our society. 0:16:39.070,0:16:44.074 Instead, we have private property, and so we don't do that much openness relatively. 0:16:44.074,0:16:45.848 But there's something different about digital information. 0:16:45.848,0:16:48.944 We all know it, but it's worth emphasising, which is, it's very cheaply copied. 0:16:49.344,0:16:52.782 I mean, give me a copy of your data isn't a problem if you're the government. 0:16:52.782,0:16:56.393 Give me a copy of your car, or your house, or whatever, is. 0:16:56.803,0:16:58.584 And it's also about innovation here. 0:16:58.584,0:17:01.219 I mean, in a way it's almost the purest aspect of markets. 0:17:01.219,0:17:05.619 Markets are about moving things to the person who could use them most best. 0:17:06.389,0:17:07.200 And that's true of data. 0:17:07.200,0:17:10.660 The best thing to do with your data will likely be thought of by someone else. 0:17:11.340,0:17:14.973 And vice versa! You will think of the best thing to do with someone else's data. 0:17:15.783,0:17:20.103 And Open Data allows us, in the most frictionless, easiest way, 0:17:20.103,0:17:22.708 to move data to where it can be most optimally used, 0:17:22.728,0:17:23.905 particularly if you're government. 0:17:24.275,0:17:26.843 So in short, it's about better understanding, it's about better government, 0:17:26.843,0:17:29.115 it's about better research, it's about better economy. 0:17:29.115,0:17:31.124 And something also for companies and governments: 0:17:31.124,0:17:32.707 I think it's about better engagement. 0:17:33.137,0:17:34.986 It's about a closer relationship, sometimes, 0:17:34.986,0:17:37.331 between your citizens and you as the government. 0:17:37.331,0:17:40.960 Between you, even possibly, as a company, and your users. 0:17:41.690,0:17:43.972 So I wanted to kind of finish here by saying where we're going. 0:17:43.972,0:17:46.429 The story was, of this talk, was, you know, where are we? 0:17:46.869,0:17:49.654 Why have we got here? And where are we going? 0:17:50.734,0:17:52.248 So one answer is just more use. 0:17:52.248,0:17:55.313 So right now, I just said at the beginning, Open Data is relatively young. 0:17:55.313,0:17:57.844 This vast outpouring, for example, of government data, 0:17:57.844,0:18:02.058 that anyone can freely use, reuse, and redistribute, is really new, 0:18:02.058,0:18:03.382 even if it's done three years ago. 0:18:03.382,0:18:06.479 For example, in the UK, much of the most useful data that could be released 0:18:06.479,0:18:09.091 has only been released in the last six months or a year. 0:18:09.091,0:18:10.478 You want prescription data? 0:18:10.478,0:18:11.894 Are you a pharmaceutical company, 0:18:11.894,0:18:15.261 and you want to know what kind of prescription habits are going on in the UK? 0:18:15.261,0:18:18.837 I would emphasise: at an anonymised or somewhat aggregate level. 0:18:18.837,0:18:20.707 Do you want to know about what crime is going on? 0:18:20.707,0:18:24.241 Are you building a real estate website and you want data on environment, 0:18:24.241,0:18:25.742 or you want data on unemployment, 0:18:25.742,0:18:28.742 or other information about where properties are situated? 0:18:28.742,0:18:30.077 You can now get that. 0:18:30.687,0:18:32.783 So I think there's going to be a lot more use from business. 0:18:33.743,0:18:35.398 There'll be a lot more use from everyone. 0:18:35.398,0:18:38.686 But I think particularly business is going to wake up to the opportunities here. 0:18:38.686,0:18:40.192 I think it's also going to lead to more data. 0:18:40.192,0:18:41.789 One is, government is going to be more data. 0:18:41.789,0:18:45.597 I think also businesses are going to realise, and communities, 0:18:45.597,0:18:47.719 that they want to share back some of that data, 0:18:47.719,0:18:48.764 some of the data they have. 0:18:48.764,0:18:50.863 It's not going to be their kind of crown jewels, 0:18:50.863,0:18:53.707 and it's not going -- often start out with data that's not core to their business. 0:18:53.707,0:18:58.108 It's like. kind of Nike, they realised that by opening and sharing data, 0:18:58.108,0:19:00.569 they can scale in a way they can't on their own. 0:19:01.029,0:19:03.069 And does it mean that richer data, going back 0:19:03.069,0:19:05.809 -- how could I leave out Hegel and Marx in a talk like this -- 0:19:05.809,0:19:08.869 "Quantity changes quality" as Hegel told us. 0:19:09.319,0:19:14.156 And more data, going back to that woven ball, more data actually means better data. 0:19:14.156,0:19:17.634 It means richer data, it's a qualitative difference in what we can do. 0:19:17.634,0:19:19.781 Geodata on it's own isn't that useful. 0:19:19.781,0:19:21.593 Transport data on it's own isn't useful. 0:19:21.593,0:19:24.023 Geodata plus transport data is useful! 0:19:24.703,0:19:26.187 And we're going to be seeing data refining. 0:19:26.187,0:19:27.441 Data is the new oil, right? 0:19:27.441,0:19:28.928 So, we're going to refine it. 0:19:28.928,0:19:32.037 And that's going to be a big business: higher quality data. 0:19:32.487,0:19:34.149 I want to leave you with a couple of thoughts. 0:19:34.149,0:19:35.646 So, one is, some people say: 0:19:35.646,0:19:37.527 "Well, okay, but, you know, selling data is big business". 0:19:37.527,0:19:40.987 And it is, but going forward in some of these things like software, 0:19:40.987,0:19:42.485 data is going to be a platform. 0:19:42.485,0:19:43.720 It's not a commodity. 0:19:43.720,0:19:46.084 Businesses built purely on selling data, 0:19:46.084,0:19:47.558 I just don't think are going to make it. 0:19:47.558,0:19:51.668 You need to be building on your data, not attempting to purely sell it. 0:19:52.588,0:19:54.638 And the other answer is to be modest. 0:19:55.278,0:19:56.797 So I said: where are we going? 0:19:56.797,0:19:57.917 I don't know if people know 0:19:57.917,0:20:02.124 -- and this takes us back to an earlier age, an age of electricity and steam -- of Faraday. 0:20:02.124,0:20:04.678 So he's demonstrating electricity at the Royal Society, 0:20:04.678,0:20:08.316 and Gladstone, the future Prime Minister of England, sees him do this stuff, you know, 0:20:08.316,0:20:10.445 the frog legs move, and Gladstone's like: 0:20:10.445,0:20:12.392 "Well, I mean, this is party trick, Faraday. 0:20:12.392,0:20:15.971 It's great, but, what's really, you know, what's electricity going to amount to?" (20:16) 0:20:15.971,0:20:20.832 And Faraday says to him: "Well, what's the use of a baby?" 0:20:20.832,0:20:24.299 You know, a baby when it's young is not very useful. 0:20:24.299,0:20:25.718 [scattered laughter] 0:20:25.718,0:20:27.426 But it grows up into something! 0:20:27.916,0:20:29.725 And that is where we are going today. 0:20:29.725,0:20:32.488 We are the beginning of the Open Data journey. 0:20:33.088,0:20:36.157 And partly is, we don't know what it's going to grow up into. 0:20:36.157,0:20:37.004 Thank you very much! 0:20:37.004,0:20:40.558 [Applause] 0:20:40.558,0:20:44.708 [Questioner] Um, citizens and I guess patients at hospitals, 0:20:44.708,0:20:48.683 assume that the institutions have all this data and it's very well organised, 0:20:48.683,0:20:49.925 and it's a question of will. 0:20:50.555,0:20:53.889 Have you encountered cases in which they simply don't have it, 0:20:53.889,0:20:57.268 or they have it, and it's just such a mess that they're too embarrassed to give it out? 0:20:57.268,0:20:58.661 [Rufus Pollock] Absolutely. 0:20:58.661,0:21:00.959 I mean, one story that kind of intrigues me, 0:21:00.959,0:21:03.914 is we've been building this "Where Does My Money Go?" open spending project. 0:21:04.094,0:21:06.948 And one of the things the government mandated was giving out, 0:21:06.948,0:21:09.065 rather than just high-level financial information, 0:21:09.065,0:21:11.445 giving out information at a detailed level, you know, 0:21:11.445,0:21:12.838 so they now publish, for example, 0:21:12.838,0:21:15.601 spending data from each government department monthly, 0:21:15.601,0:21:17.713 every transaction within 5,000 pounds (check). 0:21:17.713,0:21:22.119 Every purchase they make, every mobile phone provider they contract with, we get that data. 0:21:22.119,0:21:25.590 And one of the intriguing things, of their mandating this, was it turned out, 0:21:25.590,0:21:30.022 before, they had no way, before they did this, of actually seeing, on any regular basis, 0:21:30.022,0:21:31.232 what their department spent money on. 0:21:31.232,0:21:35.029 Because in fact, the only thing they reported up on to, in central government to Treasury, 0:21:35.029,0:21:38.871 was kind of like, how much did you spend against Project X that you were allocated budget for? 0:21:38.871,0:21:41.156 You know, departments, were actually really intrigued, they [say]: 0:21:41.156,0:21:43.435 "Oh, that other department's going with Vodafone, 0:21:43.435,0:21:46.349 and we're with Orange, and look how much they're paying per month!" 0:21:46.349,0:21:49.310 So I think in essence, it is really driving changes in government, 0:21:49.310,0:21:52.700 and yeah, there are people, I think you'd been worried about giving out data quality. 0:21:52.700,0:21:54.841 I was just talking to the Department of Education last week and they said 0:21:54.841,0:21:57.650 -- you know, one of the things -- they had financial information from schools, 0:21:57.650,0:21:59.567 and which they were slowly being mandated to publish. 0:21:59.567,0:22:01.220 And schools are suddenly all ringing up, saying: 0:22:01.220,0:22:04.290 "Well we never really bothered to really update that information to be accurate! 0:22:04.290,0:22:05.900 Uh, we really want to do it right now". 0:22:06.210,0:22:08.164 So I think that definitely does happen, yep. 0:22:08.164,0:22:12.460 [Questioner] Are you seeing now new roles in government, to help facilitate this? 0:22:13.099,0:22:16.009 [Rufus Pollock] Yeah. I mean, to take another example, I, sorry. 0:22:16.009,0:22:19.492 Both in government, so the UK government has a transparency kind of 'czar' if you like. 0:22:19.492,0:22:22.812 Also I learnt, is Nike hired an Open Data evangelist. 0:22:22.812,0:22:24.611 One of the things they, while they were implementing this programme, 0:22:24.611,0:22:27.990 they actually hired explicitly, an Open Data evangelist. 0:22:27.990,0:22:30.023 So yeah, I think we are, we're definitely seeing this in government. 0:22:30.023,0:22:32.357 Both in the tech level, but also at the policy level. 0:22:32.357,0:22:34.729 And I think it's not just government, 0:22:34.729,0:22:37.957 it will also be companies doing this, and so on, who will be saying: 0:22:37.957,0:22:39.192 "We need an Open Data expert. 0:22:39.192,0:22:43.662 We need to be aware of what's going on here and be able to plan it as part of our strategy." 0:22:44.472,0:22:45.489 [Questioner] A final question. 0:22:45.489,0:22:48.972 You mentioned that, kind of outsourcing, almost, some of this data refining, 0:22:48.972,0:22:51.306 outside government or the big institutions, has helped them. 0:22:51.306,0:22:54.446 Can you tell us any stories of kind of gratitude being expressed by the government? I mean... 0:22:55.146,0:22:57.247 [Rufus Pollock] Well, I mean, to kind of, yeah. 0:22:57.247,0:22:58.773 I mean there was an interesting example actually 0:22:58.773,0:23:01.792 where we had some complaint because the open spending data I told you about 0:23:01.792,0:23:05.323 where we're aggregating the government spending and financial data 0:23:05.323,0:23:10.651 -- you know, the site had a few performance issues, occasionally, as we loaded more data in. 0:23:10.651,0:23:12.755 I remember kind of getting this call kind of going : 0:23:12.755,0:23:16.266 "Well, you know, we're a little bit upset, you know, data.gov.uk," 0:23:16.266,0:23:19.133 and it turned out the reason was, the Treasury kept looking at this data, 0:23:19.133,0:23:21.002 and they were annoyed when the site was going down. 0:23:21.002,0:23:22.563 So that was really intriguing to me 0:23:22.563,0:23:26.139 that we were kind of one of the best, at least, up-to-date aggregators out there. 0:23:26.139,0:23:29.484 Um, I think you are already seeing people doing stuff with the data 0:23:29.484,0:23:30.979 and kind of doing stuff, sometimes for free, you know. 0:23:31.289,0:23:33.131 You don't have to have the shiny front-end. 0:23:33.131,0:23:35.019 I mean, one of the things we went about, on about, 0:23:35.019,0:23:36.493 I know Tim Berners-Lee went on about -- 0:23:36.493,0:23:41.393 raw data now, you know, you can build fewer shiny front-ends, and just release raw data. 0:23:41.393,0:23:46.455 And you know, someone else will help you build the app, the front-end, the interface, 0:23:46.455,0:23:47.762 and help you innovate about it. 0:23:47.762,0:23:50.917 What is the best way to provide healthcare data to citizens, 0:23:50.917,0:23:52.254 or education data to citizens, 0:23:52.254,0:23:54.367 so they make better and more informed choices? 0:23:54.367,0:23:56.608 I don't know, and the government probably doesn't know. 0:23:56.608,0:23:59.155 But somewhere out there, someone is going to innovate 0:23:59.155,0:24:02.672 and really provide the best way for us to deliver that kind of information to citizens. 0:24:02.672,0:24:04.250 QUESTIONER: Thank you very much. 0:24:04.250,0:24:05.296 [Rufus Pollock] Thank you. 0:24:05.296,0:24:06.901 [Applause] 0:24:06.901,0:24:09.423 lift _ Video Production ACTUA 0:24:09.423,0:24:11.311 Copyright (c) 2012 Lift conference