WEBVTT 00:00:01.014 --> 00:00:04.185 (lift) 00:00:04.185 --> 00:00:07.244 (lift 12 - Feb 24 2012 - Geneva) 00:00:07.244 --> 00:00:10.121 (Rufus Pollock - Stories) 00:00:10.121 --> 00:00:11.788 [Rufus Pollock] Just to say for those of you who don't know: 00:00:11.788 --> 00:00:13.666 the Open Knowledge Foundation is a non-profit -- not for profit 00:00:13.666 --> 00:00:15.611 founded in 2004 00:00:15.611 --> 00:00:17.865 and which builds tools and communities 00:00:17.865 --> 00:00:20.934 to create, use and share open information 00:00:20.934 --> 00:00:24.585 and that's information that anyone can use, reuse and redistribute. 00:00:24.585 --> 00:00:28.321 And as such, we've been working on open data for quite a long time 00:00:28.321 --> 00:00:30.011 since we started in 2004. 00:00:30.011 --> 00:00:34.817 And today, I want to start the story by going back in time 5000 years, 00:00:34.817 --> 00:00:37.610 to ancient Mesopotamia. 00:00:37.610 --> 00:00:41.393 There, between the Tigris and the Euphrates rivers, 00:00:41.393 --> 00:00:43.714 flourished the Sumerian civilization. 00:00:43.714 --> 00:00:46.729 And they were confronted by a problem. 00:00:46.729 --> 00:00:50.269 They were confronted by the limitations of human memory 00:00:50.269 --> 00:00:53.985 in the recording of taxes, food and other goods. 00:00:53.985 --> 00:00:59.473 And those ancient civil servants and businessmen hit on a novel solution: 00:00:59.473 --> 00:01:04.237 What they decided to do was they would start counting things with small clay chits, 00:01:04.237 --> 00:01:08.819 which they would bake inside of a clay -- a little clay box 00:01:08.819 --> 00:01:12.617 and then mark, on the outside of that box, what they were counting. 00:01:12.617 --> 00:01:15.027 You know, was it grain, was it tax payments, whatever. 00:01:15.027 --> 00:01:19.786 And so, born out of necessity for a state and a society, 00:01:19.786 --> 00:01:25.081 came one of the great information technology revolutions of all time: writing. 00:01:25.081 --> 00:01:28.172 The Sumerians invented writing via cuneiform. 00:01:28.172 --> 00:01:34.039 And if we fast-forward from that a few thousand years, we come to the UK census. 00:01:34.039 --> 00:01:38.393 Again, it's always interesting that states, governments are often at the forefront 00:01:38.393 --> 00:01:42.681 of at least driving information technology and information systems innovations. 00:01:42.681 --> 00:01:45.657 The UK census: again, the state, this is during the Napoleon Wars, 00:01:45.657 --> 00:01:48.601 desire to count the population more accurately: 00:01:48.601 --> 00:01:50.626 and we have the first UK census in 1801. 00:01:50.626 --> 00:01:55.482 And in the US, they also had censuses, in fact starting in 1790. 00:01:55.482 --> 00:01:59.153 And one of the problems encountered in the 1880 census 00:01:59.153 --> 00:02:01.146 was they tabulated the census by hand. 00:02:01.146 --> 00:02:06.746 And by the 1880 census, it was taking seven years to tabulate the census. 00:02:06.746 --> 00:02:12.407 So after it got taken in 1880, it wasn't until 1887 they actually had any data they could use. 00:02:12.407 --> 00:02:15.513 And they calculated that for the next census in 1890, 00:02:15.513 --> 00:02:18.042 they wouldn't be finished by 1900. 00:02:18.042 --> 00:02:21.736 They still wouldn't have the results of the census by the time they started the next one. 00:02:21.736 --> 00:02:24.233 They had a crisis of information technology. 00:02:24.233 --> 00:02:26.810 And what they went and did is they commissioned Herman Hollerith 00:02:26.810 --> 00:02:29.378 to build the first automatic tabulator. 00:02:29.378 --> 00:02:32.466 And for those of you who know your company history, of course, 00:02:32.466 --> 00:02:35.113 Herman Hollerith's company went on to be one of the founders, 00:02:35.113 --> 00:02:37.948 if you like, one of the companies that came and created IBM. 00:02:37.948 --> 00:02:43.797 And IBM, by the sixties, were building this -- they replaced those hand -- 00:02:43.797 --> 00:02:46.605 those kind of wooden, mechanical tabulators with this stuff: 00:02:46.605 --> 00:02:50.283 digital tabulators, the modern computer of this age 00:02:50.283 --> 00:02:52.610 and again, much of this -- I don't know if you guys know -- 00:02:52.610 --> 00:02:53.705 IBM would have gone bankrupt 00:02:53.705 --> 00:02:58.154 if it hadn't been for Franklin Roosevelt passing the Social Security Act in the States, 00:02:58.154 --> 00:03:01.132 which necessitated a huge amount of new tabulation. 00:03:01.132 --> 00:03:04.276 So, again, a lot of innovation in this space came out of government need 00:03:04.276 --> 00:03:06.370 and also, of course, the nuclear program, 00:03:06.370 --> 00:03:08.518 the other great needer of computational power. 00:03:08.518 --> 00:03:15.211 And today, today, we find ourselves again in the midst of a revolution. 00:03:15.211 --> 00:03:18.962 It's a revolution driven by two needs: 00:03:18.962 --> 00:03:21.443 ones that have been the same throughout history as I've just shown, 00:03:21.443 --> 00:03:23.810 information complexity, which is the necessity, 00:03:23.810 --> 00:03:27.345 and information technology, which is the opportunity. 00:03:27.345 --> 00:03:32.441 And what we're doing in this case is a policy innovation, if you like. 00:03:32.441 --> 00:03:36.361 We are innovating by opening up information. 00:03:36.361 --> 00:03:40.945 So just take the obvious example, government, as I said often the innovator. 00:03:40.945 --> 00:03:46.330 In the last 3 years ago, you go back 3 years, there's almost no open government data initiatives in the world. 00:03:46.330 --> 00:03:52.857 Today there are dozens. The UK, the US, Finland, Kenya, The Netherlands, and there's new ones almost every week. 00:03:52.857 --> 00:04:02.001 There's been a launch of an official kind of movement as apart of the UN called the Open Government Partnership in which countries sign up, and among other things, they open up their data. 00:04:02.001 --> 00:04:08.833 And of course, it's been, in the UK and other countries, Tim Berners-Lee has been involved. I've helped advise the government around this in the UK. 00:04:08.833 --> 00:04:11.545 But it's not just government. 00:04:11.545 --> 00:04:23.858 It's also companies. Companies are opening up data. Very interestingly, last year, Nike started an open data initiative there to open up supply chain and sustainability data, for themselves and also for their suppliers. 00:04:23.858 --> 00:04:34.830 Which I think is a very interesting change. And it's also communities. Often, in fact, back there in the beginning, this incredible map that you saw in an earlier slide, is a OpenStreetMap activity, around the world. 00:04:34.830 --> 00:04:47.971 People adding to this crowd-built map of the world. And in the last 6 years, OpenStreetMap, from a bottom-up community, have built a complete, comprehensive, map of the world, of fully open data. 00:04:47.971 --> 00:04:54.002 So I've just gone on about Open Data, and one thing I'm aware of, of this audience, is you might not all know what it is. 00:04:54.002 --> 00:04:59.152 So I'm going to take a brief moment, a brief moment, to say what it is. 00:04:59.152 --> 00:05:05.414 What does it mean when I say 'open'? And was it, you know, what's different from anything else? What's different from simply public data? 00:05:05.414 --> 00:05:22.794 So there's actually a definition, a definition we the Open Knowledge Foundation helped write, it's very simple. In a nutshell, a piece of information, a piece of data, is open if anyone is free to use, reuse, and redistribute it subject only at most to a requirement to attribute and share alike. 00:05:22.794 --> 00:05:25.784 And anyone means anyone! 00:05:25.784 --> 00:05:34.430 It doesn't mean there can't be any commercial restrictions. You can't say: hey, here's this data, but only people using it for non-commercial purposes. Or only people working in education. 00:05:34.430 --> 00:05:40.678 Or only people living in the developing world, or the developed world. There can't be any restrictions like that. 00:05:40.678 --> 00:05:48.566 And there's a reason for this, by the way, and it isn't just because one's obsessed about if you like, trademarking an attractive term. 00:05:48.566 --> 00:05:51.291 It's because it's about interoperability. 00:05:51.291 --> 00:06:03.316 One of my experiences at this conference, which I remember from previous trips to Geneva, is I've been unable to plug in my laptop! Even though I have a French adaptor, in fact, these wonderful Swiss plugs here, are you know, these wonderful, small octagonal shape. 00:06:03.316 --> 00:06:12.198 And even with my adaptor I can't plug in. Right? And it's called interoperability. When we travel around to different countries, our power adaptors don't actually fit in. We have to buy something. 00:06:12.198 --> 00:06:18.094 And the point about this definition, and the point about caring about Open Data, is about interoperability. 00:06:18.094 --> 00:06:25.974 The dream of Open Data is interoperability. Of seamlessly being able to share and interweave information. 00:06:25.974 --> 00:06:36.558 And if every time I get information from two different people I have to consult a lawyer, I have to work out whether I'm allowed to do it, whether I'm allowed to put these things together, we lose that dream. 00:06:36.558 --> 00:06:42.166 That dream is shattered. And the key point is, this definition, and those conditions, ensure interoperability. 00:06:42.166 --> 00:06:48.083 If you comply with them, we know that any piece of info, of Open Data, will work with any other piece of Open Data. 00:06:48.083 --> 00:06:56.021 And also, it's worth saying for a quick moment, what kind of data. And to emphasize a point. Just to foreclose those kinds of questions, otherwise I always get asked. 00:06:56.021 --> 00:07:03.974 When we talk about opening up data, in general, we're not talking about personal data. We're not talking about opening up your private health records. 00:07:03.974 --> 00:07:19.680 Or opening up your personal tax information. We're talking about information that is non-personal in nature. And for the government for example: transport, geodata, statistics, electoral, legal. Stuff that the UK has, in fact, for example been opening up over the last few years. 00:07:19.680 --> 00:07:30.418 This financial information, on government spending, this information on health outcomes, on prescriptions. This information on educational outcomes, this information on the law. There's information -- statistical information. 00:07:30.418 --> 00:07:33.146 That's the kind of thing that we're talking about. 00:07:33.146 --> 00:07:39.142 Now, I want to say... because in this story, we have this story, this story over time. But why governments are doing it now? 00:07:39.142 --> 00:07:47.228 And why Open Data? So, okay, for thousands of years, governments innovate, but why do they innovate at this particular moment and in this way? 00:07:47.228 --> 00:07:54.912 So I want to start here with a quick story. A story of medicine gone wrong. Just from a great book by a guy called Stephen Klaidman. 00:07:54.912 --> 00:07:59.790 It's in fact one of the things that made me think quite deeply about this: why I was interested in Open Data. 00:07:59.790 --> 00:08:15.032 In that picture there, you can see what was the Redding Medical Centre in Northern California. There, in 2002, in the summer of 2002, John Corapi, in typical American style, an ex-accountant from Vegas turned Catholic priest, 00:08:15.032 --> 00:08:17.602 [scattered laughter] 00:08:17.602 --> 00:08:22.408 ...arrived at the Redding Medical Centre having been referred by his doctor for having chest pains. 00:08:22.408 --> 00:08:34.910 He had a cardiogram by the local cardiologist and was told that he needed an immediate heart bypass. That he was at serious risk, and that he should come back later that day, or at the latest, tomorrow, to have open heart surgery. 00:08:34.910 --> 00:08:41.191 Rather shocked, and dazed by this news, he returned home to pack his bags in order to return to hospital. 00:08:41.191 --> 00:08:52.504 He called up his best friend, who was still an accountant in Vegas, whose partner was a hospital nurse. And who advised him that he should get a second opinion. 00:08:52.504 --> 00:09:00.135 That, according to his partner, it was not, you know, it was very unusual that you would need to have immediate open heart surgery, and that he should get a second opinion. 00:09:00.135 --> 00:09:11.503 Rather doubtful about this, because he was extremely worried, he did get on a plane. He went to Vegas, he got seen by another specialist... who, to his complete surprise told him there was nothing wrong with his heart. 00:09:11.503 --> 00:09:18.463 He saw another specialist, just to make sure. They told him also, there was nothing wrong with his heart. 00:09:18.463 --> 00:09:27.367 Relieved, and rather, you know, happy, he returned home and just wanted to really forget about it. But his friend said: "No, what's going on here? Somethings wrong". 00:09:27.367 --> 00:09:38.621 And they went in to see the CEO of Tenet Healthcare, the people running this hospital (which, by the way, was a private hospital). And said: "Look, something's wrong, what's going on, what are you going to do about this?" 00:09:38.621 --> 00:09:46.337 And basically they were told: not very much. You know, mistakes get made, it's bad luck, don't worry about it, we'll look into it, but thank you very much. 00:09:46.337 --> 00:10:00.426 They weren't convinced by this, and eventually they decided to contact the FBI. The reason they contacted the FBI, by the way, is it was a private healthcare provider in the United States, they provide Medicare provision of healthcare to the Federal Government. 00:10:00.426 --> 00:10:04.792 So, if the Federal Government is getting defrauded, the FBI can get involved. 00:10:04.792 --> 00:10:07.071 The FBI started investigating. 00:10:07.071 --> 00:10:16.105 Eventually it turned out, that hundreds, probably thousands of people over a ten, or longer year period, had been operated on unnecessarily. 00:10:16.105 --> 00:10:24.350 Most of them had had serious procedures performed on them, open heart surgery, some had died as a result. Obviously it's quite a serious operation. 00:10:24.350 --> 00:10:35.391 Some people had basically been condemned to a lifetime of pain, one of the most traumatic examples was a 36-year-old, he had been cut open, which is obviously what happens in open heart surgery, and his chest had never knitted back together correctly. 00:10:35.391 --> 00:10:38.463 Basically, he would be in pain for the rest of his life. 00:10:38.463 --> 00:10:55.911 So, hundreds, thousands of people had been harmed. One of the interesting things was that in this community there was already some suspicion, there were anecdotes, I mean, one of the ones I really liked from this book was the story that went: don't get a flat tyre outside of Redding Medical Centre because you'll end up with a heart bypass. 00:10:55.911 --> 00:10:57.583 [scattered laughter] 00:10:57.583 --> 00:11:06.690 You know, but the thing was, there was no data. People were you know, a bit suspicious, but it was among doctors, and you, you know, in the community, and who wants to doubt it. 00:11:06.690 --> 00:11:19.737 And guess what? Also, Redding Medical Centre had one of the best mortality rates, for cardiac procedures in the United States, because if you operate on healthy people, you have a good mortality rate! 00:11:19.737 --> 00:11:21.129 [scattered laughter] 00:11:21.129 --> 00:11:37.603 So, the other thing, though, and this is the point that comes to Open Data for me, the other red flag if you had been looking at the data, was these two things: one is incredibly low mortality rate, and (B) that it had almost the highest number of procedures for the population that it covered in the United States. 00:11:37.603 --> 00:11:49.559 Which should be a red flag, right? Because, one, it's just a massive outlier on that basis, and also, the more people you should be operating on, the more you're doing marginal cases, the higher should be your mortality rate unless something very odd is going on. 00:11:49.559 --> 00:11:59.783 The thought was: what if people had been looking at this data? What if we'd - if this data had been open and public? And not maybe just for particular researchers to look at or the government. 00:11:59.783 --> 00:12:12.041 And it kind of reminded me of a phrase that's very famous in Open Source software, which is: "to many eyes, all bugs are shallow". What's great about Open Source software is lots of people can look at it, lots of people can fix it. 00:12:12.041 --> 00:12:23.753 And for me, what this was saying was: to many eyes, all anomalies are noticeable. It's somewhat of an exaggeration, but what happens if rather than ten or twenty people who worked in monitoring Medicare provision in the US government, we'd had thousands or millions of people? 00:12:23.753 --> 00:12:33.161 If the local journalists or citizens, who had suspicions, had been able to go and look at that data and say: "Whoa! What's going on here? This isn't just anecdotes, there's some data". 00:12:33.161 --> 00:12:42.269 And so, and it's not just then, about kind of spotting healthcare errors, or issues, or risks, it's also about things like apps and services. 00:12:42.269 --> 00:12:44.007 That you can build with Open Data. 00:12:44.007 --> 00:12:54.991 This is a great app built by mySociety in the UK called Mapumental. And the question is, I don't know if people know, London house prices are very expensive, I don't know whether they rival Geneva's, but they're, it's a pretty difficult thing. 00:12:54.991 --> 00:13:07.203 And one of the questions was, if I have to work somewhere, and I want to know where I can live, and afford, and I can commute to work in a certain time, and it's not too ugly, this is what this app does. 00:13:07.203 --> 00:13:17.307 You can choose the price, you can say where you're going to work, you can choose the commute time, and you can choose the scenicness. And it will show you, on this map, where you can live. 00:13:17.307 --> 00:13:35.936 Another example, more about transparency, is a project we did called "Where Does My Money Go?". It's an interactive version, you can kind of draw it out, so what it starts with, is one, is it tells you what your tax is, something that most people often don't know. And it will tell you how much you're paying each day to a particular area of society. 00:13:35.936 --> 00:13:45.320 And the dream for me, a dream that we're on the way to realising, is in this visualisation, you can drill down into areas. And my dream is to keep drilling down. 00:13:45.320 --> 00:13:55.114 So depending on what day we have, I want to go down, right down through those bubbles, step by step, until I see the money spent on street lights on my street. On filling in potholes. On collecting my rubbish. 00:13:55.114 --> 00:14:05.820 And for two reasons: One, obviously there's a question, particularly in some countries, of inefficiency or corruption, but also, just because most of us don't feel very happy about paying tax. 00:14:05.820 --> 00:14:17.141 It's not one of those things people welcome! But it's something that we should. Government does an awful lot for us, and having a better sense of where it's going could make us feel an awful lot better about paying that tax. 00:14:17.141 --> 00:14:24.887 In the way that when we go to a restaurant, we don't, when we get the bill we don't necessarily feel bad. We feel "Wow, that was a great meal. That was worth it." 00:14:24.887 --> 00:14:31.240 But why Open? I've given you examples, and you know, we see a lot of apps and services. Why is Open relevant here? 00:14:31.240 --> 00:14:42.055 This goes back to what I said about the information technology, the revolution. So it's the challenge and the opportunity. It's the challenge that we see today, is exploding informational complexity. 00:14:42.055 --> 00:14:56.434 I mean, another great story, in the 1820s, all bank clearing in the largest financial centre in the world was done in a single room, where one person from each bank gathered and they'd go round the room pulling out gold, and swapping it around, between different banks. 00:14:56.434 --> 00:14:59.256 And that's how they did bank clearing. 00:14:59.256 --> 00:15:10.507 Today we have billions of transactions a minute. And the way we as humans deal with complexity is by dividing and conquering it. We split it up into manageable chunks that we deal with. 00:15:10.507 --> 00:15:22.079 The other answer, and this answer's particularly relevant about Open Data, is information technology. Today, a smartphone has as much computing power as the system that ran the Apollo moon landings. 00:15:22.079 --> 00:15:39.967 And in an even better example of storage, one terabyte of storage today is a hundred dollars. In 1994, this would have cost 400,000 dollars. I can have every financial transaction the UK government, or the US government made last year, or even for the last decade, on my laptop. 00:15:39.967 --> 00:15:44.490 That was not possible for an average citizen a decade ago. 00:15:44.490 --> 00:15:51.778 So it's mass participation information access, processing, and production. It's decentralisation. And the claim here is that openness is key. 00:15:51.778 --> 00:16:13.208 It's because it's about scaling. What we are doing is weaving data together. Like I said, we deal with complexity by splitting it up, we componentise, we split data up into blocks that we recombine. But if we are going to recombine information, we need to put Humpty Dumpty back together again, it won't work most of the time if it is closed. 00:16:13.208 --> 00:16:17.292 We need Open Data to scale and to componentise. 00:16:17.292 --> 00:16:31.262 And it's a point just to make here in this respect, that you might think: "Well you know, you're talking about Open Data, you know, this could be true of anything! Why don't we have like, Open Cars, and Open Shoes, and you know, why don't we just share everything, man! It would be so beautiful!". 00:16:31.262 --> 00:16:38.750 Right? And the sad thing is, is that that hasn't generally worked as a way of organising most production in our society. 00:16:38.750 --> 00:16:48.843 Instead, we have private property, and so we don't do that much openness relatively. But there's something different about digital information. We all know it, but it's worth emphasising, which is, it's very cheaply copied. 00:16:48.843 --> 00:16:56.689 I mean, give me a copy of your data isn't a problem if you're the government. Give me a copy of your car, or your house, or whatever, is. 00:16:56.689 --> 00:17:05.488 And it's also about innovation here. I mean, in a way it's almost the purest aspect of markets. Markets are about moving things to the person who could use them most best. 00:17:05.488 --> 00:17:10.900 And that's true of data. The best thing to do with your data will likely be thought of by someone else. 00:17:10.900 --> 00:17:22.968 And vice versa! You will think of the best thing to do with someone else's data. And Open Data allows us, in the most frictionless, easiest way, to move data to where it can be most optimally used. 00:17:22.968 --> 00:17:24.416 Particularly, if you're government. 00:17:24.416 --> 00:17:29.277 So in short, it's about better understanding. It's about better government. It's about better research. It's about better economy. 00:17:29.277 --> 00:17:37.240 And something also for companies and governments, I think it's about better engagement. It's about a closer relationship, sometimes, between your citizens and you as the government. 00:17:37.240 --> 00:17:40.600 Between you, even possibly, as a company, and your users. 00:17:40.600 --> 00:17:43.972 So I wanted to kind of finish here by saying where we're going. 00:17:43.972 --> 00:17:50.017 The story was, of this talk, was, you know, where are we? Why have we got here? And where are we going? 00:17:50.017 --> 00:18:03.287 So one answer is just more use. So right now, I just said at the beginning, Open Data is relatively young. This vast outpouring, for example, of government data, that anyone can freely use, reuse, and redistribute, is really new. Even if it's done three years ago. 00:18:03.287 --> 00:18:08.959 For example, in the UK, much of the most useful data that could be released has only been released in the last six months or a year. 00:18:08.959 --> 00:18:19.160 You want prescription data? Are you a pharmaceutical company? And you want to know what kind of prescription habits are going on in the UK? I would emphasise: at an anonymised or somewhat aggregate level. 00:18:19.160 --> 00:18:28.420 Do you want to know about what crimes are going on? Are you building a real estate website and you want data on environment, or you want data on unemployment, or other information about where properties are situated? 00:18:28.420 --> 00:18:35.351 You can now get that. So I think there's going to be a lot more use from business. There'll be a lot more use from everyone. 00:18:35.351 --> 00:18:38.446 But I think particularly business is going to wake up to the opportunities here. 00:18:38.446 --> 00:18:48.666 I think it's also going to lead to more data. One is, government is going to be more data. I think also businesses are going to realise, and communities, that they want to share back some of that data, some of the data they have. 00:18:48.666 --> 00:19:00.990 It's not going to be their kind of crown jewels, and it's not going to, it's often going to start out with data that's not core to their business. Right? It's kind of like Nike, they realised that by opening and sharing data, they can scale in a way they can't on their own. 00:19:00.990 --> 00:19:14.017 And does it mean that richer data, going back -- how could I leave out Hegel and Marx in a talk like this -- "Quantity changes quality" as Hegel told us. And more data, going back to that woven ball, more data actually means better data. 00:19:14.017 --> 00:19:24.040 It means richer data, it's a qualitative difference in what we can do. Geodata on it's own isn't that useful. Transport data on it's own isn't useful. Geodata plus transport data is useful! 00:19:24.040 --> 00:19:31.912 And we're going to be seeing data refining. Data is the new oil, right? So, we're going to refine it. And that's going to be a big business. Higher quality data. 00:19:31.912 --> 00:19:42.440 And I want to leave you with a couple of thoughts. So, one is: Some people say, "well, okay, but, you know, selling data is big business". And it is, but going forward in some of these things like software, data is going to be a platform. 00:19:42.440 --> 00:19:47.548 It's not a commodity. Businesses built purely on selling data, I just don't think I'm going to make it. 00:19:47.548 --> 00:19:52.067 You need to be building on your data, not attempting to purely sell it. 00:19:52.067 --> 00:20:02.223 And the other answer is to be modest. So I said: where are we going? I don't know if people know, and this takes us back to an earlier age, an age of electricity and steam, of Faraday. 00:20:02.223 --> 00:20:16.694 So he's demonstrating electricity at the Royal Society, and Gladstone, the future Prime Minister of England, sees him do this stuff, you know, the frog legs move, and Gladstone's like: "well, I mean, this is party trick, Faraday, it's great, but, what's really, you know, what's electricity going to amount to?" 00:20:16.694 --> 00:20:20.832 And Faraday says to him: "Well, what's the use of a baby?" 00:20:20.832 --> 00:20:23.739 You know, a baby when it's young is not very useful. 00:20:23.739 --> 00:20:25.718 [scattered laughter] 00:20:25.718 --> 00:20:36.165 But it grows up into something! And that is where we are going today. We are the beginning of the Open Data journey. And partly is, we don't know what it's going to grow up into. 00:20:36.165 --> 00:20:37.774 Thank you very much! 00:20:37.774 --> 00:20:40.558 [Applause] 00:20:40.558 --> 00:20:57.332 QUESTIONER: Um, citizens and um, I guess patients in hospitals, assume that the institutions have all this data and it's very well organised, and it's a question of will. Have you encountered cases in which they simply don't have it, or they have it, and it's just such a mess that they're too embarrassed to give it out? 00:20:57.332 --> 00:21:17.527 RUFUS: Absolutely. I mean, one story that kind of intrigues me, is we've been building this "Where Does My Money Go?" open spending project. And one of the things the government mandated was giving out, rather than just high-level financial information, giving out information at a detailed level. You know, so they now publish, for example, spending data from each government department monthly, every transaction over 25,000 pounds. 00:21:17.527 --> 00:21:31.130 Every purchase they make, every mobile phone provider they contract with, we get that data. And one of the intriguing things, of their mandating this, was it turned out, before, they had no way, before they did this, of actually seeing, on any regular basis, what their department spent money on. 00:21:31.130 --> 00:21:38.839 Because in fact, the only thing they reported up onto, in central government to Treasury, was kind of like, how much did you spend against Project X that you were allocated budget for? 00:21:38.839 --> 00:21:46.454 You know, departments, were actually really intrigued, they say "Oh, well that other department's going with Vodafone, and we're with Orange, and look how much they're paying per month!" 00:21:46.454 --> 00:21:59.519 So I think in essence it is really driving changes in government, and yeah, there are people, I think you'd been worried about giving out data quality. I was just talking to the Department of Education last week and they said -- you know, one of the things -- they had financial information from schools, and which they were slowly being mandated to publish. 00:21:59.519 --> 00:22:05.974 And schools are suddenly all ringing up, saying: "Well we never really bothered to really update that information to be accurate! Uh, we really want to do it right now". 00:22:05.974 --> 00:22:08.659 So I think that definitely does happen, yep. 00:22:08.659 --> 00:22:12.617 QUESTIONER: Are you seeing now new roles in government, to help facilitate this? 00:22:12.617 --> 00:22:27.580 RUFUS: Yeah. I mean, to take another example, I, sorry. Both in government, so the UK government has a transparency kind of 'czar' if you like. Also I learnt, is Nike hired an Open Data evangelist. One of the things they, while they were implementing this programme, they actually hired explicitly, an Open Data evangelist. 00:22:27.580 --> 00:22:32.406 So yeah, I think we are, we're definitely seeing this in government. Both in the tech level, but also at the policy level. 00:22:32.406 --> 00:22:43.837 And I think it's not just government, it will also be companies doing this, and so on, who will be saying: "We need an Open Data expert. We need to be aware of what's going on here and be able to plan it as part of our strategy." 00:22:43.837 --> 00:22:55.085 QUESTIONER: A final question. You mentioned that, kind of outsourcing, almost, some of this data refining, outside government or the big institutions, has helped them. Can you tell us any stories of kind of gratitude being expressed by the government? 00:22:55.085 --> 00:23:10.447 RUFUS: Well, I mean, to kind of, yeah. I mean there was an interesting example actually where we had some complaint because the open spending data I told you about, where we're aggregating the government spending and financial data -- you know, the site had a few performance issues, occasionally, as we loaded more data in. 00:23:10.447 --> 00:23:20.864 I remember kind of getting this call kind of going : "Well, you know, we're a little bit upset". You know, data.gov.uk, and it turned out the reason was, the Treasury kept looking at this data, and they were annoyed when the site was going down. 00:23:20.864 --> 00:23:25.878 So that was really intriguing to me, that we were kind of one of the best, at least, up-to-date aggregators out there. 00:23:25.878 --> 00:23:33.184 Um, I think you are already seeing people doing stuff with the data and kind of doing stuff, sometimes for free. And you don't have to have the shiny front-end. 00:23:33.184 --> 00:23:46.246 I mean, one of the things, we went about, we went on about, I know Tim Berners-Lee went on about -- raw data now, you know, you can build fewer shiny front-ends, and just release raw data. And you know, someone else will help you build the app, the front-end, the interface. 00:23:46.246 --> 00:23:54.122 And help you innovate about it. What is the best way to provide healthcare data to citizens, or education data to citizens, so they make better and more informed choices? 00:23:54.122 --> 00:23:56.767 I don't know, and the government probably doesn't know. 00:23:56.767 --> 00:24:02.799 But somewhere out there, someone is going to innovate and really provide the best way for us to deliver that kind of information to citizens. 00:24:02.799 --> 00:24:04.250 QUESTIONER: Thank you very much. 00:24:04.250 --> 00:24:05.496 RUFUS: Thank you. 00:24:05.496 --> 00:24:06.901 [Applause] 00:24:06.901 --> 00:24:09.423 lift _ Video Production ACTUA 00:24:09.423 --> 00:24:11.751 Copyright (c) 2012 Lift conference