1 00:00:06,292 --> 00:00:10,018 Well, it's almost time to begin the presentation. 2 00:00:11,598 --> 00:00:17,114 We will begin this last session with a presentation on WikiCite, 3 00:00:17,883 --> 00:00:22,029 led by Elizabeth Seiver, Simon Cobb, and Liam Wyatt. 4 00:00:22,029 --> 00:00:26,280 And I'll just let you introduce yourself. 5 00:00:27,129 --> 00:00:30,288 Please don't hesitate to take notes on Etherpad. 6 00:00:30,746 --> 00:00:33,214 Thank you for everything. 7 00:00:35,359 --> 00:00:37,055 Alright, let's get started. 8 00:00:37,055 --> 00:00:38,474 So, I'm Elizabeth Seiver. 9 00:00:38,474 --> 00:00:42,784 I'm the outgoing program manager for WikiCite. 10 00:00:42,784 --> 00:00:45,247 And I wanted to tell you all a little bit about it. 11 00:00:45,247 --> 00:00:48,375 Just as a show of hands, how many people are already familiar with WikiCite? 12 00:00:49,020 --> 00:00:51,773 That's great. I'm just glad that so many of you are. 13 00:00:51,773 --> 00:00:54,366 I was wondering how many people here-- I was thinking about it-- 14 00:00:54,366 --> 00:00:57,641 is just like, "Who are all these people putting all the citations 15 00:00:57,641 --> 00:00:59,629 in Wikidata and filling it up?" 16 00:00:59,629 --> 00:01:01,052 And WikiCite is so much more. 17 00:01:01,052 --> 00:01:03,850 So, we're all excited to tell you about it today. 18 00:01:05,676 --> 00:01:07,425 So, what is WikiCite? 19 00:01:08,275 --> 00:01:12,098 The goal of WikiCite is to collect all citations 20 00:01:12,098 --> 00:01:14,185 for the sum of all human knowledge. 21 00:01:14,185 --> 00:01:16,046 You know, just a little something. 22 00:01:16,046 --> 00:01:19,696 And we're doing this in a number of ways. 23 00:01:20,915 --> 00:01:25,286 And one of them is via conferences and workshops 24 00:01:25,286 --> 00:01:27,727 and getting together the community of people 25 00:01:27,727 --> 00:01:30,482 who are interested in working on citations. 26 00:01:30,482 --> 00:01:32,992 And it's a very diverse group of people. 27 00:01:32,992 --> 00:01:35,463 So, of course, we have people who are working in Wikidata, 28 00:01:35,463 --> 00:01:36,879 and other Wikimedians. 29 00:01:36,879 --> 00:01:40,580 We have librarians, people into linked open data, 30 00:01:40,580 --> 00:01:44,751 software engineers, data scientists, open knowledge advocates-- 31 00:01:45,517 --> 00:01:50,211 coming together about linked open bibliographic data. 32 00:01:53,226 --> 00:01:55,858 So, in terms of the history of WikiCite, 33 00:01:55,858 --> 00:01:59,185 it was founded as an initiative in 2016. 34 00:01:59,485 --> 00:02:05,456 And we secured dedicated funding for events for three years in 2018. 35 00:02:05,456 --> 00:02:10,677 And as I mentioned, you're probably familiar with the big-- 36 00:02:11,380 --> 00:02:13,445 the millions of citations that we already have 37 00:02:13,445 --> 00:02:15,184 that are hosted on Wikidata. 38 00:02:23,293 --> 00:02:27,601 So, what are we doing in WikiCite and with all these citations? 39 00:02:27,601 --> 00:02:30,278 It's not just about collecting them. It's about using them. 40 00:02:30,278 --> 00:02:36,038 And it creates so many opportunities for new projects. 41 00:02:36,567 --> 00:02:38,798 So, one of the things you can do with this data 42 00:02:38,798 --> 00:02:42,161 is build data models for bibliographic item types, 43 00:02:42,161 --> 00:02:44,998 which should be exciting for people who are into schemas. 44 00:02:45,973 --> 00:02:50,319 You can also do open cataloging and disambiguation-- 45 00:02:51,856 --> 00:02:54,209 sorry, my notes are not in sync with this. 46 00:02:54,448 --> 00:02:57,396 And people are also building tools on top of this. 47 00:02:57,896 --> 00:03:00,833 Visualization tools, such as Scholia. 48 00:03:03,275 --> 00:03:06,995 If you're interested at all in open cataloging, 49 00:03:06,995 --> 00:03:09,305 or author disambiguation, 50 00:03:10,201 --> 00:03:12,912 or just even figuring out how sources link together, 51 00:03:12,912 --> 00:03:15,306 WikiCite is a good way to do that. 52 00:03:17,331 --> 00:03:20,454 So, in terms of the direction that WikiCite is heading in, 53 00:03:21,021 --> 00:03:23,306 one of the things we're trying to do is expand 54 00:03:23,306 --> 00:03:25,566 all the types of things that are cited. 55 00:03:25,566 --> 00:03:28,441 Right now, in Wikidata, it's mostly journal articles. 56 00:03:28,927 --> 00:03:31,078 We'd like to keep growing our community, 57 00:03:31,078 --> 00:03:32,870 especially outside of the Global North 58 00:03:32,870 --> 00:03:35,862 and outside of English language publications. 59 00:03:36,635 --> 00:03:41,409 And I realize this is actually something that Liam will be talking about. 60 00:03:42,704 --> 00:03:45,537 So, what we wanted to do now, 61 00:03:45,537 --> 00:03:49,840 to do sort of a deep dive into one of the uses of Wikidata. 62 00:03:49,840 --> 00:03:53,725 So, for that, I would like to introduce Simon Cobb. 63 00:03:57,294 --> 00:03:58,348 Hi, everyone. 64 00:03:58,348 --> 00:04:01,260 So, what I want to talk about is an example of something 65 00:04:01,260 --> 00:04:05,061 we could potentially focus on within the scope of WikiCite. 66 00:04:05,061 --> 00:04:08,056 And that's the data quality issues that I've been encountering 67 00:04:08,056 --> 00:04:12,388 over the last year, as I've been editing on scholarly papers. 68 00:04:15,549 --> 00:04:17,934 The three issues I'm going to briefly touch on 69 00:04:17,934 --> 00:04:21,059 are the quality of the author items that are getting attached 70 00:04:21,059 --> 00:04:24,996 to scholarly articles, issues around DOI formats, 71 00:04:24,996 --> 00:04:29,038 and just general curation of the data that we're creating. 72 00:04:31,269 --> 00:04:33,578 Firstly, we look at some authors. 73 00:04:33,578 --> 00:04:35,798 Oh, sorry, firstly, I'll provide some context. 74 00:04:35,798 --> 00:04:38,952 We've got 26 million scholarly article items now. 75 00:04:38,952 --> 00:04:41,943 And the data quality issues I'm going to talk about, 76 00:04:41,943 --> 00:04:43,844 a very small proportion of these 77 00:04:43,844 --> 00:04:47,448 are generally creating quite good quality data. 78 00:04:48,467 --> 00:04:53,645 We have a lot of external identifiers-- 21.65 million PubMed IDs, 79 00:04:53,645 --> 00:04:59,595 19 million DOIs, and we've added 8.3 million author statements, 80 00:04:59,972 --> 00:05:05,200 although we still have 105.5 million author name strings to replace. 81 00:05:07,308 --> 00:05:09,562 In terms of the authors, 82 00:05:09,562 --> 00:05:12,823 we've been creating a lot of items from ORCID IDs. 83 00:05:14,583 --> 00:05:17,950 We've got over half a million items with an ORCID ID now. 84 00:05:18,437 --> 00:05:22,848 But over 50% of those do not have any affiliation data yet. 85 00:05:23,455 --> 00:05:26,697 And that's now in employer or in educated at. 86 00:05:26,697 --> 00:05:28,727 I found 25,000 87 00:05:28,727 --> 00:05:30,757 where we only have two statements. 88 00:05:30,757 --> 00:05:34,786 That's an ORCID ID, and an instance of a human. 89 00:05:34,786 --> 00:05:37,747 This isn't particularly useful in terms of-- 90 00:05:39,105 --> 00:05:42,267 we use for anyone else and beyond Wikidata. 91 00:05:42,267 --> 00:05:45,510 If we're serious about approaching a bibliographic database 92 00:05:45,510 --> 00:05:47,696 and providing open data for people, 93 00:05:47,696 --> 00:05:50,773 we really need to be focusing on quality, I believe. 94 00:05:52,083 --> 00:05:54,014 So, there's a lot of work to be done. 95 00:05:54,014 --> 00:05:56,852 We've done really well with automatic input, 96 00:05:56,852 --> 00:05:59,287 but I think we need to, in the future, 97 00:05:59,287 --> 00:06:02,612 step back and think how can we really make this data useful. 98 00:06:03,638 --> 00:06:04,910 And one of the ways to do that 99 00:06:04,910 --> 00:06:09,917 is by making our author items better quality 100 00:06:09,917 --> 00:06:14,274 by adding affiliation information, adding first names, surnames, 101 00:06:14,864 --> 00:06:18,928 and just moving beyond occupation researcher, 102 00:06:18,928 --> 00:06:22,445 trying to get what field people are working in, for example. 103 00:06:23,916 --> 00:06:25,699 Moving on to DOIs. 104 00:06:25,699 --> 00:06:29,429 When I was looking at how many scholarly papers we have now, 105 00:06:29,429 --> 00:06:35,040 I immediately noticed that we have DOIs that are just four characters. 106 00:06:35,040 --> 00:06:37,231 And that is not a correct DOI. 107 00:06:37,231 --> 00:06:42,288 We've got about 110 items with this DOI format. 108 00:06:42,868 --> 00:06:46,702 In the grand scheme of things, not that big a problem. 109 00:06:46,702 --> 00:06:49,055 But that's never been a correct DOI 110 00:06:49,055 --> 00:06:51,525 that's being created by an automatic process. 111 00:06:51,525 --> 00:06:55,279 No one's checked that and realized we had this error and corrected it. 112 00:06:55,279 --> 00:06:57,974 So, it's kind of an appeal I want to make to people-- 113 00:06:58,534 --> 00:07:01,099 if you're doing batch imports, to check what you're doing, 114 00:07:01,099 --> 00:07:03,648 look for these obvious data quality problems. 115 00:07:06,544 --> 00:07:10,607 And another final issue that I've noticed is errata. 116 00:07:10,607 --> 00:07:15,718 We have over 13 thousand items that are instance of errata, 117 00:07:16,050 --> 00:07:18,604 but they're not linked to the paper they're correcting. 118 00:07:18,609 --> 00:07:25,343 So, I've also produced a table of the top ten titles of the-- 119 00:07:26,153 --> 00:07:27,341 these are errata items. 120 00:07:27,341 --> 00:07:30,030 You will notice they're not particularly informative. 121 00:07:30,030 --> 00:07:32,495 So, as some point, we're going to have to go back 122 00:07:32,495 --> 00:07:36,868 and look at how we can actually get the information 123 00:07:36,868 --> 00:07:39,839 about what these errata are correcting, 124 00:07:39,839 --> 00:07:43,919 because they're not really of much use to anyone at the moment. 125 00:07:44,711 --> 00:07:48,834 So, in the future, I hope this is one area that we can work on as a community, 126 00:07:48,834 --> 00:07:53,354 and we can coordinate a bit better with what data imports we're doing, 127 00:07:53,354 --> 00:07:57,773 and how we can curate all our data, bring it all together, 128 00:07:57,773 --> 00:07:59,876 and combine our expertise. 129 00:08:01,619 --> 00:08:03,277 I'm going to pass over to Liam now 130 00:08:03,277 --> 00:08:07,905 to talk a bit about how we might be able to coordinate our efforts in the future. 131 00:08:15,119 --> 00:08:16,536 Thank you. 132 00:08:21,296 --> 00:08:26,837 So, as mentioned in the final slide from Elizabeth, 133 00:08:26,837 --> 00:08:33,115 WikiCite is trying to be more and more diverse, 134 00:08:33,115 --> 00:08:36,139 and high quality, and more widely spread. 135 00:08:36,139 --> 00:08:39,947 The idea is over the next year or so, with the dedicated funding 136 00:08:39,947 --> 00:08:43,114 that's been provided and is available over a three-year period, 137 00:08:43,114 --> 00:08:45,346 of which we've entered, 138 00:08:45,346 --> 00:08:51,189 to change WikiCite-- the conference-- which there's been a few-- 139 00:08:51,189 --> 00:08:58,162 into a series of proposals from you, 140 00:08:58,891 --> 00:09:02,858 into what we're calling "satellite events" around the world. 141 00:09:03,342 --> 00:09:08,240 This will be focusing-- there'll be a call for a proposal system-- 142 00:09:08,240 --> 00:09:12,043 like reviewing a procedure 143 00:09:12,043 --> 00:09:14,393 that is currently not yet invented 144 00:09:14,393 --> 00:09:18,877 for deciding on how to-- 145 00:09:20,967 --> 00:09:22,482 what's the word I'm after-- 146 00:09:22,482 --> 00:09:24,804 prioritize these requests. 147 00:09:24,804 --> 00:09:29,378 And see if we can't get a wider diversity 148 00:09:29,378 --> 00:09:32,784 of content contributor and topic 149 00:09:33,115 --> 00:09:36,546 supported in the WikiCite umbrella, 150 00:09:38,244 --> 00:09:41,009 through this series of satellite events. 151 00:09:41,347 --> 00:09:42,886 To that end, 152 00:09:43,102 --> 00:09:46,702 the WikiCite grant-- 153 00:09:47,037 --> 00:09:52,345 was successfully applied for and received 154 00:09:52,345 --> 00:09:57,460 through the work of WikiCite's father, Dario, 155 00:09:57,460 --> 00:10:00,982 who many of you might know from the Wikimedia Foundation. 156 00:10:00,982 --> 00:10:04,109 Dario no longer works with the Wikimedia Foundation, 157 00:10:04,109 --> 00:10:06,662 and so this grant has a-- 158 00:10:08,038 --> 00:10:09,858 needed a home. 159 00:10:09,858 --> 00:10:14,275 What has happened is that the WikiCite steering committee, 160 00:10:14,275 --> 00:10:17,104 primarily made up of the organizing team 161 00:10:17,104 --> 00:10:20,317 from last year's WikiCite conference, 162 00:10:20,317 --> 00:10:23,923 will continue to oversee this work, 163 00:10:23,923 --> 00:10:27,654 and the Wikimedia Foundation has hired a temporary 164 00:10:28,218 --> 00:10:31,567 or a part-time coordinator, 165 00:10:32,024 --> 00:10:34,617 to oversee and support that work, 166 00:10:34,617 --> 00:10:38,790 and to promote and receive those applications 167 00:10:38,790 --> 00:10:40,453 for the satellite events. 168 00:10:40,453 --> 00:10:41,998 And that will be me. 169 00:10:42,838 --> 00:10:45,405 (laughter and cheers) 170 00:10:48,323 --> 00:10:50,591 So, I got the call yesterday 171 00:10:50,591 --> 00:10:54,320 so that I could be able to like confirm that in-- 172 00:10:54,320 --> 00:10:58,641 among an audience which is highly relevant to that topic. 173 00:10:59,308 --> 00:11:02,592 Which is helpful, so I can talk to you here and now about that. 174 00:11:03,967 --> 00:11:08,671 So, this is listed as a panel in the program. 175 00:11:09,562 --> 00:11:11,262 Even though it's a bit of a-- 176 00:11:12,194 --> 00:11:13,776 I think panel is a generous way 177 00:11:13,776 --> 00:11:17,342 of describing the three of us in this context. 178 00:11:17,342 --> 00:11:19,573 But the idea is we would like to hear from you 179 00:11:19,573 --> 00:11:22,181 on that immediate thought about-- 180 00:11:22,573 --> 00:11:24,908 or questions to Simon, as well-- 181 00:11:24,908 --> 00:11:27,494 if you have questions for Simon, specifically-- 182 00:11:27,494 --> 00:11:31,912 about what you think are good directions 183 00:11:31,912 --> 00:11:35,316 that should be addressed or should be attempted 184 00:11:35,316 --> 00:11:37,308 in this forthcoming year, 185 00:11:38,605 --> 00:11:40,429 either individually, online-- 186 00:11:41,516 --> 00:11:43,987 and things that not necessarily you can do, 187 00:11:43,987 --> 00:11:46,135 but think should be done. 188 00:11:47,431 --> 00:11:52,023 And specifically, to start thinking about 189 00:11:52,023 --> 00:11:55,624 what a satellite event would mean 190 00:11:55,624 --> 00:11:58,459 with relation to open citations 191 00:11:58,459 --> 00:12:03,193 and how the community at large would best be served 192 00:12:03,193 --> 00:12:04,731 by that kind of support. 193 00:12:04,731 --> 00:12:08,862 Beyond merely financial, but what does support mean 194 00:12:08,862 --> 00:12:13,364 for satellite events in open citations according to you. 195 00:12:14,502 --> 00:12:16,726 If you want to come back up, and we can-- 196 00:12:17,293 --> 00:12:18,584 Did you have a question? 197 00:12:18,584 --> 00:12:23,039 (woman) Ah, yes. I do research on predatory publishing 198 00:12:23,039 --> 00:12:25,062 and on retractions. 199 00:12:25,062 --> 00:12:26,807 You only mentioned errata. 200 00:12:26,807 --> 00:12:29,999 So, how are you dealing with expressions of concern 201 00:12:29,999 --> 00:12:31,666 and retractions? 202 00:12:31,666 --> 00:12:36,000 And what is your policy on trying to identify predatory publishers? 203 00:12:37,014 --> 00:12:39,802 Okay, so, within the scope of preparing for this, 204 00:12:39,802 --> 00:12:45,552 I wasn't looking at retractions, but people have been doing work on that 205 00:12:45,552 --> 00:12:49,039 and trying to-- we have a property-- notice of retractions-- 206 00:12:49,039 --> 00:12:51,509 so we can be creating those links. 207 00:12:52,449 --> 00:12:55,482 I don't know what extent that's happened in the same way. 208 00:12:55,482 --> 00:13:00,302 Not all the errata are linked to the paper that's being corrected. 209 00:13:00,302 --> 00:13:02,500 I suspect that's a similar case with-- 210 00:13:02,500 --> 00:13:04,439 - (woman) It's exactly the same. - Yeah. 211 00:13:05,327 --> 00:13:06,906 As I said, I wasn't looking at that, 212 00:13:06,906 --> 00:13:12,490 but we can potentially link the retraction to the retracted article, 213 00:13:12,490 --> 00:13:15,144 the retraction notice to the retracted article. 214 00:13:15,144 --> 00:13:17,246 In terms of predatory publishers, 215 00:13:18,278 --> 00:13:21,379 I'm not aware of anyone having done any work in this area, 216 00:13:21,379 --> 00:13:23,842 but I wouldn't like to say that hasn't happened. 217 00:13:23,842 --> 00:13:26,835 We have Charles, whose hand is going up there. 218 00:13:26,835 --> 00:13:30,080 Do you want to comment on predatory publishers, Charles? 219 00:13:35,170 --> 00:13:38,843 (Charles) Well, I encountered this problem in the ScienceSource project. 220 00:13:38,843 --> 00:13:43,462 And first of all, I did what I could to put fields list in Wikidata format. 221 00:13:43,462 --> 00:13:46,984 Fields list isn't sort of what everybody wants to be dealing with, 222 00:13:46,984 --> 00:13:48,442 but it was a starting point. 223 00:13:48,442 --> 00:13:52,684 So, that has been done, as far as I was able to. 224 00:13:53,274 --> 00:13:56,744 But the thing I rely on more, perhaps, 225 00:13:56,744 --> 00:14:00,717 is DOAJ IDs. 226 00:14:00,717 --> 00:14:04,182 That is, if we put all the DOAJ IDs 227 00:14:04,182 --> 00:14:07,826 into Wikidata, 228 00:14:07,826 --> 00:14:09,908 we'd have made a really good attempt 229 00:14:09,908 --> 00:14:12,620 to isolate the predatory publishers. 230 00:14:12,620 --> 00:14:14,057 And that is not the whole story, 231 00:14:14,057 --> 00:14:16,926 but these days, it's the bulk of the story. 232 00:14:16,926 --> 00:14:19,072 (woman) [Is the directory of open access there?] 233 00:14:19,072 --> 00:14:22,042 - (Charles) Directory of open access, yes. - (woman) Alright, good. 234 00:14:31,452 --> 00:14:34,241 (man) To start with, I just spent a year traveling around New Zealand 235 00:14:34,241 --> 00:14:37,775 trying to explain Wikidata to the library community, 236 00:14:37,775 --> 00:14:40,222 and as soon as I mentioned WikiCite, their eyes rolled, 237 00:14:40,222 --> 00:14:42,868 because they've just been told they have to be [up] with Wikipedia, 238 00:14:42,868 --> 00:14:43,956 Wiki Commons, Wikidata. 239 00:14:43,956 --> 00:14:46,232 Here's another Wiki project that they need to know about. 240 00:14:46,232 --> 00:14:48,739 "Why can't we just do it all with Wikidata?" they were saying. 241 00:14:48,739 --> 00:14:50,893 So, there's a public perception problem straightaway, 242 00:14:50,893 --> 00:14:53,803 and that's the very community that we need to have onboard 243 00:14:53,803 --> 00:14:54,819 for this to work. 244 00:14:54,819 --> 00:14:57,096 I'm interested in thinking how we are going to reach 245 00:14:57,096 --> 00:15:01,420 the library community, educate them, and get them integrally involved 246 00:15:01,420 --> 00:15:03,119 in this process? 247 00:15:06,449 --> 00:15:09,328 I have thoughts, but I'd like to hear your thoughts first. 248 00:15:10,909 --> 00:15:14,035 - Sure, I think-- - (assistant) [This one is on.] 249 00:15:16,384 --> 00:15:18,692 This better? Alright. 250 00:15:18,692 --> 00:15:20,535 Feel like I'm in a concert. 251 00:15:20,535 --> 00:15:23,088 So, one of the things we've tried to do 252 00:15:23,088 --> 00:15:29,666 is incorporate librarians and libraries 253 00:15:29,666 --> 00:15:32,616 into WikiCite in everything that we do. 254 00:15:32,616 --> 00:15:33,917 So, on the steering committee, 255 00:15:33,917 --> 00:15:39,457 we have at least two librarians, if not more. 256 00:15:39,457 --> 00:15:44,230 And at our actual WikiCite events, 257 00:15:44,230 --> 00:15:46,702 one of the things that's actually pretty great about WikiCite 258 00:15:46,702 --> 00:15:50,911 is that we end up getting both speakers and participants, 259 00:15:50,911 --> 00:15:54,846 who maybe are not actually involved in any Wiki projects. 260 00:15:54,846 --> 00:15:56,908 So, we don't have Wiki fatigue. 261 00:15:57,472 --> 00:16:01,117 And a lot of times, they're coming from the perspective of... 262 00:16:01,688 --> 00:16:04,841 "Well, I'm interested in linked open data, 263 00:16:04,841 --> 00:16:07,427 I love to use citations at my university, 264 00:16:07,427 --> 00:16:09,750 can you tell me a little bit more about how Wikidata works, 265 00:16:09,750 --> 00:16:12,378 and how I might use the citations that are in Wikidata?" 266 00:16:12,378 --> 00:16:17,057 So, I think it's very much about bringing these communities together, 267 00:16:17,057 --> 00:16:20,702 which might seem disparate, around these common goals 268 00:16:20,702 --> 00:16:23,075 for people who are really concerned about curating data, 269 00:16:23,075 --> 00:16:26,926 and then, people who might already know about how to do that on Wikidata. 270 00:16:28,346 --> 00:16:31,397 I would say, in terms of the confusion, 271 00:16:31,397 --> 00:16:35,322 the complexity implied by the question of well, there's WikiCite, 272 00:16:35,322 --> 00:16:37,351 and there's Wikidata, and there's this... 273 00:16:37,351 --> 00:16:41,999 WikiCite is a brand name, it's a project-- GLAM-Wiki-- 274 00:16:42,894 --> 00:16:45,529 GLAM-Wiki also uses the word Wiki, 275 00:16:45,529 --> 00:16:48,758 but it's not pretending to be a Wiki 276 00:16:48,758 --> 00:16:51,186 or competing with Wikipedia and Wikidata. 277 00:16:51,186 --> 00:16:56,118 It's the particular focus area of reference information, 278 00:16:56,118 --> 00:16:58,789 "referenceable" information. 279 00:16:58,789 --> 00:17:04,908 Now, particularly in the context of a series of conferences 280 00:17:04,908 --> 00:17:07,055 that have happened over the last few years, 281 00:17:07,055 --> 00:17:09,447 and the conference is called WikiCite-- 282 00:17:11,959 --> 00:17:15,811 particularly within this community, the Wikidata core group, 283 00:17:15,811 --> 00:17:21,397 WikiCite is seen, known, understood as a large number of items 284 00:17:21,397 --> 00:17:24,188 uploaded to Wikidata about scholarly publications. 285 00:17:24,674 --> 00:17:30,462 That is what is understood as WikiCite by this community, mostly. 286 00:17:32,937 --> 00:17:35,125 I would like to-- 287 00:17:36,054 --> 00:17:38,045 there is a question about, 288 00:17:38,045 --> 00:17:42,060 could WikiCite be made into its own Wikibase 289 00:17:42,060 --> 00:17:45,228 of just citation stuff? 290 00:17:45,228 --> 00:17:47,982 Not Wikidata, and then there's federation, 291 00:17:48,914 --> 00:17:50,573 and funky things like that, 292 00:17:50,573 --> 00:17:54,905 and you could put a lot more very specific information 293 00:17:54,905 --> 00:17:58,431 about individual, citable things there, 294 00:17:59,644 --> 00:18:01,590 which is a perfectly valid way 295 00:18:01,590 --> 00:18:04,346 of dealing with questions of notability and properties. 296 00:18:04,346 --> 00:18:06,452 But the technology for doing that 297 00:18:06,452 --> 00:18:10,791 is not yet relevant in any way. 298 00:18:10,791 --> 00:18:17,003 We need a lot more work, particularly on federation in Wikibase 299 00:18:17,539 --> 00:18:19,553 to make sure everything syncs neatly. 300 00:18:20,122 --> 00:18:25,954 So, until such time as that would be a viable outcome, 301 00:18:26,724 --> 00:18:31,315 in the meantime, all of the things that would serve that kind of outcome 302 00:18:32,532 --> 00:18:35,969 also serve just improving the quality on Wikidata 303 00:18:35,969 --> 00:18:39,087 and improving the links with Wikipedia and Wikisource. 304 00:18:40,145 --> 00:18:43,665 The brand name is, as far as I'm concerned, irrelevant. 305 00:18:43,665 --> 00:18:47,953 It's just the project to make better footnotes. 306 00:18:50,093 --> 00:18:52,454 (woman 2) Just a comment in relation to your query 307 00:18:52,454 --> 00:18:57,726 about satellite proposals 308 00:18:57,726 --> 00:18:59,956 for satellite conferences-- 309 00:19:00,768 --> 00:19:04,007 I don't think you realize the level of ignorance 310 00:19:04,007 --> 00:19:08,351 about Wiki-anything from our country in New Zealand. 311 00:19:08,351 --> 00:19:09,731 I mean, seriously. 312 00:19:09,731 --> 00:19:12,497 As an Australian, I recognize the ignorance of New Zealanders-- 313 00:19:12,497 --> 00:19:15,142 (laughter) 314 00:19:17,747 --> 00:19:21,270 (woman 2) Oh, [inaudible], come on! 315 00:19:26,987 --> 00:19:31,248 What I'm trying to say is that if we have a satellite 316 00:19:31,248 --> 00:19:36,159 or somehow organize a joint satellite conference, 317 00:19:37,399 --> 00:19:39,793 from my perspective, what I'm looking for 318 00:19:39,793 --> 00:19:43,192 is strategies and how to engage the community. 319 00:19:43,192 --> 00:19:48,215 They aren't even at the level of being-- 320 00:19:48,215 --> 00:19:50,681 they don't know enough to even be enthusiastic 321 00:19:50,681 --> 00:19:52,997 about Wikidata and WikiCite yet. 322 00:19:52,997 --> 00:19:57,041 They look at it with a lot of skepticism, if they're even aware of it. 323 00:19:57,041 --> 00:20:02,632 So I, in particular, want to be able to have a meeting 324 00:20:02,632 --> 00:20:04,902 in order to be able to learn from those. 325 00:20:04,902 --> 00:20:08,387 We've already engaged more successfully with the community 326 00:20:08,387 --> 00:20:14,339 to get a skill base in order to build some collaborations in New Zealand. 327 00:20:15,117 --> 00:20:18,555 You're talking about extra people to actually engage with. 328 00:20:18,555 --> 00:20:21,885 I just want the core library community to get on board, 329 00:20:21,885 --> 00:20:24,367 and then go the extra step. 330 00:20:24,367 --> 00:20:28,284 It's like I'm looking at you saying that we want to reach out 331 00:20:28,284 --> 00:20:29,663 to other communities, 332 00:20:29,663 --> 00:20:32,820 and I'm saying, I just want to reach out to a community. 333 00:20:32,820 --> 00:20:36,550 You know, we're a lot further behind where we are. 334 00:20:36,550 --> 00:20:37,963 So, yeah. 335 00:20:39,292 --> 00:20:44,937 I would not wish to pretend that WikiCite and open bibliographic information 336 00:20:44,937 --> 00:20:50,478 is the be-all and end-all of Wikidata or Wikimedia outreach. 337 00:20:51,209 --> 00:20:53,107 It's a specific subset. 338 00:20:53,107 --> 00:20:59,333 And I would not wish to try and make WikiCite a brand, 339 00:21:00,477 --> 00:21:04,242 appear to be overriding or replacing 340 00:21:04,242 --> 00:21:06,841 or somehow getting in the way 341 00:21:06,841 --> 00:21:10,970 of just general, good quality outreach about Wikimedia, 342 00:21:11,462 --> 00:21:16,085 and working with libraries, in general, and Wikidata, 343 00:21:16,085 --> 00:21:17,183 even more specific. 344 00:21:17,183 --> 00:21:19,386 This is a subset of Wikidata. 345 00:21:19,386 --> 00:21:22,770 So, particularly, for WikiCite satellite events, 346 00:21:24,100 --> 00:21:30,251 I don't want to make it appear like there's a competition for Wiki-- 347 00:21:30,662 --> 00:21:33,976 so, everything about Wikidata now has to be called WikiCite-- no. 348 00:21:33,976 --> 00:21:37,948 This is a really quite niche-- in the scheme of things-- topic area, 349 00:21:37,948 --> 00:21:40,497 supporting general awareness-raising 350 00:21:40,497 --> 00:21:43,044 about Wikidata and open access information, 351 00:21:43,044 --> 00:21:47,656 and Wikimedia is far beyond the scope 352 00:21:47,656 --> 00:21:52,031 of this kind of particular specialist outreach. 353 00:21:52,413 --> 00:21:57,812 And that's not to say that it's not a good thing, too. 354 00:21:59,304 --> 00:22:01,898 (woman 2) I just perceived-- sorry, one more comment-- 355 00:22:01,898 --> 00:22:06,198 WikiCite as the possible inroad 356 00:22:06,209 --> 00:22:09,703 to those at the wider community 357 00:22:09,703 --> 00:22:13,321 for the people we want to get on board. 358 00:22:13,321 --> 00:22:16,510 So, to me, WikiCite is-- yes, it's a subset, 359 00:22:16,510 --> 00:22:22,630 and really a much smaller set of beliefs and information, et cetera-- 360 00:22:23,075 --> 00:22:26,844 but I see it as an easy steppingstone to get them addicted, 361 00:22:26,844 --> 00:22:28,870 and then you can open it up. 362 00:22:29,356 --> 00:22:30,648 So, yeah. 363 00:22:35,901 --> 00:22:38,814 (assistant) We have just time for one short question. 364 00:22:41,062 --> 00:22:45,549 So, one of you have another question for the WikiCite team? 365 00:22:47,970 --> 00:22:49,749 Thank you for sharing this feedback with us. 366 00:22:49,749 --> 00:22:52,161 Oh, somebody has a question. 367 00:22:58,502 --> 00:23:01,402 (assistant) Which one of you wants to... 368 00:23:06,032 --> 00:23:07,732 (woman 3) Hi, thank you so much for this. 369 00:23:07,732 --> 00:23:11,259 I was just wondering, is there ever going to be 370 00:23:11,989 --> 00:23:17,861 a paring of the bibliography used in Wikipedia articles and WikiCite? 371 00:23:17,867 --> 00:23:22,831 Are you planning to move all those references and parse them 372 00:23:22,831 --> 00:23:27,452 so that we can do some analyses of which references we're using 373 00:23:27,452 --> 00:23:30,249 in the Wikipedia articles-- 374 00:23:30,249 --> 00:23:33,808 and when you create an article in another language 375 00:23:33,808 --> 00:23:38,510 just to get suggestions of this, are the references that have been used, 376 00:23:38,510 --> 00:23:39,855 kind of like that. 377 00:23:39,855 --> 00:23:45,137 I know one of the short-term goals of WikiCite is to have all citations 378 00:23:45,137 --> 00:23:48,058 in WikiProjects represented in Wikidata. 379 00:23:48,503 --> 00:23:50,613 Currently, there's not an automatic pipeline 380 00:23:50,613 --> 00:23:53,045 that keeps that updated, 381 00:23:53,045 --> 00:23:55,831 but that's definitely one of our primary goals. 382 00:23:57,410 --> 00:24:01,266 And ultimately, there is not specific support 383 00:24:01,266 --> 00:24:07,834 in the developer community for that kind of activity in particular. 384 00:24:07,834 --> 00:24:10,467 That's on the interests of individual community members 385 00:24:10,467 --> 00:24:14,842 to do exports-- like all this work that's been demonstrated 386 00:24:14,842 --> 00:24:17,273 that's not from the foundation-- 387 00:24:17,273 --> 00:24:20,520 people doing individual work on their interests. 388 00:24:20,520 --> 00:24:23,316 So, that could be a good satellite event 389 00:24:23,316 --> 00:24:26,438 to try and explore that kind of work. 390 00:24:26,438 --> 00:24:31,301 Getting it a good pipeline so that you can make references 391 00:24:31,301 --> 00:24:36,445 in Wikipedia's easily hook into Wikidata items, 392 00:24:36,445 --> 00:24:38,144 multilingual, et cetera-- 393 00:24:38,144 --> 00:24:41,273 does not yet exist technologically, 394 00:24:41,273 --> 00:24:45,876 and certain languages have concerns about that. 395 00:24:45,876 --> 00:24:50,135 The larger the Wikipedia language, the more defensive they are 396 00:24:50,135 --> 00:24:53,224 about using Wikidata directly. 397 00:24:54,402 --> 00:24:56,101 But that'll come. 398 00:24:57,719 --> 00:25:00,273 Yeah, I was just going to say when Liam's finished with that-- 399 00:25:00,273 --> 00:25:03,288 that it's strictly citations or something that are very much within scope, 400 00:25:03,288 --> 00:25:07,547 and what we would like to work for, but that needs community to build this, 401 00:25:07,547 --> 00:25:09,653 to take on that challenge, I think. 402 00:25:10,353 --> 00:25:15,198 And also, we need to be doing the outreach to the Wikipedians to show them 403 00:25:15,198 --> 00:25:18,618 that we can provide good quality data consistently. 404 00:25:22,185 --> 00:25:24,162 (assistant) We are running out of time. 405 00:25:24,162 --> 00:25:27,665 So, if someone has another question 406 00:25:27,665 --> 00:25:33,075 I think that these nice people will ask you privately after. 407 00:25:33,617 --> 00:25:36,452 So, it's time for us, for the last edition, 408 00:25:36,452 --> 00:25:39,386 and we are welcoming on stage. 409 00:25:39,956 --> 00:25:44,012 Jean-Fred, Envel, and... 410 00:25:44,012 --> 00:25:46,621 (applause)