WEBVTT 00:00:06.292 --> 00:00:10.018 Well, it's almost time to begin the presentation. 00:00:11.598 --> 00:00:17.114 We will begin this last session with a presentation on WikiCite, 00:00:17.883 --> 00:00:22.029 led by Elizabeth Seiver, Simon Cobb, and Liam Wyatt. 00:00:22.029 --> 00:00:26.280 And I'll just let you introduce yourself. 00:00:27.129 --> 00:00:30.288 Please don't hesitate to take notes on Etherpad. 00:00:30.746 --> 00:00:33.214 Thank you for everything. 00:00:35.359 --> 00:00:37.055 Alright, let's get started. 00:00:37.055 --> 00:00:38.474 So, I'm Elizabeth Seiver. 00:00:38.474 --> 00:00:42.784 I'm the outgoing program manager for WikiCite. 00:00:42.784 --> 00:00:45.247 And I wanted to tell you all a little bit about it. 00:00:45.247 --> 00:00:48.375 Just as a show of hands, how many people are already familiar with WikiCite? 00:00:49.020 --> 00:00:51.773 That's great. I'm just glad that so many of you are. 00:00:51.773 --> 00:00:54.366 I was wondering how many people here-- I was thinking about it-- 00:00:54.366 --> 00:00:57.641 is just like, "Who are all these people putting all the citations 00:00:57.641 --> 00:00:59.629 in Wikidata and filling it up?" 00:00:59.629 --> 00:01:01.052 And WikiCite is so much more. 00:01:01.052 --> 00:01:03.850 So, we're all excited to tell you about it today. 00:01:05.676 --> 00:01:07.425 So, what is WikiCite? 00:01:08.275 --> 00:01:12.098 The goal of WikiCite is to collect all citations 00:01:12.098 --> 00:01:14.185 for the sum of all human knowledge. 00:01:14.185 --> 00:01:16.046 You know, just a little something. 00:01:16.046 --> 00:01:19.696 And we're doing this in a number of ways. 00:01:20.915 --> 00:01:25.286 And one of them is via conferences and workshops 00:01:25.286 --> 00:01:27.727 and getting together the community of people 00:01:27.727 --> 00:01:30.482 who are interested in working on citations. 00:01:30.482 --> 00:01:32.992 And it's a very diverse group of people. 00:01:32.992 --> 00:01:35.463 So, of course, we have people who are working in Wikidata, 00:01:35.463 --> 00:01:36.879 and other Wikimedians. 00:01:36.879 --> 00:01:40.580 We have librarians, people into linked open data, 00:01:40.580 --> 00:01:44.751 software engineers, data scientists, open knowledge advocates-- 00:01:45.517 --> 00:01:50.211 coming together about linked open bibliographic data. 00:01:53.226 --> 00:01:55.858 So, in terms of the history of WikiCite, 00:01:55.858 --> 00:01:59.185 it was founded as an initiative in 2016. 00:01:59.485 --> 00:02:05.456 And we secured dedicated funding for events for three years in 2018. 00:02:05.456 --> 00:02:10.677 And as I mentioned, you're probably familiar with the big-- 00:02:11.380 --> 00:02:13.445 the millions of citations that we already have 00:02:13.445 --> 00:02:15.184 that are hosted on Wikidata. 00:02:23.293 --> 00:02:27.601 So, what are we doing in WikiCite and with all these citations? 00:02:27.601 --> 00:02:30.278 It's not just about collecting them. It's about using them. 00:02:30.278 --> 00:02:36.038 And it creates so many opportunities for new projects. 00:02:36.567 --> 00:02:38.798 So, one of the things you can do with this data 00:02:38.798 --> 00:02:42.161 is build data models for bibliographic item types, 00:02:42.161 --> 00:02:44.998 which should be exciting for people who are into schemas. 00:02:45.973 --> 00:02:50.319 You can also do open cataloging and disambiguation-- 00:02:51.856 --> 00:02:54.209 sorry, my notes are not in sync with this. 00:02:54.448 --> 00:02:57.396 And people are also building tools on top of this. 00:02:57.896 --> 00:03:00.833 Visualization tools, such as Scholia. 00:03:03.275 --> 00:03:06.995 If you're interested at all in open cataloging, 00:03:06.995 --> 00:03:09.305 or author disambiguation, 00:03:10.201 --> 00:03:12.912 or just even figuring out how sources link together, 00:03:12.912 --> 00:03:15.306 WikiCite is a good way to do that. 00:03:17.331 --> 00:03:20.454 So, in terms of the direction that WikiCite is heading in, 00:03:21.021 --> 00:03:23.306 one of the things we're trying to do is expand 00:03:23.306 --> 00:03:25.566 all the types of things that are cited. 00:03:25.566 --> 00:03:28.441 Right now, in Wikidata, it's mostly journal articles. 00:03:28.927 --> 00:03:31.078 We'd like to keep growing our community, 00:03:31.078 --> 00:03:32.870 especially outside of the Global North 00:03:32.870 --> 00:03:35.862 and outside of English language publications. 00:03:36.635 --> 00:03:41.409 And I realize this is actually something that Liam will be talking about. 00:03:42.704 --> 00:03:45.537 So, what we wanted to do now, 00:03:45.537 --> 00:03:49.840 to do sort of a deep dive into one of the uses of Wikidata. 00:03:49.840 --> 00:03:53.725 So, for that, I would like to introduce Simon Cobb. 00:03:57.294 --> 00:03:58.348 Hi, everyone. 00:03:58.348 --> 00:04:01.260 So, what I want to talk about is an example of something 00:04:01.260 --> 00:04:05.061 we could potentially focus on within the scope of WikiCite. 00:04:05.061 --> 00:04:08.056 And that's the data quality issues that I've been encountering 00:04:08.056 --> 00:04:12.388 over the last year, as I've been editing on scholarly papers. 00:04:15.549 --> 00:04:17.934 The three issues I'm going to briefly touch on 00:04:17.934 --> 00:04:21.059 are the quality of the author items that are getting attached 00:04:21.059 --> 00:04:24.996 to scholarly articles, issues around DOI formats, 00:04:24.996 --> 00:04:29.038 and just general curation of the data that we're creating. 00:04:31.269 --> 00:04:33.578 Firstly, we look at some authors. 00:04:33.578 --> 00:04:35.798 Oh, sorry, firstly, I'll provide some context. 00:04:35.798 --> 00:04:38.952 We've got 26 million scholarly article items now. 00:04:38.952 --> 00:04:41.943 And the data quality issues I'm going to talk about, 00:04:41.943 --> 00:04:43.844 a very small proportion of these 00:04:43.844 --> 00:04:47.448 are generally creating quite good quality data. 00:04:48.467 --> 00:04:53.645 We have a lot of external identifiers-- 21.65 million PubMed IDs, 00:04:53.645 --> 00:04:59.595 19 million DOIs, and we've added 8.3 million author statements, 00:04:59.972 --> 00:05:05.200 although we still have 105.5 million author name strings to replace. 00:05:07.308 --> 00:05:09.562 In terms of the authors, 00:05:09.562 --> 00:05:12.823 we've been creating a lot of items from ORCID IDs. 00:05:14.583 --> 00:05:17.950 We've got over half a million items with an ORCID ID now. 00:05:18.437 --> 00:05:22.848 But over 50% of those do not have any affiliation data yet. 00:05:23.455 --> 00:05:26.697 And that's now in employer or in educated at. 00:05:26.697 --> 00:05:28.727 I found 25,000 00:05:28.727 --> 00:05:30.757 where we only have two statements. 00:05:30.757 --> 00:05:34.786 That's an ORCID ID, and an instance of a human. 00:05:34.786 --> 00:05:37.747 This isn't particularly useful in terms of-- 00:05:39.105 --> 00:05:42.267 we use for anyone else and beyond Wikidata. 00:05:42.267 --> 00:05:45.510 If we're serious about approaching a bibliographic database 00:05:45.510 --> 00:05:47.696 and providing open data for people, 00:05:47.696 --> 00:05:50.773 we really need to be focusing on quality, I believe. 00:05:52.083 --> 00:05:54.014 So, there's a lot of work to be done. 00:05:54.014 --> 00:05:56.852 We've done really well with automatic input, 00:05:56.852 --> 00:05:59.287 but I think we need to, in the future, 00:05:59.287 --> 00:06:02.612 step back and think how can we really make this data useful. 00:06:03.638 --> 00:06:04.910 And one of the ways to do that 00:06:04.910 --> 00:06:09.917 is by making our author items better quality 00:06:09.917 --> 00:06:14.274 by adding affiliation information, adding first names, surnames, 00:06:14.864 --> 00:06:18.928 and just moving beyond occupation researcher, 00:06:18.928 --> 00:06:22.445 trying to get what field people are working in, for example. 00:06:23.916 --> 00:06:25.699 Moving on to DOIs. 00:06:25.699 --> 00:06:29.429 When I was looking at how many scholarly papers we have now, 00:06:29.429 --> 00:06:35.040 I immediately noticed that we have DOIs that are just four characters. 00:06:35.040 --> 00:06:37.231 And that is not a correct DOI. 00:06:37.231 --> 00:06:42.288 We've got about 110 items with this DOI format. 00:06:42.868 --> 00:06:46.702 In the grand scheme of things, not that big a problem. 00:06:46.702 --> 00:06:49.055 But that's never been a correct DOI 00:06:49.055 --> 00:06:51.525 that's being created by an automatic process. 00:06:51.525 --> 00:06:55.279 No one's checked that and realized we had this error and corrected it. 00:06:55.279 --> 00:06:57.974 So, it's kind of an appeal I want to make to people-- 00:06:58.534 --> 00:07:01.099 if you're doing batch imports, to check what you're doing, 00:07:01.099 --> 00:07:03.648 look for these obvious data quality problems. 00:07:06.544 --> 00:07:10.607 And another final issue that I've noticed is errata. 00:07:10.607 --> 00:07:15.718 We have over 13 thousand items that are instance of errata, 00:07:16.050 --> 00:07:18.604 but they're not linked to the paper they're correcting. 00:07:18.609 --> 00:07:25.343 So, I've also produced a table of the top ten titles of the-- 00:07:26.153 --> 00:07:27.341 these are errata items. 00:07:27.341 --> 00:07:30.030 You will notice they're not particularly informative. 00:07:30.030 --> 00:07:32.495 So, as some point, we're going to have to go back 00:07:32.495 --> 00:07:36.868 and look at how we can actually get the information 00:07:36.868 --> 00:07:39.839 about what these errata are correcting, 00:07:39.839 --> 00:07:43.919 because they're not really of much use to anyone at the moment. 00:07:44.711 --> 00:07:48.834 So, in the future, I hope this is one area that we can work on as a community, 00:07:48.834 --> 00:07:53.354 and we can coordinate a bit better with what data imports we're doing, 00:07:53.354 --> 00:07:57.773 and how we can curate all our data, bring it all together, 00:07:57.773 --> 00:07:59.876 and combine our expertise. 00:08:01.619 --> 00:08:03.277 I'm going to pass over to Liam now 00:08:03.277 --> 00:08:07.905 to talk a bit about how we might be able to coordinate our efforts in the future. 00:08:15.119 --> 00:08:16.536 Thank you. 00:08:21.296 --> 00:08:26.837 So, as mentioned in the final slide from Elizabeth, 00:08:26.837 --> 00:08:33.115 WikiCite is trying to be more and more diverse, 00:08:33.115 --> 00:08:36.139 and high quality, and more widely spread. 00:08:36.139 --> 00:08:39.947 The idea is over the next year or so, with the dedicated funding 00:08:39.947 --> 00:08:43.114 that's been provided and is available over a three-year period, 00:08:43.114 --> 00:08:45.346 of which we've entered, 00:08:45.346 --> 00:08:51.189 to change WikiCite-- the conference-- which there's been a few-- 00:08:51.189 --> 00:08:58.162 into a series of proposals from you, 00:08:58.891 --> 00:09:02.858 into what we're calling "satellite events" around the world. 00:09:03.342 --> 00:09:08.240 This will be focusing-- there'll be a call for a proposal system-- 00:09:08.240 --> 00:09:12.043 like reviewing a procedure 00:09:12.043 --> 00:09:14.393 that is currently not yet invented 00:09:14.393 --> 00:09:18.877 for deciding on how to-- 00:09:20.967 --> 00:09:22.482 what's the word I'm after-- 00:09:22.482 --> 00:09:24.804 prioritize these requests. 00:09:24.804 --> 00:09:29.378 And see if we can't get a wider diversity 00:09:29.378 --> 00:09:32.784 of content contributor and topic 00:09:33.115 --> 00:09:36.546 supported in the WikiCite umbrella, 00:09:38.244 --> 00:09:41.009 through this series of satellite events. 00:09:41.347 --> 00:09:42.886 To that end, 00:09:43.102 --> 00:09:46.702 the WikiCite grant-- 00:09:47.037 --> 00:09:52.345 was successfully applied for and received 00:09:52.345 --> 00:09:57.460 through the work of WikiCite's father, Dario, 00:09:57.460 --> 00:10:00.982 who many of you might know from the Wikimedia Foundation. 00:10:00.982 --> 00:10:04.109 Dario no longer works with the Wikimedia Foundation, 00:10:04.109 --> 00:10:06.662 and so this grant has a-- 00:10:08.038 --> 00:10:09.858 needed a home. 00:10:09.858 --> 00:10:14.275 What has happened is that the WikiCite steering committee, 00:10:14.275 --> 00:10:17.104 primarily made up of the organizing team 00:10:17.104 --> 00:10:20.317 from last year's WikiCite conference, 00:10:20.317 --> 00:10:23.923 will continue to oversee this work, 00:10:23.923 --> 00:10:27.654 and the Wikimedia Foundation has hired a temporary 00:10:28.218 --> 00:10:31.567 or a part-time coordinator, 00:10:32.024 --> 00:10:34.617 to oversee and support that work, 00:10:34.617 --> 00:10:38.790 and to promote and receive those applications 00:10:38.790 --> 00:10:40.453 for the satellite events. 00:10:40.453 --> 00:10:41.998 And that will be me. 00:10:42.838 --> 00:10:45.405 (laughter and cheers) 00:10:48.323 --> 00:10:50.591 So, I got the call yesterday 00:10:50.591 --> 00:10:54.320 so that I could be able to like confirm that in-- 00:10:54.320 --> 00:10:58.641 among an audience which is highly relevant to that topic. 00:10:59.308 --> 00:11:02.592 Which is helpful, so I can talk to you here and now about that. 00:11:03.967 --> 00:11:08.671 So, this is listed as a panel in the program. 00:11:09.562 --> 00:11:11.262 Even though it's a bit of a-- 00:11:12.194 --> 00:11:13.776 I think panel is a generous way 00:11:13.776 --> 00:11:17.342 of describing the three of us in this context. 00:11:17.342 --> 00:11:19.573 But the idea is we would like to hear from you 00:11:19.573 --> 00:11:22.181 on that immediate thought about-- 00:11:22.573 --> 00:11:24.908 or questions to Simon, as well-- 00:11:24.908 --> 00:11:27.494 if you have questions for Simon, specifically-- 00:11:27.494 --> 00:11:31.912 about what you think are good directions 00:11:31.912 --> 00:11:35.316 that should be addressed or should be attempted 00:11:35.316 --> 00:11:37.308 in this forthcoming year, 00:11:38.605 --> 00:11:40.429 either individually, online-- 00:11:41.516 --> 00:11:43.987 and things that not necessarily you can do, 00:11:43.987 --> 00:11:46.135 but think should be done. 00:11:47.431 --> 00:11:52.023 And specifically, to start thinking about 00:11:52.023 --> 00:11:55.624 what a satellite event would mean 00:11:55.624 --> 00:11:58.459 with relation to open citations 00:11:58.459 --> 00:12:03.193 and how the community at large would best be served 00:12:03.193 --> 00:12:04.731 by that kind of support. 00:12:04.731 --> 00:12:08.862 Beyond merely financial, but what does support mean 00:12:08.862 --> 00:12:13.364 for satellite events in open citations according to you. 00:12:14.502 --> 00:12:16.726 If you want to come back up, and we can-- 00:12:17.293 --> 00:12:18.584 Did you have a question? 00:12:18.584 --> 00:12:23.039 (woman) Ah, yes. I do research on predatory publishing 00:12:23.039 --> 00:12:25.062 and on retractions. 00:12:25.062 --> 00:12:26.807 You only mentioned errata. 00:12:26.807 --> 00:12:29.999 So, how are you dealing with expressions of concern 00:12:29.999 --> 00:12:31.666 and retractions? 00:12:31.666 --> 00:12:36.000 And what is your policy on trying to identify predatory publishers? 00:12:37.014 --> 00:12:39.802 Okay, so, within the scope of preparing for this, 00:12:39.802 --> 00:12:45.552 I wasn't looking at retractions, but people have been doing work on that 00:12:45.552 --> 00:12:49.039 and trying to-- we have a property-- notice of retractions-- 00:12:49.039 --> 00:12:51.509 so we can be creating those links. 00:12:52.449 --> 00:12:55.482 I don't know what extent that's happened in the same way. 00:12:55.482 --> 00:13:00.302 Not all the errata are linked to the paper that's being corrected. 00:13:00.302 --> 00:13:02.500 I suspect that's a similar case with-- 00:13:02.500 --> 00:13:04.439 - (woman) It's exactly the same. - Yeah. 00:13:05.327 --> 00:13:06.906 As I said, I wasn't looking at that, 00:13:06.906 --> 00:13:12.490 but we can potentially link the retraction to the retracted article, 00:13:12.490 --> 00:13:15.144 the retraction notice to the retracted article. 00:13:15.144 --> 00:13:17.246 In terms of predatory publishers, 00:13:18.278 --> 00:13:21.379 I'm not aware of anyone having done any work in this area, 00:13:21.379 --> 00:13:23.842 but I wouldn't like to say that hasn't happened. 00:13:23.842 --> 00:13:26.835 We have Charles, whose hand is going up there. 00:13:26.835 --> 00:13:30.080 Do you want to comment on predatory publishers, Charles? 00:13:35.170 --> 00:13:38.843 (Charles) Well, I encountered this problem in the ScienceSource project. 00:13:38.843 --> 00:13:43.462 And first of all, I did what I could to put fields list in Wikidata format. 00:13:43.462 --> 00:13:46.984 Fields list isn't sort of what everybody wants to be dealing with, 00:13:46.984 --> 00:13:48.442 but it was a starting point. 00:13:48.442 --> 00:13:52.684 So, that has been done, as far as I was able to. 00:13:53.274 --> 00:13:56.744 But the thing I rely on more, perhaps, 00:13:56.744 --> 00:14:00.717 is DOAJ IDs. 00:14:00.717 --> 00:14:04.182 That is, if we put all the DOAJ IDs 00:14:04.182 --> 00:14:07.826 into Wikidata, 00:14:07.826 --> 00:14:09.908 we'd have made a really good attempt 00:14:09.908 --> 00:14:12.620 to isolate the predatory publishers. 00:14:12.620 --> 00:14:14.057 And that is not the whole story, 00:14:14.057 --> 00:14:16.926 but these days, it's the bulk of the story. 00:14:16.926 --> 00:14:19.072 (woman) [Is the directory of open access there?] 00:14:19.072 --> 00:14:22.042 - (Charles) Directory of open access, yes. - (woman) Alright, good. 00:14:31.452 --> 00:14:34.241 (man) To start with, I just spent a year traveling around New Zealand 00:14:34.241 --> 00:14:37.775 trying to explain Wikidata to the library community, 00:14:37.775 --> 00:14:40.222 and as soon as I mentioned WikiCite, their eyes rolled, 00:14:40.222 --> 00:14:42.868 because they've just been told they have to be [up] with Wikipedia, 00:14:42.868 --> 00:14:43.956 Wiki Commons, Wikidata. 00:14:43.956 --> 00:14:46.232 Here's another Wiki project that they need to know about. 00:14:46.232 --> 00:14:48.739 "Why can't we just do it all with Wikidata?" they were saying. 00:14:48.739 --> 00:14:50.893 So, there's a public perception problem straightaway, 00:14:50.893 --> 00:14:53.803 and that's the very community that we need to have onboard 00:14:53.803 --> 00:14:54.819 for this to work. 00:14:54.819 --> 00:14:57.096 I'm interested in thinking how we are going to reach 00:14:57.096 --> 00:15:01.420 the library community, educate them, and get them integrally involved 00:15:01.420 --> 00:15:03.119 in this process? 00:15:06.449 --> 00:15:09.328 I have thoughts, but I'd like to hear your thoughts first. 00:15:10.909 --> 00:15:14.035 - Sure, I think-- - (assistant) [This one is on.] 00:15:16.384 --> 00:15:18.692 This better? Alright. 00:15:18.692 --> 00:15:20.535 Feel like I'm in a concert. 00:15:20.535 --> 00:15:23.088 So, one of the things we've tried to do 00:15:23.088 --> 00:15:29.666 is incorporate librarians and libraries 00:15:29.666 --> 00:15:32.616 into WikiCite in everything that we do. 00:15:32.616 --> 00:15:33.917 So, on the steering committee, 00:15:33.917 --> 00:15:39.457 we have at least two librarians, if not more. 00:15:39.457 --> 00:15:44.230 And at our actual WikiCite events, 00:15:44.230 --> 00:15:46.702 one of the things that's actually pretty great about WikiCite 00:15:46.702 --> 00:15:50.911 is that we end up getting both speakers and participants, 00:15:50.911 --> 00:15:54.846 who maybe are not actually involved in any Wiki projects. 00:15:54.846 --> 00:15:56.908 So, we don't have Wiki fatigue. 00:15:57.472 --> 00:16:01.117 And a lot of times, they're coming from the perspective of... 00:16:01.688 --> 00:16:04.841 "Well, I'm interested in linked open data, 00:16:04.841 --> 00:16:07.427 I love to use citations at my university, 00:16:07.427 --> 00:16:09.750 can you tell me a little bit more about how Wikidata works, 00:16:09.750 --> 00:16:12.378 and how I might use the citations that are in Wikidata?" 00:16:12.378 --> 00:16:17.057 So, I think it's very much about bringing these communities together, 00:16:17.057 --> 00:16:20.702 which might seem disparate, around these common goals 00:16:20.702 --> 00:16:23.075 for people who are really concerned about curating data, 00:16:23.075 --> 00:16:26.926 and then, people who might already know about how to do that on Wikidata. 00:16:28.346 --> 00:16:31.397 I would say, in terms of the confusion, 00:16:31.397 --> 00:16:35.322 the complexity implied by the question of well, there's WikiCite, 00:16:35.322 --> 00:16:37.351 and there's Wikidata, and there's this... 00:16:37.351 --> 00:16:41.999 WikiCite is a brand name, it's a project-- GLAM-Wiki-- 00:16:42.894 --> 00:16:45.529 GLAM-Wiki also uses the word Wiki, 00:16:45.529 --> 00:16:48.758 but it's not pretending to be a Wiki 00:16:48.758 --> 00:16:51.186 or competing with Wikipedia and Wikidata. 00:16:51.186 --> 00:16:56.118 It's the particular focus area of reference information, 00:16:56.118 --> 00:16:58.789 "referenceable" information. 00:16:58.789 --> 00:17:04.908 Now, particularly in the context of a series of conferences 00:17:04.908 --> 00:17:07.055 that have happened over the last few years, 00:17:07.055 --> 00:17:09.447 and the conference is called WikiCite-- 00:17:11.959 --> 00:17:15.811 particularly within this community, the Wikidata core group, 00:17:15.811 --> 00:17:21.397 WikiCite is seen, known, understood as a large number of items 00:17:21.397 --> 00:17:24.188 uploaded to Wikidata about scholarly publications. 00:17:24.674 --> 00:17:30.462 That is what is understood as WikiCite by this community, mostly. 00:17:32.937 --> 00:17:35.125 I would like to-- 00:17:36.054 --> 00:17:38.045 there is a question about, 00:17:38.045 --> 00:17:42.060 could WikiCite be made into its own Wikibase 00:17:42.060 --> 00:17:45.228 of just citation stuff? 00:17:45.228 --> 00:17:47.982 Not Wikidata, and then there's federation, 00:17:48.914 --> 00:17:50.573 and funky things like that, 00:17:50.573 --> 00:17:54.905 and you could put a lot more very specific information 00:17:54.905 --> 00:17:58.431 about individual, citable things there, 00:17:59.644 --> 00:18:01.590 which is a perfectly valid way 00:18:01.590 --> 00:18:04.346 of dealing with questions of notability and properties. 00:18:04.346 --> 00:18:06.452 But the technology for doing that 00:18:06.452 --> 00:18:10.791 is not yet relevant in any way. 00:18:10.791 --> 00:18:17.003 We need a lot more work, particularly on federation in Wikibase 00:18:17.539 --> 00:18:19.553 to make sure everything syncs neatly. 00:18:20.122 --> 00:18:25.954 So, until such time as that would be a viable outcome, 00:18:26.724 --> 00:18:31.315 in the meantime, all of the things that would serve that kind of outcome 00:18:32.532 --> 00:18:35.969 also serve just improving the quality on Wikidata 00:18:35.969 --> 00:18:39.087 and improving the links with Wikipedia and Wikisource. 00:18:40.145 --> 00:18:43.665 The brand name is, as far as I'm concerned, irrelevant. 00:18:43.665 --> 00:18:47.953 It's just the project to make better footnotes. 00:18:50.093 --> 00:18:52.454 (woman 2) Just a comment in relation to your query 00:18:52.454 --> 00:18:57.726 about satellite proposals 00:18:57.726 --> 00:18:59.956 for satellite conferences-- 00:19:00.768 --> 00:19:04.007 I don't think you realize the level of ignorance 00:19:04.007 --> 00:19:08.351 about Wiki-anything from our country in New Zealand. 00:19:08.351 --> 00:19:09.731 I mean, seriously. 00:19:09.731 --> 00:19:12.497 As an Australian, I recognize the ignorance of New Zealanders-- 00:19:12.497 --> 00:19:15.142 (laughter) 00:19:17.747 --> 00:19:21.270 (woman 2) Oh, [inaudible], come on! 00:19:26.987 --> 00:19:31.248 What I'm trying to say is that if we have a satellite 00:19:31.248 --> 00:19:36.159 or somehow organize a joint satellite conference, 00:19:37.399 --> 00:19:39.793 from my perspective, what I'm looking for 00:19:39.793 --> 00:19:43.192 is strategies and how to engage the community. 00:19:43.192 --> 00:19:48.215 They aren't even at the level of being-- 00:19:48.215 --> 00:19:50.681 they don't know enough to even be enthusiastic 00:19:50.681 --> 00:19:52.997 about Wikidata and WikiCite yet. 00:19:52.997 --> 00:19:57.041 They look at it with a lot of skepticism, if they're even aware of it. 00:19:57.041 --> 00:20:02.632 So I, in particular, want to be able to have a meeting 00:20:02.632 --> 00:20:04.902 in order to be able to learn from those. 00:20:04.902 --> 00:20:08.387 We've already engaged more successfully with the community 00:20:08.387 --> 00:20:14.339 to get a skill base in order to build some collaborations in New Zealand. 00:20:15.117 --> 00:20:18.555 You're talking about extra people to actually engage with. 00:20:18.555 --> 00:20:21.885 I just want the core library community to get on board, 00:20:21.885 --> 00:20:24.367 and then go the extra step. 00:20:24.367 --> 00:20:28.284 It's like I'm looking at you saying that we want to reach out 00:20:28.284 --> 00:20:29.663 to other communities, 00:20:29.663 --> 00:20:32.820 and I'm saying, I just want to reach out to a community. 00:20:32.820 --> 00:20:36.550 You know, we're a lot further behind where we are. 00:20:36.550 --> 00:20:37.963 So, yeah. 00:20:39.292 --> 00:20:44.937 I would not wish to pretend that WikiCite and open bibliographic information 00:20:44.937 --> 00:20:50.478 is the be-all and end-all of Wikidata or Wikimedia outreach. 00:20:51.209 --> 00:20:53.107 It's a specific subset. 00:20:53.107 --> 00:20:59.333 And I would not wish to try and make WikiCite a brand, 00:21:00.477 --> 00:21:04.242 appear to be overriding or replacing 00:21:04.242 --> 00:21:06.841 or somehow getting in the way 00:21:06.841 --> 00:21:10.970 of just general, good quality outreach about Wikimedia, 00:21:11.462 --> 00:21:16.085 and working with libraries, in general, and Wikidata, 00:21:16.085 --> 00:21:17.183 even more specific. 00:21:17.183 --> 00:21:19.386 This is a subset of Wikidata. 00:21:19.386 --> 00:21:22.770 So, particularly, for WikiCite satellite events, 00:21:24.100 --> 00:21:30.251 I don't want to make it appear like there's a competition for Wiki-- 00:21:30.662 --> 00:21:33.976 so, everything about Wikidata now has to be called WikiCite-- no. 00:21:33.976 --> 00:21:37.948 This is a really quite niche-- in the scheme of things-- topic area, 00:21:37.948 --> 00:21:40.497 supporting general awareness-raising 00:21:40.497 --> 00:21:43.044 about Wikidata and open access information, 00:21:43.044 --> 00:21:47.656 and Wikimedia is far beyond the scope 00:21:47.656 --> 00:21:52.031 of this kind of particular specialist outreach. 00:21:52.413 --> 00:21:57.812 And that's not to say that it's not a good thing, too. 00:21:59.304 --> 00:22:01.898 (woman 2) I just perceived-- sorry, one more comment-- 00:22:01.898 --> 00:22:06.198 WikiCite as the possible inroad 00:22:06.209 --> 00:22:09.703 to those at the wider community 00:22:09.703 --> 00:22:13.321 for the people we want to get on board. 00:22:13.321 --> 00:22:16.510 So, to me, WikiCite is-- yes, it's a subset, 00:22:16.510 --> 00:22:22.630 and really a much smaller set of beliefs and information, et cetera-- 00:22:23.075 --> 00:22:26.844 but I see it as an easy steppingstone to get them addicted, 00:22:26.844 --> 00:22:28.870 and then you can open it up. 00:22:29.356 --> 00:22:30.648 So, yeah. 00:22:35.901 --> 00:22:38.814 (assistant) We have just time for one short question. 00:22:41.062 --> 00:22:45.549 So, one of you have another question for the WikiCite team? 00:22:47.970 --> 00:22:49.749 Thank you for sharing this feedback with us. 00:22:49.749 --> 00:22:52.161 Oh, somebody has a question. 00:22:58.502 --> 00:23:01.402 (assistant) Which one of you wants to... 00:23:06.032 --> 00:23:07.732 (woman 3) Hi, thank you so much for this. 00:23:07.732 --> 00:23:11.259 I was just wondering, is there ever going to be 00:23:11.989 --> 00:23:17.861 a paring of the bibliography used in Wikipedia articles and WikiCite? 00:23:17.867 --> 00:23:22.831 Are you planning to move all those references and parse them 00:23:22.831 --> 00:23:27.452 so that we can do some analyses of which references we're using 00:23:27.452 --> 00:23:30.249 in the Wikipedia articles-- 00:23:30.249 --> 00:23:33.808 and when you create an article in another language 00:23:33.808 --> 00:23:38.510 just to get suggestions of this, are the references that have been used, 00:23:38.510 --> 00:23:39.855 kind of like that. 00:23:39.855 --> 00:23:45.137 I know one of the short-term goals of WikiCite is to have all citations 00:23:45.137 --> 00:23:48.058 in WikiProjects represented in Wikidata. 00:23:48.503 --> 00:23:50.613 Currently, there's not an automatic pipeline 00:23:50.613 --> 00:23:53.045 that keeps that updated, 00:23:53.045 --> 00:23:55.831 but that's definitely one of our primary goals. 00:23:57.410 --> 00:24:01.266 And ultimately, there is not specific support 00:24:01.266 --> 00:24:07.834 in the developer community for that kind of activity in particular. 00:24:07.834 --> 00:24:10.467 That's on the interests of individual community members 00:24:10.467 --> 00:24:14.842 to do exports-- like all this work that's been demonstrated 00:24:14.842 --> 00:24:17.273 that's not from the foundation-- 00:24:17.273 --> 00:24:20.520 people doing individual work on their interests. 00:24:20.520 --> 00:24:23.316 So, that could be a good satellite event 00:24:23.316 --> 00:24:26.438 to try and explore that kind of work. 00:24:26.438 --> 00:24:31.301 Getting it a good pipeline so that you can make references 00:24:31.301 --> 00:24:36.445 in Wikipedia's easily hook into Wikidata items, 00:24:36.445 --> 00:24:38.144 multilingual, et cetera-- 00:24:38.144 --> 00:24:41.273 does not yet exist technologically, 00:24:41.273 --> 00:24:45.876 and certain languages have concerns about that. 00:24:45.876 --> 00:24:50.135 The larger the Wikipedia language, the more defensive they are 00:24:50.135 --> 00:24:53.224 about using Wikidata directly. 00:24:54.402 --> 00:24:56.101 But that'll come. 00:24:57.719 --> 00:25:00.273 Yeah, I was just going to say when Liam's finished with that-- 00:25:00.273 --> 00:25:03.288 that it's strictly citations or something that are very much within scope, 00:25:03.288 --> 00:25:07.547 and what we would like to work for, but that needs community to build this, 00:25:07.547 --> 00:25:09.653 to take on that challenge, I think. 00:25:10.353 --> 00:25:15.198 And also, we need to be doing the outreach to the Wikipedians to show them 00:25:15.198 --> 00:25:18.618 that we can provide good quality data consistently. 00:25:22.185 --> 00:25:24.162 (assistant) We are running out of time. 00:25:24.162 --> 00:25:27.665 So, if someone has another question 00:25:27.665 --> 00:25:33.075 I think that these nice people will ask you privately after. 00:25:33.617 --> 00:25:36.452 So, it's time for us, for the last edition, 00:25:36.452 --> 00:25:39.386 and we are welcoming on stage. 00:25:39.956 --> 00:25:44.012 Jean-Fred, Envel, and... 00:25:44.012 --> 00:25:46.621 (applause)