1
00:00:06,292 --> 00:00:10,018
Well, it's almost time
to begin the presentation.
2
00:00:11,598 --> 00:00:17,114
We will begin this last session
with a presentation on WikiCite,
3
00:00:17,883 --> 00:00:22,029
led by Elizabeth Seiver,
Simon Cobb, and Liam Wyatt.
4
00:00:22,029 --> 00:00:26,280
And I'll just let you introduce yourself.
5
00:00:27,129 --> 00:00:30,288
Please don't hesitate
to take notes on Etherpad.
6
00:00:30,746 --> 00:00:33,214
Thank you for everything.
7
00:00:35,359 --> 00:00:37,055
Alright, let's get started.
8
00:00:37,055 --> 00:00:38,474
So, I'm Elizabeth Seiver.
9
00:00:38,474 --> 00:00:42,784
I'm the outgoing
program manager for WikiCite.
10
00:00:42,784 --> 00:00:45,247
And I wanted to tell you all
a little bit about it.
11
00:00:45,247 --> 00:00:48,375
Just as a show of hands, how many people
are already familiar with WikiCite?
12
00:00:49,020 --> 00:00:51,773
That's great. I'm just glad
that so many of you are.
13
00:00:51,773 --> 00:00:54,366
I was wondering how many people here--
I was thinking about it--
14
00:00:54,366 --> 00:00:57,641
is just like, "Who are all these people
putting all the citations
15
00:00:57,641 --> 00:00:59,629
in Wikidata and filling it up?"
16
00:00:59,629 --> 00:01:01,052
And WikiCite is so much more.
17
00:01:01,052 --> 00:01:03,850
So, we're all excited
to tell you about it today.
18
00:01:05,676 --> 00:01:07,425
So, what is WikiCite?
19
00:01:08,275 --> 00:01:12,098
The goal of WikiCite
is to collect all citations
20
00:01:12,098 --> 00:01:14,185
for the sum of all human knowledge.
21
00:01:14,185 --> 00:01:16,046
You know, just a little something.
22
00:01:16,046 --> 00:01:19,696
And we're doing this in a number of ways.
23
00:01:20,915 --> 00:01:25,286
And one of them
is via conferences and workshops
24
00:01:25,286 --> 00:01:27,727
and getting together
the community of people
25
00:01:27,727 --> 00:01:30,482
who are interested
in working on citations.
26
00:01:30,482 --> 00:01:32,992
And it's a very diverse group of people.
27
00:01:32,992 --> 00:01:35,463
So, of course, we have people
who are working in Wikidata,
28
00:01:35,463 --> 00:01:36,879
and other Wikimedians.
29
00:01:36,879 --> 00:01:40,580
We have librarians,
people into linked open data,
30
00:01:40,580 --> 00:01:44,751
software engineers, data scientists,
open knowledge advocates--
31
00:01:45,517 --> 00:01:50,211
coming together about
linked open bibliographic data.
32
00:01:53,226 --> 00:01:55,858
So, in terms of the history of WikiCite,
33
00:01:55,858 --> 00:01:59,185
it was founded as an initiative in 2016.
34
00:01:59,485 --> 00:02:05,456
And we secured dedicated funding
for events for three years in 2018.
35
00:02:05,456 --> 00:02:10,677
And as I mentioned,
you're probably familiar with the big--
36
00:02:11,380 --> 00:02:13,445
the millions of citations
that we already have
37
00:02:13,445 --> 00:02:15,184
that are hosted on Wikidata.
38
00:02:23,293 --> 00:02:27,601
So, what are we doing in WikiCite
and with all these citations?
39
00:02:27,601 --> 00:02:30,278
It's not just about collecting them.
It's about using them.
40
00:02:30,278 --> 00:02:36,038
And it creates so many opportunities
for new projects.
41
00:02:36,567 --> 00:02:38,798
So, one of the things
you can do with this data
42
00:02:38,798 --> 00:02:42,161
is build data models
for bibliographic item types,
43
00:02:42,161 --> 00:02:44,998
which should be exciting for people
who are into schemas.
44
00:02:45,973 --> 00:02:50,319
You can also do open cataloging
and disambiguation--
45
00:02:51,856 --> 00:02:54,209
sorry, my notes are not in sync with this.
46
00:02:54,448 --> 00:02:57,396
And people are also building
tools on top of this.
47
00:02:57,896 --> 00:03:00,833
Visualization tools, such as Scholia.
48
00:03:03,275 --> 00:03:06,995
If you're interested at all
in open cataloging,
49
00:03:06,995 --> 00:03:09,305
or author disambiguation,
50
00:03:10,201 --> 00:03:12,912
or just even figuring out
how sources link together,
51
00:03:12,912 --> 00:03:15,306
WikiCite is a good way to do that.
52
00:03:17,331 --> 00:03:20,454
So, in terms of the direction
that WikiCite is heading in,
53
00:03:21,021 --> 00:03:23,306
one of the things
we're trying to do is expand
54
00:03:23,306 --> 00:03:25,566
all the types of things that are cited.
55
00:03:25,566 --> 00:03:28,441
Right now, in Wikidata,
it's mostly journal articles.
56
00:03:28,927 --> 00:03:31,078
We'd like to keep growing our community,
57
00:03:31,078 --> 00:03:32,870
especially outside of the Global North
58
00:03:32,870 --> 00:03:35,862
and outside of English
language publications.
59
00:03:36,635 --> 00:03:41,409
And I realize this is actually something
that Liam will be talking about.
60
00:03:42,704 --> 00:03:45,537
So, what we wanted to do now,
61
00:03:45,537 --> 00:03:49,840
to do sort of a deep dive
into one of the uses of Wikidata.
62
00:03:49,840 --> 00:03:53,725
So, for that, I would like
to introduce Simon Cobb.
63
00:03:57,294 --> 00:03:58,348
Hi, everyone.
64
00:03:58,348 --> 00:04:01,260
So, what I want to talk about
is an example of something
65
00:04:01,260 --> 00:04:05,061
we could potentially focus on
within the scope of WikiCite.
66
00:04:05,061 --> 00:04:08,056
And that's the data quality issues
that I've been encountering
67
00:04:08,056 --> 00:04:12,388
over the last year, as I've been editing
on scholarly papers.
68
00:04:15,549 --> 00:04:17,934
The three issues I'm going
to briefly touch on
69
00:04:17,934 --> 00:04:21,059
are the quality of the author items
that are getting attached
70
00:04:21,059 --> 00:04:24,996
to scholarly articles,
issues around DOI formats,
71
00:04:24,996 --> 00:04:29,038
and just general curation
of the data that we're creating.
72
00:04:31,269 --> 00:04:33,578
Firstly, we look at some authors.
73
00:04:33,578 --> 00:04:35,798
Oh, sorry, firstly,
I'll provide some context.
74
00:04:35,798 --> 00:04:38,952
We've got 26 million
scholarly article items now.
75
00:04:38,952 --> 00:04:41,943
And the data quality issues
I'm going to talk about,
76
00:04:41,943 --> 00:04:43,844
a very small proportion of these
77
00:04:43,844 --> 00:04:47,448
are generally creating
quite good quality data.
78
00:04:48,467 --> 00:04:53,645
We have a lot of external identifiers--
21.65 million PubMed IDs,
79
00:04:53,645 --> 00:04:59,595
19 million DOIs, and we've added
8.3 million author statements,
80
00:04:59,972 --> 00:05:05,200
although we still have 105.5 million
author name strings to replace.
81
00:05:07,308 --> 00:05:09,562
In terms of the authors,
82
00:05:09,562 --> 00:05:12,823
we've been creating a lot
of items from ORCID IDs.
83
00:05:14,583 --> 00:05:17,950
We've got over half a million items
with an ORCID ID now.
84
00:05:18,437 --> 00:05:22,848
But over 50% of those
do not have any affiliation data yet.
85
00:05:23,455 --> 00:05:26,697
And that's now in employer
or in educated at.
86
00:05:26,697 --> 00:05:28,727
I found 25,000
87
00:05:28,727 --> 00:05:30,757
where we only have two statements.
88
00:05:30,757 --> 00:05:34,786
That's an ORCID ID,
and an instance of a human.
89
00:05:34,786 --> 00:05:37,747
This isn't particularly
useful in terms of--
90
00:05:39,105 --> 00:05:42,267
we use for anyone else
and beyond Wikidata.
91
00:05:42,267 --> 00:05:45,510
If we're serious about approaching
a bibliographic database
92
00:05:45,510 --> 00:05:47,696
and providing open data for people,
93
00:05:47,696 --> 00:05:50,773
we really need to be focusing
on quality, I believe.
94
00:05:52,083 --> 00:05:54,014
So, there's a lot of work to be done.
95
00:05:54,014 --> 00:05:56,852
We've done really well
with automatic input,
96
00:05:56,852 --> 00:05:59,287
but I think we need to, in the future,
97
00:05:59,287 --> 00:06:02,612
step back and think
how can we really make this data useful.
98
00:06:03,638 --> 00:06:04,910
And one of the ways to do that
99
00:06:04,910 --> 00:06:09,917
is by making our author items
better quality
100
00:06:09,917 --> 00:06:14,274
by adding affiliation information,
adding first names, surnames,
101
00:06:14,864 --> 00:06:18,928
and just moving beyond
occupation researcher,
102
00:06:18,928 --> 00:06:22,445
trying to get what field people
are working in, for example.
103
00:06:23,916 --> 00:06:25,699
Moving on to DOIs.
104
00:06:25,699 --> 00:06:29,429
When I was looking at how many
scholarly papers we have now,
105
00:06:29,429 --> 00:06:35,040
I immediately noticed that we have DOIs
that are just four characters.
106
00:06:35,040 --> 00:06:37,231
And that is not a correct DOI.
107
00:06:37,231 --> 00:06:42,288
We've got about 110 items
with this DOI format.
108
00:06:42,868 --> 00:06:46,702
In the grand scheme of things,
not that big a problem.
109
00:06:46,702 --> 00:06:49,055
But that's never been a correct DOI
110
00:06:49,055 --> 00:06:51,525
that's being created
by an automatic process.
111
00:06:51,525 --> 00:06:55,279
No one's checked that and realized
we had this error and corrected it.
112
00:06:55,279 --> 00:06:57,974
So, it's kind of an appeal
I want to make to people--
113
00:06:58,534 --> 00:07:01,099
if you're doing batch imports,
to check what you're doing,
114
00:07:01,099 --> 00:07:03,648
look for these obvious
data quality problems.
115
00:07:06,544 --> 00:07:10,607
And another final issue
that I've noticed is errata.
116
00:07:10,607 --> 00:07:15,718
We have over 13 thousand items
that are instance of errata,
117
00:07:16,050 --> 00:07:18,604
but they're not linked
to the paper they're correcting.
118
00:07:18,609 --> 00:07:25,343
So, I've also produced a table
of the top ten titles of the--
119
00:07:26,153 --> 00:07:27,341
these are errata items.
120
00:07:27,341 --> 00:07:30,030
You will notice they're not
particularly informative.
121
00:07:30,030 --> 00:07:32,495
So, as some point,
we're going to have to go back
122
00:07:32,495 --> 00:07:36,868
and look at how we can actually
get the information
123
00:07:36,868 --> 00:07:39,839
about what these errata are correcting,
124
00:07:39,839 --> 00:07:43,919
because they're not really
of much use to anyone at the moment.
125
00:07:44,711 --> 00:07:48,834
So, in the future, I hope this is one area
that we can work on as a community,
126
00:07:48,834 --> 00:07:53,354
and we can coordinate a bit better
with what data imports we're doing,
127
00:07:53,354 --> 00:07:57,773
and how we can curate all our data,
bring it all together,
128
00:07:57,773 --> 00:07:59,876
and combine our expertise.
129
00:08:01,619 --> 00:08:03,277
I'm going to pass over to Liam now
130
00:08:03,277 --> 00:08:07,905
to talk a bit about how we might be able
to coordinate our efforts in the future.
131
00:08:15,119 --> 00:08:16,536
Thank you.
132
00:08:21,296 --> 00:08:26,837
So, as mentioned
in the final slide from Elizabeth,
133
00:08:26,837 --> 00:08:33,115
WikiCite is trying to be
more and more diverse,
134
00:08:33,115 --> 00:08:36,139
and high quality, and more widely spread.
135
00:08:36,139 --> 00:08:39,947
The idea is over the next year or so,
with the dedicated funding
136
00:08:39,947 --> 00:08:43,114
that's been provided and is available
over a three-year period,
137
00:08:43,114 --> 00:08:45,346
of which we've entered,
138
00:08:45,346 --> 00:08:51,189
to change WikiCite-- the conference--
which there's been a few--
139
00:08:51,189 --> 00:08:58,162
into a series of proposals from you,
140
00:08:58,891 --> 00:09:02,858
into what we're calling
"satellite events" around the world.
141
00:09:03,342 --> 00:09:08,240
This will be focusing--
there'll be a call for a proposal system--
142
00:09:08,240 --> 00:09:12,043
like reviewing a procedure
143
00:09:12,043 --> 00:09:14,393
that is currently not yet invented
144
00:09:14,393 --> 00:09:18,877
for deciding on how to--
145
00:09:20,967 --> 00:09:22,482
what's the word I'm after--
146
00:09:22,482 --> 00:09:24,804
prioritize these requests.
147
00:09:24,804 --> 00:09:29,378
And see if we can't get a wider diversity
148
00:09:29,378 --> 00:09:32,784
of content contributor and topic
149
00:09:33,115 --> 00:09:36,546
supported in the WikiCite umbrella,
150
00:09:38,244 --> 00:09:41,009
through this series of satellite events.
151
00:09:41,347 --> 00:09:42,886
To that end,
152
00:09:43,102 --> 00:09:46,702
the WikiCite grant--
153
00:09:47,037 --> 00:09:52,345
was successfully applied for and received
154
00:09:52,345 --> 00:09:57,460
through the work
of WikiCite's father, Dario,
155
00:09:57,460 --> 00:10:00,982
who many of you might know
from the Wikimedia Foundation.
156
00:10:00,982 --> 00:10:04,109
Dario no longer works
with the Wikimedia Foundation,
157
00:10:04,109 --> 00:10:06,662
and so this grant has a--
158
00:10:08,038 --> 00:10:09,858
needed a home.
159
00:10:09,858 --> 00:10:14,275
What has happened
is that the WikiCite steering committee,
160
00:10:14,275 --> 00:10:17,104
primarily made up of the organizing team
161
00:10:17,104 --> 00:10:20,317
from last year's WikiCite conference,
162
00:10:20,317 --> 00:10:23,923
will continue to oversee this work,
163
00:10:23,923 --> 00:10:27,654
and the Wikimedia Foundation
has hired a temporary
164
00:10:28,218 --> 00:10:31,567
or a part-time coordinator,
165
00:10:32,024 --> 00:10:34,617
to oversee and support that work,
166
00:10:34,617 --> 00:10:38,790
and to promote and receive
those applications
167
00:10:38,790 --> 00:10:40,453
for the satellite events.
168
00:10:40,453 --> 00:10:41,998
And that will be me.
169
00:10:42,838 --> 00:10:45,405
(laughter and cheers)
170
00:10:48,323 --> 00:10:50,591
So, I got the call yesterday
171
00:10:50,591 --> 00:10:54,320
so that I could be able
to like confirm that in--
172
00:10:54,320 --> 00:10:58,641
among an audience
which is highly relevant to that topic.
173
00:10:59,308 --> 00:11:02,592
Which is helpful, so I can talk to you
here and now about that.
174
00:11:03,967 --> 00:11:08,671
So, this is listed as a panel
in the program.
175
00:11:09,562 --> 00:11:11,262
Even though it's a bit of a--
176
00:11:12,194 --> 00:11:13,776
I think panel is a generous way
177
00:11:13,776 --> 00:11:17,342
of describing the three of us
in this context.
178
00:11:17,342 --> 00:11:19,573
But the idea is we would like
to hear from you
179
00:11:19,573 --> 00:11:22,181
on that immediate thought about--
180
00:11:22,573 --> 00:11:24,908
or questions to Simon, as well--
181
00:11:24,908 --> 00:11:27,494
if you have questions
for Simon, specifically--
182
00:11:27,494 --> 00:11:31,912
about what you think are good directions
183
00:11:31,912 --> 00:11:35,316
that should be addressed
or should be attempted
184
00:11:35,316 --> 00:11:37,308
in this forthcoming year,
185
00:11:38,605 --> 00:11:40,429
either individually, online--
186
00:11:41,516 --> 00:11:43,987
and things that not
necessarily you can do,
187
00:11:43,987 --> 00:11:46,135
but think should be done.
188
00:11:47,431 --> 00:11:52,023
And specifically, to start thinking about
189
00:11:52,023 --> 00:11:55,624
what a satellite event would mean
190
00:11:55,624 --> 00:11:58,459
with relation to open citations
191
00:11:58,459 --> 00:12:03,193
and how the community at large
would best be served
192
00:12:03,193 --> 00:12:04,731
by that kind of support.
193
00:12:04,731 --> 00:12:08,862
Beyond merely financial,
but what does support mean
194
00:12:08,862 --> 00:12:13,364
for satellite events in open citations
according to you.
195
00:12:14,502 --> 00:12:16,726
If you want to come back up, and we can--
196
00:12:17,293 --> 00:12:18,584
Did you have a question?
197
00:12:18,584 --> 00:12:23,039
(woman) Ah, yes. I do research
on predatory publishing
198
00:12:23,039 --> 00:12:25,062
and on retractions.
199
00:12:25,062 --> 00:12:26,807
You only mentioned errata.
200
00:12:26,807 --> 00:12:29,999
So, how are you dealing
with expressions of concern
201
00:12:29,999 --> 00:12:31,666
and retractions?
202
00:12:31,666 --> 00:12:36,000
And what is your policy on trying
to identify predatory publishers?
203
00:12:37,014 --> 00:12:39,802
Okay, so, within the scope
of preparing for this,
204
00:12:39,802 --> 00:12:45,552
I wasn't looking at retractions,
but people have been doing work on that
205
00:12:45,552 --> 00:12:49,039
and trying to-- we have a property--
notice of retractions--
206
00:12:49,039 --> 00:12:51,509
so we can be creating those links.
207
00:12:52,449 --> 00:12:55,482
I don't know what extent
that's happened in the same way.
208
00:12:55,482 --> 00:13:00,302
Not all the errata are linked
to the paper that's being corrected.
209
00:13:00,302 --> 00:13:02,500
I suspect that's a similar case with--
210
00:13:02,500 --> 00:13:04,439
- (woman) It's exactly the same.
- Yeah.
211
00:13:05,327 --> 00:13:06,906
As I said, I wasn't looking at that,
212
00:13:06,906 --> 00:13:12,490
but we can potentially link the retraction
to the retracted article,
213
00:13:12,490 --> 00:13:15,144
the retraction notice
to the retracted article.
214
00:13:15,144 --> 00:13:17,246
In terms of predatory publishers,
215
00:13:18,278 --> 00:13:21,379
I'm not aware of anyone
having done any work in this area,
216
00:13:21,379 --> 00:13:23,842
but I wouldn't like to say
that hasn't happened.
217
00:13:23,842 --> 00:13:26,835
We have Charles, whose hand
is going up there.
218
00:13:26,835 --> 00:13:30,080
Do you want to comment
on predatory publishers, Charles?
219
00:13:35,170 --> 00:13:38,843
(Charles) Well, I encountered
this problem in the ScienceSource project.
220
00:13:38,843 --> 00:13:43,462
And first of all, I did what I could
to put fields list in Wikidata format.
221
00:13:43,462 --> 00:13:46,984
Fields list isn't sort of what everybody
wants to be dealing with,
222
00:13:46,984 --> 00:13:48,442
but it was a starting point.
223
00:13:48,442 --> 00:13:52,684
So, that has been done,
as far as I was able to.
224
00:13:53,274 --> 00:13:56,744
But the thing I rely on more, perhaps,
225
00:13:56,744 --> 00:14:00,717
is DOAJ IDs.
226
00:14:00,717 --> 00:14:04,182
That is, if we put all the DOAJ IDs
227
00:14:04,182 --> 00:14:07,826
into Wikidata,
228
00:14:07,826 --> 00:14:09,908
we'd have made a really good attempt
229
00:14:09,908 --> 00:14:12,620
to isolate the predatory publishers.
230
00:14:12,620 --> 00:14:14,057
And that is not the whole story,
231
00:14:14,057 --> 00:14:16,926
but these days,
it's the bulk of the story.
232
00:14:16,926 --> 00:14:19,072
(woman) [Is the directory
of open access there?]
233
00:14:19,072 --> 00:14:22,042
- (Charles) Directory of open access, yes.
- (woman) Alright, good.
234
00:14:31,452 --> 00:14:34,241
(man) To start with, I just spent a year
traveling around New Zealand
235
00:14:34,241 --> 00:14:37,775
trying to explain Wikidata
to the library community,
236
00:14:37,775 --> 00:14:40,222
and as soon as I mentioned WikiCite,
their eyes rolled,
237
00:14:40,222 --> 00:14:42,868
because they've just been told
they have to be [up] with Wikipedia,
238
00:14:42,868 --> 00:14:43,956
Wiki Commons, Wikidata.
239
00:14:43,956 --> 00:14:46,232
Here's another Wiki project
that they need to know about.
240
00:14:46,232 --> 00:14:48,739
"Why can't we just do it all
with Wikidata?" they were saying.
241
00:14:48,739 --> 00:14:50,893
So, there's a public perception
problem straightaway,
242
00:14:50,893 --> 00:14:53,803
and that's the very community
that we need to have onboard
243
00:14:53,803 --> 00:14:54,819
for this to work.
244
00:14:54,819 --> 00:14:57,096
I'm interested in thinking
how we are going to reach
245
00:14:57,096 --> 00:15:01,420
the library community, educate them,
and get them integrally involved
246
00:15:01,420 --> 00:15:03,119
in this process?
247
00:15:06,449 --> 00:15:09,328
I have thoughts, but I'd like
to hear your thoughts first.
248
00:15:10,909 --> 00:15:14,035
- Sure, I think--
- (assistant) [This one is on.]
249
00:15:16,384 --> 00:15:18,692
This better? Alright.
250
00:15:18,692 --> 00:15:20,535
Feel like I'm in a concert.
251
00:15:20,535 --> 00:15:23,088
So, one of the things we've tried to do
252
00:15:23,088 --> 00:15:29,666
is incorporate librarians and libraries
253
00:15:29,666 --> 00:15:32,616
into WikiCite in everything that we do.
254
00:15:32,616 --> 00:15:33,917
So, on the steering committee,
255
00:15:33,917 --> 00:15:39,457
we have at least
two librarians, if not more.
256
00:15:39,457 --> 00:15:44,230
And at our actual WikiCite events,
257
00:15:44,230 --> 00:15:46,702
one of the things that's actually
pretty great about WikiCite
258
00:15:46,702 --> 00:15:50,911
is that we end up getting
both speakers and participants,
259
00:15:50,911 --> 00:15:54,846
who maybe are not actually involved
in any Wiki projects.
260
00:15:54,846 --> 00:15:56,908
So, we don't have Wiki fatigue.
261
00:15:57,472 --> 00:16:01,117
And a lot of times, they're coming
from the perspective of...
262
00:16:01,688 --> 00:16:04,841
"Well, I'm interested in linked open data,
263
00:16:04,841 --> 00:16:07,427
I love to use citations at my university,
264
00:16:07,427 --> 00:16:09,750
can you tell me a little bit more
about how Wikidata works,
265
00:16:09,750 --> 00:16:12,378
and how I might use the citations
that are in Wikidata?"
266
00:16:12,378 --> 00:16:17,057
So, I think it's very much
about bringing these communities together,
267
00:16:17,057 --> 00:16:20,702
which might seem disparate,
around these common goals
268
00:16:20,702 --> 00:16:23,075
for people who are really concerned
about curating data,
269
00:16:23,075 --> 00:16:26,926
and then, people who might already know
about how to do that on Wikidata.
270
00:16:28,346 --> 00:16:31,397
I would say, in terms of the confusion,
271
00:16:31,397 --> 00:16:35,322
the complexity implied by the question
of well, there's WikiCite,
272
00:16:35,322 --> 00:16:37,351
and there's Wikidata, and there's this...
273
00:16:37,351 --> 00:16:41,999
WikiCite is a brand name,
it's a project-- GLAM-Wiki--
274
00:16:42,894 --> 00:16:45,529
GLAM-Wiki also uses the word Wiki,
275
00:16:45,529 --> 00:16:48,758
but it's not pretending to be a Wiki
276
00:16:48,758 --> 00:16:51,186
or competing with Wikipedia and Wikidata.
277
00:16:51,186 --> 00:16:56,118
It's the particular focus area
of reference information,
278
00:16:56,118 --> 00:16:58,789
"referenceable" information.
279
00:16:58,789 --> 00:17:04,908
Now, particularly in the context
of a series of conferences
280
00:17:04,908 --> 00:17:07,055
that have happened
over the last few years,
281
00:17:07,055 --> 00:17:09,447
and the conference is called WikiCite--
282
00:17:11,959 --> 00:17:15,811
particularly within this community,
the Wikidata core group,
283
00:17:15,811 --> 00:17:21,397
WikiCite is seen, known, understood
as a large number of items
284
00:17:21,397 --> 00:17:24,188
uploaded to Wikidata
about scholarly publications.
285
00:17:24,674 --> 00:17:30,462
That is what is understood as WikiCite
by this community, mostly.
286
00:17:32,937 --> 00:17:35,125
I would like to--
287
00:17:36,054 --> 00:17:38,045
there is a question about,
288
00:17:38,045 --> 00:17:42,060
could WikiCite be made
into its own Wikibase
289
00:17:42,060 --> 00:17:45,228
of just citation stuff?
290
00:17:45,228 --> 00:17:47,982
Not Wikidata, and then there's federation,
291
00:17:48,914 --> 00:17:50,573
and funky things like that,
292
00:17:50,573 --> 00:17:54,905
and you could put a lot more
very specific information
293
00:17:54,905 --> 00:17:58,431
about individual, citable things there,
294
00:17:59,644 --> 00:18:01,590
which is a perfectly valid way
295
00:18:01,590 --> 00:18:04,346
of dealing with questions
of notability and properties.
296
00:18:04,346 --> 00:18:06,452
But the technology for doing that
297
00:18:06,452 --> 00:18:10,791
is not yet relevant in any way.
298
00:18:10,791 --> 00:18:17,003
We need a lot more work,
particularly on federation in Wikibase
299
00:18:17,539 --> 00:18:19,553
to make sure everything syncs neatly.
300
00:18:20,122 --> 00:18:25,954
So, until such time
as that would be a viable outcome,
301
00:18:26,724 --> 00:18:31,315
in the meantime, all of the things
that would serve that kind of outcome
302
00:18:32,532 --> 00:18:35,969
also serve just improving
the quality on Wikidata
303
00:18:35,969 --> 00:18:39,087
and improving the links
with Wikipedia and Wikisource.
304
00:18:40,145 --> 00:18:43,665
The brand name is,
as far as I'm concerned, irrelevant.
305
00:18:43,665 --> 00:18:47,953
It's just the project to make
better footnotes.
306
00:18:50,093 --> 00:18:52,454
(woman 2) Just a comment
in relation to your query
307
00:18:52,454 --> 00:18:57,726
about satellite proposals
308
00:18:57,726 --> 00:18:59,956
for satellite conferences--
309
00:19:00,768 --> 00:19:04,007
I don't think you realize
the level of ignorance
310
00:19:04,007 --> 00:19:08,351
about Wiki-anything
from our country in New Zealand.
311
00:19:08,351 --> 00:19:09,731
I mean, seriously.
312
00:19:09,731 --> 00:19:12,497
As an Australian, I recognize
the ignorance of New Zealanders--
313
00:19:12,497 --> 00:19:15,142
(laughter)
314
00:19:17,747 --> 00:19:21,270
(woman 2) Oh, [inaudible], come on!
315
00:19:26,987 --> 00:19:31,248
What I'm trying to say
is that if we have a satellite
316
00:19:31,248 --> 00:19:36,159
or somehow organize
a joint satellite conference,
317
00:19:37,399 --> 00:19:39,793
from my perspective, what I'm looking for
318
00:19:39,793 --> 00:19:43,192
is strategies and how
to engage the community.
319
00:19:43,192 --> 00:19:48,215
They aren't even at the level of being--
320
00:19:48,215 --> 00:19:50,681
they don't know enough
to even be enthusiastic
321
00:19:50,681 --> 00:19:52,997
about Wikidata and WikiCite yet.
322
00:19:52,997 --> 00:19:57,041
They look at it with a lot of skepticism,
if they're even aware of it.
323
00:19:57,041 --> 00:20:02,632
So I, in particular, want to be able
to have a meeting
324
00:20:02,632 --> 00:20:04,902
in order to be able to learn from those.
325
00:20:04,902 --> 00:20:08,387
We've already engaged
more successfully with the community
326
00:20:08,387 --> 00:20:14,339
to get a skill base in order to build
some collaborations in New Zealand.
327
00:20:15,117 --> 00:20:18,555
You're talking about extra people
to actually engage with.
328
00:20:18,555 --> 00:20:21,885
I just want the core library community
to get on board,
329
00:20:21,885 --> 00:20:24,367
and then go the extra step.
330
00:20:24,367 --> 00:20:28,284
It's like I'm looking at you saying
that we want to reach out
331
00:20:28,284 --> 00:20:29,663
to other communities,
332
00:20:29,663 --> 00:20:32,820
and I'm saying, I just want
to reach out to a community.
333
00:20:32,820 --> 00:20:36,550
You know, we're a lot further
behind where we are.
334
00:20:36,550 --> 00:20:37,963
So, yeah.
335
00:20:39,292 --> 00:20:44,937
I would not wish to pretend that WikiCite
and open bibliographic information
336
00:20:44,937 --> 00:20:50,478
is the be-all and end-all of Wikidata
or Wikimedia outreach.
337
00:20:51,209 --> 00:20:53,107
It's a specific subset.
338
00:20:53,107 --> 00:20:59,333
And I would not wish to try
and make WikiCite a brand,
339
00:21:00,477 --> 00:21:04,242
appear to be overriding or replacing
340
00:21:04,242 --> 00:21:06,841
or somehow getting in the way
341
00:21:06,841 --> 00:21:10,970
of just general, good quality outreach
about Wikimedia,
342
00:21:11,462 --> 00:21:16,085
and working with libraries,
in general, and Wikidata,
343
00:21:16,085 --> 00:21:17,183
even more specific.
344
00:21:17,183 --> 00:21:19,386
This is a subset of Wikidata.
345
00:21:19,386 --> 00:21:22,770
So, particularly, for WikiCite
satellite events,
346
00:21:24,100 --> 00:21:30,251
I don't want to make it appear
like there's a competition for Wiki--
347
00:21:30,662 --> 00:21:33,976
so, everything about Wikidata now
has to be called WikiCite-- no.
348
00:21:33,976 --> 00:21:37,948
This is a really quite niche--
in the scheme of things-- topic area,
349
00:21:37,948 --> 00:21:40,497
supporting general awareness-raising
350
00:21:40,497 --> 00:21:43,044
about Wikidata
and open access information,
351
00:21:43,044 --> 00:21:47,656
and Wikimedia is far beyond the scope
352
00:21:47,656 --> 00:21:52,031
of this kind of particular
specialist outreach.
353
00:21:52,413 --> 00:21:57,812
And that's not to say
that it's not a good thing, too.
354
00:21:59,304 --> 00:22:01,898
(woman 2) I just perceived--
sorry, one more comment--
355
00:22:01,898 --> 00:22:06,198
WikiCite as the possible inroad
356
00:22:06,209 --> 00:22:09,703
to those at the wider community
357
00:22:09,703 --> 00:22:13,321
for the people we want to get on board.
358
00:22:13,321 --> 00:22:16,510
So, to me, WikiCite is--
yes, it's a subset,
359
00:22:16,510 --> 00:22:22,630
and really a much smaller set
of beliefs and information, et cetera--
360
00:22:23,075 --> 00:22:26,844
but I see it as an easy steppingstone
to get them addicted,
361
00:22:26,844 --> 00:22:28,870
and then you can open it up.
362
00:22:29,356 --> 00:22:30,648
So, yeah.
363
00:22:35,901 --> 00:22:38,814
(assistant) We have just time
for one short question.
364
00:22:41,062 --> 00:22:45,549
So, one of you have another question
for the WikiCite team?
365
00:22:47,970 --> 00:22:49,749
Thank you for sharing
this feedback with us.
366
00:22:49,749 --> 00:22:52,161
Oh, somebody has a question.
367
00:22:58,502 --> 00:23:01,402
(assistant) Which one of you wants to...
368
00:23:06,032 --> 00:23:07,732
(woman 3) Hi, thank you so much for this.
369
00:23:07,732 --> 00:23:11,259
I was just wondering,
is there ever going to be
370
00:23:11,989 --> 00:23:17,861
a paring of the bibliography
used in Wikipedia articles and WikiCite?
371
00:23:17,867 --> 00:23:22,831
Are you planning to move
all those references and parse them
372
00:23:22,831 --> 00:23:27,452
so that we can do some analyses
of which references we're using
373
00:23:27,452 --> 00:23:30,249
in the Wikipedia articles--
374
00:23:30,249 --> 00:23:33,808
and when you create an article
in another language
375
00:23:33,808 --> 00:23:38,510
just to get suggestions of this,
are the references that have been used,
376
00:23:38,510 --> 00:23:39,855
kind of like that.
377
00:23:39,855 --> 00:23:45,137
I know one of the short-term goals
of WikiCite is to have all citations
378
00:23:45,137 --> 00:23:48,058
in WikiProjects represented in Wikidata.
379
00:23:48,503 --> 00:23:50,613
Currently, there's not
an automatic pipeline
380
00:23:50,613 --> 00:23:53,045
that keeps that updated,
381
00:23:53,045 --> 00:23:55,831
but that's definitely one
of our primary goals.
382
00:23:57,410 --> 00:24:01,266
And ultimately, there
is not specific support
383
00:24:01,266 --> 00:24:07,834
in the developer community
for that kind of activity in particular.
384
00:24:07,834 --> 00:24:10,467
That's on the interests
of individual community members
385
00:24:10,467 --> 00:24:14,842
to do exports-- like all this work
that's been demonstrated
386
00:24:14,842 --> 00:24:17,273
that's not from the foundation--
387
00:24:17,273 --> 00:24:20,520
people doing individual work
on their interests.
388
00:24:20,520 --> 00:24:23,316
So, that could be a good satellite event
389
00:24:23,316 --> 00:24:26,438
to try and explore that kind of work.
390
00:24:26,438 --> 00:24:31,301
Getting it a good pipeline
so that you can make references
391
00:24:31,301 --> 00:24:36,445
in Wikipedia's easily hook
into Wikidata items,
392
00:24:36,445 --> 00:24:38,144
multilingual, et cetera--
393
00:24:38,144 --> 00:24:41,273
does not yet exist technologically,
394
00:24:41,273 --> 00:24:45,876
and certain languages
have concerns about that.
395
00:24:45,876 --> 00:24:50,135
The larger the Wikipedia language,
the more defensive they are
396
00:24:50,135 --> 00:24:53,224
about using Wikidata directly.
397
00:24:54,402 --> 00:24:56,101
But that'll come.
398
00:24:57,719 --> 00:25:00,273
Yeah, I was just going to say
when Liam's finished with that--
399
00:25:00,273 --> 00:25:03,288
that it's strictly citations or something
that are very much within scope,
400
00:25:03,288 --> 00:25:07,547
and what we would like to work for,
but that needs community to build this,
401
00:25:07,547 --> 00:25:09,653
to take on that challenge, I think.
402
00:25:10,353 --> 00:25:15,198
And also, we need to be doing the outreach
to the Wikipedians to show them
403
00:25:15,198 --> 00:25:18,618
that we can provide good
quality data consistently.
404
00:25:22,185 --> 00:25:24,162
(assistant) We are running out of time.
405
00:25:24,162 --> 00:25:27,665
So, if someone has another question
406
00:25:27,665 --> 00:25:33,075
I think that these nice people
will ask you privately after.
407
00:25:33,617 --> 00:25:36,452
So, it's time for us,
for the last edition,
408
00:25:36,452 --> 00:25:39,386
and we are welcoming on stage.
409
00:25:39,956 --> 00:25:44,012
Jean-Fred, Envel, and...
410
00:25:44,012 --> 00:25:46,621
(applause)