cdn.media.ccc.de/.../wikidatacon2019-5-eng-Wikibase_inspiration_panel_hd.mp4

Edit subtitles

0:06 - 0:09

Willkommen, Bienvenue-- Welcome.
0:09 - 0:11

I always wanted to say that on a stage.
0:11 - 0:13

(laughter)
0:13 - 0:15

This is going to be inspirational,
0:15 - 0:19

because this is the official
Wikibase inspiration panel
0:19 - 0:23

of WikidataCon 2019.
0:24 - 0:28

The point of this panel
is to be inspired by all the things
0:28 - 0:34

that people, in various countries,
in various fields, do with Wikibase,
0:34 - 0:36

the software behind Wikidata.
0:36 - 0:39

I was really surprised to learn today
that someone came to me and said,
0:39 - 0:42

"I learned about Wikibase
the first time today."
0:43 - 0:47

So, it is the software that runs Wikidata.
0:47 - 0:51

And if you want
to order things in the world
0:51 - 0:54

the way Wikidata orders things
in the world,
0:55 - 0:59

but you don't agree with the items
that we have in there,
0:59 - 1:02

because you might need
a finer level of granularity,
1:02 - 1:06

or maybe you don't want to start
with Q1, which is the universe,
1:06 - 1:10

because in your little world,
Q1 could be a book, if you are a library,
1:10 - 1:14

or it could be some kind of animal,
if you work in biology,
1:14 - 1:19

or it could be a historic person,
if you do digital humanities,
1:19 - 1:22

but you still want
the same system of ordering,
1:22 - 1:25

then Wikibase is the thing for you.
1:25 - 1:30

Over the last one or two years,
we have made contact
1:30 - 1:34

with extraordinary people,
who are pioneers, who are trailblazing,
1:34 - 1:37

who are evaluating Wikibase,
1:37 - 1:40

and who are doing
extremely great stuff with that.
1:41 - 1:44

This panel is going to be very rushed.
1:44 - 1:48

Every one of the participants
of this panel would have deserved
1:48 - 1:51

a one-hour slot to present their thing.
1:51 - 1:54

But our program is packed.
1:54 - 2:00

So, yeah, keep your seat belt fastened
for a fast-paced ride
2:00 - 2:04

through the inspirational
world of Wikibases.
2:04 - 2:10

And the first one is a project
from two organizations,
2:10 - 2:12

which is a little sensation in itself.
2:13 - 2:16

The Bibliothèque nationale de France,
the French National Library,
2:16 - 2:22

and Abes, which is an authority
for higher education.
2:23 - 2:26

But I think you will talk about that
more in your presentation,
2:26 - 2:31

and yeah, we'd like to welcome
Anila Angjeli and Benjamin Bober
2:31 - 2:35

on stage for the first
ten minutes of inspiration.
2:36 - 2:41

(applause)
2:46 - 2:47

Hi, everybody.
2:47 - 2:49

So, yeah, my name is Benjamin Bober.
2:49 - 2:52

So, I work for Abes,
2:52 - 2:54

which stands for Higher Education Agency,
2:54 - 2:56

Bibliographic Higher Education Agency.
2:56 - 3:01

Basically, we work with all
the university libraries in France,
3:01 - 3:03

and manage the union catalog.
3:03 - 3:06

And also their authority files.
3:07 - 3:10

And I'm here with Anila Angjeli,
from the BnF,
3:10 - 3:12

French National Library.
3:12 - 3:16

And we're going to talk to you
about our joint project,
3:17 - 3:21

which is about creating
a new production tool
3:21 - 3:24

for authorities data--
3:25 - 3:29

person, corporate bodies,
concepts, and so on.
3:29 - 3:33

And we spent the last months
3:33 - 3:37

asking Wikibase to do this stuff.
3:38 - 3:44

So, I will give you some context
really quickly,
3:46 - 3:49

because it's important for us,
as libraries--
3:49 - 3:54

There's been this technological
shift recently
3:56 - 3:58

with the linked open data movement,
3:58 - 4:02

and we wanted, as a bibliographical agency,
4:02 - 4:06

to follow this new trend.
4:06 - 4:08

And, well, it's been years since we've--
4:10 - 4:13

experimenting with linked open data,
4:13 - 4:16

with RDF, SPARQL and so on.
4:16 - 4:22

But we think that now
is the good time to move forward.
4:23 - 4:28

It's also a good time
because there's been a-- not a shift,
4:30 - 4:31

there's a fundamental change
4:31 - 4:37

in the way we consider
bibliographical data.
4:38 - 4:41

We used to, and we still have data
4:42 - 4:45

stored in records, we call it MARC records
4:45 - 4:48

in the library landscape.
4:48 - 4:51

We used a specific format called MARC.
4:53 - 4:57

But recently, there has been some way
4:59 - 5:02

to think about it
from another point of view.
5:02 - 5:07

And to go from a record-based world,
to an entity-based world
5:07 - 5:12

when we try to interconnect
people, works,
5:14 - 5:17

and other entities.
5:18 - 5:24

So, in this context, we decided
to launch this joint initiative.
5:26 - 5:29

But our goal is far beyond libraries.
5:29 - 5:32

We would like to have with us
5:36 - 5:38

other French GLAMS, for instance,
5:38 - 5:42

because we think our project
can help them also.
5:44 - 5:49

So basically, our project is called
Fichier National d'Entités,
5:49 - 5:51

so National Entity Files.
5:52 - 5:56

And it will be a shared platform
for collaboratively create
5:56 - 5:59

and maintain reference
data about entities.
5:59 - 6:02

Like I said, persons,
corporate bodies, places, concepts,
6:02 - 6:03

and creative works.
6:03 - 6:06

So, we embrace a lot of things.
6:07 - 6:10

And it's a challenge
because it's the first time
6:10 - 6:16

BnF and Abes collaborate
at such a level.
6:19 - 6:22

Giving you a quick view
about where we are--
6:23 - 6:25

where we've come from
and where we are now.
6:25 - 6:28

We have been working
on this project since 2017.
6:29 - 6:32

We've benchmarked,
other similar initiatives,
6:32 - 6:34

and came to the conclusion last year
6:34 - 6:41

that there was a strong interest
in Wikibase as the FNE's backbone.
6:42 - 6:45

We were considering it a good solution
6:45 - 6:49

to build upon, but we still
had doubts at this time,
6:50 - 6:54

because we have specific needs to fulfill.
6:55 - 6:59

So we decided to launch,
to spend this year
6:59 - 7:02

to build a proof of concept with real data
7:02 - 7:06

both from BnF catalog,
authority catalog, and our catalogs.
7:07 - 7:11

And well, try to merge this data
into a Wikibase,
7:11 - 7:13

and to try to see how they behave
7:13 - 7:18

and how the tool can fulfill our needs.
7:18 - 7:22

And we were helped
in this proof of concept
7:22 - 7:27

by Maxime and Vincent
from Inventaire.io,
7:28 - 7:33

who helped us have a better idea
7:33 - 7:37

about what Wikibase can bring us.
7:37 - 7:40

And Anila will talk
about the first findings.
7:42 - 7:47

So, while this decision to go
7:47 - 7:50

with experiments with the Wikibase
7:50 - 7:53

as the technical infrastructure backbone
7:53 - 7:57

or the basic layer for our FNE
7:57 - 8:04

was because it's not trivial
to move from one system to another,
8:05 - 8:10

and because the initiative
of using the Wikibase
8:11 - 8:16

as the technical infrastructure
for our data--
8:18 - 8:19

it was both--
8:20 - 8:26

means that we move from our classical
8:27 - 8:28

system information
8:28 - 8:33

or library information system
to quite another thing.
8:34 - 8:36

And so, we needed to experiment first,
8:36 - 8:42

and just to see whether a set
of functionalities that are--
8:42 - 8:48

that we usually need to perform
and fulfill in our environment--
8:48 - 8:50

professional environment.
8:50 - 8:53

I'm talking here about creating
and maintaining,
8:53 - 8:57

and not publishing,
which is a big difference.
8:57 - 9:00

You were at the session,
the previous session,
9:00 - 9:04

with just Wikidata Commons,
9:04 - 9:07

contribution strategies for GLAM--
9:07 - 9:13

it was about publication
and ways about creation in itself.
9:13 - 9:16

So, we need to go step by step,
9:16 - 9:20

and that's why we conducted
this experiment, this proof of concept.
9:21 - 9:27

And, good surprise, no major obstacle
to ingest library data
9:27 - 9:31

according to a specific ontology,
which is, while we--
9:31 - 9:38

I briefly mentioned that we put their data
in two different flavors of MARC,
9:39 - 9:43

then we defined
some [inaudible] properties
9:43 - 9:47

in order to be able to experiment
with merging the data,
9:47 - 9:53

and there was no major obstacle
from the technical point of view.
9:53 - 9:57

Of course, we came up with a confirmation
9:57 - 10:00

that Wikibase does offer built-in features
10:00 - 10:05

that could be used as the basis
for the technical infrastructure for FNE.
10:06 - 10:09

But again, the decision is not yet made,
10:09 - 10:12

because the experiment is still--
10:12 - 10:16

let's say, the developments
have been completed.
10:17 - 10:22

Now, we're in the phase of writing
the final conclusions,
10:22 - 10:29

and the decision is not yet made
from the strategic point of view,
10:29 - 10:34

but these are really the first findings
we can talk about.
10:35 - 10:38

And Wikibase-- it appears to us
10:38 - 10:43

that a Wikibase might be
a good operational solution
10:43 - 10:49

for managing this initiative--
that is jointly, collaboratively,
10:49 - 10:52

create these entity, these things,
10:53 - 10:57

to remind you of the opposition,
which is things and strings.
10:58 - 11:01

However, we noticed there are gaps.
11:01 - 11:05

Within the specific needs
of our specific institutions,
11:06 - 11:12

there are defined communities
with their own culture, practices and,
11:15 - 11:20

well, it is certain processes
that are inherent to the libraries,
11:21 - 11:26

and the solution offered by Wikibase,
for example, the search.
11:27 - 11:29

I mean, from the professional standpoint,
11:29 - 11:32

not only from this end-user standpoint,
11:32 - 11:35

but professional, we need some indexes
11:35 - 11:39

in order to ensure
data quality, data curation,
11:39 - 11:41

and it is very important
for the professional,
11:41 - 11:46

and Wikibase with its Elasticsearch
11:46 - 11:49

and CirrusSearch doesn't offer.
11:49 - 11:52

But still areas of investigation there.
11:52 - 11:54

The roles-- how are the roles managed?
11:54 - 11:57

The bureaucrat, the patrolling of--
11:57 - 12:01

it's not exactly what happened
in our world.
12:01 - 12:05

Although there is a layer
that can be used,
12:05 - 12:11

upon which we can build
other roles that are more in compliance
12:11 - 12:15

with our way of managing the data.
12:16 - 12:20

Or different constraints,
constraints related to data publication,
12:21 - 12:26

or data-- there's an error there
we need to correct.
12:27 - 12:29

Data policy-- okay, thank you.
12:30 - 12:33

So, there are things that need to be--
12:33 - 12:39

other layers, bricks,
need to be built upon Wikibase.
12:39 - 12:43

And of course, one of the reasons,
the major reasons,
12:43 - 12:45

the reason why we are here with you,
12:45 - 12:50

is that we-- we are willing,
and we feel the necessity
12:50 - 12:54

to be part of a community
sharing the same concerns.
12:54 - 12:59

And we all know, given the program,
12:59 - 13:02

that libraries and GLAMs
13:02 - 13:05

are heavily represented in this event.
13:06 - 13:12

So, I think-- we think that maybe
13:12 - 13:14

in a couple of weeks,
13:14 - 13:19

or next year, we will able
to communicate more openly
13:19 - 13:24

on our decision to go forward
with this solution.
13:24 - 13:26

Thank you.
13:26 - 13:28

Thank you so much.
13:28 - 13:31

(applause)
13:31 - 13:34

So, we will have short
presentations first,
13:34 - 13:35

and we will all return on stage
13:35 - 13:38

for questions, if we have
the time for that.
13:38 - 13:41

But yeah, we heard something from France.
13:43 - 13:44

There's another project.
13:45 - 13:48

It's not Fichier National d'Ent--
13:48 - 13:50

(jokingly struggles with name)
13:50 - 13:52

But it's Gemeinsame Normdatei,
13:53 - 13:57

the universal authority file
13:57 - 13:58

for the German-speaking world.
13:58 - 14:04

And I'm so happy to have good friends
of the Wikimedia movement here.
14:05 - 14:09

Barbara Fischer and Sarah Hartmann.
14:12 - 14:15

Thanks alot for the invitation
to talk about our project,
14:15 - 14:18

which is called GND meets Wikibase.
14:19 - 14:22

And it's a joint project
of Wikimedia Deutschland,
14:22 - 14:23

and the GND.
14:24 - 14:26

And we'd like to give you
a quick overview,
14:26 - 14:29

as Jens said before,
there are just 10 minutes.
14:30 - 14:33

Why we go for that approach
to evaluate Wikibase,
14:33 - 14:37

if it fulfills the requirements
for managing authority data
14:37 - 14:40

on a collaborative level, I would say.
14:42 - 14:46

So, where do we come from,
and what's the idea of authority control?
14:46 - 14:50

And GND, which stands for
Gemeinsame Normdatei,
14:51 - 14:52

what's the idea of it?
14:52 - 14:56

And yeah, where do we come from,
as I said before.
14:56 - 14:59

It's not that different
from what Anila and Ben said,
15:00 - 15:02

just a few seconds ago.
15:03 - 15:06

The GND is used
for the description of resources,
15:06 - 15:10

such as publications,
and objects, for example,
15:10 - 15:14

and in order to enable
accurate data retrieval,
15:14 - 15:19

I would say, the GND provides
unambiguous and distinct entities
15:19 - 15:21

for that retrieval.
15:22 - 15:25

And so, there are persistent identifiers,
as well, as you all know,
15:25 - 15:29

for identification and reference
for these entities.
15:31 - 15:34

The authority file is used
by mainly libraries,
15:35 - 15:38

we would say,
in the German-speaking countries,
15:38 - 15:41

but a few other institutions
from the cultural heritage domain,
15:41 - 15:45

are using the authority file already.
15:46 - 15:53

And all in all there are
around about 60 million records,
15:53 - 15:55

and in Wikibase, we would say "items,"
15:55 - 15:58

which refer to persons, names of persons,
15:58 - 16:01

corporate bodies, for example,
geographic names, and works.
16:02 - 16:07

And the GND is run cooperatively
by so-called GND agencies,
16:07 - 16:11

and at the moment, there are
around about 1,000 institutions
16:11 - 16:15

who are active users of the GND--
that means they establish new records
16:15 - 16:20

and added records or items
on a regular basis.
16:21 - 16:24

And the most important thing, I would say,
16:24 - 16:28

is that the GND data
is provided free of charge
16:28 - 16:30

under CC0 conditions,
16:30 - 16:33

and that all the APIs
and documentation is open as well.
16:35 - 16:37

Yeah, talking about open--
16:38 - 16:42

that's the point,
and the crucial one here--
16:42 - 16:45

at the moment, we challenge
to open up the GND
16:45 - 16:51

for other GLAM institutions
and institutions from the science domain.
16:52 - 16:56

At the moment, it's really focused
on the library sector.
16:57 - 17:00

That means that the handy tool
of librarians has to evolve
17:01 - 17:06

into a tool that is used
and accepted across domains.
17:06 - 17:10

And that means a lot of work
on organizational stuff,
17:10 - 17:15

community building, discussions
about the current data model,
17:15 - 17:18

and infrastructural and technical issues.
17:18 - 17:20

And, yeah.
17:21 - 17:23

Talking about the infrastructural issues,
17:24 - 17:29

we came up with the idea
to become partners in crime
17:30 - 17:35

with Wikibase, I would say,
so have slightly the same aims,
17:35 - 17:40

namely make cultural data
more accessible and interoperable.
17:41 - 17:45

And therefore we now
evaluate the software,
17:45 - 17:50

which was originally conceived
for a sole application, Wikidata,
17:50 - 17:53

if it's sufficient for managing
authority data.
17:58 - 18:01

Right-- hi from my side as well.
18:01 - 18:06

We're focusing in our evaluation
[inaudible] we do commonly
18:06 - 18:07

with Wikimedia Deutschland.
18:08 - 18:11

First of all, if Wikibase meets
the requirements
18:11 - 18:15

of GLAM institutions, galleries,
libraries, archives, and museums,
18:15 - 18:18

to drive collaboratively
an authority file,
18:18 - 18:21

which is like our basic question.
18:22 - 18:26

We also would like to see
Wikibase to increase usability
18:26 - 18:29

as the software system
we're using right now
18:29 - 18:33

is, let's say, quite a complex software
18:33 - 18:37

that is not as handy
as you might like it to be.
18:39 - 18:42

Well, and then, we would like to know
18:42 - 18:46

if Wikibase would also ease
both data linking
18:46 - 18:49

and growing a diverse community.
18:49 - 18:52

As Sarah said before, we are right now
in a process of opening up
18:52 - 18:58

towards a broader scope
of GLAM institutions,
18:58 - 19:00

and science institutions.
19:00 - 19:06

And of course, they are working
within their own software structures,
19:06 - 19:09

and we would like to know
if Wikibase would ease
19:09 - 19:12

the cooperation-- collaboration with us.
19:13 - 19:15

So, why do we do that?
19:16 - 19:19

This is because we consider that Wikibase
19:19 - 19:22

might be the attractive community zone,
19:22 - 19:26

which means--I had to write that down--
19:27 - 19:31

first of all, as it is open source,
it will be more accessible
19:31 - 19:35

than any proprietary source
software system that is used
19:35 - 19:39

in the cataloging fields
of the GLAM institutions.
19:40 - 19:43

Then, we feel that the Wikibase community
19:43 - 19:46

already by now
is a very dedicated community,
19:46 - 19:50

and we would like to participate
in that dedicated community,
19:50 - 19:53

because we believe that sharing is caring.
19:54 - 19:59

What we want to share
is our knowledge is your knowledge,
19:59 - 20:03

and together, in order to omit redundance,
20:03 - 20:07

not by editing the same information
over and over again,
20:07 - 20:09

but reuse data, link it,
20:09 - 20:12

quoting it, and enriching it.
20:13 - 20:17

And I placed here on the picture
one of the tools
20:17 - 20:23

that is broadly spread within Wikidata,
this Histropedia,
20:23 - 20:29

because we also feel that if we are able
to introduce our data into Wikibase,
20:29 - 20:34

we might be able to share tools,
improving the code,
20:34 - 20:38

and thus being an active,
contributing part of the community.
20:38 - 20:40

Thank you.
20:40 - 20:43

I'd like to debate that with you later on.
20:43 - 20:45

Thank you so much.
20:45 - 20:46

(applause)
20:46 - 20:48

Thank you so much.
20:50 - 20:54

So, at some point,
we ask ourselves, did we--
20:57 - 21:00

by accident, write a library software?
21:00 - 21:05

Because the adoption of Wikibase
in the library fields is so overwhelming.
21:06 - 21:08

But there's more to it.
21:09 - 21:14

And of course, we didn't
accidentally write a library system.
21:14 - 21:18

It can be used for other fields as well.
21:18 - 21:20

For instance, for biology.
21:20 - 21:23

And David Fichtmueller will tell us
about using Wikibase
21:23 - 21:26

as a platform for biodiversity.
21:27 - 21:29

- I think that was grayed.
- Yeah.
21:29 - 21:32

Full screen? Oh, okay.
21:38 - 21:40

Yes. Hello, everybody.
21:41 - 21:43

I'm David, and I work
at the Botanic Garden,
21:43 - 21:45

Botanical Museum here in Berlin.
21:46 - 21:48

And I work there as a computer scientist.
21:48 - 21:51

We have an entire department
called Biodiversity Informatics.
21:52 - 21:54

Generally speaking, we write the software
21:54 - 21:56

that biologists use in their daily work.
21:56 - 21:59

And on my private side,
21:59 - 22:03

I've been a Wikipedia contributor
for almost 15 years now,
22:03 - 22:06

and Wikidata contributor
for almost five years now.
22:07 - 22:09

And also, as part of my job,
22:09 - 22:12

I'm a co-administrator of a MediaWiki farm
22:12 - 22:17

with more than 80 wikis
regarding the biology community.
22:19 - 22:22

And a couple of years ago,
I was assigned to a project
22:23 - 22:27

that was, yeah, about working
on a standard.
22:27 - 22:30

In particular, it's a standard
called ABCD,
22:31 - 22:33

that we needed to do some work on.
22:33 - 22:37

And I assume most of you
haven't heard about ABCD,
22:37 - 22:40

that's not really a bad thing.
22:40 - 22:41

It's really specific.
22:41 - 22:44

It stands for Access to Biological
Collection Data.
22:45 - 22:47

And it's an XML schema.
22:47 - 22:50

So, it can express
biological information,
22:50 - 22:54

particular things like information
about herbarium sheets,
22:54 - 23:00

about collections, like fish in
alcohol jars, or--
23:01 - 23:02

but also observations--
23:02 - 23:05

scientists being out in the field,
seeing certain plants,
23:05 - 23:07

seeing certain animals.
23:07 - 23:09

A lot of variety in here,
and because of this,
23:09 - 23:10

it's quite a huge standard.
23:10 - 23:14

So, we have 1,800
different concepts in there.
23:15 - 23:18

That's counting the different XPaths
there are within the file.
23:20 - 23:22

And so the challenge was to convert this
23:22 - 23:25

into a new modern semantic standard.
23:25 - 23:27

We wanted to use an OWL ontology
23:27 - 23:31

that is able to express
the same kind of information
23:31 - 23:34

that has previously been expressed
with the XML files,
23:35 - 23:38

and also keep all the existing
documentation,
23:38 - 23:41

and restrictions,
and all of the connections
23:41 - 23:43

between the items
23:43 - 23:46

and have a collaborative platform
23:46 - 23:50

where other scientists can come in
and give us advice
23:50 - 23:53

on their specific fields of focus.
23:53 - 23:55

Did we model this correctly?
23:55 - 23:57

Is there anything missing?
23:57 - 24:01

So, yeah, with all of this in mind,
we went looking around,
24:01 - 24:04

and found a solution, and I guess
it wouldn't surprise anybody here,
24:04 - 24:07

it's Wikibase, otherwise
I wouldn't have been talking here.
24:08 - 24:11

So, we decided on using Wikibase.
24:11 - 24:14

And we started to install it
without the Docker Image.
24:15 - 24:17

Big mistake. Don't do this.
24:17 - 24:18

(laughter)
24:18 - 24:21

In our defense, we started this
two and a half years ago.
24:22 - 24:24

And it was two years ago
at the WikidataCon
24:24 - 24:26

that the Docker Image was first released.
24:27 - 24:30

So, we had to figure out our own way.
24:30 - 24:32

And once we had things up and running,
24:32 - 24:35

we didn't really want to break
changing things.
24:35 - 24:40

We do have the Docker installed
for the Query Service,
24:40 - 24:43

and we have a weird, hybrid
of custom installation
24:43 - 24:46

and Docker installation
and modified scripts
24:46 - 24:49

connecting those two instances.
24:49 - 24:52

We then installed
QuickStatements, again, manually,
24:52 - 24:57

because by that time, it wasn't part
of the Query Service,
24:57 - 25:00

did some slight modifications,
and adjustments to get it to work.
25:01 - 25:05

I know it's now part
of the Docker Image.
25:06 - 25:11

But yeah, we had it running,
so, we didn't bother changing it.
25:12 - 25:13

Keep this in mind for later on.
25:14 - 25:16

But before I go into what we did,
25:16 - 25:18

I'm going to avoid
a possible confusion here,
25:18 - 25:22

because we're talking
about data standards,
25:22 - 25:25

and when we express things
in a semantic way,
25:25 - 25:30

we will convert the concepts
from the XML into Classes and Properties.
25:31 - 25:34

So, this being Object Properties
connecting the different classes,
25:34 - 25:37

and Datatype Properties
that actually contain the content,
25:37 - 25:40

that is to store text, numbers,
things like that.
25:41 - 25:44

And we express all of this
within Wikibase,
25:44 - 25:47

but all of those are items in Wikibase.
25:48 - 25:51

And they are then described
using Wikibase Properties.
25:51 - 25:55

So, we have ABCD properties
being items being described
25:55 - 25:57

as Wikibase Properties.
25:57 - 26:01

I try to make sure to use
the prefixes accordingly,
26:01 - 26:04

so you know what I'm talking about
when I talk about properties
26:04 - 26:05

in this talk.
26:06 - 26:08

So, let's look at the properties,
26:08 - 26:10

in particular, with Wikibase Properties.
26:10 - 26:13

We sat down and thought,
"Okay, what do we need
26:13 - 26:16

to describe the concepts
we want to model?"
26:17 - 26:19

And we ended up using around 25 properties
26:20 - 26:23

in addition to, of course, label,
description, alias.
26:23 - 26:24

I'm not going to mention all of them,
26:24 - 26:26

just so you see the variety.
26:27 - 26:30

Those fulfill our requirements.
26:30 - 26:36

And yeah, some things
express some restrictions,
26:36 - 26:39

and others--
26:39 - 26:40

Most of them are optional.
26:41 - 26:43

Only very few are mandatory.
26:43 - 26:46

So then, we set on importing
all of this information.
26:47 - 26:51

We wrote a Schema Parser that extracts
all of the different concepts.
26:51 - 26:54

So everything that has an XPath
within the XML Schema,
26:54 - 26:57

and all of the documentation
that is part of the XML schema,
26:57 - 27:00

and so we got this into a nice CSV file,
27:00 - 27:05

and then we could work on this
and import it using QuickStatements.
27:06 - 27:07

Worked quite well.
27:07 - 27:11

But then, we had, as I said,
1,800-plus concepts
27:11 - 27:13

in our Wikibase instance.
27:14 - 27:17

But then, when we had things like person--
27:18 - 27:20

person name, and contact email--
27:20 - 27:23

those appear a couple of times
within the schema--
27:23 - 27:27

for the data set owner, for the person
who took an image, things like that.
27:27 - 27:29

So, of course, we needed to reduce those,
27:29 - 27:32

and combine those to reusable classes.
27:32 - 27:35

So, there was a lot of manual editing
27:35 - 27:36

to reduce the number of concepts,
27:36 - 27:40

and in the end, we ended up
with a little more than 500.
27:40 - 27:44

So, we have Classes, Object Properties,
Datatype Properties,
27:44 - 27:45

a couple of other ones I'm skipping
27:45 - 27:47

to avoid additional complexity here.
27:48 - 27:53

And for certain large-scale edits,
we also used QuickStatements again.
27:55 - 27:57

So now, we did all of the editing,
27:57 - 27:59

now we wanted to make sure
that the data we have
27:59 - 28:01

is actually consistent.
28:01 - 28:05

So, that's where we used what we call
Maintenance Queries,
28:06 - 28:10

used the query interface
with some SPARQL queries,
28:10 - 28:12

basically to check for missing properties,
28:13 - 28:15

wrong links between concepts,
28:16 - 28:19

basically, things that didn't match
28:19 - 28:21

with our concept, with our structure.
28:22 - 28:24

And in the end, we also had to do
28:24 - 28:26

a manual review of all of the concepts
28:26 - 28:28

just to make sure we didn't miss anything.
28:28 - 28:30

This was kind of a lot of work,
28:30 - 28:34

because if you only take
like five minutes per item,
28:34 - 28:36

multiply it by 550,
28:37 - 28:40

it's over one week of full
and concentrated work.
28:41 - 28:43

But of course, we don't need five minutes,
28:43 - 28:46

because you sometimes spend
like half an hour to fix a certain item
28:46 - 28:48

when there's problems with the modeling.
28:49 - 28:51

So, we now had all of the data.
28:51 - 28:53

Now, it was time to get the data
out of Wikibase.
28:54 - 28:58

We wrote an export script in Python
that uses the Query Service
28:58 - 29:01

to get the information about the concepts,
29:01 - 29:05

and fill them in templates--
prepared templates.
29:05 - 29:08

So, in the end, we get
a nice valid OWL file
29:08 - 29:10

that contains everything we need.
29:10 - 29:13

And this is the actual basis
of the standard.
29:13 - 29:17

For future versions,
when we're going to make revisions,
29:17 - 29:20

the Wikibase is our working platform.
29:20 - 29:23

And once we do an export,
this is the new version of the standard.
29:23 - 29:25

Keeping those separate,
this would also allow us
29:25 - 29:29

to move the server
to a different instance,
29:29 - 29:33

or as I said, change the installation.
29:33 - 29:36

We export JSON
for the documentation of the website.
29:37 - 29:41

And we also export the data
to a second Wikibase instance.
29:41 - 29:43

This is like really
experimental, right now.
29:43 - 29:47

We haven't really used this
in production where it can--
29:47 - 29:49

where the concepts can then be used
to describe actual data.
29:49 - 29:51

So we're breaking down those--
29:52 - 29:56

we're taking them a step down
from properties being Wikibase items,
29:56 - 29:59

and converting them into actual
Wikibase properties.
30:00 - 30:03

This is quite a lot of requests--
quite a lot of steps
30:03 - 30:05

to keep all of the data
and all of the linking consistent,
30:05 - 30:07

but it works.
30:07 - 30:09

And in the end, well,
it was quite successful.
30:10 - 30:12

There is a huge community--
30:12 - 30:15

there is a community about
Biodiversity Information Standards,
30:15 - 30:18

who also had their annual meeting
just in the past days.
30:19 - 30:22

So, there's a huge interest
in reusing this approach
30:22 - 30:23

for other standards, as well.
30:24 - 30:25

And so, in the future,
30:25 - 30:28

we want to try a bit
about Shape Expressions--
30:28 - 30:31

as I said, we have some restrictions
in there to export them--
30:32 - 30:35

and build some better workflows
for the versioning.
30:35 - 30:37

We haven't done this yet.
30:37 - 30:39

And switch up the Docker instance.
30:39 - 30:42

So, at the end, I'm gong to have
a small wish list--
30:42 - 30:43

what things could be improved.
30:43 - 30:47

Well, there are a lot more tools
out there that are really written
30:47 - 30:50

for Wikidata, but could be more agnostic,
30:52 - 30:53

in particular, QuickStatements.
30:53 - 30:57

As I said, I did
some adjustments manually.
30:57 - 31:00

Many of the issues I had
are probably solved by now,
31:00 - 31:02

but I don't think all of them.
31:02 - 31:07

Then we want to import existing templates,
31:07 - 31:09

or the SPARQL template,
the Q and the P template.
31:09 - 31:12

They are really useful
when working with Wikibase.
31:12 - 31:15

So, this would be done automatically.
31:15 - 31:17

And as I said, we did a lot
of manual editing.
31:17 - 31:21

So, it would be useful,
just ideal to have a tool where you can--
31:21 - 31:22

Like in an Excel table--
31:22 - 31:26

you load a couple of items,
and you load a couple of properties,
31:26 - 31:28

and then just jump from cell to cell,
31:28 - 31:32

really quickly edit a lot of things
31:32 - 31:33

in a semi-automated way.
31:35 - 31:36

Thanks. That's the end.
31:37 - 31:38

Thank you so much.
31:38 - 31:41

(applause)
31:41 - 31:43

So much to talk about on this.
31:43 - 31:48

So, there is not only--
well, how do I get back from here.
31:51 - 31:54

It's not only about science.
It's not only about libraries.
31:54 - 31:57

You can also create
art and beauty with Wikibase.
31:57 - 32:02

And who would be better to tell us
about this than Stuart Prior.
32:12 - 32:15

Now, slightly embarrassingly,
we talk about art and beauty,
32:15 - 32:17

but this is a really ugly presentation.
32:17 - 32:19

(laughter)
32:20 - 32:23

Starting off with a room
full of Wikimedians,
32:23 - 32:24

trains--people like trains.
32:25 - 32:26

But it has a purpose.
32:26 - 32:31

So, this is Hackney Downs Station
in Northeast London.
32:31 - 32:34

And this is about
Banner Repeater and Wikibase,
32:34 - 32:36

which I'll explain further.
32:36 - 32:38

So, this is a terrible photo.
32:38 - 32:43

But it is actually where
an artists' publishing archive is held,
32:44 - 32:46

which is on the platform
of a train station.
32:47 - 32:51

Within there, they've got
several hundred copies
32:51 - 32:53

of various types of artists' publishing.
32:53 - 32:54

They get a lot of public footfall.
32:54 - 32:57

It does a lot of outreach
to actual general public.
32:57 - 32:58

Like you get on the train,
32:58 - 33:02

you'll find bits of
sort of obscure art on the train.
33:03 - 33:05

So, it's a really interesting project,
33:05 - 33:07

but part of a much wider community.
33:07 - 33:10

So, what is Artists' Publishing?
What are Artists' Books?
33:10 - 33:12

Like, I didn't know either.
33:14 - 33:15

So, the definition,
according to Wikipedia,
33:15 - 33:19

is "Artists' books are works of art
that utilize the form of the book."
33:19 - 33:21

Well, you can read it.
33:22 - 33:24

But it's individual pieces of art,
33:24 - 33:28

or sometimes collections of art,
using publishing as a medium.
33:29 - 33:31

This varies quite a lot.
It's very interesting.
33:31 - 33:33

It was kind of--
33:33 - 33:35

There was a lot of it
in the early '20s and '30s,
33:35 - 33:38

and it had a bit of a renaissance,
'60s and 70's,
33:38 - 33:39

and continues to expand.
33:39 - 33:42

Has a large global community,
multilingual,
33:43 - 33:48

somewhat separate from large
institutional art institutions.
33:48 - 33:50

So, you'll find collections,
33:50 - 33:54

such as the V&A
has a collection, obviously.
33:55 - 33:58

So, they've got various kind
of items such as these.
33:59 - 34:02

This is just an article,
so it's just not the best display.
34:03 - 34:08

But it's a really kind of interesting,
yet slightly niche field of work.
34:09 - 34:12

But it's not very good on Wikidata.
34:14 - 34:18

This is, again, a really terrible photo--
it's not my photo--
34:18 - 34:21

of some the stuff held
in Banner Repeater's archive.
34:21 - 34:24

If you see in the middle,
the pink one, Blast,
34:24 - 34:28

that's actually a fairly notable
piece of artists' publishing
34:28 - 34:30

from the '20s.
34:31 - 34:33

What does it look like on Wikidata?
34:33 - 34:34

It's not good on Wikidata.
34:35 - 34:38

It's often just confused with books
34:38 - 34:40

or other forms of publishing.
34:40 - 34:43

The average kind of Wikidata item for
34:43 - 34:46

a notable piece of artists' publishing
34:47 - 34:51

doesn't really have much to say about it.
34:51 - 34:54

You know, it's just--
there you go, that's it.
34:55 - 34:57

There's not a huge amount
of identifier numbers as well.
34:58 - 35:01

So, there's clearly a lot missing
35:01 - 35:04

when it comes to artists' publishing,
35:04 - 35:07

certainly compared
to more traditional forms of art--
35:07 - 35:09

paintings and sculpture and so forth.
35:10 - 35:13

And there's a huge desire
within the community
35:13 - 35:16

to start codifying this,
and making it a real thing.
35:17 - 35:19

So, I'll give you an example
of what is actually available.
35:19 - 35:22

You can point out what's wrong
with this query.
35:24 - 35:28

So, this is basically all there is.
35:29 - 35:32

That's every artists' book on Wikidata.
35:32 - 35:33

So, there's really not a lot.
35:33 - 35:36

Some of them don't even
have labels for a start.
35:36 - 35:39

And it's something
that really needs expanding.
35:39 - 35:41

And something that has capacity
to be expanded.
35:41 - 35:43

Has anyone seen what's wrong
with this query yet?
35:45 - 35:47

The labels-- the labels say "sausage",
35:48 - 35:51

because I just stole
someone else's query,
35:51 - 35:52

and changed the key number.
35:52 - 35:53

(laughter)
35:53 - 35:55

It's actually a query about sausages.
35:56 - 35:58

Anyway, moving on.
35:58 - 36:00

But yeah, you see it doesn't really have
much of a presence.
36:01 - 36:04

We were approached by Banner Repeater.
36:05 - 36:07

So, I work with Wikimedia UK.
36:07 - 36:10

We were approached by Banner Repeater
to help them with this--
36:11 - 36:12

with setting up a Wikibase--
36:13 - 36:16

in terms of funding,
in getting extra funding,
36:16 - 36:18

but also in terms of bringing in
a wider community,
36:18 - 36:20

and being part of the process.
36:21 - 36:24

So, the process is basically
to gather this community
36:24 - 36:27

of artists, archivists,
and linked data experts,
36:29 - 36:32

and work out what the schema,
the data model,
36:32 - 36:34

for artists' publishing should be.
36:34 - 36:36

It's a very specialized field.
36:36 - 36:38

Doesn't really map
onto Wikidata perfectly.
36:38 - 36:41

It's probably too granular for it.
36:42 - 36:44

And the other thing
is the kind of flexibility of it.
36:45 - 36:47

Maybe it doesn't really fit in Wikidata.
36:47 - 36:50

Maybe it's too rigid at the moment.
36:50 - 36:53

The Wikibase is being built,
36:53 - 36:56

so I haven't got much to show you,
because it's not been built yet,
36:56 - 36:57

but this more about the process.
36:57 - 37:01

And the process is extensive
community consultation,
37:01 - 37:02

a few kind of layers of it.
37:02 - 37:05

So, we're not just going
to do this in one session.
37:05 - 37:06

It's not a few individuals deciding.
37:06 - 37:08

It's kind of ongoing,
and ongoing, and ongoing.
37:09 - 37:13

The impact of this
could be fairly substantial,
37:13 - 37:15

because no one else is doing this work.
37:15 - 37:19

A lot of the larger institutions
have artists' publishing
37:19 - 37:20

sitting in their kind of back room.
37:20 - 37:22

They don't really know
how to categorize it.
37:22 - 37:24

They haven't categorized it very well.
37:24 - 37:26

They're not very interested in it.
37:26 - 37:29

But there is a huge community
that is interested in doing this.
37:31 - 37:34

So, this is basically
the process at the moment.
37:35 - 37:37

So, the initial workshop has happened.
37:37 - 37:40

So, it was an expert workshop
with some people
37:40 - 37:44

deep in the field of artists' publishing--
37:44 - 37:46

archivists, people
who own collections, and so forth--
37:46 - 37:49

to establish a kind of
basic set of priors,
37:49 - 37:52

to look at what things were existing.
37:52 - 37:55

The existing status was on Wikidata,
37:55 - 37:57

and look at how that
could be expanded or improved.
37:58 - 38:01

And then they documented that,
38:01 - 38:04

and established this basic structure.
38:04 - 38:06

And now, we move into the next process
38:06 - 38:08

where it's bringing in
a much wider community.
38:08 - 38:11

So that's-- it's not just data people,
it's creators, as well.
38:12 - 38:13

There'll be a lot of narrative in this,
38:13 - 38:15

and a lot of qualitative things.
38:15 - 38:18

Again, stuff that just
doesn't really belong on Wikidata.
38:19 - 38:21

But also working with archivists,
38:21 - 38:24

and working with linked
data experts, and so forth,
38:24 - 38:26

to hopefully bring this all together,
38:26 - 38:30

to create a resource that will have
a nice accessible front end,
38:30 - 38:33

and also build this community--
people who can contribute to it,
38:33 - 38:36

and kind of own this data set.
38:36 - 38:39

I'll show you what we've got ready.
38:41 - 38:44

This is subject to change.
38:45 - 38:47

But this is basically kind of
where we've got so far
38:47 - 38:49

with the expert ones.
38:49 - 38:52

So, you see different P numbers
being developed,
38:52 - 38:55

and look at what
their equivalent on Wikidata is.
38:55 - 38:58

And obviously, it's a lot more granular
38:58 - 39:01

than probably the information
on Wikidata is at the moment, so--
39:03 - 39:07

There's a lot of detailed stuff,
so there's qualities
39:07 - 39:09

such as height, width,
thickness, and so forth,
39:10 - 39:12

which aren't necessarily that present
39:12 - 39:15

on other groups
of artists' publishing on Wikidata.
39:15 - 39:20

But there's also other things like
"commissioned by", and "contributors to",
39:20 - 39:23

and a lot of these works
will have multiple contributors.
39:23 - 39:26

And multiple editions
and things like that.
39:26 - 39:27

There's really a lot
of granular information
39:27 - 39:29

that can come about these things.
39:29 - 39:31

And a lot of narrative as well, you know,
39:32 - 39:33

as things have changed over time,
39:33 - 39:35

as people have reinterpreted things.
39:36 - 39:38

And this was what was created.
39:40 - 39:43

Again, most of it has
Wikidata equivalents,
39:43 - 39:44

but some of it doesn't yet.
39:44 - 39:47

So, what do we have here.
39:48 - 39:50

Other editions, and things like that.
39:50 - 39:52

So, it's fairly specialized.
39:52 - 39:53

This is the first stage.
39:53 - 39:55

And this will go through another process,
39:55 - 39:57

as people take things away from it
or contribute, too.
39:58 - 40:01

The flexibility is really
important in this.
40:02 - 40:05

It's kind of getting away
from older kind of standards,
40:05 - 40:07

and moving to something
which is a bit more up-to-date,
40:07 - 40:09

and something where the community
can really change things,
40:09 - 40:12

and not be dictated to--
and I'll start speaking quicker.
40:14 - 40:18

So, power dynamics, at the moment,
and why Wikibase.
40:18 - 40:20

So at the moment, this is the art world.
40:20 - 40:22

This is what the art world looks like.
40:22 - 40:23

It's a big orange thing.
40:24 - 40:25

But you've got these large institutions,
40:25 - 40:28

and then you've got sort of
groups of artists' publishing.
40:28 - 40:32

That could be Delhi, Mexico City,
London, and so forth.
40:32 - 40:36

And what we don't want
is this kind of thing
40:36 - 40:39

where large institutions and experts
get to dictate
40:39 - 40:42

the kind of ontology,
and how these things are going to work.
40:43 - 40:47

So, working to establish a Wikibase
among an artist community
40:48 - 40:52

can help them work out
what they're going to do,
40:52 - 40:55

and then they start pushing back
into the larger institutions,
40:55 - 40:57

with a more kind of flexible data model,
40:57 - 40:59

with something that's more up-to-date
40:59 - 41:02

and coming from grassroots organizations,
41:02 - 41:05

as opposed as coming
from institutions, so to speak.
41:06 - 41:08

So, I think there's huge value
in this approach
41:08 - 41:12

in terms of creating
a sort of parallel infrastructure
41:12 - 41:16

for communities of people
who own content, and so forth,
41:16 - 41:19

much like Wikimedia is,
41:20 - 41:22

and kind of pushing out to institutions,
41:22 - 41:24

rather than doing it the other way around.
41:25 - 41:27

Do I have another slide?
What next?
41:27 - 41:29

I always put this slide in,
because it's always the worst slide,
41:29 - 41:30

and it's such a stereotype.
41:31 - 41:35

What next? We're moving on
to the community consultation stage,
41:35 - 41:39

so we'll get a bit more kind of
expansive and interesting.
41:39 - 41:44

This obviously, this database
will be talking to Wikidata,
41:44 - 41:47

but on what term,
we're not 100% sure.
41:47 - 41:49

But it could be that this becomes very--
41:49 - 41:51

just a very specific instance
for artists' publishing
41:51 - 41:54

that Wikidata can draw from,
and vice versa.
41:55 - 41:57

And I'll just finish off
with that picture again,
41:57 - 41:58

because I just quite like it.
41:59 - 42:01

And that's all I have to say.
Thank you.
42:01 - 42:05

- Thank you so much.
- (applause)
42:06 - 42:09

We're almost at the end
of our fast-paced ride,
42:09 - 42:13

and we'll-- what to say?
we saved the best for last?
42:13 - 42:16

No, but we give the last presentation
42:16 - 42:20

to someone who's a true pioneer
of using Wikibase
42:20 - 42:23

in the field of digital humanities.
42:23 - 42:26

And, yeah-- Olaf Simons.
42:26 - 42:28

You have not prepared any slides,
but you will do some live action.
42:29 - 42:30

Exactly.
42:31 - 42:34

And I have been on Wikipedia
since 2004, actually.
42:34 - 42:36

I have the 15 years.
42:38 - 42:40

What am I going to show?
42:42 - 42:44

I've been congratulated for this.
42:44 - 42:48

I'm going to show you
the Wikibase instance we created.
42:48 - 42:49

It's not a Docker Image.
42:49 - 42:52

And I could agree, it's not the best
to have a Docker--
42:52 - 42:57

it's not the best to have
an independent installation.
42:57 - 42:58

It's difficult,
42:58 - 43:00

and it has been extremely
difficult for us,
43:00 - 43:04

and we're grateful
for the Wikimedia Germany
43:05 - 43:09

to help us get it done
on a mutual agreement we had.
43:09 - 43:16

So, basically, we have here
several projects on this.
43:16 - 43:18

It's more project-oriented than Wikidata.
43:19 - 43:21

And my thing should be in here.
43:22 - 43:27

I open that and go--
just should have done that before.
43:27 - 43:29

Here we are.
43:30 - 43:34

The history of the Illuminati--
I start with this one.
43:34 - 43:36

This has been a little film
43:36 - 43:40

which has been created
by Paul-Olivier Dehaye,
43:42 - 43:44

whom I only know from Twitter,
43:44 - 43:46

as he asked us what kind of experience
43:46 - 43:50

did we make when we got our Wikibase,
43:50 - 43:52

and he was experimenting with his own.
43:52 - 43:56

And I talked to him
about things we could do,
43:56 - 43:57

and things we could not do.
43:57 - 44:00

This was a film I would love
to be able to do.
44:00 - 44:02

And he said, "It's easy for me.
44:02 - 44:05

I can run a SPARQL search,
get the information,
44:05 - 44:08

and put it into a program,
in which you can then see this thing."
44:09 - 44:12

It's actually 20 years of research
on the Illuminati,
44:12 - 44:16

and gives you a short history
of the entire organization
44:16 - 44:18

and all its correspondences.
44:18 - 44:20

That's not a Wikimedia tool.
44:20 - 44:23

It's not a tool of Wikibase.
44:23 - 44:25

But it's something you can do.
44:25 - 44:30

And actually, I like it
that it is not a tool already.
44:30 - 44:31

It should become a tool.
44:31 - 44:34

I like it because it shows
our data is really free.
44:34 - 44:37

Someone can download our data,
someone can do something with it,
44:37 - 44:42

which we haven't expected,
and it can be done within two hours,
44:42 - 44:44

if you're bright--
and he is bright, of course.
44:45 - 44:47

So, he created this for us.
44:47 - 44:49

I go back to my presentation.
44:50 - 44:53

Why on Wikibase?
44:53 - 44:56

This was the immediate question
when we approached Wikimedia.
44:56 - 44:59

I knew of Wikidata since 2010,
44:59 - 45:05

and in 2017, it was ready
to be used by us.
45:06 - 45:11

And there was actually an interest
from Wikimedia people to say,
45:11 - 45:13

"Do it, and we support you."
45:14 - 45:15

Why our own base?
45:16 - 45:20

Basically, as original research
that we have to do.
45:20 - 45:25

And the entire installation
is a research tool.
45:25 - 45:28

It's not only there to take a look
at what we did
45:28 - 45:29

and for presentation purposes,
45:29 - 45:32

but actually, I use it every day
for my research.
45:32 - 45:35

I change dates of documents,
45:35 - 45:39

and take a look at how things look
when I have changed that.
45:39 - 45:41

I do a lot with working hypothesis.
45:41 - 45:48

And we ask projects that have data
to give us their data,
45:48 - 45:50

and to feed them in,
45:50 - 45:54

and they can, again, put a label,
45:54 - 45:58

put an item to their data sets,
45:58 - 46:02

that says this has been produced
by the following project.
46:02 - 46:05

Next projects can continue with it.
46:05 - 46:07

But it's already there as a marker
46:07 - 46:11

that this is a data set
with work from a certain project.
46:11 - 46:14

And if you have a project, DFG--
46:15 - 46:18

DFG funded, the German
research institution--
46:18 - 46:19

if you have a project, you want to show
46:19 - 46:21

what kind of work you have done.
46:21 - 46:23

And you can now do a SPARQL search
46:23 - 46:26

and present your entire group of data sets
46:26 - 46:30

in the final résumé of your work.
46:31 - 46:36

So we get original research,
we identify research,
46:36 - 46:39

we encourage the working hypothesis.
46:39 - 46:40

This is a working tool,
46:40 - 46:43

and it's actually quite useful
to start from the beginning,
46:43 - 46:44

not to present something in the end.
46:44 - 46:47

But from day one, you work with it,
46:47 - 46:50

and what you think is
the proper answer to that question,
46:50 - 46:53

you can put it into Wikibase, and then
46:53 - 46:55

you can substantiate information
46:55 - 46:57

until you see this
is the right identification
46:57 - 47:00

of a person or the right date for a thing
47:00 - 47:02

which we haven't been able to date so far.
47:02 - 47:05

So, actually, accumulate work
while you are doing it,
47:05 - 47:08

use the Wikibase as a kind of tool
47:08 - 47:10

that is getting you closer
to the final result.
47:11 - 47:15

Our first meeting took place
on December 1, 2017.
47:15 - 47:19

And I remember I had
a little challenge for you,
47:19 - 47:25

and that was a death date--
a date of death for a person--
47:25 - 47:30

where I wanted to have someone
to show a source for that,
47:30 - 47:31

and that was extremely difficult,
47:31 - 47:33

because he had to create the source
47:33 - 47:35

before he could connect it to that.
47:35 - 47:36

And in the room, we were--
47:36 - 47:40

we had the clear idea,
if we do this, we'd do it
47:40 - 47:45

with the sources already part
of the Wikibase installation we have.
47:45 - 47:46

And if we have the sources in there--
47:46 - 47:50

that is, all the early modern books
that have been printed
47:50 - 47:51

would be the ideal.
47:51 - 47:53

If we have that in there,
we need the GND in there.
47:53 - 48:00

And when we heard that the GND people
are on their track to test the software,
48:00 - 48:02

I approached them and asked,
"Wouldn't you like to do this
48:02 - 48:05

in a cooperation with us,
so that we can have your data,
48:05 - 48:07

which we want to have, anyway,
48:07 - 48:10

and that you can see
how it works on a Wikibase."
48:10 - 48:12

And this is where we are at the moment.
48:12 - 48:15

And presently, I would say,
a lot of things,
48:15 - 48:16

we're not sure how they are done,
48:16 - 48:18

or at least I am not sure
how they are done.
48:18 - 48:21

How's the input done, how do you get
from a resource of strings
48:21 - 48:24

to an item-based resource--
lots of things.
48:25 - 48:28

And basically, my talk here
is an invitation.
48:28 - 48:30

Join us.
48:31 - 48:33

We are still not really part
of the Wikibase community.
48:33 - 48:34

That doesn't exist.
48:34 - 48:36

We have a Wikidata community.
48:36 - 48:38

And lots of things
are taking place in Wikidata,
48:38 - 48:43

but if I ask for help for a Wikibase
that is not Wikidata,
48:43 - 48:45

that's a difficult thing.
48:46 - 48:49

First thing I would say is,
actually, to work with us is cool,
48:49 - 48:54

because you can grab the data
for Wikidata anytime, any moment, at CC0.
48:54 - 48:58

So, actually, you can use it
as an incubator of your work,
48:58 - 49:00

and drag it to Wikidata.
49:01 - 49:06

And also, we will work with big data,
when we have the GND
49:06 - 49:08

in there, that will be quite something.
49:08 - 49:10

So, if you really want the challenge,
49:10 - 49:12

you can get it also on our platform.
49:12 - 49:15

And we offer interesting communities.
49:16 - 49:18

Basically, one of the things
that is different
49:18 - 49:21

is that we have all clear-name accounts
and institutions.
49:21 - 49:24

So, but that also means you can do things
49:24 - 49:26

which you couldn't do on Wikidata.
49:26 - 49:28

You can do your genealogy at our site.
49:28 - 49:29

We don't mind.
49:29 - 49:32

It's interesting to have people
getting such data.
49:32 - 49:36

You can do your city's search--
research, historical research
49:36 - 49:38

on our platform-- we don't mind.
49:38 - 49:42

You can be with research on our platform.
49:43 - 49:46

So, lots of things need to be done.
49:46 - 49:49

We have immense problems
running the database.
49:49 - 49:51

It was implemented by Wikimedia,
49:51 - 49:53

but now, we see lots of things
don't really work.
49:53 - 49:54

We can't really fix that.
49:54 - 49:58

It's extremely difficult to get help
49:58 - 50:00

to run the database,
to update the database,
50:00 - 50:03

to solve little technical problems,
50:03 - 50:09

which we face as soon as we run
an instance outside Wikidata.
50:09 - 50:13

Like getting the direct
GND link is difficult.
50:13 - 50:16

It works on Wikidata,
it doesn't work on our instance.
50:16 - 50:20

Getting images from Wikimedia Commons
50:20 - 50:23

on our Wikibase is not that easy.
50:23 - 50:25

Lots of little things still remain.
50:25 - 50:28

So, actually, this is an invitation.
50:28 - 50:32

If you want to join us
on the mass input, do that.
50:34 - 50:35

Approach us.
50:35 - 50:37

If you want to help us
with technical things,
50:37 - 50:39

this is highly welcome.
50:39 - 50:40

And then, we need tools.
50:40 - 50:42

You saw the tool we had in the beginning.
50:42 - 50:45

Actually, it's not that difficult
to get such tools.
50:46 - 50:51

I saw what kind of query you do
to get such a visualization,
50:51 - 50:55

and once you have it,
you should be able to modify it easily.
50:57 - 50:59

These tools are extremely precious
50:59 - 51:03

in our community
of digital humanities projects.
51:03 - 51:06

And there are little companies
that create these tools,
51:06 - 51:09

again, and again, and again,
and get money for that.
51:09 - 51:12

I would love to have these tools
just once and for all free
51:12 - 51:15

and on the market and working
with a Wikibase instance.
51:15 - 51:20

So, anyone who is interested
in developing tools,
51:20 - 51:22

approach us, and we have plenty of ideas
51:22 - 51:25

of what visualizations
historians would love to see,
51:25 - 51:27

and that should be done.
51:28 - 51:31

So, basically, lots of things,
like, still remain.
51:32 - 51:34

I've got one minute.
I don't need that one minute.
51:34 - 51:36

And you're putting pressure on me.
51:37 - 51:39

(person) Give it to the audience.
51:39 - 51:40

I give the minute to the audience.
51:40 - 51:42

Yeah. Thank you so much.
51:42 - 51:44

And maybe you want to sit down,
51:44 - 51:49

because I would like everyone
to join me back on stage.
51:50 - 51:52

And we can have a round of questions.
51:52 - 51:55

I really like that we ended
with an invitation,
51:55 - 51:57

because this is what this is now.
51:57 - 51:59

You are invited to ask questions.
51:59 - 52:03

You are also invited to join us tomorrow
at the Wikibase meetup.
52:03 - 52:06

If you are-- if you have some idea
52:06 - 52:09

for an awesome Wikibase installation,
52:09 - 52:12

for your institution, for your hobby,
for changing the world--
52:13 - 52:16

please come and join us,
we will meet up, and--
52:18 - 52:20

There's some complication
with the chairs.
52:20 - 52:22

Well, let's stand up. Okay.
52:22 - 52:24

I think we have another microphone, here.
52:24 - 52:27

(person) I have the microphone
for the questions.
52:27 - 52:29

Okay. So--
52:31 - 52:33

Thank you for the presenters.
52:33 - 52:36

And meet us at the Wikibase meetup,
52:36 - 52:39

and now, I can't wait to hear
your questions to the panel.
52:41 - 52:42

(person) Who's the first?
52:44 - 52:47

(person) Hi. I will be talking
in the lightning session, too,
52:47 - 52:51

about geosciences, and how in geosciences,
52:51 - 52:54

there's many data repositories
that have collected
52:54 - 52:57

and shared data with the community
52:57 - 52:59

for years, for decades in some cases.
53:00 - 53:05

And they curate the data set,
their schemas evolve continuously,
53:05 - 53:07

they get a lot of feedback
from the community.
53:07 - 53:10

All they desire is to organize
the community,
53:10 - 53:13

to enable the growth
of these repositories.
53:13 - 53:17

So, they don't necessarily desire
to put all their content in Wikidata
53:17 - 53:19

and lose control over it.
53:19 - 53:22

They offer a tremendous service
curating this content.
53:23 - 53:28

So, I just wanted to point out
that some of the requirements
53:28 - 53:31

and needs that have been voiced
by the panelists
53:31 - 53:33

appear in my communities.
53:33 - 53:40

And my question is, how do you mix
or maintain control
53:40 - 53:43

over those schemas, over the standards,
53:43 - 53:48

while allowing the community
to continue to introduce feedback
53:48 - 53:52

and have more of this crowdsourcing
spirit that Wikidata has?
53:53 - 53:56

I think everyone could answer that,
but maybe David, you want to start?
53:57 - 53:59

I'm not sure whether I'm the right
person to answer this,
53:59 - 54:01

because in our use case--
54:02 - 54:04

in terms of data modeling,
54:04 - 54:09

it's really a narrow set of people
who actually do the work.
54:09 - 54:13

We contact experts
for the relevant segments,
54:14 - 54:17

and some of them could contribute,
but for the current iteration,
54:17 - 54:21

it was only me and two colleagues
who actually worked on it.
54:21 - 54:26

So, we want to have this option,
that we get experts in,
54:26 - 54:29

but it's always in close
collaboration with us,
54:29 - 54:32

so that we don't really have to worry
54:32 - 54:34

about the problem of crowdsourcing.
54:36 - 54:38

Being part of the Wikimedia community,
54:38 - 54:41

I would say, I would not be that worried.
54:41 - 54:46

95% of the edits are good edits,
and improving things--more than that.
54:47 - 54:50

As soon as we have an instance
that is actually closed--
54:50 - 54:53

where I offer the accounts on real name,
54:53 - 54:59

that's an additional hurdle
that no fool is going to go over.
55:00 - 55:05

People are required on our instance
to offer an address, on page--
55:05 - 55:07

not to me, but on page--
55:07 - 55:10

and this is something only
institutions usually do,
55:10 - 55:12

or private people that say,
55:12 - 55:14

"Okay, I'm a private person.
I love this research.
55:14 - 55:16

This is my personal field.
I give you my address."
55:16 - 55:20

And this is a thing that puts off every--
55:20 - 55:24

any vandal who wants to destroy Wikidata.
55:24 - 55:28

So, you can close the system, but then,
55:28 - 55:30

you are not really part
of the same flowing community.
55:30 - 55:33

But again, I would say, if you go to CC0,
55:33 - 55:36

then you can open up,
you can be the incubator
55:36 - 55:41

where people do the research,
and then it goes out to the community.
55:41 - 55:45

But it's an invitation--
use maybe closed works,
55:45 - 55:49

and use an instance where
you work together with people you like.
55:54 - 55:56

Well, I think that--
56:00 - 56:04

I don't think that it's only my opinion--
56:04 - 56:07

it is there are different perspectives,
56:07 - 56:13

and it will be hard to reconcile
all perspectives and say,
56:13 - 56:19

"Wikidata is the solution
for the entire world to go into."
56:20 - 56:24

I don't say by this that Wikidata
is not a solution,
56:25 - 56:28

but there are different perspectives,
there are different needs.
56:28 - 56:35

The world is-- really, there is
a large variety of needs,
56:35 - 56:40

of professional perspectives,
that you cannot reconcile
56:40 - 56:45

in a unique worldwide database.
56:45 - 56:49

So, I think that both are--
56:49 - 56:52

The trickiest thing is how to reconcile
56:52 - 56:59

and find angles of dialogue
between these two large families
56:59 - 57:01

of needs and perspectives.
57:03 - 57:05

If there are more questions,
57:05 - 57:08

I would rather like to go
to more questions.
57:09 - 57:10

Anybody else?
57:12 - 57:15

If not, meanwhile you're thinking
about your questions--
57:15 - 57:18

I would just like to say
that's one of the reasons
57:18 - 57:20

why we consider Wikibase,
57:20 - 57:24

because we believe that adding,
editing information
57:24 - 57:28

within the Wikibase instance,
where you have rights and roles,
57:28 - 57:31

as you have in Wikidata,
gives us the opportunity
57:31 - 57:36

to share that information
with the information in Wikidata
57:36 - 57:39

in a more easy way,
a more convenient way
57:39 - 57:44

than if we try to build these bridges
in between our authority file
57:44 - 57:47

and Wikidata at the moment.
57:47 - 57:48

(person) So, I find it quite exciting
57:48 - 57:52

hearing about how
you're energizing communities
57:52 - 57:55

to find their own ways for data modeling,
57:55 - 57:59

and that you can put into Wikibase.
57:59 - 58:03

Will you-- I'm just saying
of Stuart Prior's community,
58:03 - 58:04

but also some of the others--
58:04 - 58:06

be trying to feed the approaches
58:06 - 58:10

that as a community
that you decide work back to Wikidata,
58:10 - 58:13

to say, "We've done artists' books,
58:13 - 58:15

we've thrashed through several iterations,
58:15 - 58:18

this is what we found really worked,
58:18 - 58:20

and the properties that you should have
58:20 - 58:23

or revisions you should make
to the Wikidata data model.
58:24 - 58:26

Good question. Very short answer.
58:27 - 58:29

It's an interesting question.
58:30 - 58:32

I don't know whether this is a model
58:32 - 58:34

that's going to work for other types.
58:34 - 58:35

I hope it is.
58:36 - 58:39

But it's a difficult one if you question
58:39 - 58:43

of whether the Wikidata community
accepts the kind of authority
58:43 - 58:46

of a separate community that goes off
and does the work on its own.
58:47 - 58:48

But I would certainly hope
58:48 - 58:50

that it's a way of people
feeding back into this process,
58:50 - 58:54

without necessarily needing to go
onto Wikidata and do it.
58:57 - 58:59

Well, I would say, grab it.
58:59 - 59:02

Grab it if it's convenient, take it,
and take a look at how it works
59:02 - 59:03

in the other instance.
59:03 - 59:06

And if you feel like
this is a cool property
59:06 - 59:09

to do certain searches,
then that will be adopted,
59:09 - 59:11

that will be flowing.
59:11 - 59:13

I wouldn't think
of authorities doing this.
59:13 - 59:15

(person) Coming from
a Wikidata user perspective,
59:15 - 59:18

the great thing you're doing
is showing you've established code
59:18 - 59:19

that works and runs.
59:19 - 59:21

You've established a data model
that people can see,
59:21 - 59:23

is implementable, and works.
59:23 - 59:26

And so, in the open source community,
59:26 - 59:28

you know, show us the code.
59:28 - 59:29

You can do that.
59:29 - 59:33

And that's why I think it's very exciting
to have these branches
59:33 - 59:35

that can then fold it back
for data modeling.
59:35 - 59:36

Yeah, thank you.
59:36 - 59:38

I think that is exactly the point.
59:39 - 59:42

I also like the verb
that you used-- energize.
59:42 - 59:44

This is exactly what we want to do.
59:44 - 59:47

Energize, as in Star Trek.
59:48 - 59:50

Yeah, this panel comes to an end.
59:51 - 59:54

And if you have any more questions
59:54 - 59:57

on all these Wikibase projects, talk.
59:57 - 60:00

- Please come tomorrow.
- Have conversations.
60:00 - 60:02

This is what this conference is about.
60:02 - 60:03

Thank you very much.
60:03 - 60:08

(applause)

Title:: cdn.media.ccc.de/.../wikidatacon2019-5-eng-Wikibase_inspiration_panel_hd.mp4
Video Language:: English
Duration:: 01:00:14

	Bar Sch edited English subtitles for cdn.media.ccc.de/.../wikidatacon2019-5-eng-Wikibase_inspiration_panel_hd.mp4
	C3Subtitles edited English subtitles for cdn.media.ccc.de/.../wikidatacon2019-5-eng-Wikibase_inspiration_panel_hd.mp4

English subtitles

Revisions

Revision 2 Uploaded

Bar Sch

cdn.media.ccc.de/.../wikidatacon2019-5-eng-Wikibase_inspiration_panel_hd.mp4

Revisions

Our website uses cookies

Operating cookies (Required)