Hi, guys! Can everybody hear me?

So, hi! Nice to meet you all.
I'm Erica Azzellini.

I'm one of the Wikimovement 
Brazil's Liaison,

and this is my first international 
Wikimedia event,

so I'm super excited to be here
and I hopefully,

<i>will share something interesting for you
all here on this lengthy talk.</i>

<i>So this work starts with research 
that I was developing in Brazil,</i>

<i>Computational Journalism 
and Structured Narratives with Wikidata.</i>

<i>So in journalism,</i>

<i>they're using some natural language
generation software</i>

<i>for automating news</i>

<i>for news that have 
quite similar narrative structure.</i>

<i>And we developed this concept here 
of structured narratives,</i>

<i>thinking about this practice 
on computational journalism,</i>

<i>that is the development of verbal text,
understandable by humans,</i>

<i>automated from predetermined 
arrangements that process information</i>

<i>from structured databases, 
which looks like that,</i>

<i>the Wikimedia universe
and on this tool that we developed.</i>

<i>So, when I'm talking about verbal text
understandable by humans,</i>

<i>I'm talking about Wikipedia entries.</i>

<i>When I'm talking about 
structured databases,</i>

<i>of course, I'm talking about 
Wikidata here.</i>

<i>And predetermined arrangement,
I'm talking about Mbabel,</i>

<i>that is this tool.</i>

<i>The Mbabel tool was inspired by a template</i>
<i>by user Pharos, right here in front of me,</i>

<i>thank you very much,</i>

<i>and it was developed with Ederporto
that is right here too,</i>

<i>the brilliant Ederporto.</i>

<i>We developed this tool</i>

<i>that automatically generates 
Wikipedia entries</i>

<i>based on information from Wikidata.</i>

<i>We actually do some thematic templates</i>

<i>that are created on the Wikidata module,</i>

<i>WikidataIB Module,</i>

<i>and these templates are pre-determined,
generic and editable templates</i>

<i>for various article themes.</i>

<i>We realized that many Wikipedia entries
had a quite similar structured narrative</i>

<i>so we could create a tool
that automatically generates that</i>

<i>for many Wikidata items.</i>

<i>Until now we have templates for museums,
works of art, books, films,</i>

<i>journals, earthquakes, libraries,
archives,</i>

<i>and Brazilian municipal 
and state elections, and growing.</i>

<i>So, everybody here is able to contribute
and create new templates.</i>

<i>Each narrative template includes
an introduction, Wikidata infobox,</i>

<i>section suggestions for the users,</i>

<i>content tables or lists with Listeria,
depending on the case,</i>

<i>references and categories,
and of course the sentences,</i>

<i>that are created 
with the Wikidata information.</i>

<i>I'm gonna show you in a sec
an example of that.</i>

<i>It's an integration with Wikipedia,
integration with Wikidata,</i>

<i>so the more properties properly filled
on Wikidata,</i>

<i>the more text entries you'll get
on your article stub.</i>

<i>That's very important to highlight here.</i>

<i>Structuring this Wikidata 
can get more complex</i>

<i>as I'm going to show you 
on the election projects that we've made.</i>

<i>So I'm going to let you hear this 
Wikidata Lab XIV for you</i>

<i>after this lengthy talk</i>

<i>that is very brief, 
so you'll be able to choose</i>

<i>on the work that we've been doing
on structuring Wikidata</i>

<i>for this purpose too.</i>

<i>We have this challenge to build 
a narrative template</i>

<i>that is generic enough 
to cover different Wikidata items</i>

<i>and to suppress the gender</i>

<i>and the number of difficulties 
of languages,</i>

<i>and still sounding natural for the user</i>

<i>because we don't want to sound like
it doesn't click for the user</i>

<i>to edit after that.</i>

<i>This is how the Mbabel looks like
on the bottom form.</i>

<i>You just have insert the item number there</i>
<i>and call the desired template</i>

<i>and then you have article to edit
and expand, and everything.</i>

<i>So, more importantly, why we did it?
Not because it's cool to develop</i>

<i>things here in Wikidata,
we know, we all hear, know about it.</i>

<i>But we are experimenting this integration
from Wikidata to Wikipedia</i>

<i>and we want to focus 
on meaningful individual contributions.</i>

<i>So we've been working 
on education programs</i>

<i>and we want the students to feel the value</i>

<i>of their entries too, but not only--</i>

<i>Oh, five minutes only,
Geez, I'm gonna rush here.</i>

(laughing)

<i>And we want you all to make tasks
for users in general,</i>

<i>especially on tables 
and this kind of content</i>

<i>that it's a bit of a rush to do.</i>

<i>And we're working on this concept
of abstract Wikipedia.</i>

<i>Denny Vrandečić wrote an article
super interesting about it</i>

<i>so I linked here too.</i>

<i>And we also want to now support 
small language communities</i>

<i>to fill the lack of content there.</i>

<i>This is an example of how we've been using</i>
<i>this Mbabel tool for GLAM</i>

<i>and education programs,</i>

<i>and I showed you earlier
the bottom form of the Mbabel tool</i>

<i>but also we can make red links
that aren't exactly empty.</i>

<i>So you click on this red link</i>

<i>and you automatically have 
this article draft</i>

<i>on your user page to edit.</i>

<i>And I'm going to briefly talk about it
because I only have some minutes more.</i>

<i>On educational projects,</i>

<i>we've been doing this with elections 
in Brazil for journalism students.</i>

<i>We have the experience
with the [inaudible] students</i>

<i>with user Joalpe--
he's not here right now,</i>

<i>but we all know him, I think.</i>

<i>And we realize that we have the data
about Brazilian elections</i>

<i>but we don't have media cover on it.</i>

<i>So we were lacking also 
Wikipedia entries on it.</i>

<i>How do we insert this meaningful 
information on Wikipedia</i>

<i>that people really access?</i>

<i>Next year we're going 
to have some election,</i>

<i>people are going to look for 
this kind of information on Wikipedia</i>

<i>and they simply won't find it.</i>

<i>So this tool looks quite useful
for this purpose</i>

<i>and the students were introduced,
not only to Wikipedia,</i>

<i>but also to Wikidata.</i>

<i>Actually, they were introduced 
to Wikipedia with Wikidata,</i>

<i>which is an experience super interesting
and we had a lot of fun,</i>

<i>and it was quite challenging 
to organize all that.</i>

<i>We can talk about it later too.</i>

<i>And they also added the background 
and the analysis sections</i>

<i>on these elections articles,</i>

<i>because we don't want them</i>
<i>to just simply automate the content there.</i>

<i>We can do better.</i>

<i>So this is the example 
I'm going to show you.</i>

<i>This is from a municipal election
in Brazil.</i>

<i>Two minutes... oh my!</i>

<i>This example here was entirely created
with the Mbabel tool.</i>

<i>You have here this introduction text.
It really sounds natural for the reader.</i>

<i>The Wikidata infobox here--</i>

<i>it's a masterpiece 
of Ederporto right there.</i>

(laughter)

<i>And we have here the tables with the
election results for each position.</i>

<i>And we also have these results here 
on the textual form too,</i>

<i>so it really looks like an article
that was made, that was handcrafted.</i>

<i>The references here were also made 
with the Mbabel tool</i>

<i>and we used identifiers
to build these references here</i>

<i>and the categories too.</i>

<i>So, to wrap things up here,
it is still a work in progress,</i>

<i>and we have some challenges 
on outreach and technical</i>

<i>to bring Mbabel 
to other language communities,</i>

<i>especially the smaller ones,</i>

<i>and how do we support those tools</i>

<i>on lower resource 
language communities too.</i>

<i>And finally, is it possible 
to create an Mbabel</i>

<i>that overcomes language barriers?</i>

<i>I think that's a question 
very interesting for the conference</i>

<i>and hopefully we can figure 
that out together.</i>

<i>So, thank you very much,
and look for the Mbabel poster downstairs</i>

<i>if you like to have all this information
wrapped up, okay?</i>

Thank you.

(audience clapping)

(moderator) I'm afraid 
we're a little too short for questions

but yes, Erica, as she said, 
has a poster and is very friendly.

So I'm sure you can talk to her
afterwards,

and if there's time at the end, 
I'll allow it.

But in the meantime, 
I'd like to bring up our next speaker...

Thank you.

(audience chattering)

Next we've got Yolanda Gil, 
talking about Wikidata and Geosciences.

Thank you.

I come from the University 
of Southern California

and I've been working with 
Semantic Technologies for a long time.

I want to talk about geosciences
in particular,

where this idea of crowd-sourcing
from the community is very important.

<i>So I'll give you a sense 
that individual scientists,</i>

<i>most of them in colleges,</i>

<i>collect their own data 
for their particular project.</i>

<i>They describe it in their own way.</i>

<i>They use their own properties, 
their own metadata characteristics.</i>

<i>This is an example 
of some collaborators of mine</i>

<i>that collect data from a river.</i>

<i>They have their own sensors, 
their own robots,</i>

<i>and they study the water quality.</i>

<i>I'm going to talk today about an effort
that we did to crowdsource metadata</i>

<i>for a community that works
in paleoclimate.</i>

<i>The article just came out
so it's in the slides if you're curious,</i>

<i>but it's a pretty large community
that work together</i>

<i>to integrate data more efficiently
through crowdsourcing.</i>

<i>So, if you've heard of the 
hockey stick graphics for climate,</i>

<i>this is the community that does this.</i>

<i>This is a study for climate
in the last 200 years,</i>

<i>and it takes them literally many years
to look at data</i>

<i>from different parts of the globe.</i>

<i>Each dataset is collected by 
a different investigator.</i>

<i>The data is very, very different,</i>

<i>so it takes them a long time 
to put together</i>

<i>these global studies of climate,</i>

<i>and our goal is to make that 
more efficient.</i>

<i>So, I've done a lot of work 
over the years.</i>

<i>Going back to 2005, we used to call it,</i>

<i>"Knowledge Collection from Web Volunteers"</i>

<i>or from netizens at that time.</i>

<i>We had a system called "Learner."</i>

<i>It collected 700,000 common sense,</i>

<i>common knowledge statements 
about the world.</i>

<i>We did a lot of different techniques.</i>

<i>The forms that we did
to extract knowledge from volunteers</i>

<i>really fit the knowledge models,
the data models that we used</i>

<i>and the properties that we wanted to use.</i>

<i>I worked with Denny 
in the system called "Shortipedia"</i>

<i>when he was a Post Doc at ISI,</i>

<i>looking at keeping track 
of the prominence of the assertions,</i>

<i>and we started to build 
on Semantic Media Wiki software.</i>

<i>So everything that 
I'm going to describe today</i>

<i>builds on that software,</i>

<i>but I think that now we have Wikibase,</i>

<i>we'll be starting to work more 
on Wikibase.</i>

<i>So the LinkedEarth is the project</i>
<i>where we work with paleoclimate scientists</i>

<i>to crowdsource the metadata,</i>

<i>and seeing the title that we said,
"controlled crowdsourcing."</i>

<i>So we found a nice niche</i>

<i>where we could let them create 
new properties</i>

<i>but we had an editorial process for it.</i>

<i>So I'll describe to you how it works.</i>

<i>For them, if you're looking at a sample
from lake sediments from 200 years ago,</i>

<i>you use different properties
to describe it</i>

<i>than if you have coral sediments
that you're looking at</i>

<i>or coral samples that you're looking at
that you extract from the ocean.</i>

<i>Palmyra is a coral atoll in the Pacific.</i>

<i>So if you have coral, you care 
about the species and the genus,</i>

<i>but if you're just looking at lake sand,
you don't have that.</i>

<i>So each type of sample 
has very different properties.</i>

<i>In LinkedEarth, 
they're able to see in a map</i>

<i>where the datasets are.</i>

<i>They actually annotate their own datasets</i>
<i>or the datasets of other researchers</i>

<i>when they're using it.</i>

<i>So they have a reason 
why they want certain properties</i>

<i>to describe those datasets.</i>

<i>Whenever there are disagreements, 
or whenever there are agreements,</i>

<i>there's community discussions 
about them</i>

<i>and they're also polls to decide on 
what properties to settle.</i>

<i>So it's a nice ecosystem. 
I'll give you examples.</i>

<i>You look at a particular dataset,
in this case it's a lake in Africa.</i>

<i>So you have the category of the page;
it can be a dataset,</i>

<i>it can be other things.</i>

<i>You can download the dataset itself
and you have kind of canonical properties</i>

<i>that they have all agreed to have 
for datasets,</i>

<i>and then under Extra Information,</i>

<i>those are properties 
that the person describing this dataset,</i>

<i>added on their own accord.</i>

<i>So these can be new properties.</i>

<i>We call them "crowd properties,"
rather than "core properties."</i>

<i>And then when you're describing 
your dataset,</i>

<i>in this case 
it's an ice core that you got</i>

<i>from a glacier dataset,</i>

<i>and your'e adding a dataset 
you want to talk about measurements,</i>

<i>you have an offering 
of all the existing properties</i>

<i>that match what you're saying.</i>

<i>So we do this search completion
so that you can adopt that.</i>

<i>That promotes normalization.</i>

<i>The core of the properties
has been agreed by the community</i>

<i>so we're really extending that core.</i>

<i>And that core is very important
because it gives structure</i>

<i>to all the extensions.</i>

<i>We engage the community 
through many different ways.</i>

<i>We had one face-to-face meeting
at the beginning</i>

<i>and after about a year and a half,
we do have a new standard,</i>

<i>and a new way for them
to continue to evolve that standard.</i>

<i>They have editors, very much
in the Wikipedia style</i>

<i>of editorial boards.</i>

<i>They have working groups 
for different types of data.</i>

<i>They do polls with the community,</i>

<i>and they have pretty nice engagement
of the community at large,</i>

<i>even if they've never visited our Wiki.</i>

<i>The metadata evolves</i>

<i>so what we do is that people annotate
their datasets,</i>

<i>then the schema evolves,
the properties evolve</i>

<i>and we have an entire infrastructure
and mechanisms</i>

<i>to re-annotate the datasets
with the new structure of the ontology</i>

<i>and the new properties.</i>

<i>This is described in the paper.
I won't go into the details.</i>

But I think that <i>
having that kind of capability</i>

<i>in Wikibase would be really interesting.</i>

<i>We basically extended 
Semantic Media Wiki and Media Wiki</i>

<i>to create our own infrastructure.</i>

<i>I think a lot of this is now something
that we find in Wikibase,</i>

<i>but this is older than that.</i>

<i>And in general, we have many projects
where we look at crowdsourcing</i>

<i>not just descriptions of datasets</i>
<i>but also descriptions of hydrology models,</i>

<i>descriptions of multi-step 
data analytic workflows</i>

<i>and many other things in the sciences.</i>

<i>So we are also interested in including
in Wikidata additional things</i>

<i>that are not just datasets or entities</i>

<i>but also other things 
that have to do with science.</i>

<i>I think Geosciences are more complex
in this sense than Biology, for example.</i>

<i>That's it.</i>

Thank you.
(audience clapping)

- Do I have time for questions?
- Yes.

(moderator) We have time 
for just a couple of short questions.

When answering, 
can go back to the microphone?

- Yes.
- Hopefully, yeah.

(audience 1) Does the structure allow
tabular datasets to be described

and can you talk a bit about that?

Yes. So the properties of the datasets
talk more about who collected them,

what kind of data was collected,
what kind of sample it was,

and then there's a separate standard
which is called "lipid"

that's complementary and mapped
to the properties

that describes the format
of the actual files

and the actual structure of the data.

So, you're right that there's both,
"how do I find data about x"

but also, "Now, how do I use it?

How do I know where
the temperature that I'm looking for

is actually in the file?"

(moderator) This will be the last.

(audience 2) I'll have 
to make it relevant.

So, you have shown this process 
of how users can suggest

or like actually already put in 
properties,

and I didn't fully understand
how this thing works,

or what's the process behind it.

Is there some kind of 
folksonomy approach--obviously--

but how is it promoted 
into the core vocabulary

if something is promoted?

Yes, yes. It is.

So what we do is we have a core ontology
and the initial one was actually

very thoughtfully put together 
through a lot of discussion

by very few people.

<i>And then the idea was 
the whole community can extend that</i>

<i>or propose changes to that.</i>

<i>So, as they are describing datasets,
they can add new properties</i>

<i>and those become "crowd properties."</i>

<i>And every now and then, 
the Editorial Committee</i>

<i>looks at all of those properties,</i>

<i>the working groups look at all of those
crowd properties,</i>

<i>and decide whether to incorporate them
into the main ontology.</i>

<i>So it could be because they're used
for a lot of dataset descriptions.</i>

<i>It could be because 
they are proposed by somebody</i>

<i>and they're found to be really interesting</i>
<i>or key, or uncontroversial.</i>

<i>So there's an entire editorial process
to incorporate those new crowd properties</i>

or the folksonomy part of it,

but they are really built around the core
of the ontology.

The core ontology then grows
with more crowd properties

and then people propose
additional crowd properties again.

So we've gone through a couple 
of these iterations

of rolling out a new core,
and then extending it,

and then rolling out a new core
and then extending it.

- (audience 2) Great. Thank you.
- Thanks.

(moderator) Thank you.
(audience applauding)

(moderator) Thank you, Yolanda.

And now we have Adam Shorn
with "Something About Wikibase,"

according to the title.

Uh... where's the internet? There it is.

So, I'm going to do a live demo,
which is probably a bad idea

<i>but I'm going to try and do it
as the birthday present later</i>

<i>so I figure I might as well try it here.</i>

<i>And I also have some notes on my phone
because I have no slides.</i>

<i>So, two years ago, 
I made these Wikibase doc images</i>

<i>that quite a few people have tried out,</i>

<i>and even before then, 
I was working on another project,</i>

<i>which is kind of ready now, 
and here it is.</i>

It's a website that allows you <i>
to instantly create a Wikibase</i>

with a query service and quick statements,

without needing to know about
any of the technical details,

without needing to manage 
any of them either.

There are still lots of features to go
and there's still some bugs,

but here goes the demo.

Let me get my emails up ready...
because I need them too...

Da da da... Stopwatch.

Okay.

So it's a simple as... 
at the moment it's locked down behind...

Oh no! German keyboard!

(audience laughing)

Foiled... okay.

Okay.

(audience continues to laugh)

Aha! Okay.

I'll remember that for later.
(laughs)

Yes.

♪ (humming) ♪

Oh my god... now it's American.

All you have to do is create an account...

da da da...

Click this button up here...

Come up with a name for Wiki--
"Demo1"

"Demo1"

"Demo user"

Agree to the terms 
which don't really exist yet.

(audience laughing)

Click on this thing which isn't a link.

And then you have your Wikibase.

(audience cheers and claps)

<i>Anmelden</i> in German.

Demo... oh god! I'm learning lots about
my demo later.

1-6-1-4-S-G...

- (audience 3) Y...
- (Adam) It's random.

(audience laughing)

Oh, come on....
(audience laughing)

Oh no. It's because this is a capital U...

(audience chattering)

6-1-4....

S-G-ENJ...

Is J... oh no. That's... oh yeah. Okay.

I'm really... I'm gonna have to look
at the laptop

that I'm doing this on later.

Cool...

Da da da da da...

Maybe I should have some things
in my clipboard ready.

Okay, so now I'm logged in.

Oh... keyboards.

So you can go and create an item...

<i>Yeah, maybe I should make a video.
It might be easier.</i>

<i>So, yeah. You can make items,
you have quick statements here</i>

<i>that have... oh... it is all in German.</i>

(audience laughing)

(sighs)

<i>Oh, log in? Log in?</i>

<i>It has... Oh, set up ready.</i>

<i>Da da da...</i>

<i>It's as easy as...</i>

<i>I learned how to use 
Quick Statements yesterday...</i>

<i>that's what I know how to do.</i>

<i>I can then go back to the Wiki...</i>

<i>We can go and see in Recent Changes</i>

<i>that there are now two items,
the one that I made</i>

<i>and the one from Quick Statements...</i>

<i>and then you go to Quick...</i>

♪ (hums a tune) ♪

<i>Stop...no...</i>

<i>No... </i>

(audience laughing)

<i>Oh god...</i>

<i>I'm glad I tried this out in advance.</i>

<i>There you go. 
And the query service is updated.</i>

(audience clapping)

<i>And the idea of this is it'll allow 
people to try out Wikibases.</i>

<i>Hopefully, it'll even be able 
to allow people to...</i>

<i>have their real Wikibases here.</i>

At the moment you can create
as many as you want

and they all just appear 
in this lovely list.

As I said, there's lots of bugs
but it's all super quick.

Exactly how this is going to continue
in the future, we don't know yet

because I only finished writing this
in the last few days.

It's currently behind an invitation code
so that if you want to come try it out,

come and talk to me.

And if you have any other comments
or thoughts, let me know.

Oh, three minutes...40. That's...
That's not that bad.

Thanks.

(audience clapping)

Any questions?

(audience 5) Does the Quick Statements
and the Query Service

are automatically updated?

Yes. So the idea is that 
there will be somebody,

at the moment, me,

maintaining all of the horrible stuff

that you don't have to behind the scenes.

So kind of think of it like GitHub.com,

but you don't have to know anything 
about Git to use it. It's just all there.

- [inaudible]
- Yeah, we'll get that.

But any of those 
big hosted solution things.

- (audience 6) A feature request.
- Yes.

Is there any-- In Scope

do you have plans on making it
so you can easily import existing...

- Wikidata...
- I have loads of plans.

Like I want there to be a button
where you can just import

another whole Wikibase and all of--yeah.

There will, in the future list 
that's really long. Yeah.

(audience 7) I understand that it's...
you want to make it user-friendly

but if I want to access 
to the machine itself, can I do that?

Nope.
(audience laughing)

So again, like, in the longer term future,
there are possib...

Everything's possible, 
but at the moment, no.

(audience 8) Two questions. 
Is there a plan to have export tools

so that you can export it 
to your own Wikibase maybe at some point?

- Yes.
- Great.

And is this a business?

I have no idea.
(audience laughing)

Not currently.

(audience 9) What if I stop 
using it tomorrow,

how long will the data be there?

So my plan was at the end of WikidataCon
I was going to delete all of the data

and there's a Wikibase Workshop
on a Sunday,

and we will maybe be using this
for the Wikibase workshop

so that everyone can have
their own Wikibase.

And then, from that point,
I probably won't be deleting the data

so it will all just stay there.

(moderator) Question.

(audience 10) It's two minutes...

Alright, fine. I'll allow two more 
questions if you talk quickly.

(audience laughing)

- Alright, good people.
- Thank you, Adam.

Thank you for letting me test 
my demo... I mean...

I'm going to do it different.
(audience clapping)

(moderator) Thank you.

Now we have Dennis Diefenbach 
presenting <i>Q Answer.</i>

Hello, I'm Dennis Diefenbach,
I would like to present <i>Q-Answer</i>

which is a question-answering system
on top of Wikidata.

So, what we need are some questions 
and this is the interface of QAnswer.

For example, where is WikidataCon?

<i>Alright, I think it's written like this.</i>

<i>2019... And we get this response
which is Berlin.</i>

<i>So, other questions. For example,
"When did Wikidata start?"</i>

<i>It started the 30 October 2012
so it's birthday is approaching.</i>

<i>It is 6 years old, 
so it will be their 7th birthday.</i>

<i>Who is developing Wikidata?</i>

<i>The Wikimedia Foundation 
and Wikimedia Deutschland,</i>

<i>so thank you very much to them.</i>

<i>Something like museums in Berlin...
I don't know why this is not so...</i>

<i>Only one museum... no, yeah, a few more.</i>

<i>So, when you ask something like this,</i>

<i>we allow the user 
to explore the information</i>

<i>with different aggregations.</i>

<i>For example, 
if there are many geo coordinates</i>

<i>attached to the entities,
we will display a map.</i>

<i>If there are many images attached to them,</i>
<i>we will display the images,</i>

<i>and otherwise there is a list
where you can explore</i>

<i>the different entities.</i>

<i>You can ask something like 
"Who is the mayor of Berlin,"</i>

<i>"Give me politicians born in Berlin,"
and things like this.</i>

<i>So you can both ask keyword questions
and foreign natural language questions.</i>

<i>The whole data is coming from Wikidata</i>

<i>so all entities which are in Wikidata
are queryable by this service.</i>

<i>And the data is really all from Wikidata</i>

<i>in the sense, 
there are some Wikipedia snippets,</i>

<i>there are images from Wikimedia Commons,</i>

<i>but the rest is all Wikidata data.</i>

<i>We can do this in several languages.
This is now in Chinese.</i>

<i>I don't know what is written there
so do not ask me.</i>

<i>We are currently supporting this languages</i>
<i>with more or less good quality</i>

<i>because... yeah.</i>

<i>So, how can this be useful
for the Wikidata community?</i>

<i>I think there are different reasons.</i>

<i>First of all, this thing helps you
to generate SPARQL queries</i>

<i>and I know there are even some workshops
about how to use SPARQL.</i>

<i>It's not a language that everyone speaks.</i>

<i>So, if you ask something like 
"a philosopher born before 1908,"</i>

<i>to figure out, to construct 
a SPARQL query like this could be tricky,</i>

<i>In fact when you ask a question,
we generate many SPARQL queries</i>

<i>and the first one is always the thing,
the SPARQL query where we think</i>

<i>this is the good one.</i>

<i>So, if you ask your question
and then you go on SPARQL list,</i>

<i>then there is this button 
for the Wikidata query service</i>

<i>and you have the SPARQL query right there
and you will get the same result</i>

<i>as you would get in the interface.</i>

<i>Another thing where it could be useful for</i>

<i>is for finding missing 
contextual information.</i>

<i>For example, if you ask for actors
in "The Lord of the Rings,"</i>

<i>most of these entities 
will have associated an image</i>

<i>but not all of them.</i>

<i>So here there is some missing metadata
that could be added.</i>

<i>You could go to this entity at an image</i>

<i>and then see first 
that there is an image missing and so on.</i>

<i>Another thing is that you could find 
schema issues.</i>

<i>For example, if you ask 
"books by Andrea Camilleri,"</i>

<i>which is a famous Italian writer,</i>

<i>you would currently get 
these three books.</i>

<i>But he wrote many more.
He wrote more than 50.</i>

<i>And so the question is, 
are they not in Wikidata</i>

<i>or is maybe my knowledge
not correctly currently like it is.</i>

<i>And in this case, I know 
there is another book from him,</i>

<i>which is "Un mese con Montalbano."</i>

<i>It has only an Italian label
so you can only search it in Italian.</i>

<i>And if you go to this entity, 
you will say that he has written it.</i>

<i>It's a short story by Andrea Camilleri
and it's an instance of literary work,</i>

<i>but it's not instance of book</i>

<i>so that's the reason why 
it doesn't appear.</i>

<i>This is a way to track 
where things are missing</i>

<i>in the Wikidata model</i>

<i>not as you would expect.</i>

<i>Another reason is just to have fun.</i>

<i>I imagine that many of you added 
many Wikidata entities</i>

<i>so just search for the ones
that you care most</i>

<i>or you have edited yourself.</i>

<i>So in this case, who developed 
QAnswer, and that's it.</i>

<i>For any other questions, 
go to www.QAnswer.eu/qa</i>

<i>and hopefully we'll find 
an answer for you.</i>

(audience clapping)

- Sorry.
- I'm just the dumbest person here.

(audience 11) So I want to know 
how is this kind of agnostic

to Wikibase instance,

or has it been tied to the exact 
like property numbers

and things in Wikidata?

Has it learned in some way 
or how was it set up?

There is training data
and we rely on training data

and this is also most of the cases
why you will not get good resutls.

But we're training the system
by the simple yes and no answer.

When you ask a question, 
and we ask always for feedback, yes or no,

and this feedback is used by 
the machine learning algorithm.

This is where machine learning 
comes into play.

But basically, we put up separate 
Wikibase instances

and we can plug this in.

In fact, the system is agnostic
in the sense that it only wants RDF.

And RDF, you have in each Wikibase,

there are some few configurations

but you can have this on top 
of any Wikibase.

(audience 11) Awesome.

(audience 12) You mentioned that
it's being trained by yes/no answers.

So I guess this is assuming that 
the Wikidata instance is free of errors

or is it also...?

You assume that the Wikidata instances...

(audience 12) I guess I'm asking, like, 
are you distinguishing

between source level errors
or misunderstanding the question

versus a bad mapping, etc.?

Generally, we assume that the data
in Wikidata is true.

So if you click "no" 
and the data in Wikidata would be false,

then yeah... we would not catch
this difference.

But sincerely, Wikidata quality
is very good,

so I rarely have had this problem.

(audience 12) Is this data available 
as a dataset by any chance, sir?

- What is... direct service?
- The... dataset of...

"is this answer correct
versus the query versus the answer?"

Is that something you're publishing
as part of this?

- The training data that you've...
- We published the training data.

We published some old training data
but no, just a--

There is a question there.
I don't know if we have still time.

(audience 13) Maybe I just missed this
but is it running on a live,

like the Live Query Service,

or is it running on 
some static dump you loaded

or where is the data source
for Wikidata?

Yes. The problem is 
to apply this technology,

you need a local dump.

Because we do not rely only 
on the SPARQL end point,

we rely on special indexes.

So, we are currently loading 
the Wikidata dump.

We are updating this every two weeks.

We would like to do it more often,

in fact we would like to get the difs
for each day, for example,

to put them in our index.

But unfortunately, right now,
the Wikidata dumps are released

only once every week.

So, we cannot be faster than that
and we also need some time

to re-index the data, 
so it takes one or two days.

So we are always behind. Yeah.

(moderator) Any more?

- Okay, thank you very much.
- Thank you all very much.

(audience clapping)

(moderator) And now last, we have
Eugene Alvin Villar,

talking about Panandâ.

Good afternoon, 
my name is Eugene Alvin Villar

and I'm from the Philippines,
and I'll be talking about Panandâ:

a mobile app powered by Wikidata.

This is a follow-up to my lightning talk
that I presented two years ago

at WikidataCon 2017
together with Carlo Moskito.

You can download the slides

and there's a link 
to that presentation there.

I'll give you a bit of a background.

Wiki Society of the Philippines,
formerly, Wikimedia Philippines,

had a series of projects related 
to Philippine heritage and history.

So we have the usual photo contests,
<i>Wikipedia Takes Manila,</i>

<i>Wiki Loves Monuments,</i>

and then our media project
was <i>Cultural Heritage Mapping Project</i>

<i>back in 2014-2015.</i>

<i>In that project, we trained volunteers
to edit articles</i>

<i>related to cultural heritage.</i>

<i>This is our biggest, 
and most successful project that we had.</i>

<i>794 articles were created or improved, 
including 37 "Did You Knows"</i>

<i>and 4 "Good Articles,"</i>

<i>and more than 5,000 images were uploaded
to Commons.</i>

<i>As a result of that, we then launched</i>

<i>the</i> Encyclopedia 
of Philippine Heritage <i>program</i>

<i>in order to expand the scope
and also include Wikidata in the scope.</i>

<i>Here's the Core Team: myself,
Carlo and Roel.</i>

<i>Our first pilot project was to document
the country's historical markers</i>

<i>in Wikidata and Commons,</i>

<i>starting with those created by
our historical national agency, NHCP.</i>

<i>For example, they installed a marker
for our national hero, here in Berlin,</i>

<i>so there's no Wikidata page
for that marker </i>

<i>and a collection of photos of that marker
in Commons.</i>

<i>Unfortunately, the government agency
does not keep a good database</i>

<i>up-to-date or complete of their markers,</i>

<i>so we have to painstakingly input these
to Wikidata manually.</i>

<i>After careful research and confirmation,
here's a graph of the number of markers</i>

<i>that we've added to Wikidata over time,
over the past three years.</i>

<i>And we've developed 
this Historical Markers Map web app</i>

<i>that lets users view 
these markers on a map,</i>

<i>so we can browse it as a list,
view a good visualization of the markers</i>

<i>with information and inscriptions.</i>

<i>All of this is powered by Live Query
from Wikidata Query Service.</i>

<i>There's the link 
if you want to play around with it.</i>

<i>And so we developed 
a mobile app for this one.</i>

<i>To better publicize our project,
I developed the</i> Panandâ

<i>which is Tagalog for "marker",
as an android app,</i>

<i>that was published back in 2018,</i>

<i>and I'll publish the IOS version
sometime in the future, hopefully.</i>

<i>I'd like to demo the app
but we have no time,</i>

<i>so here are some 
of the features of the app.</i>

<i>There's a Map and a List view,
with text search,</i>

<i>so you can drill down as needed.</i>

<i>You can filter by region or by distance,</i>

<i>and whether you have marked 
these markers,</i>

<i>as either you have visited them 
or you'd like to bookmark them</i>

<i>for future visits.</i>

<i>Then you can use your GPS
on your mobile phone</i>

<i>to use for distance filtering.</i>

<i>For example, if I want markers
that are near me, you can do that.</i>

<i>And when you click on the Details page,
you can see the same thing,</i>

<i>photos from Commons, 
inscription about the marker,</i>

<i>how to find the marker,
its location and address, etc.</i>

<i>And one thing that's unique for this app
is you can, again, visit </i>

<i>or put a bookmark of these,
so on the map or on the list,</i>

<i>or on the Details page,</i>

<i>you can just tap on those buttons 
and say that you've visited them,</i>

<i>or you'd like to bookmark them 
for future visits.</i>

<i>And my app has been covered by the press
and given recognition,</i>

<i>so plenty of local press articles.</i>

<i>Recently, it was selected 
as one of the Top 5 finalists</i>

<i>for the Android Masters competition
in the App for Social Good category.</i>

<i>The final event will be next month.</i>

<i>Hopefully, we'll win.</i>

<i>Okay, so some behind the scenes.</i>

<i>How did I develop this app?</i>

<i>Panandâ is actually a hybrid app,
it's not native.</i>

<i>Basically it's just a web app
packaged as a mobile app</i>

<i>using Apache Cordova.</i>

<i>That reduces development time</i>

<i>because I don't have to learn 
a different language.</i>

<i>I know JavaScript, HTML.</i>

<i>It's cross-platform, allows code reuse
from the Historical Markers Map.</i>

<i>And the app is also FIN Open Source.
under the MIT license.</i>

<i>So there's the GitHub repository 
over there.</i>

<i>The challenge is 
the apps data is not live.</i>

<i>Because if you query the data live,</i>

<i>it means you pulling around half 
a megabyte of compressed JSON every time</i>

<i>which is not friendly 
for those on mobile data,</i>

<i>incurs too much delay when starting
the app, </i>

<i>and if there are any errors in Wikidata,
that may result in poor user experience.</i>

<i>So instead, what I did was 
the app is updated every few months</i>

<i>with fresh data, compiled using 
a Perl script</i>

<i>that queries Wikidata Query Service,</i>

<i>and this script also does 
some data validation</i>

<i>to highlight consistency or schema errors,</i>
<i>so that allows fixes before updates</i>

<i>in order to provide a good experience
for the mobile user.</i>

<i>And here's the... if you're tech-oriented,</i>
<i>here's the more or less,</i>

<i>the technologies that I'm using.</i>

<i>So a bunch of JavaScript libraries.</i>

<i>Here's the first script 
that queries Wikidata,</i>

<i>some Cordova plug-ins,</i>

<i>and building it using Cordova
and then publishing this app.</i>

<i>And that's it.</i>

(audience clapping)

(moderator) I hope you win. 
Alright, questions.

(audience 14) Sorry if I missed this.

Are you opening your code 
so the people can adapt your app

and do it for other cities?

Yes, as I've mentioned, 
the app is free and open source,

- (audience 14) But where is it?
- There's the GitHub repository.

You can download the slides,
and there's a link

in one of the previous slides
to the repository.

(audience 14) Okay. Can you put it?

Yeah, at the bottom.

(audience 15) Hi. Sorry, maybe 
I also missed this,

but how do you check for a schema errors?

Basically, we have a Wikiproject
on Wikidata,

so we try to put the other guidelines
on how to model these markers correctly.

Although it's not updated right now.

As far as I know, we're the only country

that's currently modeling these
in Wikidata.

There's also an effort 
to add [inaudible]

in Wikidata,

but I think that's 
a different thing altogether.

(audience 16) So I guess this may be part

of this Wikiproject you just described,

but for the consistency checks,
have you considered moving those

into like complex schema constraints
that then can be flagged

on the Wikidata side for
what there is to fix on there?

I'm actually interested in seeing 
if I can do, for example,

shape expressions, so that, yeah,
we can do those things.

(moderator) At this point, 
we have quite a few minutes left.

The speakers did very well,
so if Erica is okay with it,

I'm also going to allow 
some time for questions,

still about this presentation,
but also about Mbabel,

if anyone wants to jump in
with something there,

either presentation is fair game.

Unless like me, you're all so dazzled
that you just want to go to snacks

and think about it.
(audience giggles)

- (moderator) You know...
- Yeah.

(audience 17) I will always have 
questions about everything.

So, I came in late for the Mbabel tool.

But I was looking through 
and I saw there's a number of templates,

and I was wondering 
if there's a place to contribute

to adding more templates 
for different types

or different languages and the like?

(Erica) So for now, we're developing
those narrative templates

on Portuguese Wikipedia.

I can show you if you like.

We're inserting those templates
on English Wikipedia too.

It's not complicated to do
but we have to expand for other languages.

- French?
- French.

- Yes.
- French and German already have.

(laughing)

Yeah.

(inaudible chatter)

(audience 18) I also have a question 
about Mbabel,

which is, is this really just templates?

Is this based on the LUA scripting?
Is that all? Wow. Okay.

Yeah, so it's very deployable. Okay. Cool.

(moderator) Just to catch that 
for the live stream,

the answer was an emphatic nod 
of the head, and a yes.

(audience laughing)

- (Erica) Super simple.
- (moderator) Super simple.

(audience 19) Yeah. 
I would also like to ask.

Sorry I haven't delved 
into Mbabel earlier.

I'm wondering, you're working also
with the links, the red links.

Are you adding some code there?

- (Erica) For the lists?
- Wherever the link comes from...

(audience 19) The architecture. 
Maybe I will have to look into it.

(Erica) I'll show you later.

(moderator) Alright. You're all ready
for snack break, I can tell.

So let's wrap it up.

But our kind speakers, 
I'm sure will stick around

if you have questions for them.

Please join me in giving... first of all
we didn't give a round of applause yet.

I can tell you're interested in doing so.

(audience clapping)