< Return to Video

cdn.media.ccc.de/.../wikidatacon2019-11-eng-Open_data_panel_hd.mp4

  • 0:06 - 0:07
    Hello.
  • 0:12 - 0:15
    The two of us are starting
  • 0:15 - 0:20
    a level on a side-effect
    or side-project or whatever,
  • 0:20 - 0:23
    something which
    is loosely connected to Wikidata,
  • 0:23 - 0:25
    which is open data
  • 0:25 - 0:28
    and we're glad to see you're here.
  • 0:28 - 0:29
    I'm Alice Wiegand.
  • 0:29 - 0:34
    I'm the project lead for open data
    in the municipality of Düsseldorf,
  • 0:35 - 0:39
    and this is Knut Huhne, who is a student.
  • 0:39 - 0:40
    You may introduce yourself.
  • 0:42 - 0:44
    Yeah, I'm a software developer by day,
  • 0:44 - 0:47
    and in my spare time
    I do a lot of work at Code for Germany,
  • 0:47 - 0:50
    which is in community organization
    that I'll talk a bit about,
  • 0:50 - 0:54
    and we try to build civic tech tools
    based on open data.
  • 0:54 - 0:57
    Yeah, that's exactly what we need.
  • 0:57 - 1:02
    And so let's see where we are [on this].
  • 1:03 - 1:07
    [inaudible]
  • 1:07 - 1:09
    So if we talk about open government data,
  • 1:09 - 1:11
    this is something where I think
  • 1:12 - 1:15
    the entire world is much more forward
  • 1:15 - 1:19
    than Europe and especially Germany is.
  • 1:19 - 1:23
    But in Germany,
    where we both come from and live,
  • 1:23 - 1:28
    this is getting some dynamics
    because laws are changing.
  • 1:28 - 1:33
    And overall, we have just data
    which is used, produced,
  • 1:33 - 1:36
    and cared
  • 1:36 - 1:40
    and maintained by government,
  • 1:41 - 1:44
    which is just a reliable data source,
  • 1:44 - 1:48
    and it's official data with a high value,
  • 1:48 - 1:51
    and it is sometimes
    really surprising to see
  • 1:51 - 1:56
    what kind of data there is,
    openly kind of published.
  • 1:56 - 2:00
    So this is, for example...
  • 2:01 - 2:03
    I hope it opens soon.
  • 2:03 - 2:05
    This, for example, is...
  • 2:07 - 2:10
    it's the measure of radioactivity in kale.
  • 2:10 - 2:13
    And I think it's surprising,
  • 2:13 - 2:17
    I wonder why is it kale
    and not red cabbage?
  • 2:17 - 2:20
    And I wonder why is this a fixed date?
  • 2:20 - 2:24
    You know, 20th of November in 2013.
  • 2:24 - 2:27
    And I wonder why is it that far away?
  • 2:27 - 2:31
    What are we doing
    with radioactivity in kale today?
  • 2:31 - 2:32
    I don't know.
  • 2:32 - 2:35
    So you find a lot
    of these surprising things
  • 2:35 - 2:37
    when you start to...
  • 2:38 - 2:41
    What have I to do, do you know?
  • 2:41 - 2:45
    ...when you start to
    look at open data in Germany.
  • 2:49 - 2:52
    I'm confused with this computer.
  • 2:52 - 2:53
    Oh, yes. Thanks.
  • 2:54 - 2:58
    Yeah, and this data usually is up to date.
  • 2:58 - 3:01
    Well, it should be, of course.
  • 3:01 - 3:05
    As in all data, we have our gaps there.
  • 3:05 - 3:12
    And overall if I just look
    on the region I know best,
  • 3:12 - 3:17
    we have 86
  • 3:17 - 3:22
    of singular portals
    with open data within Germany,
  • 3:22 - 3:25
    which is on municipality level,
    on the country level,
  • 3:25 - 3:29
    on the federal country level,
    and on state level.
  • 3:29 - 3:34
    And in Austria, it's 19;
    and in Switzerland, it's 6,
  • 3:34 - 3:36
    and numbers are growing.
  • 3:36 - 3:39
    So, of course, also,
    question is why are we all doing
  • 3:39 - 3:42
    the same thing on different places?
  • 3:42 - 3:46
    It doesn't seem to be that efficient,
  • 3:46 - 3:50
    I'm not sure, but this is how
    our world today works.
  • 3:50 - 3:53
    So now I find the right key, thanks.
  • 3:54 - 3:59
    And there are a lot of challenges
    which we have to face
  • 3:59 - 4:04
    and kind of a huge gap
    between wish and reality.
  • 4:04 - 4:08
    So, after all, I do think there is a huge,
  • 4:08 - 4:13
    you know, kind of [friendliness]
  • 4:13 - 4:16
    between open data and Wikidata.
  • 4:16 - 4:18
    It's all about essential data.
  • 4:18 - 4:25
    It is about being as actual
    or being as up to date as possible.
  • 4:25 - 4:30
    But in the end, when we look
    at the open data platforms
  • 4:30 - 4:32
    in mostly Europe,
  • 4:32 - 4:36
    we find incompatible licenses.
  • 4:36 - 4:41
    So usually mainly municipalities
  • 4:41 - 4:44
    choose a BY license,
  • 4:44 - 4:49
    because they think it would be good
    to know where this data came from
  • 4:49 - 4:52
    and to be named there.
  • 4:52 - 4:54
    And this is really a crazy thing.
  • 4:54 - 4:57
    I looked at open data portals,
  • 4:57 - 5:02
    and we have a portal in Düsseldorf
    for two years now
  • 5:02 - 5:07
    and by design, we choose the 0 license.
  • 5:07 - 5:10
    And I found that open data in Zurich--
  • 5:10 - 5:12
    Okay, it's not Germany, but it's Zurich--
  • 5:12 - 5:16
    and they are doing
    a lot of cool stuff there as well.
  • 5:16 - 5:19
    And they also use the 0 license.
  • 5:19 - 5:26
    But usually municipalities
    like CC BY licenses, sadly.
  • 5:26 - 5:29
    And another thing we have to face
  • 5:29 - 5:35
    is that, especially in municipalities,
    this kind of task to publish
  • 5:35 - 5:39
    this internal data
    on a free and open license,
  • 5:39 - 5:40
    on a platform, wherever,
  • 5:40 - 5:45
    is just given to a person
    who usually does something else.
  • 5:45 - 5:48
    So it's not, you know, a 100 person task
  • 5:48 - 5:51
    for this person to do,
  • 5:51 - 5:55
    but something to do, you know,
    with all the other things.
  • 5:58 - 6:00
    Overall, I think we can say
  • 6:00 - 6:06
    that of course there are people
    who are really doing a great job.
  • 6:06 - 6:11
    Usually, we don't find
    that level of expertise
  • 6:11 - 6:15
    on data analysis and data management
  • 6:15 - 6:21
    that we would need to
    to really find high-quality data
  • 6:21 - 6:25
    within the open data
    which comes from governances.
  • 6:25 - 6:28
    And I think this is a problem,
  • 6:28 - 6:31
    and I realized also
    that there's a language issue.
  • 6:31 - 6:35
    So if I just think about
    putting my colleagues into this room,
  • 6:35 - 6:39
    into the session we had just before,
    about data quality,
  • 6:41 - 6:44
    it would be problematic
    to find a common language,
  • 6:44 - 6:49
    to figure out how we can start
    to improve our data quality
  • 6:49 - 6:54
    so that Wikidata's data quality
    is also improved.
  • 6:54 - 6:58
    Another thing
    is that we have no standards
  • 6:58 - 7:02
    in the name of anthologies,
  • 7:02 - 7:06
    in the name of how we prepare data.
  • 7:06 - 7:09
    There is a metadata standard,
    which is great,
  • 7:09 - 7:13
    but this, after all, does not mean
    that we all do the same thing
  • 7:13 - 7:16
    and that we find the same kind of data,
  • 7:16 - 7:18
    just because it is named in the same way.
  • 7:19 - 7:22
    But, overall, it's a lot of official data.
  • 7:22 - 7:25
    You can get from open data.
  • 7:26 - 7:31
    I made an example here
    which is about street names,
  • 7:31 - 7:34
    and usually you find a lot
  • 7:34 - 7:40
    of different forms and street names.
  • 7:40 - 7:43
    Sometimes something like the Karlsplatz
  • 7:43 - 7:47
    it's written with a C,
    or with a K, or separated,
  • 7:47 - 7:51
    and sometimes this is also developing
  • 7:51 - 7:53
    over the time.
  • 7:53 - 7:57
    And in the end, there's just
    only one official name
  • 7:57 - 7:59
    of a place or of a street,
  • 7:59 - 8:04
    and it's the municipality
    which can give you that name.
  • 8:04 - 8:07
    And this part, like a list
    of official street names
  • 8:07 - 8:10
    is something which is regularly published
  • 8:10 - 8:15
    by a lot of municipalities
    in their open data portals.
  • 8:15 - 8:20
    And I think that at all
    is a good start to figure out
  • 8:20 - 8:24
    what we can do with this
    in Wikidata as well.
  • 8:24 - 8:27
    So this is my short introduction,
  • 8:27 - 8:32
    and I'm happy to hear about
    community work with open data.
  • 8:34 - 8:38
    Yeah, I thought I would just kind of give
    a quick introduction from the other side,
  • 8:38 - 8:41
    of movement from the community side.
  • 8:41 - 8:44
    So, as I said, I work in my spare time
  • 8:44 - 8:47
    for an organization
    called Code for Germany.
  • 8:47 - 8:50
    We've been running since about five years
  • 8:50 - 8:52
    where we have labs,
    that is groups of people
  • 8:52 - 8:57
    that meet once a week,
    some once a month in Germany
  • 8:58 - 9:00
    in local, what we call labs.
  • 9:00 - 9:04
    And we try to build tools
    that somehow make it easier
  • 9:04 - 9:06
    for people to participate in politics,
  • 9:06 - 9:08
    to get an understanding
    of the environment around them,
  • 9:08 - 9:11
    to collect data about air pollution.
  • 9:11 - 9:14
    And, of course, we'd like to use
  • 9:14 - 9:17
    governmentally provided
    open data for that,
  • 9:17 - 9:21
    but we've also realized
    that there's difficulties with that,
  • 9:21 - 9:24
    that sometimes the data isn't there,
    it's under a difficult license,
  • 9:25 - 9:30
    which is kind of how we found our way
    to Wikidata also, I think.
  • 9:30 - 9:32
    We also happened to meet in Berlin
  • 9:32 - 9:34
    in the offices of Wikimedia Deutschland,
  • 9:34 - 9:38
    so this kind of brought us
    very close to Wikidata.
  • 9:39 - 9:40
    And I think it's cool to see
  • 9:40 - 9:44
    that we're kind of strengthening
    the relationship
  • 9:44 - 9:49
    between the Wikidata community in Germany
    and the Code for Germany community.
  • 9:50 - 9:53
    We also would like to work
    even closer with the government,
  • 9:53 - 9:55
    but talking about bridging gaps.
  • 9:55 - 10:00
    I mean, there's very basic problems
    such as us meeting after we work
  • 10:00 - 10:04
    and the people for the government
    wanting to meet when they work.
  • 10:04 - 10:08
    So I think when we think about
    how these communities can work together,
  • 10:08 - 10:11
    there's very mundane things,
    such as working times,
  • 10:11 - 10:13
    that we need to keep in mind.
  • 10:15 - 10:20
    So just a quick introduction
    to what we do at Code for Germany
  • 10:20 - 10:22
    especially with regards to Wikidata.
  • 10:23 - 10:26
    We've had a couple of hackathons now
    within the last years
  • 10:26 - 10:28
    where people from the Wikidata community
  • 10:28 - 10:31
    and the Code for Germany community
  • 10:31 - 10:34
    kind of came together to meet
  • 10:34 - 10:38
    and just spend a weekend
    to work on Wikidata.
  • 10:38 - 10:41
    And we've done
    all kinds of different things.
  • 10:42 - 10:45
    We've usually been very interested
    in political data,
  • 10:45 - 10:47
    so we've been importing a lot of data
  • 10:47 - 10:51
    regarding politicians
    and regarding elections.
  • 10:51 - 10:54
    We've thought about how to model
    election data in Wikidata a lot
  • 10:54 - 10:59
    and we've also had a lot of people
    that built games with Wikidata.
  • 10:59 - 11:01
    One of the nice examples for this
  • 11:01 - 11:05
    would be the Wikidata card game,
    where you can put in any Q number
  • 11:05 - 11:07
    and you get a nice trading card game.
  • 11:08 - 11:09
    You might have seen that.
  • 11:09 - 11:11
    If not, I encourage you to look for that.
  • 11:11 - 11:16
    I think that's a really cool way
    to sell Wikidata to other people.
  • 11:19 - 11:22
    Selling-- this is also
    something that we've realized
  • 11:22 - 11:23
    when we talk to data providers,
  • 11:23 - 11:27
    that often they're quite scared
    to give data to you
  • 11:27 - 11:29
    with the traditional argument
  • 11:29 - 11:33
    of "Our data is so complicated,
    you won't understand it,
  • 11:33 - 11:36
    and you'll build bad applications
    that will make us look bad."
  • 11:37 - 11:42
    And our strategy usually
    is to just take the data anyway,
  • 11:42 - 11:46
    build an application share it with them,
    and then their response is usually,
  • 11:47 - 11:50
    "Oh, this is pretty cool.
    Can we link to that from our website?"
  • 11:51 - 11:52
    And then, at some point,
  • 11:52 - 11:55
    maybe you can start having
    a discussion with them.
  • 11:56 - 11:58
    But, yeah, I think this is kind
    of what we can do as a community.
  • 11:58 - 12:03
    We can build little small games
    and tools to showcase.
  • 12:03 - 12:05
    Okay, there is Wikidata,
    and it's pretty cool,
  • 12:05 - 12:08
    and you have open data,
    and we can build cool things with it,
  • 12:08 - 12:09
    but you'll need to give it to us,
  • 12:09 - 12:12
    you'll need to publish it
    under a license that we can work with.
  • 12:13 - 12:18
    And this is one of the things
    that we try to do at Code for Germany.
  • 12:19 - 12:21
    [inaudible], thanks.
  • 12:23 - 12:25
    (applause)
  • 12:27 - 12:29
    Yeah, thank you.
  • 12:29 - 12:32
    Before we open
    the room for questions from you,
  • 12:32 - 12:38
    we would like to just open
    or ask some questions to you.
  • 12:38 - 12:41
    I think that Knut has really described
  • 12:41 - 12:44
    the challenges we face quite well.
  • 12:44 - 12:49
    But, still, I do think there's a lot
    of opportunities in these data,
  • 12:49 - 12:55
    and we just need to kind of harvest it
    better than we do it right now.
  • 12:55 - 12:59
    And so my questions--
    and maybe it helps you a bit
  • 12:59 - 13:03
    to think about that--
    is how could we integrate
  • 13:03 - 13:07
    more open government data
    into Wikidata in a more structured way.
  • 13:07 - 13:13
    Just keeping in mind that the people
    who are kind of providing these data
  • 13:13 - 13:17
    are not the experts you may expect.
  • 13:17 - 13:18
    And at the same time,
  • 13:19 - 13:23
    there already is a WikiProject,
    open government data,
  • 13:23 - 13:27
    and I'm not sure if you, Christina
    had opened it quite a while ago.
  • 13:27 - 13:31
    And I wonder in which way we can
  • 13:31 - 13:37
    kind of reanimate it
    and make the best out of it
  • 13:37 - 13:42
    because we still have this place,
    and we have people
  • 13:42 - 13:45
    who are engaged
    in the municipalities, in governments,
  • 13:45 - 13:48
    to open up data.
  • 13:48 - 13:51
    And maybe it's an opportunity
  • 13:51 - 13:55
    to just match these different
  • 13:55 - 13:58
    languages and expectations.
  • 13:58 - 14:01
    So, yeah, I'm open
    for any ideas to do that,
  • 14:01 - 14:05
    and I'm happy to engage
    a bit in that as well.
  • 14:05 - 14:07
    So, questions?
  • 14:11 - 14:13
    (person 1) Hi, thank you, guys.
  • 14:13 - 14:15
    Maybe an idea is one
  • 14:15 - 14:18
    we could be taking
    from the Wikipedia beginnings,
  • 14:18 - 14:20
    where I think it was Matthias Schindler,
  • 14:20 - 14:22
    who started
    with his Content Liberation Army.
  • 14:23 - 14:26
    And the idea that,
    you know, you have to really go in,
  • 14:26 - 14:29
    and the data is there.
  • 14:29 - 14:32
    But for example,
    I had a project with a student
  • 14:32 - 14:33
    where we were looking
  • 14:33 - 14:35
    at where the trees
    are geolocated in Berlin,
  • 14:35 - 14:39
    and this is sometimes on paper,
    it's sometimes on a stupid database.
  • 14:39 - 14:42
    We were accused of being terrorists
  • 14:42 - 14:44
    by the people who didn't want
    to give us the data.
  • 14:44 - 14:49
    We had to get really, really
    picky about this and point to the laws
  • 14:49 - 14:52
    saying, "This is open data,
    and you have to give it to us."
  • 14:52 - 14:56
    but we have to sort of go in friendly,
    as you were saying
  • 14:56 - 14:59
    and try and explain to them
    what they will have from it.
  • 14:59 - 15:01
    Many of them don't see
    that they have a use of it
  • 15:01 - 15:04
    because it's more work for them
    having to deal with us.
  • 15:04 - 15:08
    I think that's one
    of the main kind of fears
  • 15:08 - 15:13
    which is there are coming people
    who are just putting more work onto us.
  • 15:13 - 15:17
    And at the same time,
    there's so little understanding
  • 15:17 - 15:21
    that this is just part
    of what they are doing already.
  • 15:21 - 15:25
    And that they can really also
  • 15:25 - 15:28
    learn and get a lot of input
  • 15:28 - 15:31
    from the people
    who are asking about that data.
  • 15:31 - 15:33
    But this is really culture change,
  • 15:33 - 15:36
    a cultural change
    especially here in Germany.
  • 15:36 - 15:39
    So we are working on it.
  • 15:39 - 15:43
    We are working hard,
    but it's really kind of a tough thing.
  • 15:43 - 15:45
    - Maybe I can add?
    - Yes.
  • 15:45 - 15:47
    I think what's also
    really interesting to see
  • 15:47 - 15:48
    from the community's perspective
  • 15:48 - 15:50
    is that when we talk to different cities,
  • 15:50 - 15:52
    it so depends on who happens
    to work in the cities.
  • 15:52 - 15:54
    Like we have this very small city of Moers
  • 15:54 - 15:56
    that is very unknown,
  • 15:56 - 15:58
    but if you talk to people
    in the open data community,
  • 15:58 - 16:00
    everyone will know it
  • 16:00 - 16:04
    because they happen to pay someone
    to do work on open data.
  • 16:05 - 16:08
    And when I talk to people
    from the government in Berlin,
  • 16:08 - 16:12
    they tell me, "Okay, I now know
    I have to publish open data,
  • 16:12 - 16:17
    but I don't know how, for whom, or why.
  • 16:19 - 16:21
    And I think this is actually
  • 16:24 - 16:28
    a chance for the smaller cities
    to kind of champion this idea
  • 16:28 - 16:30
    because it's so much easier for them
  • 16:30 - 16:33
    to kind of get a movement
    and to liberate some data
  • 16:33 - 16:37
    where if we talk in Berlin,
    we always need to talk to 12 districts,
  • 16:37 - 16:39
    and they'll never align
    on what data they want to publish.
  • 16:47 - 16:49
    (person 2) And we have a remote comment
  • 16:49 - 16:50
    from Beat Estermann
  • 16:50 - 16:54
    who wants to point out
    he has some links in Etherpad
  • 16:54 - 16:58
    about "Interest in open government data
    helps Swiss authorities
  • 16:58 - 17:01
    prioritize base registers
    and controlled vocabularies."
  • 17:01 - 17:03
    And I'm told he just came in
  • 17:03 - 17:05
    while I'm reading his Etherpad entry.
  • 17:06 - 17:09
    So if you could just take the mic from me.
  • 17:12 - 17:13
    (person 2) Go on.
  • 17:13 - 17:15
    (Beat) Okay, thank you.
  • 17:16 - 17:18
    I missed the first introduction.
  • 17:19 - 17:20
    What did you start on?
  • 17:20 - 17:23
    - (person 2) I was just reading--
    - (Beat) Oh, you were reading. Okay.
  • 17:23 - 17:26
    So we're currently running--
  • 17:26 - 17:29
    In Switzerland, we're running a survey
  • 17:31 - 17:36
    to kind of prioritize data
    from within the government.
  • 17:36 - 17:40
    There are like base registers
    or controlled vocabularies.
  • 17:40 - 17:43
    Because we think
    that they would be crucial
  • 17:43 - 17:46
    to actually promote and boost
    the publication of linked open data
  • 17:46 - 17:48
    across the public authorities,
  • 17:48 - 17:51
    so we're running a server
    to prioritize them.
  • 17:52 - 17:57
    And for some authorities
    to know which ones to publish now
  • 17:57 - 17:58
    and for others--
  • 17:59 - 18:02
    for the community to know
    where to put pressure on
  • 18:02 - 18:03
    and how to actually,
  • 18:05 - 18:07
    yeah, argue why they should publish it.
  • 18:07 - 18:10
    We're also collecting use cases.
  • 18:10 - 18:13
    I posted the link to the Etherpad.
  • 18:13 - 18:17
    It's in German and French only,
    the questionnaires.
  • 18:17 - 18:20
    I'm sorry we're still not like up
  • 18:20 - 18:24
    five language count here,
    but you said four languages-
  • 18:24 - 18:26
    (person 3) Just switch to English.
  • 18:26 - 18:28
    (Beat) Yeah, we could switch
    to English, right.
  • 18:30 - 18:32
    Yeah, so that's one point.
  • 18:32 - 18:34
    The other point I think is we could...
  • 18:36 - 18:38
    and I'll put a little bit more love
  • 18:38 - 18:42
    into kind of documenting
    the whole Wiki project,
  • 18:43 - 18:44
    open common data,
  • 18:44 - 18:47
    and that's something
    we're not really doing
  • 18:47 - 18:51
    if you compare it
    to what is going on in GLAM.
  • 18:53 - 18:56
    I think that is definitely something
  • 18:56 - 19:00
    which I probably will try to figure out
  • 19:00 - 19:04
    after my vacation time,
  • 19:04 - 19:07
    which is starting on Monday.
  • 19:08 - 19:11
    There is this WikiProject,
  • 19:11 - 19:15
    and we need to figure out
    who is interested in it
  • 19:15 - 19:18
    what can we do there,
  • 19:18 - 19:21
    and how can we motivate people
  • 19:21 - 19:25
    from kind of [out] the Wikidata community
  • 19:25 - 19:29
    to add this important information to that.
  • 19:29 - 19:32
    So I do think there is a huge opportunity
  • 19:32 - 19:36
    to figure out how we can include
  • 19:36 - 19:43
    more of this really, really valuable
    and reliable data into Wikidata.
  • 19:43 - 19:46
    But overall, there's a lot
    of challenges as well,
  • 19:46 - 19:52
    and still it's kind of
    a different crowd of people,
  • 19:52 - 19:55
    and we need to figure out
    how to bring them together.
  • 19:58 - 20:00
    Any idea is welcome.
  • 20:00 - 20:01
    (Beat) Yeah, there is another point
  • 20:01 - 20:03
    which we're currently not focusing on
  • 20:03 - 20:06
    with this base register
    and vocabulary thing.
  • 20:06 - 20:09
    But what I have had as a request
  • 20:09 - 20:12
    is to be able
    to actually store tabular data
  • 20:12 - 20:14
    and to be able to pull it.
  • 20:15 - 20:17
    Because it does not make sense
  • 20:17 - 20:22
    to put like 200 years
    of population statistics from Zurich
  • 20:22 - 20:26
    into that Wikidata item for Zurich.
  • 20:28 - 20:34
    Maybe I just pick it up
    and just an anecdote from my day work.
  • 20:34 - 20:38
    So I started to introduce Wikidata
    to my colleagues.
  • 20:38 - 20:41
    We are a small team doing open data,
  • 20:41 - 20:47
    and it was fine,
    and they were really, really interested,
  • 20:47 - 20:53
    but in the end we started
    to add some of the population dates,
  • 20:53 - 20:56
    and then, you know, there isn't any order.
  • 20:56 - 21:02
    So it's so hard to figure out
    if you find a population date
  • 21:02 - 21:07
    for year Y or X or something,
    and if it is still missing.
  • 21:07 - 21:11
    So, of course,
    there are still a lot of things
  • 21:11 - 21:13
    to improve in Wikidata as well,
  • 21:13 - 21:17
    and tabular data could be one of it also.
  • 21:18 - 21:21
    (person 4) [inaudible] Is it working?
  • 21:21 - 21:23
    I have a comment on the tabular data.
  • 21:23 - 21:26
    I remember we had also discussions
  • 21:26 - 21:29
    with a canton and the city
    of Zurich about this,
  • 21:29 - 21:33
    and that it might make sense to start
  • 21:33 - 21:37
    discussions on whether
    we should maybe consider
  • 21:37 - 21:42
    setting up a Wikibase
    for open governmental data
  • 21:42 - 21:46
    and having such kind of datasets
  • 21:46 - 21:51
    and then link them to Wikidata
    or link them from Wikidata to them,
  • 21:51 - 21:55
    because mostly
    the linked open data technology
  • 21:55 - 21:57
    is actually enabling that
  • 21:57 - 22:00
    and is one of the key advantages
    of this technology.
  • 22:00 - 22:05
    It is, of course, something
    that doesn't relate only to OGD data,
  • 22:05 - 22:09
    it's a global divide
    in the whole Wikidata community.
  • 22:09 - 22:15
    Because the larger we make
    the central endpoint or the graph
  • 22:15 - 22:19
    the more difficult it is to handle it--
    I think we all agree on that.
  • 22:19 - 22:24
    So I think there should be
    a deeper conversation and discussion
  • 22:24 - 22:28
    on whether we should
    start building this network.
  • 22:28 - 22:30
    Well, actually, there is already
    a network of Wikibases.
  • 22:30 - 22:36
    We also work in the university
    with publications and research data
  • 22:36 - 22:38
    with our own Wikibase.
  • 22:39 - 22:43
    Yeah, and then another comment
    about the Wiki projects.
  • 22:43 - 22:48
    So we continued working
    and documenting the materials
  • 22:48 - 22:49
    of the events,
  • 22:49 - 22:53
    so we actually now have
    two upcoming events in November.
  • 22:53 - 22:57
    We have a full weekend
    technical training on Wikidata
  • 22:57 - 22:59
    in collaboration
    with the open data Zurich people
  • 22:59 - 23:00
    and the canton of Zurich,
  • 23:00 - 23:04
    and also Wikimedia Switzerland,
    and we have a hackathon.
  • 23:04 - 23:07
    But I totally agree that it would be great
  • 23:07 - 23:10
    to start having conversations
    with all the participants
  • 23:10 - 23:12
    that have been listed already
    in the project,
  • 23:12 - 23:14
    and start more discussions,
  • 23:14 - 23:18
    especially with all the countries
    that have many good initiatives,
  • 23:18 - 23:20
    like Germany, like what you described
  • 23:20 - 23:22
    and start documenting
  • 23:22 - 23:25
    what are the specific needs
    of these institutions,
  • 23:25 - 23:26
    what are the problems,
  • 23:26 - 23:30
    and what specific tools
    we need to develop, or procedures,
  • 23:30 - 23:34
    that we can help them import
    or link data in Wikidata.
  • 23:36 - 23:40
    I think we're out of time.
    One last question.
  • 23:41 - 23:44
    (person 5) So a proposal
    to use Wikibase for that?
  • 23:44 - 23:46
    I'm not sure whether
    that actually would solve
  • 23:46 - 23:49
    this tabular data problem.
  • 23:49 - 23:53
    And when thinking of statistical data,
    like population data,
  • 23:53 - 23:56
    that is not data
    that we want to really edit,
  • 23:57 - 23:59
    that's data we just want to consume.
  • 24:00 - 24:03
    So it means we have to ask ourselves
  • 24:03 - 24:07
    whether we want to build in
    the capability to actually pull data
  • 24:07 - 24:10
    directly from external third-party
    SPARQL endpoints,
  • 24:10 - 24:15
    and not just from
    within this Wikibase ecosystem
  • 24:15 - 24:17
    that we're planning to build up as well.
  • 24:17 - 24:19
    (person 4) So I agree
    that it doesn't solve the tabular data,
  • 24:19 - 24:21
    but what I was trying to say
  • 24:21 - 24:24
    is that the information
    that is more specific,
  • 24:24 - 24:27
    it might be the case that we want
    to export it to something else
  • 24:27 - 24:33
    and I see Wikibase also
    as a very good data modeling example.
  • 24:33 - 24:38
    So not only because you want
    to have humans editing,
  • 24:38 - 24:42
    but also because the whole data modeling
    happening in Wikidata
  • 24:42 - 24:44
    with all the qualifiers and references
  • 24:44 - 24:47
    adds a lot to all the datasets.
  • 24:47 - 24:50
    So if we would do it from scratch in RDF
  • 24:51 - 24:53
    we would be missing these features
  • 24:53 - 24:56
    that Wikidata has,
    and I see it has an advantage.
  • 24:56 - 24:59
    So that was a reason why I mentioned
  • 24:59 - 25:01
    that it would be very helpful
    to maybe think of
  • 25:01 - 25:04
    for the Wikibases around the OGD data.
  • 25:05 - 25:08
    (moderator) So, I'm sorry,
    but I think we just ran out of time,
  • 25:08 - 25:11
    and I encourage you
    to keep talking with our speakers,
  • 25:11 - 25:13
    [inaudible] during all the conference
  • 25:13 - 25:15
    and please, a round of applause for them.
  • 25:15 - 25:17
    (applause)
  • 25:20 - 25:21
    Thank you.
Title:
cdn.media.ccc.de/.../wikidatacon2019-11-eng-Open_data_panel_hd.mp4
Video Language:
English
Duration:
25:26

English subtitles

Revisions