< Return to Video

cdn.media.ccc.de/.../wikidatacon2019-13-eng-OpenStreetMap_and_Wikidata_hd.mp4

  • 0:06 - 0:08
    Hello. So, good afternoon.
  • 0:08 - 0:12
    Welcome to the OpenStreetMap
    and Wikidata workshop.
  • 0:12 - 0:13
    My name is Eugene.
  • 0:13 - 0:17
    And together with Edward,
    we'll be talking about OpenStreetMap
  • 0:17 - 0:20
    and how it can work together
    with Wikidata.
  • 0:20 - 0:22
    So, just a show of hands.
  • 0:22 - 0:25
    Who here has an OpenStreetMap account?
  • 0:27 - 0:31
    Okay, some. So, probably
    this is not new to you.
  • 0:31 - 0:34
    But for those who are not familiar
    with OpenStreetMap,
  • 0:34 - 0:37
    I'll give an introduction
    to OpenStreetMap and its data model.
  • 0:37 - 0:39
    So, basically, what is OpenStreetMap?
  • 0:39 - 0:43
    It is basically a crowdsourced project
    to map the whole world.
  • 0:44 - 0:47
    And the usual way we introduce
    OpenStreetMap to people
  • 0:47 - 0:51
    is like OpenStreetMap
    is like Wikipedia for maps.
  • 0:52 - 0:55
    But actually, a more accurate way
    to introduce OpenStreetMap
  • 0:55 - 0:58
    is that it is like Wikidata
    for geographical data.
  • 0:58 - 1:02
    But that presupposes that the audience
    already knows or is familiar
  • 1:02 - 1:04
    with what Wikidata is.
  • 1:04 - 1:08
    And why do we say
    that OpenStreetMap is like Wikidata?
  • 1:08 - 1:12
    And that's because both
    have quite a lot of things in common,
  • 1:12 - 1:15
    both being crowdsourced
    and open data projects.
  • 1:16 - 1:18
    So, you know Wikidata--
  • 1:18 - 1:21
    it has items, statements,
    properties, et cetera.
  • 1:21 - 1:27
    In the same way, OpenStreetMap
    has things like nodes, ways, relations,
  • 1:27 - 1:31
    that have members and roles,
    and these have tags
  • 1:31 - 1:33
    that are composed of keys and values.
  • 1:34 - 1:40
    So, as more detail, nodes, ways,
    and relations model the geometry
  • 1:40 - 1:42
    and topology of objects.
  • 1:43 - 1:47
    And then, we have tags,
    which are actually key value strings
  • 1:47 - 1:51
    that describe the actual things
    that those objects represent.
  • 1:52 - 1:58
    So, to give an example,
    here's the Wikidata item for Berlin.
  • 1:58 - 2:03
    So, we can have property
    like population, 3 million something,
  • 2:03 - 2:06
    with a qualifier, point in time,
    and references.
  • 2:06 - 2:10
    And they have counterparts
    in OpenStreetMap.
  • 2:10 - 2:13
    So, for example, the Berlin relation
    in OpenStreetMap
  • 2:13 - 2:17
    has the tag population is equal
    to 3.4 million something,
  • 2:18 - 2:24
    and it has another tag, source:population
    equals this URL and that date.
  • 2:24 - 2:27
    So, unlike in Wikidata,
  • 2:27 - 2:30
    wherein you can have qualifiers
    and references for your statements,
  • 2:30 - 2:32
    in OpenStreetMap, the tag is quite flat.
  • 2:33 - 2:36
    There's no secondary levels of tags.
  • 2:37 - 2:38
    Everything is flat.
  • 2:38 - 2:41
    And that's why we have to put
    what you call secondary tags.
  • 2:41 - 2:44
    So, for example here, source:population
  • 2:44 - 2:47
    to indicate that the population tag
    has this source.
  • 2:49 - 2:52
    Another thing is that OpenStreetMap's tags
    are not strictly controlled.
  • 2:52 - 2:56
    Unlike with Wikidata, wherein you have
    to have approval process
  • 2:56 - 2:58
    before properties are created,
  • 2:58 - 2:59
    here, OpenStreetMap mappers
  • 2:59 - 3:03
    can invent and add
    any tags that they like.
  • 3:03 - 3:06
    However, there is a tagging
    proposal process
  • 3:06 - 3:08
    in order to propose common tags
  • 3:08 - 3:11
    that will be used by mappers
    all over the world.
  • 3:14 - 3:18
    Okay, data modeling discussions
    on the Wikidata:Project chat page
  • 3:18 - 3:20
    are actually quite similar
    to the discussions
  • 3:20 - 3:22
    in OpenStreetMap's tagging mailing list.
  • 3:23 - 3:27
    For example, here's an example
    of discussion on the project tag,
  • 3:28 - 3:31
    how do we model a building
    that has changed its use?
  • 3:31 - 3:33
    In OpenStreetMap,
    we have similar discussions.
  • 3:33 - 3:36
    How do we tag these sorts of buildings?
  • 3:38 - 3:41
    So, I've given an introduction
    of what OpenStreetMap is.
  • 3:41 - 3:45
    I'd love to discuss it more,
    but we don't have enough time.
  • 3:45 - 3:49
    So, we'll go into how do we link
  • 3:49 - 3:51
    between OpenStreetMap
    and Wikidata together.
  • 3:51 - 3:55
    I don't have to explain
    why linking is a good thing.
  • 3:55 - 3:58
    We're all Wikidatans, and we know
    that linking data is a good thing.
  • 3:59 - 4:03
    So, how do we actually link
    Wikidata with OpenStreetMap?
  • 4:04 - 4:06
    So, from Wikidata to OpenStreetMap,
  • 4:07 - 4:11
    Wikidata items on places can link
    to OpenStreetMap relations
  • 4:11 - 4:15
    using the OSM relation ID,
    or the P402 property.
  • 4:15 - 4:18
    So, the question is: why only relations?
  • 4:18 - 4:21
    That's because OSM IDs are not stable.
  • 4:22 - 4:26
    For example, you can change nodes
    to represent a different object.
  • 4:27 - 4:29
    Ways can be split
  • 4:30 - 4:35
    to add new information about those ways.
  • 4:35 - 4:42
    However, relatively, relations
    in OpenStreetMap are relatively stable.
  • 4:43 - 4:47
    At least for major relations,
    such as administrative boundaries,
  • 4:47 - 4:50
    or highway routes,
    or public transportation routes.
  • 4:51 - 4:55
    That way, you can link at least the,
    for example, here,
  • 4:55 - 4:58
    Berlin Wikidata data edit item,
  • 4:58 - 5:00
    can link to the relation
    representing the boundary
  • 5:00 - 5:04
    in OpenStreetMap via its ID.
  • 5:05 - 5:09
    In terms of the ontology,
    Wikidata items and properties
  • 5:09 - 5:13
    for geographical features
    can link to "equivalent," in quotes,
  • 5:13 - 5:17
    OpenStreetMap classes
    using the OSM tag or key property.
  • 5:17 - 5:23
    For example, the lighthouse item
    in Wikidata has the value
  • 5:23 - 5:27
    for OpenStreetMap tag or key
    Tag:man_made=lighthouse.
  • 5:27 - 5:32
    That means that lighthouses are equivalent
    to objects that are tagged
  • 5:32 - 5:35
    in OpenStreetMap
    with man-made lighthouse.
  • 5:37 - 5:41
    Going in the other direction,
    OpenStreetMap objects can link
  • 5:41 - 5:45
    to corresponding Wikipedia articles
    and Wikidata items
  • 5:45 - 5:48
    using the Wikipedia
    and Wikidata tags, respectively.
  • 5:49 - 5:50
    So, here's an example.
  • 5:50 - 5:52
    The OpenStreetMap relation for Berlin.
  • 5:52 - 5:55
    We have the Wikidata tag, Q64,
  • 5:55 - 6:00
    and the Wikipedia article
    linking to the German article for Berlin.
  • 6:02 - 6:05
    There are also several Wikidata
    secondary tags,
  • 6:05 - 6:09
    such as for example, brand:wikidata,
    architect:wikidata, artist:wikidata,
  • 6:09 - 6:11
    or name:etymology:wikidata.
  • 6:11 - 6:14
    We use this in order to exactly specify
  • 6:15 - 6:18
    what we are referring to.
  • 6:18 - 6:21
    For example, on the top part here,
    we have the example.
  • 6:22 - 6:24
    There's an artwork in OpenStreetMap
  • 6:24 - 6:27
    that was created
    by the artist named Herakut,
  • 6:28 - 6:29
    but who is that?
  • 6:29 - 6:33
    So, in order to specify exactly,
    we use artist:wikidata,
  • 6:33 - 6:34
    and that Q ID number.
  • 6:34 - 6:40
    So that you can be exactly sure
    which Herakut artist it really is.
  • 6:40 - 6:44
    This is also useful, for example,
    if you're tagging, for example,
  • 6:46 - 6:49
    objects in OpenStreetMap
    that are in a different language.
  • 6:49 - 6:53
    For example, in Japan, you might have
    a fast-food restaurant
  • 6:53 - 6:56
    called Makudonarudo,
    which is actually McDonald's.
  • 6:56 - 6:59
    So, you can tag that using
    the brand:wikidata tag
  • 7:00 - 7:03
    pointing to the McDonald's
    item in Wikidata.
  • 7:06 - 7:11
    So, in terms of ontology,
    we define and describe the tags
  • 7:11 - 7:15
    in OpenStreetMap
    on the OpenStreetMap Wiki,
  • 7:15 - 7:20
    and we can add links to corresponding
    Wikipedia articles and Wikidata items
  • 7:21 - 7:26
    so that we can sort of explain
    the correspondences and relations
  • 7:26 - 7:30
    between these tags and items in Wikidata.
  • 7:33 - 7:39
    Okay, so how do OpenStreetMap
    and Wikimedia use each other's data?
  • 7:40 - 7:42
    So, first, we have the interactive maps.
  • 7:42 - 7:45
    So, OpenStreetMap data powers
  • 7:45 - 7:48
    the Wikimedia Foundation's
    Kartotherian map tile service,
  • 7:48 - 7:51
    which is used by the Kartographer
    MediaWiki extension.
  • 7:52 - 7:54
    So, basically, any time you see
    an interactive map
  • 7:54 - 7:58
    or almost any interactive map
    on any Wikimedia project,
  • 7:58 - 8:02
    that is usually powered
    by the Kartotherian map tile service.
  • 8:02 - 8:07
    For example, here's the interactive map
    for Berlin in the English Wikivoyage.
  • 8:08 - 8:12
    So, the base map there is all coming
    from OpenStreetMap.
  • 8:13 - 8:17
    So, another thing that
    the Kartographer extension can do
  • 8:17 - 8:20
    is it can pull and overlay geometry
    from OpenStreetMap.
  • 8:21 - 8:26
    So, here's the infobox on Commons
    for the Berlin category.
  • 8:26 - 8:31
    And the map there, you can see
    an outline for Berlin, there.
  • 8:31 - 8:34
    That outline comes from OpenStreetMap.
  • 8:36 - 8:41
    In 2008, the foundation released
    localized map tiles for Kartotherian,
  • 8:41 - 8:45
    and this leveraged the multilingual
    name tags in OpenStreetMap,
  • 8:45 - 8:49
    so that you can view those maps
    that you see on Wikimedia projects
  • 8:49 - 8:51
    in the user's language.
  • 8:53 - 8:56
    Then, how do we use Wikidata
    in OpenStreetMap?
  • 8:57 - 8:59
    For example, when tagging brands,
  • 8:59 - 9:02
    for example, in shops
    and restaurants or banks,
  • 9:03 - 9:08
    OpenStreetMap's Name Suggestion Index
    uses Wikidata to provide brand identity
  • 9:08 - 9:09
    and improved tagging.
  • 9:09 - 9:13
    So, for example, if you tag
    an object in OpenStreetMap
  • 9:13 - 9:18
    with brand Wikidata pointing
    to the McDonald's item in Wikidata,
  • 9:20 - 9:23
    the name field is now automatically locked
  • 9:23 - 9:27
    so that users cannot just change that
    to, for example, Burger King.
  • 9:28 - 9:33
    And then, you can edit or also pull icons,
  • 9:34 - 9:36
    the McDonald's icon there
  • 9:36 - 9:42
    that is taken from the Facebook
    item property
  • 9:42 - 9:44
    in Wikidata.
  • 9:44 - 9:46
    So, yeah.
  • 9:46 - 9:50
    So, that, at least, when users are tagging
    these shops in OpenStreetMap,
  • 9:50 - 9:53
    they can be sure
    that they're doing it correctly.
  • 9:55 - 9:59
    Okay, so Sophox is a SPARQL endpoint
    for OpenStreetMap data.
  • 10:00 - 10:04
    So, this service can use RDF federation
    to also query linked Wikidata items.
  • 10:05 - 10:09
    So, actually, in OpenStreetMap,
    we usually use other query services,
  • 10:09 - 10:11
    such as Overpass.
  • 10:11 - 10:17
    But if you want to also query
    using Wikidata items,
  • 10:17 - 10:22
    we have the Sophox endpoint
    that you can use.
  • 10:24 - 10:27
    And for geocoding,
    if you're not familiar with geocoding,
  • 10:27 - 10:32
    basically, that's the technology
    wherein given an address,
  • 10:33 - 10:37
    you are returned geocoordinates.
  • 10:38 - 10:43
    So, we have what we call Nominatim,
    which is the usual service
  • 10:43 - 10:45
    in OpenStreetMap for doing geocoding.
  • 10:45 - 10:50
    And previously, it already uses
    Wikipedia tags in OpenStreetMap.
  • 10:51 - 10:55
    But this year, we added
    Google Summer of Code project code
  • 10:55 - 10:59
    to integrate using Wikidata tags
    in Nominatim,
  • 10:59 - 11:03
    so that search results
    can become more relevant for users
  • 11:03 - 11:05
    who are doing the searches.
  • 11:07 - 11:12
    And for localization, Mapbox and MapTiler,
    which are third-party companies
  • 11:12 - 11:16
    that extensively use OpenStreetMap,
  • 11:16 - 11:19
    they use Wikidata to power
    their localized map products.
  • 11:19 - 11:24
    So, basically, if there
    are missing name tags in OpenStreetMap,
  • 11:24 - 11:30
    and if that object is linked to Wikidata,
    they can pull the labels from Wikidata,
  • 11:30 - 11:35
    and use that to show multilingual labels,
    if that is missing in OpenStreetMap.
  • 11:36 - 11:40
    The reason for that is because
    we have a philosophy in OpenStreetMap
  • 11:40 - 11:46
    that we do not try to add too many tags,
    especially if that can be automated.
  • 11:46 - 11:48
    For example,
    for automatic transliterations,
  • 11:49 - 11:53
    if that can be automated, we don't need
    to add that to OpenStreetMap.
  • 11:53 - 11:56
    But in Wikidata, that's no problem.
  • 11:56 - 12:01
    So, you can do that by doing this linking
    between OpenStreetMap and Wikidata.
  • 12:01 - 12:07
    You don't have to do
    that transliteration on your own.
  • 12:07 - 12:09
    You can just pull it from Wikidata.
  • 12:10 - 12:16
    And also, the OpenStreetMap Wiki
    has the Wikibase extension installed.
  • 12:17 - 12:22
    So, the idea here is that we want
    the tag information or the description
  • 12:22 - 12:27
    or the description of the tags,
    that we use in OpenStreetMap
  • 12:27 - 12:29
    to be machine-readable.
  • 12:29 - 12:33
    Hopefully, this will be used
    by software and editors
  • 12:33 - 12:38
    that use OpenStreetMap data to better see
  • 12:38 - 12:41
    how objects are described
    in OpenStreetMap.
  • 12:42 - 12:44
    Currently, this is not used as much,
  • 12:44 - 12:49
    but hopefully, as the tagging information
    becomes more complete and better,
  • 12:49 - 12:52
    this can be used
    by OpenStreetMap software,
  • 12:52 - 12:55
    thanks to the Wikibase installation.
  • 12:56 - 12:59
    Okay, some copyright and IP issues.
  • 13:00 - 13:04
    Wikidata can't import coordinates
    from OpenStreetMap.
  • 13:04 - 13:07
    The reason for that is because
    OpenStreetMap is licensed
  • 13:07 - 13:10
    under the Open Database License.
  • 13:12 - 13:15
    And also, we have conflicting doctrines.
  • 13:15 - 13:19
    Here in the European Union
    and the United Kingdom,
  • 13:19 - 13:21
    we have database rights.
  • 13:22 - 13:27
    Whereas, in the US, we have the idea
    that facts are not copyrightable.
  • 13:27 - 13:32
    So, we cannot just--
    even though you cannot say--
  • 13:32 - 13:36
    you cannot copyright the fact
    that this restaurant or this bank
  • 13:36 - 13:40
    or this place is at this location,
  • 13:40 - 13:45
    doing that as an import
    or as a batch job
  • 13:45 - 13:48
    is not allowed
  • 13:48 - 13:52
    because OpenStreetMap
    is protected by database rights
  • 13:53 - 13:56
    being hosted in the United Kingdom.
  • 13:58 - 14:04
    Conversely, OpenStreetMap
    will not import geodata from Wikidata,
  • 14:04 - 14:08
    despite the CC0 license,
    because of data provenance issues.
  • 14:10 - 14:15
    If you're not familiar
    with how geocoordinates are added
  • 14:15 - 14:20
    into Wikipedia articles,
    usually users just go to Google Maps,
  • 14:20 - 14:24
    search, and then copy the coordinates
    that show up in the results,
  • 14:24 - 14:27
    and place that
    into the Wikipedia articles.
  • 14:28 - 14:31
    In OpenStreetMap, we, as much as possible,
  • 14:31 - 14:35
    avoid copying data
    from third-party sources
  • 14:35 - 14:38
    that are proprietary, such as,
    for example, Google Maps.
  • 14:39 - 14:43
    And because of that, we will never,
    in OpenStreetMap, never import data
  • 14:43 - 14:47
    from Wikipedia and also Wikidata,
  • 14:47 - 14:51
    because most coordinates in Wikidata
    have been imported from Wikipedia.
  • 14:52 - 14:55
    So, it's an established principle
    on OpenStreetMap
  • 14:55 - 14:57
    that we don't import from Wikipedia.
  • 14:58 - 15:02
    Okay, I'll just then
    turn it over to Edward.
  • 15:03 - 15:06
    (Edward) I'm going to talk
    about the process for adding links
  • 15:06 - 15:08
    from OpenStreetMap to Wikidata.
  • 15:08 - 15:13
    So, I've written a tool
    for automating this process.
  • 15:14 - 15:16
    Like, it's user-assisted editing.
  • 15:16 - 15:18
    So, it's not a fully automated tool.
  • 15:18 - 15:20
    It's available. Anyone can use it.
  • 15:20 - 15:22
    There's the address.
  • 15:22 - 15:29
    So, when I run the tool on Berlin,
    it finds 2,800 matches.
  • 15:30 - 15:32
    So, these are Wikidata items
  • 15:32 - 15:37
    where it thinks it has found
    the same OpenStreetMap objects.
  • 15:38 - 15:41
    So, the matcher is using this criteria.
  • 15:41 - 15:43
    It looks for things
    that are the same entity type.
  • 15:43 - 15:45
    They've got the same coordinates,
  • 15:45 - 15:48
    and then either they've got the same name,
    street address, or identifier.
  • 15:49 - 15:53
    So, with the name, it's doing lots
    of kind of normalization,
  • 15:53 - 15:58
    like lower casing, removing spaces,
    all kinds of bits and pieces
  • 15:58 - 16:00
    to try and match up
    slightly different ways
  • 16:00 - 16:01
    that you could write a name.
  • 16:01 - 16:04
    And I'm looking at names
    from different sources,
  • 16:04 - 16:05
    like the labels, and the aliases,
  • 16:05 - 16:09
    but also the site links,
    the article titles,
  • 16:09 - 16:14
    and I pull anything in bold
    from the Wikipedia article,
  • 16:14 - 16:16
    so lots of sources for names.
  • 16:17 - 16:19
    These are the identifiers
    that I'm matching on.
  • 16:19 - 16:22
    So, we've got lots
    of identifiers in Wikidata.
  • 16:22 - 16:25
    OpenStreetMap has identifiers, as well.
  • 16:25 - 16:27
    So, I've got a mapping
    between the name of the tag
  • 16:27 - 16:32
    for the identifier in OpenStreetMap,
    and the property in Wikidata,
  • 16:32 - 16:34
    and I look for things
    that have the same identifier.
  • 16:35 - 16:39
    So, my first version of this,
    I tried to completely automate it,
  • 16:39 - 16:42
    and the OpenStreetMap community
    was not impressed.
  • 16:42 - 16:45
    So, better to have
    a semi-automated process,
  • 16:45 - 16:49
    so people put in a place name,
    and then they see a list of matches,
  • 16:49 - 16:52
    and they go through
    and they check the matches,
  • 16:52 - 16:54
    and when they're happy, they hit save.
  • 16:54 - 16:57
    And the OpenStreetMap community
    is much happier with that.
  • 16:57 - 17:01
    It does make mistakes, the software,
    it tries very carefully,
  • 17:01 - 17:03
    but there are errors in there.
  • 17:03 - 17:06
    So you have to have
    someone checking them.
  • 17:07 - 17:09
    I've got a question of like,
    when I designed this,
  • 17:09 - 17:11
    I felt like there should be
    a one-to-one mapping
  • 17:11 - 17:14
    between things
    in OpenStreetMap and Wikidata,
  • 17:14 - 17:15
    and it doesn't really work.
  • 17:15 - 17:21
    Like for my example, tunnels
    often get represented as two objects
  • 17:21 - 17:23
    in OpenStreetMap--
    one for each tunnel bore,
  • 17:23 - 17:26
    or each road, lane within the tunnel--
  • 17:26 - 17:30
    whereas in Wikidata they tend
    to be represented as a single item,
  • 17:31 - 17:34
    so I need to change my software
    to take account of this.
  • 17:36 - 17:39
    And I have difficulties with tram stops.
  • 17:39 - 17:43
    So one item in Wikidata for a tram stop,
  • 17:43 - 17:47
    but in OpenStreetMap, it's represented
  • 17:47 - 17:50
    as a relation with nodes
  • 17:50 - 17:53
    where the tram stops
    on either side of the road.
  • 17:53 - 17:58
    But I'm using a piece of software
    called osm2pgsql
  • 17:58 - 18:01
    to do the OpenStreetMap side of things.
  • 18:01 - 18:03
    And it doesn't really support
    these relations.
  • 18:03 - 18:06
    So, I'm struggling with tram stops.
  • 18:07 - 18:08
    And so, people are using this tool.
  • 18:08 - 18:12
    There's almost 10,000 changesets
    uploaded to OpenStreetMap.
  • 18:12 - 18:15
    Edits on OpenStreetMap
    are grouped into changesets;
  • 18:15 - 18:18
    they're not individual edits
    like on Wikidata.
  • 18:18 - 18:20
    And I've got over 200 users.
  • 18:21 - 18:26
    And using this tool, there's been
    a quarter of a million links added
  • 18:26 - 18:27
    to OpenStreetMap.
  • 18:28 - 18:32
    But overall, those people are adding
    Wikidata links by hand,
  • 18:32 - 18:36
    or with other tools, and there's now
    1.4 million OpenStreetMap objects
  • 18:36 - 18:38
    with a Wikidata tag.
  • 18:40 - 18:43
    Yeah, so that is our presentation.
  • 18:44 - 18:45
    Any questions?
  • 18:45 - 18:47
    And just while we're taking questions,
  • 18:47 - 18:50
    I'll see if I can do
    a live demo of the tool.
  • 18:52 - 18:54
    Any questions?
  • 19:00 - 19:05
    (woman) I'm very interested in sort of
    surpassing this license problem.
  • 19:06 - 19:12
    And can you tell me about strategies,
    that can be already used, such as--
  • 19:13 - 19:17
    I understand that there
    are some contributions that aren't CC0--
  • 19:17 - 19:22
    or like the users, or whatever they are--
  • 19:22 - 19:27
    that can facilitate the exchange
    of information between the systems.
  • 19:28 - 19:31
    (Edward) It's true that when you sign up
    to OpenStreetMap,
  • 19:31 - 19:34
    you can tick a box saying,
    "My edits are CC0."
  • 19:34 - 19:37
    But the difficulty is that you
    are often editing something
  • 19:37 - 19:39
    that somebody else has edited.
  • 19:39 - 19:46
    And so, it's difficult
    to isolate the CC0 edits.
  • 19:48 - 19:52
    (woman) Maybe like-- further question,
    like what can we do about that?
  • 19:52 - 19:56
    Like, can we discuss-- I mean,
    I think it has been tried,
  • 19:56 - 20:00
    but I don't think it's necessarily
    doomed to fail.
  • 20:03 - 20:08
    (Eugene) Well, the best thing we can do
    is try to link items together
  • 20:08 - 20:11
    using Edward's tool and other tools.
  • 20:12 - 20:19
    But for the moment, we just try
    to map these things separately.
  • 20:20 - 20:22
    Maybe we can coordinate--
  • 20:22 - 20:28
    for example, if the third-party database
    that we want to import is compatible
  • 20:28 - 20:33
    with both Wikidata and OpenStreetMap,
    you can do a coordinated import to both.
  • 20:33 - 20:38
    But otherwise, we really have
    to respect the license,
  • 20:38 - 20:42
    because in order
    for OpenStreetMap to work,
  • 20:43 - 20:46
    it really respects intellectual
    property and copyright.
  • 20:55 - 20:59
    (man) Thank you. Is it possible
    to change the language of the background
  • 20:59 - 21:02
    when you go to the map?
  • 21:03 - 21:06
    Because it appears the language
    of the local place
  • 21:06 - 21:08
    that you are looking for.
  • 21:10 - 21:16
    (Eugene) So, in OpenStreetMap,
    basically, we tag the default name
  • 21:16 - 21:21
    according to the local language
    of that place.
  • 21:22 - 21:25
    So, for example, if you go to Japan
    in OpenStreetMap,
  • 21:25 - 21:27
    you will see Japanese names.
  • 21:28 - 21:31
    You cannot do that using
    the OpenStreetMap website,
  • 21:31 - 21:35
    but there are third-party services
    or tile services
  • 21:35 - 21:40
    that provide multilingual maps.
  • 21:40 - 21:43
    As I mentioned, there's Mapbox,
    there's MapTiler.
  • 21:43 - 21:46
    They provide multilingual maps
    so that you can use that
  • 21:46 - 21:50
    instead of the default layer
    in OpenStreetMap.
  • 21:51 - 21:56
    (man) [inaudible]
  • 21:56 - 22:01
    or from the OpenStreetMap [inaudible]?
  • 22:02 - 22:04
    Yeah, for example--
  • 22:04 - 22:06
    (man) [inaudible]
  • 22:06 - 22:11
    not actually to this tool,
    but also [inaudible].
  • 22:11 - 22:15
    Well, currently, OpenStreetMap,
    as a project does not--
  • 22:17 - 22:19
    no project to provide this service,
  • 22:19 - 22:23
    because the idea
    is that we provide the data,
  • 22:23 - 22:27
    and other people can build on that
    to provide the services
  • 22:27 - 22:30
    that users will be able to use.
  • 22:33 - 22:39
    (man 2) Yeah, this is a great project
    for all to [inaudible] on Wikidata.
  • 22:39 - 22:45
    Say, in Wikidata,
    we have a lot of locations,
  • 22:46 - 22:49
    which is already coded and it is CC0.
  • 22:49 - 22:56
    So, there are a lot of images,
    a lot of other things are in Wikidata.
  • 22:56 - 23:01
    So, if we integrate
    this Wikidata Q items to OSM,
  • 23:04 - 23:07
    can we pull this,
    all the other information
  • 23:07 - 23:10
    from Wikidata directly to OpenStreetMap,
  • 23:10 - 23:12
    any kind of tool, or something like that?
  • 23:12 - 23:16
    Or can we add an image
    which is in Commons?
  • 23:16 - 23:20
    Can we add the link of the image
    in Commons to OpenStreetMap,
  • 23:20 - 23:23
    like this Wikidata ID?
  • 23:24 - 23:26
    I feel like you don't need to.
  • 23:26 - 23:29
    Just leave the data in Wikidata,
    and access it through the link.
  • 23:29 - 23:32
    Like just add the link
    from OpenStreetMap to Wikidata,
  • 23:32 - 23:35
    and then, if you want the images--
    don't duplicate the data,
  • 23:35 - 23:38
    don't have the same thing in both places.
  • 23:39 - 23:42
    And like Eugene was saying,
    it's a bit tricky copying the data.
  • 23:42 - 23:44
    It's true that it's CC0,
  • 23:44 - 23:46
    but if we just start
    importing lots of data,
  • 23:46 - 23:48
    then OpenStreetMap's going to ask
    what's the provenance of the data,
  • 23:48 - 23:51
    where has all this come from.
  • 23:53 - 23:55
    I mean, I don't know if Eugene--
    if you got anything to add to that.
  • 23:55 - 23:59
    (Eugene) Well, OpenStreetMap
    does have an image tag.
  • 23:59 - 24:02
    So, you can add that image tag
    pointing to a Commons file,
  • 24:02 - 24:04
    if you really want to.
  • 24:04 - 24:08
    But if you link it to the Wikidata item,
    you don't need that,
  • 24:08 - 24:11
    because the Wikidata item
  • 24:11 - 24:15
    would probably have
    a Commons category statement,
  • 24:15 - 24:19
    and that provides you links
    to several images, as well.
  • 24:20 - 24:23
    You don't need to add that
    in OpenStreetMap.
  • 24:24 - 24:26
    Can I just show this quick demo.
  • 24:26 - 24:28
    This is the software that I built.
  • 24:28 - 24:32
    So, I've looked up
    Orange County in Indiana.
  • 24:33 - 24:38
    You can see, I've already run the software
    in 2017, and I added 43 elements.
  • 24:39 - 24:41
    It guesses the language is English,
  • 24:41 - 24:46
    by looking at the number of languages
    that the Wikidata labels are in.
  • 24:47 - 24:49
    And so the software
    has found five matches,
  • 24:49 - 24:52
    and it's got a list of matches
    with tick boxes.
  • 24:52 - 24:54
    There's a map.
  • 24:54 - 24:59
    It shows you the first paragraph
    from the Wikipedia article
  • 24:59 - 25:01
    in the currently selected language.
  • 25:01 - 25:05
    If I say, show tags, these are the tags
    from OpenStreetMap,
  • 25:05 - 25:08
    so it's matched--the name is the same.
  • 25:08 - 25:12
    It says it's found eight name matches,
    and it's a hotel which matches.
  • 25:13 - 25:14
    I can say, show on map.
  • 25:14 - 25:19
    And the pin is the location
    of the Wikidata coordinates,
  • 25:19 - 25:22
    and it's matched
    this hotel building polygon.
  • 25:23 - 25:25
    So, I can go through,
    and you can see some others.
  • 25:25 - 25:26
    There's a school.
  • 25:26 - 25:28
    It's failed with the airport.
  • 25:28 - 25:31
    The airport is represented
    both as a node and as a way,
  • 25:31 - 25:33
    and it can't figure out which one to use,
  • 25:34 - 25:36
    so it isn't going to do the airport.
  • 25:36 - 25:42
    Here's a historic bank building
    that it's managed to match.
  • 25:43 - 25:46
    There's an old name tag in OpenStreetMap,
  • 25:46 - 25:49
    that it's matched the old name,
    with the name that is in Wikidata.
  • 25:50 - 25:53
    And then it's also matched up
    public library.
  • 25:53 - 25:56
    So, if I click on add wikidata tags
    to OpenStreetMap
    ,
  • 25:56 - 25:58
    it gives me a change comment field
    where I could edit it--
  • 25:58 - 26:00
    change comment if I wanted.
  • 26:00 - 26:02
    And it shows me the same matches again.
  • 26:02 - 26:04
    And I hit save, and it's edited the map,
  • 26:04 - 26:08
    and it's added
    the Wikidata tags to the map.
  • 26:10 - 26:16
    (applause)
  • 26:20 - 26:22
    ([Muhammad]) It's actually not a question.
  • 26:22 - 26:24
    But first, thank you for this project.
  • 26:25 - 26:28
    My name is [Muhammad Hidjal]
    from Palestine.
  • 26:28 - 26:32
    I am a civil engineer,
    and I do some special statistics.
  • 26:33 - 26:37
    A few months ago, a magazine in my country
    asked me to do some statistics
  • 26:37 - 26:39
    on Nobel Prize winners.
  • 26:40 - 26:43
    So, for that, I used
    Wikidata Query Service,
  • 26:43 - 26:47
    and ArcGIS program for geographic
    information system analyzation.
  • 26:48 - 26:52
    I extracted the place of birth
    for all Nobel Prize winners,
  • 26:52 - 26:56
    and projected them on the map
    using ArcGIS program,
  • 26:56 - 26:59
    and then they asked me,
    "How many of them--
  • 26:59 - 27:03
    how many of the winners were born
    in the north part of the world,
  • 27:03 - 27:06
    how many of them were born
    in the south part of the world?"
  • 27:06 - 27:09
    The problem here is
    that ArcGIS program is not free
  • 27:09 - 27:12
    and I don't want to use it anymore.
  • 27:12 - 27:15
    Can I do these statistics
    using OpenStreetMap
  • 27:15 - 27:19
    after projecting the special
    information of these?
  • 27:21 - 27:24
    Okay, so the problem is
    that what you're doing--
  • 27:24 - 27:29
    what you're trying to do is
    what we call a geospatial analysis.
  • 27:29 - 27:32
    However, OpenStreetMap is a data project.
    We provide data.
  • 27:33 - 27:37
    And what you can do is, for example,
    take the data from OpenStreetMap,
  • 27:37 - 27:41
    take the data from
    your Nobel Prize place of births,
  • 27:41 - 27:45
    and use a tool, like ArcGIS,
    which is not free,
  • 27:45 - 27:47
    but there's an open source tool,
    called QGIS,
  • 27:47 - 27:51
    which you can use to do
    that spatial analysis that you want.
  • 27:51 - 27:54
    So, you can combine, for example,
  • 27:54 - 27:58
    the boundaries for northern countries
    versus southern countries,
  • 27:58 - 28:03
    put that in QGIS, then put your data
    with the Nobel Prize place of births,
  • 28:04 - 28:07
    and then do an intersection
    tool or function.
  • 28:08 - 28:09
    So, yeah.
  • 28:10 - 28:13
    So, I think we're out of time,
    and it's lunch now.
  • 28:13 - 28:15
    Thanks, everyone.
  • 28:15 - 28:20
    (applause)
Title:
cdn.media.ccc.de/.../wikidatacon2019-13-eng-OpenStreetMap_and_Wikidata_hd.mp4
Video Language:
English
Duration:
28:26

English subtitles

Revisions