Return to Video

cdn.media.ccc.de/.../wikidatacon2019-1037-eng-All_the_worlds_GLAM_institutions_-_and_how_we_tried_to_find_them_hd.mp4

  • 0:06 - 0:09
    (chairman) Good afternoon, everybody.
    Welcome back.
  • 0:10 - 0:15
    We will start our last session
    in this room before the closing session.
  • 0:16 - 0:19
    On stage is Alicia Fagerving
    from Wikimedia Sweden,
  • 0:19 - 0:24
    who will tell us more about finding GLAMs
    all around the world.
  • 0:24 - 0:27
    Don't forget
    the collaborative note taking--
  • 0:27 - 0:30
    there is a link to the Etherpads
    on the program,
  • 0:30 - 0:33
    as you know by now, and please help us
  • 0:33 - 0:36
    all for the documentation
    for this presentation.
  • 0:36 - 0:37
    Thank you very much.
  • 0:37 - 0:38
    (Alicia) Thank you.
  • 0:39 - 0:41
    My name is Alicia Fagerving.
  • 0:41 - 0:44
    I am a developer at Wikimedia Sverige,
  • 0:44 - 0:47
    the Swedish chapter
    of the Wikimedia Foundation.
  • 0:47 - 0:52
    And today I am going to talk
    about the project
  • 0:52 - 0:55
    that, I know, some of you
    have heard about.
  • 0:56 - 1:00
    It is a project that we have been
    working on for quite a while,
  • 1:01 - 1:05
    and we've been reaching out
    to the global Wikimedia community
  • 1:05 - 1:07
    to keep you updated.
  • 1:07 - 1:11
    We have received already
    some very enthusiastic feedback,
  • 1:12 - 1:15
    both earlier on and at this conference,
  • 1:15 - 1:20
    indicating that there is
    indeed a lot of interest
  • 1:20 - 1:23
    in the potential of Wikidata
  • 1:23 - 1:29
    as the ultimate global database
  • 1:29 - 1:32
    of cultural heritage institutions.
  • 1:38 - 1:44
    The project that we are working on
    is called FindingGLAMs,
  • 1:44 - 1:47
    and its goal is, in short,
  • 1:47 - 1:52
    to put every museum, gallery,
    archive and library
  • 1:52 - 1:54
    on the Wikidata map.
  • 1:55 - 1:58
    Wikimedia Sweden
    is working on this project
  • 1:58 - 2:01
    in collaboration
    with the Wikimedia Foundation
  • 2:01 - 2:03
    and with UNESCO,
  • 2:04 - 2:08
    and with financing
    from the Swedish Postcode Foundation,
  • 2:08 - 2:13
    which is an organization that promotes
    positive social development.
  • 2:15 - 2:20
    Originally we thought that we would
    end our project in November,
  • 2:21 - 2:23
    which is why the title of the session,
  • 2:24 - 2:27
    All the world's GLAM institutions
    and how we try to find them
    ,
  • 2:27 - 2:29
    is in the past tense.
  • 2:29 - 2:34
    Because we expected that at this point
    we would be able take a look back
  • 2:34 - 2:36
    and sum up our efforts.
  • 2:39 - 2:42
    Well, it turns out we still have
    a lot of work to look forward to
  • 2:42 - 2:47
    as the project has been extended
    until the end of February 2020.
  • 2:48 - 2:52
    And this is great news as it means
    that we will be able
  • 2:52 - 2:56
    to guide and support more volunteers
  • 2:56 - 3:00
    who want to bring this project
    to their local Wikimedia communities.
  • 3:01 - 3:03
    And I'll talk about that a bit later on,
  • 3:03 - 3:07
    but first let's have a look at some facts
  • 3:07 - 3:11
    to understand why we are even doing this.
  • 3:13 - 3:18
    How many GLAMs are there on Wikidata?
  • 3:19 - 3:24
    Almost 130,000 right now.
  • 3:24 - 3:28
    And that is a lot,
    like I would be very happy
  • 3:28 - 3:32
    if I had 130,000 of anything
  • 3:32 - 3:34
    but that's obviously not enough.
  • 3:35 - 3:42
    Now, we don't really know
    how many GLAMs there are in the world.
  • 3:43 - 3:47
    But we have access
    to some estimates and service,
  • 3:48 - 3:50
    for example, IFLA,
  • 3:50 - 3:55
    the International Federation
    of Library Associations and Institutions,
  • 3:55 - 3:57
    estimates on their website
  • 3:57 - 4:04
    that there are
    about 2.5 million libraries
  • 4:05 - 4:06
    around the world.
  • 4:06 - 4:11
    If I remember correctly,
    most of these are school libraries.
  • 4:12 - 4:14
    And on Wikidata
  • 4:15 - 4:19
    there are about 57,000 items
  • 4:19 - 4:24
    for different sorts of libraries
    around the world.
  • 4:26 - 4:30
    If we look at museums in the U.S.,
  • 4:30 - 4:35
    we know that there are
    at least 30,000 museums.
  • 4:35 - 4:38
    That is in the U.S. alone,
  • 4:39 - 4:43
    while Wikidata only has items
    for a couple of thousand,
  • 4:43 - 4:47
    about 4,000 museums in the U.S.
  • 4:48 - 4:53
    So this shows
    that there is still a lot to do
  • 4:53 - 4:58
    and a lot of possibilities
    to ingest more data.
  • 5:01 - 5:06
    Obviously it's not only numbers
    that are interesting
  • 5:06 - 5:08
    but where those GLAMs are located.
  • 5:09 - 5:14
    About 60,000 of GLAMs
    on Wikidata have coordinates,
  • 5:14 - 5:17
    which means we can show them on the map.
  • 5:18 - 5:21
    And this is an interesting way
    of demonstrating
  • 5:21 - 5:26
    not only the size of the data
    but also the geographical bias.
  • 5:26 - 5:32
    There are a lot of GLAMs
    in Western and Central Europe.
  • 5:32 - 5:36
    You can see that the U.K.,
    France, Germany, also Sweden
  • 5:36 - 5:38
    are well represented.
  • 5:39 - 5:44
    We can see, there is
    quite a lot of dots in the U.S.
  • 5:44 - 5:47
    and in certain parts of South America.
  • 5:49 - 5:54
    If we look at Asia,
    we see that Japan stands out,
  • 5:54 - 5:58
    but countries like India,
    Russia or Indonesia
  • 5:59 - 6:01
    have really bad coverage,
  • 6:01 - 6:05
    despite the fact that they are
    very, very highly populated.
  • 6:07 - 6:09
    And obviously Africa--
  • 6:09 - 6:12
    I don't even have to mention
  • 6:12 - 6:15
    that the coverage there
    is very, very bad--
  • 6:15 - 6:19
    very few GLAMs in African countries.
  • 6:21 - 6:25
    And of course this distribution
    is not unique to just GLAMs,
  • 6:26 - 6:30
    it's typical for other types
    of items on Wikidata.
  • 6:30 - 6:33
    The global North is overrepresented
  • 6:33 - 6:37
    because that's where
    a lot of the volunteers come from,
  • 6:37 - 6:40
    and that's the areas
    they have a lot of interest in.
  • 6:42 - 6:45
    Now, I haven't mentioned the colors yet
  • 6:45 - 6:48
    but they are important as well.
  • 6:49 - 6:54
    We've run the queries
    to find the GLAMs in Wikidata twice--
  • 6:54 - 6:56
    once back in April
  • 6:56 - 6:59
    and, again, very recently in October.
  • 7:00 - 7:05
    And the yellow dots
    are just the April coverage,
  • 7:05 - 7:11
    and the purple ones are the new ones
    that appeared in the October results.
  • 7:12 - 7:17
    This shows that a lot of new GLAMs
    appeared in that couple of months,
  • 7:17 - 7:19
    the six months period.
  • 7:19 - 7:24
    Some of this increase,
    not all of it but some of it,
  • 7:24 - 7:27
    is thanks to our project.
  • 7:27 - 7:33
    And that's specifically
    the U.S. and Africa coverage.
  • 7:35 - 7:37
    So how has this come about?
  • 7:38 - 7:41
    A very large part of our work
  • 7:41 - 7:46
    is working with existing
    published datasets of GLAMs
  • 7:46 - 7:47
    in different countries.
  • 7:48 - 7:50
    And there are lots of them
    in different formats
  • 7:50 - 7:56
    and published by various
    regional or national entities.
  • 7:56 - 7:58
    For example, in the U.S.
  • 7:58 - 8:02
    we have the Institute of Museum
    and Library Services,
  • 8:03 - 8:08
    which publishes the results
    of its annual Public Library Survey.
  • 8:08 - 8:12
    It contains data about the country's
    public library systems
  • 8:12 - 8:14
    and public library outlets
  • 8:14 - 8:16
    that are organized in those systems.
  • 8:17 - 8:23
    And since this data is produced
    by a U.S. government agency,
  • 8:23 - 8:27
    then it's in public domain and we know
    we can use it on Wikidata.
  • 8:27 - 8:31
    So it was possible for us
    to import that data.
  • 8:33 - 8:36
    Another example of data
    that we could import
  • 8:37 - 8:43
    is data from the Advancing
    Library Visibility in Africa
    project,
  • 8:43 - 8:47
    which is carried out
    by the University of Washington.
  • 8:48 - 8:50
    And I think this project
    is very interesting
  • 8:50 - 8:53
    because they have quite a similar goal.
  • 8:53 - 8:57
    They collect data about libraries
    in several African countries,
  • 8:58 - 9:00
    and this includes basic information,
  • 9:00 - 9:04
    such as the name of the library,
    where it's located
  • 9:04 - 9:07
    and its geographical coordinates.
  • 9:08 - 9:13
    And the researchers
    licensed their data as CC0
  • 9:14 - 9:19
    to ensure that really it can be reused
    as widely as possible
  • 9:19 - 9:21
    and reach as many people as possible.
  • 9:23 - 9:25
    So because of that we could add
  • 9:25 - 9:29
    a couple of hundred
    new Wikidata items for libraries
  • 9:29 - 9:34
    on a continent that had
    an extremely poor coverage
  • 9:34 - 9:36
    of libraries and GLAMs in general.
  • 9:39 - 9:41
    These are the positive examples
  • 9:41 - 9:46
    but the truth is that most
    of the datasets we found
  • 9:47 - 9:48
    through our research
  • 9:49 - 9:52
    could not be imported into Wikidata.
  • 9:52 - 9:58
    We decided to only work with datasets
    that are explicitly licensed
  • 9:58 - 10:01
    in a way that is compatible with Wikidata,
  • 10:01 - 10:03
    such as public domain or CC0.
  • 10:05 - 10:08
    But we believe that
    even if a dataset is copyrighted,
  • 10:08 - 10:11
    it's still valuable,
  • 10:11 - 10:14
    it's still good
    for the Wikimedia community
  • 10:14 - 10:16
    to know that it exists.
  • 10:16 - 10:20
    And that's why one of the goals
    of the FindingGLAMs project
  • 10:20 - 10:23
    is collating a master list
  • 10:23 - 10:28
    of datasets of GLAM institutions
  • 10:28 - 10:33
    which you can find on Meta,
    and if you know of a dataset
  • 10:33 - 10:35
    from your country or region
    that is not there,
  • 10:36 - 10:38
    you are very welcome to add it there.
  • 10:39 - 10:42
    Simply knowing that it exists
    is very valuable.
  • 10:42 - 10:46
    For example it can be used as a source
    to write Wikipedia articles,
  • 10:47 - 10:53
    and it provides an entry point
    to possibly discuss with the publisher
  • 10:53 - 10:57
    whether it would be possible to release it
    under an open license.
  • 10:59 - 11:02
    And here is an example.
  • 11:02 - 11:05
    When researching
    GLAM data around the world,
  • 11:05 - 11:09
    we found that Archives Portal Europe
  • 11:09 - 11:14
    has a directory
    of about 2,000 archival institutions
  • 11:14 - 11:17
    all around Europe,
    including very valuable data
  • 11:17 - 11:20
    like their coordinates,
    addresses and collection sizes.
  • 11:20 - 11:22
    When we reached out to them
  • 11:22 - 11:26
    about potentially ingesting
    that dataset on Wikidata,
  • 11:26 - 11:27
    they first said that,
  • 11:27 - 11:30
    well, it's copyrighted,
    it's not open data.
  • 11:30 - 11:34
    But we told them
    about Wikidata and open data,
  • 11:34 - 11:36
    and about our project,
  • 11:36 - 11:38
    and they were actually very positive,
  • 11:38 - 11:43
    and they promised to take steps
    to release their dataset
  • 11:43 - 11:45
    under a CC0 license.
  • 11:45 - 11:49
    We are having a very positive dialog
    around that right now,
  • 11:49 - 11:55
    and we really hope
    that this will become reality very soon.
  • 11:59 - 12:03
    Now, using official datasets
    is only one way
  • 12:03 - 12:08
    in which we can improve the coverage
    of GLAM institutions on Wikidata.
  • 12:09 - 12:12
    Another one that we are
    developing in our project
  • 12:12 - 12:17
    is making it easier for people,
    for new users,
  • 12:17 - 12:20
    especially users within GLAM institutions,
  • 12:20 - 12:23
    to add new information to Wikidata.
  • 12:24 - 12:29
    We would love to see
    more GLAM professionals edit Wikidata
  • 12:29 - 12:32
    and add information
    about GLAM institutions.
  • 12:32 - 12:36
    And many of them
    have heard about Wikidata.
  • 12:36 - 12:39
    Many of them are curious
    about how they can use it
  • 12:39 - 12:41
    in their everyday work.
  • 12:41 - 12:46
    And editing Wikidata
    is a great way to learn that.
  • 12:49 - 12:53
    But anyone who has tried to introduce
  • 12:53 - 12:56
    non-Wikimedians to Wikidata
  • 12:56 - 12:59
    knows that there is
    quite a bit of a threshold
  • 13:00 - 13:04
    before they can start making useful edits,
  • 13:04 - 13:09
    like even if you don't want to delve
    into the technical details
  • 13:09 - 13:14
    and the data modeling,
    there is still a lot to learn
  • 13:14 - 13:18
    about how the data
    in Wikidata is structured,
  • 13:18 - 13:22
    how to find whatever that interests you,
  • 13:22 - 13:25
    for example how to find
    all libraries in a city.
  • 13:26 - 13:28
    And, most importantly,
  • 13:28 - 13:33
    how this particular sort of stuff
    normally is modeled
  • 13:33 - 13:37
    because there are a lot of properties
    and lots of good practice
  • 13:37 - 13:44
    but for a new user
    who just created their first blank item,
  • 13:44 - 13:47
    this is very, very hard and complex,
  • 13:47 - 13:51
    and it's not clear where to find
    this information and whom to ask.
  • 13:52 - 13:56
    So they often need
    personal guidance to learn like
  • 13:56 - 13:59
    what sort of properties
    you add to an item
  • 13:59 - 14:02
    that represents a museum or a library.
  • 14:05 - 14:08
    So that's why we are developing software
  • 14:08 - 14:11
    to make it easier for absolute beginners
  • 14:11 - 14:16
    in GLAM institutions
    to make their first Wikidata edits.
  • 14:17 - 14:20
    This app is called Monumental.
  • 14:20 - 14:24
    We developed a first version
    that was released earlier this year
  • 14:24 - 14:27
    and based on the feedback we received.
  • 14:27 - 14:30
    We are working
    on a new improved version
  • 14:30 - 14:32
    that should be online soon.
  • 14:32 - 14:37
    I don't think the usable version
    is released already.
  • 14:38 - 14:40
    And the aim of Monumental
  • 14:40 - 14:46
    is to make searching, viewing
    and editing GLAM items easier.
  • 14:47 - 14:53
    For example the user can see
    all the GLAM institutions around them
  • 14:53 - 14:55
    or in a selected location
  • 14:55 - 14:58
    without having to write SPARQL queries.
  • 15:03 - 15:05
    And when it comes to editing,
  • 15:06 - 15:08
    we think this is especially important:
  • 15:08 - 15:13
    The user does not have to guess
    which properties to use.
  • 15:14 - 15:18
    The interface displays
    a selection of properties
  • 15:18 - 15:21
    that are normally used with GLAM items,
  • 15:22 - 15:28
    and the user can see
    that even if there are no values
  • 15:29 - 15:31
    for those properties,
  • 15:32 - 15:35
    then the user knows
    that this information is missing,
  • 15:35 - 15:38
    and this is what they can add.
  • 15:38 - 15:42
    And that way they can make
    valuable contributions
  • 15:42 - 15:47
    within this very small limited model,
    which reduces confusion
  • 15:47 - 15:51
    and reduces the fear
    of doing something wrong.
  • 15:54 - 15:56
    And we want to promote this tool
  • 15:56 - 15:59
    and get as many people
    as possible involved.
  • 15:59 - 16:02
    So we are planning
    a global crowdsourcing campaign
  • 16:02 - 16:05
    targeting primarily GLAM staff.
  • 16:06 - 16:09
    That is the people who have
    the up-to-date relevant knowledge,
  • 16:11 - 16:15
    hoping that we can make it
    as easy as possible for them
  • 16:15 - 16:17
    to start editing.
  • 16:18 - 16:23
    And we will be asking people
    in the global Wikimedia community
  • 16:23 - 16:27
    to help out and spread the word
    about the campaign
  • 16:27 - 16:29
    in their local communities.
  • 16:30 - 16:35
    We imagine that there are
    people around the world
  • 16:35 - 16:38
    who have contact with GLAM institutions
  • 16:38 - 16:41
    and who have the knowledge
    about their local Wikimedia communities
  • 16:41 - 16:46
    that can help out for example
    if translating informational material
  • 16:46 - 16:51
    or with organizing edit-a-thons
    for GLAM professionals.
  • 16:53 - 16:56
    And you can find
    more information about that
  • 16:56 - 16:59
    on the Finding GLAMs site on Meta.
  • 17:04 - 17:06
    And we think that what we are doing
  • 17:06 - 17:09
    is obviously a great goal to pursue
  • 17:09 - 17:13
    because Wikidata is a powerful platform
  • 17:13 - 17:16
    which enables and encourages re-use.
  • 17:16 - 17:19
    So the data that we collect will benefit
  • 17:19 - 17:22
    not only us in the Wikimedia community
  • 17:22 - 17:26
    but also everyone who is interested
    in where there are
  • 17:26 - 17:29
    cultural heritage institutions
    in the world.
  • 17:30 - 17:33
    People such as researchers, journalists
  • 17:34 - 17:37
    will be able to see
    and re-use that information.
  • 17:38 - 17:42
    The data will also be available
    to search engines,
  • 17:42 - 17:47
    which can be especially interesting
    for smaller GLAMs
  • 17:47 - 17:51
    that do not have
    a strong presence on the Web.
  • 17:51 - 17:54
    In Wikidata they can become visible.
  • 17:57 - 17:59
    And for us in the Wikimedia project
  • 17:59 - 18:03
    getting more GLAMs on Wikidata,
  • 18:03 - 18:07
    it will enable us
    to better work with them,
  • 18:07 - 18:09
    for example on Wikipedia.
  • 18:10 - 18:15
    It would provide material
    for example for writing articles
  • 18:16 - 18:22
    and it can benefit especially
    Wikipedias in languages
  • 18:22 - 18:24
    with smaller communities
  • 18:25 - 18:30
    which rely more and more
    on data from Wikidata
  • 18:30 - 18:33
    in order to create and enrich articles.
  • 18:34 - 18:39
    And this will bring
    more information about GLAMs
  • 18:40 - 18:44
    to Wikipedias that have
    smaller communities
  • 18:44 - 18:48
    and not enough resources
    to write and maintain those articles.
  • 18:51 - 18:53
    And, as I said, you can find
  • 18:53 - 18:57
    more information
    about our project in general
  • 18:57 - 18:59
    and about the campaign
    that we are planning
  • 19:00 - 19:02
    on Meta.
  • 19:04 - 19:05
    Thank you.
  • 19:06 - 19:09
    (applause)
  • 19:13 - 19:15
    (chairman) Thank you very much.
  • 19:15 - 19:19
    Does somebody have any questions? Yeah.
  • 19:19 - 19:20
    (Alicia) Yes.
  • 19:22 - 19:23
    (man 1) Thank you.
  • 19:24 - 19:26
    Just kind of looking
  • 19:26 - 19:29
    at the the number
    of libraries you've gotten,
  • 19:29 - 19:32
    there are more libraries
    than are galleries.
  • 19:32 - 19:34
    I'm going to make
    the wild assumption that there are.
  • 19:34 - 19:36
    (Alicia) On Wikidata or in general?
  • 19:36 - 19:38
    - (man 1) In the world.
    - (Alicia) In the world?
  • 19:38 - 19:42
    Oh yeah. That's also a very...
    it's a bit of a weird number from IFLA.
  • 19:42 - 19:47
    If you got their website, they have
    like "a map of coverage of libraries,"
  • 19:48 - 19:51
    and I do think
    that most of those 2.5 million,
  • 19:51 - 19:52
    they are school libraries.
  • 19:52 - 19:53
    (man 1) Right.
  • 19:53 - 19:56
    (Alicia) They do show
    like how, what types...
  • 19:56 - 19:59
    There are significantly more
    school libraries than public libraries.
  • 19:59 - 20:01
    (man 1) But I was thinking
  • 20:01 - 20:03
    because there are
    so many more libraries
  • 20:03 - 20:04
    than there are, say, art galleries...
  • 20:04 - 20:05
    Yes.
  • 20:05 - 20:08
    (man 1) Like why did you
    limit the project to...?
  • 20:08 - 20:10
    So why is it not a libraries project,
  • 20:10 - 20:12
    for a start, or why is it a GLAM project
  • 20:12 - 20:14
    and where does GLAM end because--
  • 20:14 - 20:15
    Oh yeah.
  • 20:15 - 20:18
    (man 1) So you've got galleries,
    archives and museums, and stuff,
  • 20:18 - 20:21
    but in terms of what we consider GLAM,
  • 20:21 - 20:23
    like it's a bit more expansive
    than that,
  • 20:23 - 20:27
    so is there kind of...
    would there be any ambition
  • 20:27 - 20:30
    to perhaps, add another letter
    to the acronym or something?
  • 20:30 - 20:34
    Well, we are not not working
    with expanding the acronym
  • 20:34 - 20:37
    like we are officially sticking
    to the acronym
  • 20:37 - 20:39
    because, well, that's the easiest.
  • 20:40 - 20:43
    We got financing for that,
  • 20:43 - 20:45
    and this is the easiest way
    of explaining this
  • 20:45 - 20:48
    to non-Wikimedians, honestly.
  • 20:49 - 20:53
    But we also use the term
    cultural heritage institution.
  • 20:54 - 20:55
    And that is a bit broader.
  • 20:56 - 20:58
    We use that in communication as well.
  • 20:58 - 21:00
    For example we ingested some data
  • 21:00 - 21:05
    about Swedish local cultural
    heritage associations
  • 21:05 - 21:07
    that are technically...
  • 21:07 - 21:11
    well, they are kind of like
    archives/museums/associations
  • 21:11 - 21:13
    of people working in cultural heritage.
  • 21:13 - 21:17
    And we decided that yes,
    this comes under our umbrella.
  • 21:20 - 21:22
    [chairman] Can you pass him
    the microphone?
  • 21:22 - 21:24
    (man 2) Hello.
  • 21:24 - 21:28
    As I come from a community
    where we have the technical excuse
  • 21:28 - 21:31
    but not like this organizing...
  • 21:31 - 21:36
    Do you have any support
    from like your international team?
  • 21:36 - 21:41
    Would you like materials
    or some press releases
  • 21:41 - 21:45
    or letters to send through some contacts
  • 21:45 - 21:49
    to engage more local institutions?
  • 21:49 - 21:51
    They can join us
  • 21:51 - 21:56
    but we can actually make
    all technical stuff ourselves.
  • 21:57 - 21:58
    Yeah, this is a great question,
  • 21:58 - 22:03
    and like if you check out
    the FindingGLAMs page on Meta,
  • 22:03 - 22:06
    there is a section with information
    for regional coordinators
  • 22:06 - 22:12
    where the point we are hoping to develop
    like an information toolkit
  • 22:12 - 22:16
    that people can translate
    into their languages and use
  • 22:17 - 22:23
    and something that we would really like
    to see from other local communities
  • 22:23 - 22:26
    is communication
    about the datasets they have
  • 22:26 - 22:31
    and the institutions that can help us out.
  • 22:33 - 22:35
    (chairman) Anybody else?
  • 22:37 - 22:42
    (woman) Well, I'm just thinking
    because I come from the library world
  • 22:42 - 22:46
    and of course the IFLA statistics,
  • 22:46 - 22:49
    they come
    from the different IFLA sections
  • 22:49 - 22:52
    that feed up information to IFLA.
  • 22:52 - 22:57
    But, anyway, in our respective countries
  • 22:58 - 23:03
    we are interested
    in establishing repositories
  • 23:03 - 23:09
    about up-to-date data
    about libraries and other GLAMs
  • 23:09 - 23:15
    to end... well,
    the issues are of different kinds.
  • 23:15 - 23:18
    There is gathering data, first of all,
  • 23:18 - 23:21
    keeping them up-to-date,
    normalizing those data
  • 23:21 - 23:25
    because it goes
    to the acronyms and things like that.
  • 23:25 - 23:30
    So then it's also another issue--
  • 23:31 - 23:36
    the difference between the institution
    as corporate body inside
  • 23:36 - 23:40
    that occupies
    or where the collections are.
  • 23:40 - 23:42
    And just as an additional information,
  • 23:42 - 23:47
    you mentioned the Archives Portal Europe,
    but Europeana is doing also
  • 23:48 - 23:51
    work on this,
    that identifying their providers.
  • 23:51 - 23:53
    - Exactly.
    - So they have more
  • 23:53 - 23:58
    than 3,000 data providers,
  • 23:58 - 24:00
    and they at some point, at the beginning,
  • 24:00 - 24:05
    they didn't deal with the identification
    of the data providers.
  • 24:05 - 24:11
    But the name that each
    of the data providers gave for itself
  • 24:11 - 24:13
    or that Europeana
  • 24:14 - 24:18
    associated the provider to the collection
    that was given to it
  • 24:18 - 24:22
    was in different forms.
  • 24:22 - 24:26
    At some point they found the need
    to normalize it,
  • 24:26 - 24:29
    and so they get one
    back to the data providers
  • 24:29 - 24:33
    asking them for the name and variant name
  • 24:33 - 24:39
    so that it's disambiguation issues,
    identification issues
  • 24:39 - 24:43
    so it's not a trivial problem.
  • 24:43 - 24:47
    And I'm bringing
    that my experience from France
  • 24:47 - 24:53
    that we have at least three repositories,
    three places
  • 24:53 - 24:56
    because I saw the link to the French
  • 24:56 - 25:01
    and I was wondering
    where did you get data from.
  • 25:01 - 25:04
    There is an observatory of public lecture
  • 25:04 - 25:07
    held by the Ministry of Culture--
  • 25:07 - 25:09
    I have not yet seen it
  • 25:09 - 25:11
    There is another repository
  • 25:11 - 25:17
    held by the Union Catalog of France
  • 25:17 - 25:23
    that identifies all the libraries
    that hold collections with information
  • 25:23 - 25:28
    extended beyond the identification
    of the institution as such
  • 25:28 - 25:33
    but with opening hours
    and the collections that they hold.
  • 25:33 - 25:34
    And there is another--
  • 25:34 - 25:36
    (chairman) Quick question.
  • 25:36 - 25:39
    (woman) Well, I'm just wondering about,
  • 25:40 - 25:44
    first of all, the best data model
  • 25:44 - 25:49
    to identify these institutions,
  • 25:50 - 25:55
    and the best way to communicate
  • 25:55 - 25:59
    with the communities
    that do have those data
  • 25:59 - 26:02
    and keep them up-to-date
  • 26:02 - 26:05
    so that there is no gap
    between what you have
  • 26:05 - 26:10
    and where the information routes from.
  • 26:10 - 26:11
    (Alicia) Thank you.
  • 26:11 - 26:13
    Data modeling that you mentioned,
  • 26:13 - 26:15
    the distinction between institution
    and physical place,
  • 26:15 - 26:19
    this is a discussion that we have taken up
  • 26:19 - 26:22
    in the community discussion on Wikidata.
  • 26:22 - 26:25
    This is something that is definitely
    within the interest of the community.
  • 26:25 - 26:27
    It is being actively discussed.
  • 26:28 - 26:29
    (woman) Thank you.
  • 26:29 - 26:31
    Yes.
  • 26:31 - 26:35
    (chairman) One last very short question
    before we have to end here?
  • 26:37 - 26:38
    (man 3) Excuse me.
  • 26:39 - 26:42
    Museums and archives
    exist on a continuum--
  • 26:42 - 26:44
    at the big end you have
    the British Museum
  • 26:44 - 26:45
    and the Hermitage, and so on,
  • 26:45 - 26:46
    but at the other end
  • 26:46 - 26:49
    you have a garden shed
    with a few objects in
  • 26:49 - 26:53
    or you have glass cases
    in the foyer of a company
  • 26:53 - 26:55
    with some historical artifacts.
  • 26:55 - 26:58
    Have you a cut off point?
    And if so, where is it?
  • 26:58 - 27:01
    We don't have so much free available data
  • 27:01 - 27:04
    that we really need
    to have a cut off point.
  • 27:04 - 27:07
    It's rather the exact opposite.
  • 27:07 - 27:11
    If we get a nice, beautiful free dataset
  • 27:11 - 27:15
    that has this sort of
    very, very granular distinction,
  • 27:15 - 27:16
    then we will worry about it,
  • 27:16 - 27:20
    but it's not a problem
    that we have encountered yet practically.
  • 27:21 - 27:23
    (chairman) Okay, thanks a lot
    for your questions
  • 27:23 - 27:25
    and your reactions and answers.
  • 27:25 - 27:26
    We have to end here
  • 27:26 - 27:32
    and invite you to the classroom
    next to this one for the final session.
  • 27:32 - 27:33
    Thank you very much.
  • 27:33 - 27:35
    (applause)
Title:
cdn.media.ccc.de/.../wikidatacon2019-1037-eng-All_the_worlds_GLAM_institutions_-_and_how_we_tried_to_find_them_hd.mp4
Video Language:
English
Duration:
27:41

English subtitles

Revisions