< Return to Video

cdn.media.ccc.de/.../wikidatacon2019-1048-eng-Wikidata_for_Emerging_Communities_knowledge_gaps_hd.mp4

  • 0:06 - 0:09
    Hi, I am Satdeep. I work
    with the Foundation in Ben's team.
  • 0:10 - 0:13
    Here's my friend from India, Bodhi.
  • 0:13 - 0:15
    He's working with the Centre
    for Internet and Society,
  • 0:15 - 0:19
    but he's here in his volunteer capacity.
  • 0:20 - 0:25
    So, we're going to talk about
    knowledge gaps and Wikidata today.
  • 0:25 - 0:27
    So what are knowledge gaps?
  • 0:28 - 0:32
    As the name suggests, it's a gap
    in our existent knowledge.
  • 0:32 - 0:36
    But in terms of Wikidata,
    we're looking at knowledge gaps
  • 0:36 - 0:38
    in two different aspects.
  • 0:38 - 0:43
    One is, how can Wikidata help us
    in filling the knowledge gaps
  • 0:43 - 0:45
    in other Wikimedia projects?
  • 0:45 - 0:50
    And the second is, how do we fill
    the knowledge gaps within Wikidata?
  • 0:52 - 0:57
    For the first one,
    "Filling knowledge gaps with Wikidata."
  • 0:57 - 0:59
    Wikidata is helping in a number of ways
  • 0:59 - 1:01
    in filling knowledge gaps
    on different Wikimedia projects,
  • 1:01 - 1:03
    for example, ArticlePlaceholder,
  • 1:03 - 1:06
    or another tool called Scribe
    is being built,
  • 1:06 - 1:09
    Wikidata Infoboxes, all of them are--
  • 1:09 - 1:10
    (audience reacts)
  • 1:10 - 1:14
    Yes, there was a session about it
    early this morning or in the afternoon.
  • 1:15 - 1:19
    And there are also a lot
    of different templates
  • 1:20 - 1:22
    which use Wikidata.
  • 1:22 - 1:28
    And then there are new templates
    called [inaudible],
  • 1:29 - 1:34
    which along with this here
    are used to make lists like these.
  • 1:34 - 1:37
    And if you click on one
    of the topics on this list,
  • 1:38 - 1:40
    you get this draft article.
  • 1:40 - 1:43
    There was a presentation about this
    in this same room by [inaudible].
  • 1:43 - 1:47
    So you get a draft article
    with some sentences
  • 1:47 - 1:50
    and the infoboxes from Wikidata.
  • 1:51 - 1:55
    But this is not what we're going
    to talk about here today.
  • 1:56 - 1:59
    We're going to talk about how, in India,
  • 1:59 - 2:03
    we first have to fill
    the knowledge gaps within Wikidata,
  • 2:03 - 2:06
    so then we can do
    all these amazing things.
  • 2:06 - 2:09
    So there are knowledge gaps
    in localization.
  • 2:09 - 2:12
    We need to add a lot more labels
    in different languages.
  • 2:13 - 2:16
    There needs to build local data
    about local places, people,
  • 2:16 - 2:19
    so that we can do
    all those awesome things.
  • 2:19 - 2:23
    But the main aspect of there
    is to build community capacity
  • 2:23 - 2:25
    to do all that stuff.
  • 2:25 - 2:29
    So, that's where we come
    to The Indic Case Study,
  • 2:29 - 2:31
    which this is all about.
  • 2:31 - 2:33
    And how did it all start?
  • 2:34 - 2:37
    There is a person
    sitting right there, Asaf.
  • 2:37 - 2:40
    He is responsible for all this--
  • 2:40 - 2:43
    for bringing Wikidata to India.
  • 2:44 - 2:48
    So there was the first community
    capacity development training
  • 2:49 - 2:53
    with the Tamil community in 2016,
    where he introduced Wikidata.
  • 2:53 - 2:58
    And then there was like a bunch
    of Wikidatans, super users
  • 2:58 - 3:00
    who started contributing to Wikidata.
  • 3:00 - 3:05
    And then, in 2017, on both our requests,
  • 3:05 - 3:08
    he came to India again and did
  • 3:08 - 3:10
    (laughs) Wiki-a-Tra--
  • 3:10 - 3:13
    it's like Wiki travel in India.
  • 3:13 - 3:16
    He did that, he went
    to seven different cities,
  • 3:16 - 3:19
    seven different communities
    at least, in India,
  • 3:19 - 3:23
    where he did Wikidata workshops,
  • 3:23 - 3:26
    mostly two-days workshops
    in all those places.
  • 3:26 - 3:30
    And then, in 2018, again,
    an Advanced Wikidata workshop.
  • 3:30 - 3:35
    And that has actually helped in building
    some sort of Wikidata community
  • 3:35 - 3:37
    around India.
  • 3:40 - 3:42
    That also got the community engaged,
  • 3:42 - 3:45
    and then we started
    building WikiProject India,
  • 3:45 - 3:47
    and then some other projects
    related to that,
  • 3:47 - 3:51
    such as WikiProject West Bengal,
    Indian Railways, and Kerala,
  • 3:51 - 3:54
    which are like some
    specifics regions in India
  • 3:54 - 3:56
    where the community has been trying
    to engage themselves
  • 3:56 - 3:58
    and doing some work around it.
  • 3:58 - 4:03
    And then there have been
    some more initiatives to engage newbies
  • 4:03 - 4:08
    such as edit-a-thons,
    or labelathons, datathons,
  • 4:09 - 4:12
    with which we've been trying
    to get more and more people involved.
  • 4:12 - 4:15
    And some initiatives around education,
  • 4:15 - 4:18
    workshops in education institutions--
    Asaf also did one of those.
  • 4:21 - 4:22
    Yeah. Next, Bodhi.
  • 4:23 - 4:26
    So, there have been
    so many workshops in India,
  • 4:26 - 4:30
    throughout all of India
    from 2017 to 2019.
  • 4:30 - 4:33
    And we're also trying to engage,
    as Satdeep said,
  • 4:33 - 4:35
    we are trying to engage
    the newbies in different ways.
  • 4:35 - 4:41
    But still, the number of power users
    are not very much in India.
  • 4:42 - 4:48
    Only very few, maybe five or six people
    are doing the heavy-duty work.
  • 4:49 - 4:52
    So one of the reasons for that:
  • 4:53 - 4:59
    mostly the Wikimedia community
    is focused in India on other projects,
  • 4:59 - 5:04
    mostly in Wikipedia and somehow,
    right now, in Wikisource.
  • 5:04 - 5:08
    So, there are very few editors who are--
  • 5:08 - 5:10
    very few active editors
  • 5:10 - 5:12
    who are contributing
    to Wikidata regularly.
  • 5:13 - 5:16
    India is a multilingual country,
  • 5:16 - 5:19
    so there are around
    22 Wikimedia projects
  • 5:19 - 5:20
    running in India.
  • 5:20 - 5:23
    So the workforces are totally divided.
  • 5:24 - 5:29
    So, we don't have
    a focused group of people
  • 5:29 - 5:31
    who are working
    on specific areas of Wikidata
  • 5:31 - 5:35
    because they are so much divided
    into different projects,
  • 5:35 - 5:39
    that we have to engage--
    we're trying to actively engage them
  • 5:39 - 5:40
    in different ways.
  • 5:40 - 5:42
    And they are spread over a vast region,
  • 5:42 - 5:44
    India is the seventh
    largest country in the world,
  • 5:44 - 5:48
    and so it's quite difficult to coordinate
    the intercommunity,
  • 5:48 - 5:54
    the 22 languages communities
    to work on only one project.
  • 5:56 - 6:00
    So, we have adopted a different approach.
  • 6:00 - 6:03
    Firstly, we're targeting the data gaps,
  • 6:03 - 6:07
    which is easy because
    there are huge data gaps in India
  • 6:07 - 6:10
    on every topic, almost every topic.
  • 6:10 - 6:11
    And...
  • 6:13 - 6:14
    (chuckles)
  • 6:14 - 6:16
    ...start locally.
  • 6:16 - 6:18
    Sorry. (laughs)
  • 6:18 - 6:22
    - So, it's 1, 1, 1--
    - Everything is a priority!
  • 6:22 - 6:24
    (laughter)
  • 6:25 - 6:28
    Anyway. So we start locally.
  • 6:29 - 6:35
    So we have thought that intercountry--
  • 6:35 - 6:38
    the data ingestion
    of intercountries is quite difficult.
  • 6:38 - 6:41
    And there are huge databases for India,
  • 6:41 - 6:45
    for example, the science databases,
    the election databases.
  • 6:45 - 6:49
    And if we work on the intercountry,
  • 6:49 - 6:54
    then it'd be really impossible
    for five or six heavy-duty users.
  • 6:54 - 6:58
    So we target one place at a time.
  • 6:58 - 7:02
    So that is the map of India,
  • 7:02 - 7:06
    and you can see the bright pink color
    that is West Bengal.
  • 7:06 - 7:12
    So in October 2018 to May 2019,
    many things happened there.
  • 7:12 - 7:17
    So lots of data
    were ingested in that part.
  • 7:17 - 7:21
    And after this map was generated,
  • 7:21 - 7:24
    there is a tool for that
    called Wikidata Analysis--
  • 7:24 - 7:26
    built by [inaudible], user: [inaudible].
  • 7:26 - 7:32
    And after we got this map,
  • 7:33 - 7:36
    we shared this with other communities.
  • 7:36 - 7:40
    That "We have done this for West Bengal,
    you can do it for your country.
  • 7:40 - 7:41
    And this is really cool."
  • 7:41 - 7:44
    And people have started working--
  • 7:45 - 7:46
    that was a direct effect.
  • 7:46 - 7:51
    WikiProject Kerala was built
    just at that time,
  • 7:51 - 7:55
    and they started working
    on the schools of India--
  • 7:55 - 7:58
    schools of Kerala--
    and Kerala is situated right here--
  • 7:58 - 8:01
    and I couldn't [locate]
    that in the map right now
  • 8:01 - 8:06
    because the tool is right now down.
  • 8:07 - 8:10
    So we just started locally.
  • 8:11 - 8:15
    We're trying to inspire people
    from other parts of the country
  • 8:15 - 8:16
    to contribute.
  • 8:17 - 8:20
    And that's what happened in West Bengal,
  • 8:20 - 8:25
    around 40,000 villages
    with 2001 and 2011 census.
  • 8:25 - 8:29
    Our data was ingested--
    that's complete data.
  • 8:29 - 8:31
    Almost complete data
  • 8:31 - 8:34
    which could have been
    ingested in Wikidata.
  • 8:34 - 8:39
    And there were 11,000
    government hospitals with coordinates
  • 8:39 - 8:40
    which were ingested,
  • 8:40 - 8:45
    and there was [inaudible] approach
    to close to 1 million Bengali labels.
  • 8:46 - 8:47
    And so on.
  • 8:47 - 8:51
    There were many things happening,
    but these were the things
  • 8:51 - 8:54
    which we've done
    in West Bengal at that time.
  • 8:54 - 8:58
    So we also tried
    to create cool visualizations
  • 8:58 - 9:00
    from those works we've done
  • 9:00 - 9:03
    because census and elections,
    these are boring data.
  • 9:03 - 9:07
    These are not paintings,
    and also so we cannot--
  • 9:07 - 9:11
    like these are also not GLAM data
    and other things.
  • 9:11 - 9:13
    So these are boring data.
  • 9:13 - 9:19
    So we need to find some way
    to make it interesting for people.
  • 9:19 - 9:23
    So, we have tried some cool queries.
  • 9:23 - 9:25
    This is one of them.
    There are many others.
  • 9:25 - 9:27
    So this is the population growth
    in West Bengal
  • 9:27 - 9:32
    between our villages--
    around 36,000 villages
  • 9:32 - 9:34
    between 2001 and 2011.
  • 9:35 - 9:38
    And not only villages,
    we have uploaded census data
  • 9:38 - 9:43
    about every administrative hierarchy,
  • 9:43 - 9:48
    like community developing blocks,
    districts, municipalities, wards, etc.,
  • 9:48 - 9:50
    cities, towns.
  • 9:52 - 9:57
    This is a new tool, InteGraality,
  • 9:58 - 10:00
    and you can see
  • 10:00 - 10:05
    that this is a count of hospitals
  • 10:07 - 10:08
    in the world,
  • 10:08 - 10:11
    and India is right now
    leading in Wikidata--
  • 10:11 - 10:16
    13,466 hospitals.
  • 10:18 - 10:21
    The blue colors are the data completeness.
  • 10:22 - 10:30
    But the funny thing is--
    it's only one area of India.
  • 10:30 - 10:31
    It's West Bengal,
  • 10:31 - 10:34
    there are 11,642 hospitals right now.
  • 10:34 - 10:38
    So if we complete all these steps
    and there are more--
  • 10:40 - 10:42
    if we complete all those steps,
  • 10:42 - 10:45
    there will be a huge amount
    of data about hospitals
  • 10:45 - 10:48
    with coordinates
    which will be there in Wikidata,
  • 10:48 - 10:55
    and we have a plan to build an app
    based on that data,
  • 10:56 - 10:59
    so that when a person gets ill,
  • 11:03 - 11:05
    using that app, he may find
  • 11:05 - 11:09
    the nearest location of the hospitals.
  • 11:15 - 11:19
    So these hospitals are ranging
    from Primary Health Centers
  • 11:19 - 11:21
    to [inaudible] Health Cares,
  • 11:22 - 11:27
    with all sorts of facilities
    available for them.
  • 11:27 - 11:31
    So we've tried to ingest
    all those data in Wikidata,
  • 11:31 - 11:33
    if possible.
  • 11:33 - 11:37
    And after completing this task,
    if we build some app,
  • 11:37 - 11:42
    then maybe someone,
    a sick person in a dying urgency
  • 11:42 - 11:45
    can find the nearest government hospital.
  • 11:49 - 11:52
    - This is another--
    - (Satdeep) Go back.
  • 11:52 - 11:54
    (Bodhi) Oh, sorry.
  • 11:55 - 12:02
    Okay. So this is the work
    which was done for Indian Railways.
  • 12:02 - 12:04
    It was started there,
    also from West Bengal.
  • 12:05 - 12:09
    And you can check the color--
  • 12:09 - 12:11
    the blue color is more complete data
  • 12:11 - 12:15
    and the green color
    is slightly not complete,
  • 12:15 - 12:18
    but it's going to get completed soon.
  • 12:19 - 12:23
    And there are right now,
    9,000 Indian railway stations
  • 12:23 - 12:26
    with coordinates, obviously,
    because they are on the map.
  • 12:26 - 12:30
    Right now, they're being connected
    with Pakistan and Bangladesh railways.
  • 12:30 - 12:35
    So we have a plan to connect
    all Asian railways one day--
  • 12:35 - 12:36
    someday, maybe.
  • 12:36 - 12:37
    (laughs)
  • 12:37 - 12:39
    But, yeah, we'll do it.
  • 12:39 - 12:44
    Anyway. So, right now on the table,
  • 12:44 - 12:49
    we are in the second position
    after Japan, obviously.
  • 12:50 - 12:53
    And-- yeah. So this is another cool query.
  • 12:54 - 12:59
    Visualization showing
    the flight connections--
  • 13:00 - 13:02
    international and domestic
    flight connections from India,
  • 13:02 - 13:03
    to and from India.
  • 13:03 - 13:06
    So it's like kind of messy,
    but we can filter it
  • 13:06 - 13:10
    for domestic connections
    or international connections.
  • 13:10 - 13:11
    So, anyway.
  • 13:13 - 13:15
    We have also completed
  • 13:15 - 13:18
    everything about 2014
    Indian General Election data.
  • 13:18 - 13:23
    India general election is a kind
    of complex state of data
  • 13:23 - 13:25
    because there are
    so many political parties,
  • 13:25 - 13:28
    so many election--
    not like a two party elections.
  • 13:29 - 13:32
    So there were 6,000 political parties
    which participate in Indian--
  • 13:33 - 13:36
    I think 600 or something.
  • 13:36 - 13:39
    So, anyway.
  • 13:39 - 13:40
    So, yeah.
  • 13:41 - 13:45
    And there were so many candidates,
    you can imagine.
  • 13:45 - 13:49
    And some of them have the same name.
  • 13:50 - 13:51
    Like in one constituency,
  • 13:51 - 13:53
    there was like three people
    with the same name.
  • 13:53 - 13:54
    (laughs)
  • 13:54 - 13:57
    So that was like a funny thing.
  • 13:58 - 14:04
    But we completed those data--
    uploading those data in Wikidata.
  • 14:04 - 14:08
    Right now, only 24 Indian
    general elections have been done.
  • 14:08 - 14:13
    We don't have much users in Wikidata--
    heavy-duty users in Wikidata in India.
  • 14:14 - 14:19
    So currently we're uploading
    geoshape files of the constituencies.
  • 14:19 - 14:24
    In West Bengal, we have
    already uploaded 43 constituencies,
  • 14:24 - 14:29
    geoshape files of the constituencies,
    and also the [inaudible].
  • 14:29 - 14:32
    There is another part of India
    that has not been done,
  • 14:32 - 14:34
    so when it will be completed then--
  • 14:34 - 14:35
    when it'll be--
  • 14:35 - 14:39
    when we upload other election that are--
  • 14:39 - 14:43
    like 2009 or before that,
  • 14:43 - 14:46
    we'll create cool animations.
  • 14:46 - 14:51
    That's showing how the voters
    have changed their minds
  • 14:51 - 14:55
    from like centrist to rightist
    or leftist to rightist, anyway.
  • 14:55 - 14:59
    So in the pipeline, there are schools,
  • 14:59 - 15:03
    bank branches, post offices, geoshapes,
    elections, and many more.
  • 15:04 - 15:06
    - (man 1) Cinema.
    - Cinema, yeah.
  • 15:06 - 15:07
    (laughs)
  • 15:07 - 15:09
    - Of course, cinema.
    - And monuments.
  • 15:09 - 15:11
    And monuments.
  • 15:11 - 15:15
    And most of them will be completed
    within a few months.
  • 15:16 - 15:22
    And in a not so distant future,
    we'll try to upload weather data.
  • 15:23 - 15:28
    There are not much good property
    for weather, right now, in Wikidata,
  • 15:28 - 15:31
    that's why we're not
    touching it right now,
  • 15:31 - 15:32
    but we'll do it.
  • 15:33 - 15:36
    Also bibliographical data
  • 15:36 - 15:40
    for Indian literature data
    are also very less in Wikidata.
  • 15:41 - 15:46
    And there will be
    some institutional partnerships.
  • 15:47 - 15:49
    There were some primary talks already,
  • 15:49 - 15:52
    and maybe we'll have
    some good news in the future.
  • 15:54 - 15:56
    So other ways to engage.
  • 15:56 - 16:00
    We have created
    some subpages of WikiProject India.
  • 16:00 - 16:05
    We have created a skillshare initiative--
    started a skillshare initiative
  • 16:05 - 16:09
    where people who have
    slightly more knowledge in Wikidata
  • 16:09 - 16:12
    can share something with other people,
  • 16:12 - 16:17
    on a one-to-one basis
    approaching online or offline way.
  • 16:17 - 16:20
    We have also started a newsletter,
    a quarterly newsletter,
  • 16:20 - 16:26
    the first issue has been published
    in October [2018],
  • 16:26 - 16:31
    and we are showcasing
    cool visualizations in social media
  • 16:31 - 16:34
    in Facebook and Twiter channels
    of Wikidata India, every day.
  • 16:35 - 16:38
    So these are the links.
  • 16:38 - 16:40
    You can find them there.
  • 16:42 - 16:44
    Thank you so much for the...
  • 16:45 - 16:49
    As most of you can already guess,
  • 16:50 - 16:55
    Bodhi is from that part of India,
    the West Bengal,
  • 16:55 - 16:57
    where they've done all that work.
  • 16:57 - 16:59
    (laughs)
  • 16:59 - 17:05
    So the West Bengali community in India
    has been really doing this amazing work,
  • 17:05 - 17:08
    and this needs to go
    to other parts of India
  • 17:08 - 17:11
    which need more capacity development,
  • 17:11 - 17:16
    which need more trainings,
    also more coordination in India.
  • 17:16 - 17:20
    And, okay, I would like to end this
  • 17:20 - 17:23
    with how you can help in identifying
    some of the knowledge gaps
  • 17:23 - 17:25
    and taking that conversation forward,
  • 17:25 - 17:27
    which is not directly
    related with this topic.
  • 17:27 - 17:31
    But there is a Wiki project,
    "Identifying knowledge gaps,"
  • 17:31 - 17:34
    you can join that and share your thoughts.
  • 17:34 - 17:39
    We are also trying to use--
    how can we use property P5008,
  • 17:39 - 17:43
    which is on the focus list
    for a specific project--
  • 17:43 - 17:48
    how we can use that to surface
    certain topics for contest
  • 17:48 - 17:50
    or other events.
  • 17:51 - 17:55
    And in the end, we'd like to thank you.
  • 17:55 - 18:01
    Also, we'd like to thank Asaf
    and Mahir and Tito
  • 18:01 - 18:06
    who are another
    two power users of Wikidata.
  • 18:06 - 18:08
    We'd like to sincerely thank everyone.
  • 18:08 - 18:10
    Thank you so much.
  • 18:10 - 18:12
    (applause)
  • 18:15 - 18:16
    Questions.
  • 18:17 - 18:19
    (woman 1) Mark here says, "Hi."
  • 18:19 - 18:20
    (laughs)
  • 18:20 - 18:23
    (moderator) So we have only five minutes
    for questions and answers.
  • 18:28 - 18:30
    There. There's a question there.
  • 18:30 - 18:32
    (woman 1) Do I need the microphone?
  • 18:34 - 18:36
    (woman 1) Thank you so much
    for your presentation.
  • 18:36 - 18:39
    Is this census data--
    what exactly kind of data is that,
  • 18:39 - 18:40
    that you've been ingesting?
  • 18:40 - 18:43
    It's not for individuals, is it?
  • 18:43 - 18:46
    It's more like populations
    and stuff like that?
  • 18:46 - 18:50
    It's population data, mainly.
    Demographic data.
  • 18:50 - 18:53
    (woman 1) Are there any other things
    that have been asked in the census?
  • 18:56 - 18:59
    (man 1) For village, gender--
  • 19:01 - 19:04
    (man 2) I was a little involved with that,
    so I remember what the data looks like.
  • 19:04 - 19:08
    Per settlement in India, per village town.
  • 19:08 - 19:12
    You have the total population,
    the masculine versus feminine population,
  • 19:12 - 19:16
    the literate versus illiterate population.
  • 19:16 - 19:19
    Within that, you have also
    a separation by gender,
  • 19:19 - 19:21
    so you know how many
    illiterate males there are
  • 19:21 - 19:23
    versus so many
    illiterate females there are.
  • 19:23 - 19:25
    It's actually quite detailed.
  • 19:25 - 19:29
    There are hundreds and hundreds
    of pieces of data per village.
  • 19:29 - 19:33
    Only some of them
    have been modeled on Wikidata.
  • 19:37 - 19:40
    Just, of course,
    no individual census data.
  • 19:42 - 19:44
    (woman 1) Sometimes countries get weird.
  • 19:45 - 19:49
    (woman 2) So I wanted to ask you
    about the label ingestion
  • 19:49 - 19:51
    or the translations of labels you do.
  • 19:52 - 19:54
    How did you do that? Do you use tools?
  • 19:54 - 19:57
    How do you get people to add it
    in their native language
  • 19:57 - 19:58
    and translate the labels.
  • 19:59 - 20:03
    So, mostly TABernacle,
  • 20:04 - 20:07
    and QuickStatements.
  • 20:07 - 20:08
    Those we can use, QuickStatements.
  • 20:08 - 20:10
    (woman 2) Alright. Cool.
  • 20:10 - 20:14
    But also at the same time,
    like using labelathons as an activity
  • 20:14 - 20:18
    to engage more and more people
    to do that activity.
  • 20:20 - 20:21
    Asaf.
  • 20:21 - 20:22
    The hero.
  • 20:25 - 20:26
    (Asaf) A note on TABernacle.
  • 20:26 - 20:29
    I just want to mention for anyone
    who may be not aware,
  • 20:30 - 20:33
    all of us here use Wikidata-related tools
  • 20:33 - 20:37
    which means all of us
    have used tools by Magnus,
  • 20:37 - 20:38
    the amazing tool builder.
  • 20:38 - 20:41
    I just wanted to point out
    that he's here at the conference.
  • 20:41 - 20:43
    So if you haven't had a chance yet
  • 20:43 - 20:48
    to thank him for his amazing work
    that enables so much impact--
  • 20:48 - 20:49
    do so today.
  • 20:49 - 20:52
    I'm not sure he is into hugs,
    but you can just thank him.
  • 20:52 - 20:53
    (laughs)
  • 21:05 - 21:08
    (man 3) Was the skillshare working?
  • 21:08 - 21:11
    What do you do? What are the results?
  • 21:12 - 21:14
    So, the response is [still no].
  • 21:15 - 21:22
    But, yeah. We have five or six people
    have already requested,
  • 21:22 - 21:23
    and we have completed those.
  • 21:24 - 21:26
    (Satdeep) That's going on--
  • 21:26 - 21:30
    Like, we just need to surface
    the value of Wikidata.
  • 21:30 - 21:32
    I think we haven't
    really been able to do that.
  • 21:32 - 21:35
    Also, we haven't been able
    to connect with other projects
  • 21:35 - 21:37
    that they are already doing,
  • 21:37 - 21:39
    like, for example,
    Wikisource or Wikipedia.
  • 21:39 - 21:42
    Like how we need to communicate
    that in a better way
  • 21:42 - 21:45
    to the larger community
    who is contributing.
  • 21:45 - 21:49
    It was just like getting up
    and creating a Wiki periodical.
  • 21:49 - 21:52
    Like how do we involve them
    and bring them here.
  • 21:52 - 21:54
    That's still a problem.
  • 21:54 - 21:57
    And Bodhi is showing the census data.
  • 21:57 - 21:59
    Bodhi, can you please explain?
  • 22:02 - 22:07
    (Bodhi) So this is population data
    from the 2011 census,
  • 22:07 - 22:12
    5007 in 2001 in the census data.
  • 22:13 - 22:15
    This is one village.
  • 22:15 - 22:18
    So there are like 36,000 villages
    or 40,000 villages.
  • 22:19 - 22:21
    This is the male population,
    female population,
  • 22:22 - 22:23
    number of households,
  • 22:23 - 22:28
    illiterate population with male,
    female, population qualifiers,
  • 22:29 - 22:32
    literate population
    and illiterate populations,
  • 22:32 - 22:33
    and so on.
  • 22:33 - 22:36
    And this is the census code
    for 2001 and 2011.
  • 22:40 - 22:44
    (woman 3) Okay. I just want to say
    that I loved your presentation,
  • 22:44 - 22:47
    and I wanted to talk nearly
    about the same thing tomorrow,
  • 22:47 - 22:52
    so it'll be great because tomorrow--
    I will just [stay] watch from this one,
  • 22:52 - 22:54
    so making my life easier.
  • 22:56 - 22:58
    What I wanted to do or to talk about--
  • 22:58 - 23:02
    but I think the WikiProject
    you're starting on Wikidata
  • 23:02 - 23:04
    will do that--
  • 23:04 - 23:08
    is all to engage people
    not working about India directly,
  • 23:08 - 23:13
    but like I have tools, names,
    but I don't deal with Indian names
  • 23:13 - 23:17
    because I am not sure I understand
    all there are on them,
  • 23:17 - 23:20
    and I don't want to do
    something massively wrong,
  • 23:20 - 23:21
    so better to be careful.
  • 23:21 - 23:27
    But I just need to ask with someone
    who understand all the problems,
  • 23:27 - 23:29
    and I can add an automated tool
  • 23:29 - 23:33
    and deal with thousands
    upon thousands of items.
  • 23:33 - 23:36
    And I think they are many, many tools
  • 23:36 - 23:40
    already doing some automated description
    and things like that
  • 23:40 - 23:46
    for which we don't actually
    need people every day,
  • 23:46 - 23:51
    we just need like 10 minutes time
    for someone to tell me
  • 23:51 - 23:55
    or to say family names in those languages,
  • 23:55 - 23:57
    and then it just added to the tool.
  • 23:57 - 24:02
    And you probably know
    [automated] description tool,
  • 24:03 - 24:07
    but if you just ask the people
    who are using it massively
  • 24:07 - 24:09
    to just add Indian languages,
  • 24:09 - 24:13
    then you have all Wikidatans
    doing the same work for you,
  • 24:13 - 24:15
    and actually, it is a problem.
  • 24:17 - 24:21
    I am helping an African community
    build up their Wikidata
  • 24:21 - 24:24
    in Wikipedia, so it's not
    the same problem,
  • 24:24 - 24:26
    but nearly the same problem.
  • 24:26 - 24:29
    And that's the problem we have
  • 24:29 - 24:32
    which is actually bridging the gap
  • 24:32 - 24:35
    between the biggest Wikidatans--
  • 24:36 - 24:39
    I am doing works in languages
    I don't know a word of,
  • 24:41 - 24:45
    but it's this kind of adoption system,
  • 24:45 - 24:49
    like I need a native speaker to tell me
  • 24:49 - 24:52
    what I can do with all the problems
    on all the complicated cases.
  • 24:52 - 24:56
    And everything
    that I can automate, I will automate.
  • 24:56 - 25:00
    And it's just an idea,
    but do you think it will be like
  • 25:00 - 25:05
    a good idea to create not so specific
    Wiki knowledge gap on Wikidata,
  • 25:05 - 25:09
    but a matching system
  • 25:09 - 25:16
    like, "Hey I am working on this subject,
    do you want to ask me for that?"
  • 25:16 - 25:21
    - Like, yeah, a matching tool, like to--
    - Connect people.
  • 25:21 - 25:23
    - (woman 3) To connect people
    across languages.
  • 25:24 - 25:27
    Yeah. So that was my idea
    because I think
  • 25:27 - 25:30
    some of the African communities
    I am helping,
  • 25:30 - 25:33
    would really, really love
    what you're doing,
  • 25:33 - 25:39
    but none of them speak Indian,
    and we just need to have pivot people
  • 25:40 - 25:41
    to create the link
  • 25:42 - 25:45
    and make all this even more powerful.
  • 25:45 - 25:47
    And I really, really love
    what you're doing. So thank you.
  • 25:47 - 25:49
    Thank you so much.
  • 25:49 - 25:51
    Thanks to Bodhi for all the awesome work.
  • 25:51 - 25:53
    (laughs)
  • 25:53 - 25:54
    And the larger Indian community.
  • 25:54 - 25:57
    But that's a really good idea,
    I think we should take that up.
  • 25:58 - 26:03
    As a movement, we have not been doing
    the sharing thing pretty good.
  • 26:03 - 26:05
    We need to figure out how to do that.
  • 26:05 - 26:07
    Because there are awesome tools,
  • 26:07 - 26:09
    one is built, but the others
    don't know about.
  • 26:09 - 26:11
    That's a larger problem,
  • 26:11 - 26:14
    and that's a piece that fits
    into the larger problem.
  • 26:14 - 26:16
    We should be solving someplace.
  • 26:16 - 26:18
    Let's figure out where we can do that.
  • 26:19 - 26:21
    Thank you.
  • 26:21 - 26:23
    (applause)
Title:
cdn.media.ccc.de/.../wikidatacon2019-1048-eng-Wikidata_for_Emerging_Communities_knowledge_gaps_hd.mp4
Video Language:
English
Duration:
26:32

English subtitles

Revisions