< Return to Video

cdn.media.ccc.de/.../wikidatacon2019-15-eng-Keynote_Questioning_Wikidata_hd.mp4

  • 0:06 - 0:08
    Hello. Just to check.
  • 0:08 - 0:10
    Can everyone hear me?
  • 0:10 - 0:12
    Grand.
    I've never understood
  • 0:12 - 0:14
    why that's such a phenomenon
    when people give talks
  • 0:14 - 0:17
    because if you can't,
    what are you meant to say?
  • 0:17 - 0:18
    (laughter)
  • 0:18 - 0:21
    But yes, so as said, I'm Os.
  • 0:21 - 0:24
    I'm a PhD student
    at the University of Washington,
  • 0:24 - 0:26
    where, according to the slide,
  • 0:26 - 0:30
    I study "Gender, Infrastructure
    and (Counter)Power."
  • 0:30 - 0:33
    I'd ask you all to do me the indulgence
    of pretending that
  • 0:33 - 0:37
    that's some very explicit, nuanced,
    thoughtful, academic description
  • 0:37 - 0:39
    and not just what I write as a catch-all,
  • 0:39 - 0:42
    because I kind of study
    a thousand different things
  • 0:42 - 0:46
    and fitting them all
    into a few words is hard.
  • 0:47 - 0:48
    But most of the things I study
  • 0:48 - 0:52
    are around how systems
    of knowledge enforce particular ideas
  • 0:52 - 0:54
    of how the world works,
  • 0:54 - 0:56
    and particular relationships of power
  • 0:56 - 0:58
    with a specific focus on gender.
  • 0:59 - 1:01
    I'm also an ex-Wikipedian.
  • 1:01 - 1:03
    I spent 15 years as an editor
  • 1:03 - 1:06
    which is maybe where my interest
    in the nature of knowledge started,
  • 1:07 - 1:10
    and I really can't express
    how happy I was to be invited
  • 1:10 - 1:13
    and how glad I am to be here
    with all of you,
  • 1:13 - 1:15
    but particularly James Forrester
  • 1:15 - 1:17
    who is probably the only person qualified
  • 1:17 - 1:20
    to countersign
    my passport renewal application,
  • 1:21 - 1:23
    cause it's running out soon
    and I've been trying to work out...
  • 1:23 - 1:24
    (laughter)
  • 1:24 - 1:26
    You move to Seattle.
    Everything is great.
  • 1:26 - 1:28
    Then you're like,
    "Oh, the UK government
  • 1:28 - 1:32
    requires me to find an ex-priest,
    civil servant, or member of parliament,
  • 1:32 - 1:33
    who's known me for at least 2 years
  • 1:33 - 1:35
    and who I can ship paperwork to."
  • 1:35 - 1:37
    That sounds plausible.
  • 1:37 - 1:38
    (laughter)
  • 1:38 - 1:40
    Anyway, but...
  • 1:40 - 1:44
    So I'm here as someone
    who has spent a lot of time of...
  • 1:45 - 1:46
    a number of years--
  • 1:46 - 1:49
    which I don't like to think about
    because it makes me feel incredibly old--
  • 1:49 - 1:52
    wrestling with the nature of knowledge
  • 1:52 - 1:53
    and the idea of knowledge--
  • 1:53 - 1:55
    to talk to you about
  • 1:55 - 1:57
    what Wikidata looks like
  • 1:57 - 2:01
    to someone from my background
    and with my research interests.
  • 2:02 - 2:05
    And I'm not going to spend much time
    on the story of Wikidata itself,
  • 2:05 - 2:08
    because if you're here,
    having spent 24 hours
  • 2:08 - 2:10
    having it brain dumped into you,
  • 2:10 - 2:12
    you're familiar with it.
  • 2:12 - 2:13
    It's a big semantic data store
  • 2:13 - 2:15
    that aims to provide
    machine-readable knowledge
  • 2:15 - 2:17
    in a centralized way.
  • 2:17 - 2:21
    And what this looks like
    is a series of items
  • 2:21 - 2:24
    with associated properties or statements.
  • 2:24 - 2:27
    So the item for "apple"
    has the property "fruit."
  • 2:27 - 2:28
    I mean, probably.
  • 2:28 - 2:31
    It's a Wiki so there's probably
    a long-running edit war
  • 2:31 - 2:33
    of whether an apple is a fruit,
  • 2:33 - 2:36
    and there's 50 people
    running 300 accounts between them,
  • 2:36 - 2:38
    and it's been going for years,
  • 2:38 - 2:41
    and at this point,
    if you mention the word apple on Wikidata,
  • 2:41 - 2:45
    you're preemptively banned
    as someone who, you know,
  • 2:45 - 2:46
    is secretly a sock puppet
  • 2:46 - 2:49
    and running an account on one
    or another side of this.
  • 2:50 - 2:52
    So as a consequence,
  • 2:52 - 2:54
    it's also a classification system, right?
  • 2:54 - 2:57
    A way of sorting and organizing the world.
  • 2:57 - 3:00
    So, objects or people or concepts
  • 3:00 - 3:04
    are classified as worth
    having a Wikidata entry or not.
  • 3:04 - 3:05
    A fruit or not.
  • 3:06 - 3:07
    And in each case
  • 3:07 - 3:08
    a series of criterion apply
  • 3:08 - 3:11
    to determine the properties
    that an object should have,
  • 3:11 - 3:13
    and the values of these properties
  • 3:13 - 3:15
    and how the objects
    all relate to each other.
  • 3:15 - 3:18
    So Wikidata is really an attempt to build
  • 3:18 - 3:20
    a universal classification system.
  • 3:22 - 3:26
    And classification systems
    have been studied pretty extensively.
  • 3:26 - 3:29
    One prominent work
    which I'd really recommend people read
  • 3:29 - 3:32
    if they're interested in this stuff
    is Sorting Things Out,
  • 3:32 - 3:36
    which is book by Geoff Bowker
    and Susan Leigh Star.
  • 3:36 - 3:39
    And they found that
    in an ideal universe,
  • 3:39 - 3:41
    a classification system,
  • 3:41 - 3:44
    be it universal
    or over a particular domain,
  • 3:44 - 3:46
    has three attributes.
  • 3:46 - 3:50
    The first is it operates on consistent
    and unique principles.
  • 3:50 - 3:52
    So, there's a consistent pattern
  • 3:52 - 3:55
    of what should be in each category
    and for what reasons.
  • 3:56 - 3:59
    The second is all the categories
    are mutually exclusive.
  • 3:59 - 4:02
    And the third is
    that the system is complete.
  • 4:02 - 4:05
    It contains total coverage of
    what it describes.
  • 4:05 - 4:07
    And this doesn't mean
    it has to have every single object
  • 4:07 - 4:09
    that fits into the system.
  • 4:09 - 4:11
    It just means that in the situation
  • 4:11 - 4:13
    where it lacks an object
  • 4:13 - 4:15
    and that object then shows up,
  • 4:15 - 4:16
    there should be
    a consistent mechanism
  • 4:16 - 4:19
    to work out
    whether it should be added or not,
  • 4:19 - 4:21
    and how it should be described
  • 4:21 - 4:22
    and so on, and so forth.
  • 4:23 - 4:26
    There is one small problem
    with this which is that:
  • 4:27 - 4:29
    "No real-world
    working classification system
  • 4:29 - 4:32
    that we have looked at
    meets these simple requirements
  • 4:32 - 4:33
    and we doubt that any ever could."
  • 4:34 - 4:35
    Or to put it another way,
  • 4:35 - 4:37
    all classification systems fail.
  • 4:38 - 4:41
    All classification systems
    have gaps and exceptions.
  • 4:42 - 4:45
    And obviously, the same is true
    for all systems, full stop.
  • 4:45 - 4:47
    Anyone who has ever coded
  • 4:47 - 4:49
    or simply worked in an environment,
  • 4:49 - 4:52
    or studied in an environment,
    or lived in the world
  • 4:52 - 4:55
    knows that we've yet
    to design a single thing
  • 4:55 - 4:58
    that we've thought all the way through.
  • 4:58 - 5:00
    The problem is that when we take a system,
  • 5:00 - 5:02
    classification, or otherwise,
  • 5:02 - 5:03
    and put it out into the world
  • 5:03 - 5:07
    and give it power and authority,
    and integrate it into other systems,
  • 5:07 - 5:09
    that already have power
    and authority,
  • 5:09 - 5:11
    there are consequences
    for what happens
  • 5:11 - 5:14
    when the system inevitably fails,
  • 5:15 - 5:20
    for how it reinforces or undermines
    existing relationships of power,
  • 5:20 - 5:22
    for how it hurts people.
  • 5:22 - 5:25
    A universal classification system is,
    in another words,
  • 5:25 - 5:29
    not merely doomed to failure,
    it's also doomed to hurt people.
  • 5:30 - 5:32
    And the way that it is structured
  • 5:32 - 5:37
    is ultimately a series of ethical
    and political choices as a result--
  • 5:38 - 5:39
    Who do you want to hurt?
    How much?
  • 5:39 - 5:42
    What should be done
    when people are injured?
  • 5:42 - 5:45
    And those choices have real consequences.
  • 5:48 - 5:52
    And so making these choices
    often involves confronting the fact
  • 5:52 - 5:54
    that there's very rarely a single
  • 5:54 - 5:56
    simple machine-readable interpretation
  • 5:56 - 5:59
    of something that's true
    for all people throughout all history.
  • 5:59 - 6:00
    Anything in the universe
  • 6:00 - 6:03
    has multiple meanings,
    and symbolisms, and nuances
  • 6:03 - 6:07
    to different people in different contexts
    at different times.
  • 6:07 - 6:10
    But designing a classification system
    and implementing it,
  • 6:10 - 6:12
    designing a system that can make a claim
  • 6:12 - 6:15
    to having consistent principles,
  • 6:15 - 6:17
    and covering everything it discusses,
  • 6:17 - 6:21
    inevitably involves
    cutting down on this complexity
  • 6:21 - 6:25
    and making decisions about what
    "the" meaning of a thing is going to be,
  • 6:25 - 6:27
    or what array of possible meaning
  • 6:27 - 6:30
    should be presented
    and in what sequence.
  • 6:31 - 6:32
    And as a result,
  • 6:32 - 6:36
    it involves silencing voices
    or rendering voices louder.
  • 6:36 - 6:38
    Again, this has consequences.
  • 6:38 - 6:40
    And to see what I mean
    about this complexity
  • 6:40 - 6:44
    and context, and reduction,
    and the consequences of it,
  • 6:44 - 6:47
    I'd like to set through some examples
    from Wikidata itself.
  • 6:48 - 6:50
    The ones I've chosen
    are all gender-related because again,
  • 6:50 - 6:54
    gender is both professionally
    and personally sort of a key interest.
  • 6:55 - 6:59
    So, the first that I'll start with
    is transexualism
  • 6:59 - 7:01
    which is described as a "condition
  • 7:01 - 7:03
    in which an individual
    identifies with a gender
  • 7:03 - 7:07
    inconsistent or not culturally associated
    with their biological sex."
  • 7:08 - 7:10
    Fairly unobjectionable and--
  • 7:10 - 7:12
    wait, no, it's classified as a disease,
  • 7:12 - 7:15
    and a psychiatric disease at that.
  • 7:16 - 7:19
    Now, I know what you're thinking,
    which is this is appalling
  • 7:19 - 7:20
    but actually it's not as simple
  • 7:20 - 7:24
    as either of these statements
    being true or false, right?
  • 7:25 - 7:28
    They're in a category of sort of,
    "true, except."
  • 7:29 - 7:33
    So, take transsexualism
    is an instance of disease, right?
  • 7:33 - 7:36
    Technically, this is true,
  • 7:36 - 7:38
    in so far as transsexualism
  • 7:38 - 7:39
    is the name of an entry
  • 7:39 - 7:42
    under the International Classification
    of Diseases, version 10.
  • 7:43 - 7:46
    But we should add some complexity
    and nuance to that.
  • 7:46 - 7:51
    So, the ICD
    is a classification of literally
  • 7:51 - 7:54
    everything in the world
    that you could have
  • 7:54 - 7:58
    that was in any way involved at all
    in someone's injury or death.
  • 7:58 - 8:03
    It is in fact illegal to die of something
    that is not listed in the ICD.
  • 8:03 - 8:04
    (laughter)
  • 8:05 - 8:07
    So it contains kind of a lot of things,
  • 8:07 - 8:09
    and transexualism is listed in it
  • 8:09 - 8:11
    so we classify it as a disease
  • 8:11 - 8:14
    because it's in a classification
    of diseases.
  • 8:14 - 8:17
    So, here are some other things
    that the ICD also lists as diseases
  • 8:17 - 8:19
    that it has specific entries for.
  • 8:20 - 8:23
    PA80: Shot by accident.
  • 8:24 - 8:28
    PA40.0: Fell off a boat, drowned.
  • 8:28 - 8:30
    (laughter)
  • 8:30 - 8:35
    PA41.1: Fell off a boat,
    damaged the boat, and drowned.
  • 8:35 - 8:37
    (laughter)
  • 8:37 - 8:40
    PA40.1: Fell off the boat,
  • 8:40 - 8:41
    didn't damage the boat,
  • 8:41 - 8:43
    didn't drown,
  • 8:43 - 8:44
    still died of something.
  • 8:44 - 8:45
    (laughter)
  • 8:46 - 8:49
    And finally, QD50: Being poor.
  • 8:49 - 8:50
    (laughter)
  • 8:50 - 8:54
    So, if any of you
    have ever fallen off a boat,
  • 8:54 - 8:57
    I'm very sorry but you have a disease
  • 8:57 - 8:59
    which you should really
    talk to a doctor about.
  • 8:59 - 9:01
    What class of doctor,
    I'm not sure.
  • 9:01 - 9:03
    It might be a psychiatrist.
  • 9:03 - 9:05
    Who knows?
  • 9:06 - 9:08
    So you know that's disease, right?
  • 9:08 - 9:11
    What about health specialty: psychiatry?
  • 9:11 - 9:14
    Well, that's also true, sort of.
  • 9:14 - 9:16
    So, psychiatrists are the people
  • 9:16 - 9:19
    who diagnose the presence
    of gender dysphoria,
  • 9:19 - 9:21
    a disconnect between one's sense of gender
  • 9:21 - 9:24
    and one's sort of like,
    embodied or perceived gender.
  • 9:25 - 9:26
    But again, context.
  • 9:26 - 9:29
    For example,
    saying psychiatrists diagnose it
  • 9:29 - 9:31
    ignores the fact
    that none of the treatments
  • 9:31 - 9:32
    are psychiatric.
  • 9:32 - 9:34
    You might as well list the specialties
  • 9:34 - 9:38
    as specialization in hormones
  • 9:38 - 9:42
    or plastic surgery,
    or being a personal shopper.
  • 9:42 - 9:46
    All of these also have some role
    in people's life trajectories.
  • 9:46 - 9:48
    They are not listed.
  • 9:50 - 9:52
    One other useful
    potential factoid by the way,
  • 9:52 - 9:54
    is that the ICD 10 is actually
  • 9:54 - 9:57
    the old International Classification
    of Diseases,
  • 9:57 - 10:01
    and the ICD 11 no longer lists
    transsexualism at all,
  • 10:01 - 10:04
    much less as a disease.
  • 10:04 - 10:08
    But my point here is not that Wikidata
    sometimes contains outdated information
  • 10:08 - 10:11
    or sometimes contains
    false information,
  • 10:11 - 10:14
    it's that the statements
    that are constructed from that information
  • 10:14 - 10:17
    as a consequence of what they leave out
  • 10:17 - 10:18
    and what the results are,
  • 10:18 - 10:21
    drop things and add risk.
  • 10:21 - 10:23
    So, one way of structuring
  • 10:23 - 10:25
    the information that
    that entry contained is:
  • 10:25 - 10:30
    "transsexualism is a psychiatric disease."
  • 10:30 - 10:32
    And this leaves out
    a lot of complexity,
  • 10:32 - 10:34
    some of which we've discussed.
  • 10:34 - 10:36
    But the greater issue is how it interlocks
  • 10:36 - 10:40
    and resonates with existing narratives,
    and existing information.
  • 10:40 - 10:43
    For example, the idea
    of transsexualism is a disease.
  • 10:43 - 10:48
    Does anyone know why
    the ICD stops listing it as a disease?
  • 10:50 - 10:51
    Well, two reasons.
  • 10:51 - 10:56
    First is because calling
    being trans a disease is not accurate.
  • 10:56 - 10:59
    It does not meet the definition
    of being a disease.
  • 11:00 - 11:03
    In fact, the only reason
    that anything to do with being trans
  • 11:03 - 11:08
    is still in the ICD is not
    out of some objective
  • 11:08 - 11:12
    like, you know, examination
    of biology or psychiatry
  • 11:12 - 11:14
    but instead purely pragmatism.
  • 11:14 - 11:16
    That if you stop listing it,
  • 11:16 - 11:18
    then insurance companies
    in places like the U.S.
  • 11:18 - 11:21
    would stop covering medical care
  • 11:21 - 11:24
    that is associated with being trans.
  • 11:24 - 11:26
    And the second is that
  • 11:27 - 11:30
    the stigma associated
    with having something classified
  • 11:30 - 11:33
    as a disease is substantive,
  • 11:33 - 11:36
    and when you list transsexualism
    as a disease
  • 11:36 - 11:37
    and a psychiatric one at that,
  • 11:37 - 11:39
    you tap into really
    long-standing assumptions
  • 11:39 - 11:42
    and false beliefs about trans people.
  • 11:42 - 11:44
    Assumptions and beliefs
    that have a lot of power.
  • 11:44 - 11:47
    Like, if it's a disease
    there must be something wrong
  • 11:47 - 11:50
    with trans people,
    something that people should fix.
  • 11:50 - 11:51
    And if it's a psychiatric condition
  • 11:51 - 11:55
    then trans people should
    be therapized out of being trans.
  • 11:55 - 11:59
    In other words, whatever the raw truth
    or falseness of the statement,
  • 11:59 - 12:02
    stripping out its complexity
    and contextuality,
  • 12:02 - 12:05
    lets people fit it into their own notions
    of what it means.
  • 12:06 - 12:07
    And that doesn't end
  • 12:07 - 12:10
    in a neutral objective
    classification system,
  • 12:10 - 12:13
    it ends in things like conversion therapy,
  • 12:13 - 12:17
    and it being legal
    to beat people to death for being trans
  • 12:17 - 12:20
    when you find out that they're trans
    after you slept with them,
  • 12:20 - 12:22
    because, you know,
    something's wrong with them.
  • 12:22 - 12:26
    Like why would you
    be considered reasonable
  • 12:26 - 12:27
    to have done this?
  • 12:30 - 12:33
    So a more accurate framing of this
    might be this,
  • 12:33 - 12:37
    which is hard to fit into Wikidata.
  • 12:38 - 12:41
    And because we can't fit
    that into Wikidata,
  • 12:41 - 12:42
    and we strip it down,
  • 12:42 - 12:43
    and we lose all that complexity,
  • 12:43 - 12:48
    we open up the possibility to, again,
    reinforce these really dangerous notions.
  • 12:50 - 12:52
    So, let's look at another example,
    also from gender,
  • 12:52 - 12:54
    and that is the entry for non-binary.
  • 12:56 - 12:58
    So, as Wikidata informs us,
  • 12:58 - 13:01
    non-binary is a range of genders
  • 13:01 - 13:03
    that are neither exclusively man
    nor woman.
  • 13:03 - 13:07
    And there are some critiques
    I have of the "also known as" section,
  • 13:07 - 13:09
    but that's not the biggest issue here.
  • 13:09 - 13:10
    No, the biggest issue here
  • 13:10 - 13:15
    is that at no point does this entire page
    make any reference to trans people.
  • 13:15 - 13:19
    So, if you go to the entry
    for transgender woman,
  • 13:19 - 13:22
    it says, "opposite to transgender man."
  • 13:22 - 13:24
    And if you go to the entry
    for transgender man
  • 13:24 - 13:27
    it says, "opposite to transgender woman."
  • 13:27 - 13:29
    If you go to this entry,
  • 13:29 - 13:33
    it has absolutely no reference
    to trans people whatsoever.
  • 13:33 - 13:36
    There is this complete disconnect
    and distinction
  • 13:36 - 13:39
    between non-binary people
    and trans people.
  • 13:40 - 13:42
    And this might be, seems to be,
  • 13:42 - 13:44
    a pedantic thing to be concerned about
  • 13:44 - 13:48
    but it's actually a really useful example
    for a couple of reasons.
  • 13:48 - 13:53
    The first is that how non-binary people
    relates to being trans
  • 13:53 - 13:55
    is really hotly debated.
  • 13:56 - 14:00
    Individual non-binary people
    may or may not identify as trans.
  • 14:02 - 14:04
    As a consequence, it's really difficult
  • 14:04 - 14:07
    to make big categorical judgements
    about a class of people.
  • 14:09 - 14:13
    Other people would say that non-binary
    people aren't trans,
  • 14:13 - 14:16
    for whatever reason,
    or that non-binary people are trans.
  • 14:18 - 14:20
    You know, you have to
    make a decision at some point.
  • 14:20 - 14:22
    How are you going
    to categorize this entry?
  • 14:22 - 14:24
    What attributes are you going
    to associate it with?
  • 14:26 - 14:28
    But it's hard to do that in Wikidata
  • 14:28 - 14:31
    when by necessity
    the structure of the platform
  • 14:31 - 14:33
    is so categorical and so fixed,
  • 14:33 - 14:37
    that you can't really say like,
    for some people these things are related
  • 14:37 - 14:40
    and for others they aren't,
    and it's actually very politically charged
  • 14:40 - 14:41
    but you should think about it.
  • 14:42 - 14:45
    There's no objective fact to fall back on.
  • 14:45 - 14:48
    It's very contextual and complex,
    and disputed.
  • 14:50 - 14:53
    So, how do you fit this in?
  • 14:54 - 14:55
    Anyone?
  • 14:58 - 15:00
    But, this reductiveness
    isn't just a question of,
  • 15:00 - 15:02
    "Oh well, we haven't fit all
    the information in
  • 15:02 - 15:04
    so I guess it's not perfect."
  • 15:04 - 15:09
    Again, it fits into preexisting discourses
    and the preexisting world,
  • 15:09 - 15:11
    and has the potential
    to cause very real harms.
  • 15:13 - 15:14
    There's this very long history
  • 15:14 - 15:17
    of non-binary people
    not being considered trans,
  • 15:18 - 15:21
    going back to, in fact, the foundational,
  • 15:21 - 15:25
    sort of medical and academic,
    and authoritative works
  • 15:25 - 15:29
    on what being trans is
    and how trans people should be treated.
  • 15:30 - 15:31
    And what this has resulted in
  • 15:31 - 15:36
    is non-binary people being cut
    out of access to resources--
  • 15:37 - 15:41
    medical care, community membership,
    any kind of support.
  • 15:41 - 15:44
    In fact until 2013,
  • 15:44 - 15:47
    being non-binary was not a thing
    you could possibly be
  • 15:47 - 15:51
    while still getting access,
    to transition-related medical treatment.
  • 15:51 - 15:55
    If you were, and you wanted access
    you would have to go to your doctor
  • 15:55 - 15:58
    and consistently lie,
    and hopefully get away with it.
  • 16:00 - 16:03
    So, if you want that diagnosis to happen
  • 16:03 - 16:06
    so that your health insurance
    will cover things
  • 16:06 - 16:09
    or that your national health service
    will cover things,
  • 16:09 - 16:11
    you could either be a man
    or a woman,
  • 16:11 - 16:13
    and nothing else.
  • 16:14 - 16:16
    And right now there's a ton of backlash
  • 16:16 - 16:18
    to non-binary existences
  • 16:18 - 16:20
    from people who are thinking
    that we are a threat,
  • 16:20 - 16:23
    or something new and novel
  • 16:23 - 16:27
    when we've been around for just
    as long as any other kind of trans person
  • 16:27 - 16:30
    and just not discussed.
  • 16:31 - 16:32
    And again, the consequence of this
  • 16:32 - 16:37
    is that this silence is reinforcing
    those preexisting ideas
  • 16:37 - 16:42
    of being non-binary has nothing to do
    with being trans whatsoever,
  • 16:42 - 16:47
    and it creates and reinforces discourses
    that cut people off from care,
  • 16:47 - 16:50
    and cut people off from community.
  • 16:51 - 16:56
    And finally, before I stop harping
    on things about gender quite so much,
  • 16:56 - 16:57
    the hijra.
  • 16:57 - 16:59
    So, according to Wikidata
  • 16:59 - 17:02
    the hijra are the third gender
    of South Asian cultures
  • 17:02 - 17:05
    and a sub class of non-binary.
  • 17:06 - 17:07
    Now, here's the thing.
  • 17:07 - 17:11
    Yes, hijra people fall
    outside a simple man-woman binary,
  • 17:12 - 17:14
    but pretty much zero hijra people
  • 17:14 - 17:16
    would ever define themselves
    as non-binary,
  • 17:16 - 17:19
    because it just doesn't make any sense.
  • 17:19 - 17:23
    In a western context,
    non-binary people are, by definition,
  • 17:23 - 17:24
    not man or woman
  • 17:24 - 17:28
    but as a consequence
    not trans man or trans woman.
  • 17:29 - 17:31
    Hijra includes trans women,
  • 17:31 - 17:34
    and also includes all intersex people,
  • 17:34 - 17:38
    all sterile people,
    and a large number of gay people
  • 17:38 - 17:41
    while not including trans men
  • 17:41 - 17:45
    or people who are non-binary,
    and were assigned female at birth.
  • 17:47 - 17:48
    All of this is really complex
  • 17:48 - 17:50
    and there are literally books written
  • 17:50 - 17:55
    on the framework of gender
    and how that fits into it.
  • 17:55 - 17:57
    But the point is
    there's not a simple mapping
  • 17:57 - 17:59
    of western gender notions
  • 17:59 - 18:01
    to gender notions
    in the rest of the world.
  • 18:02 - 18:05
    Categorizing hijra people
  • 18:05 - 18:10
    as a subset of non-binary people
  • 18:10 - 18:14
    ignores the fact that most hijra people
    do not see themselves that way,
  • 18:14 - 18:16
    would not see themselves that way,
  • 18:16 - 18:19
    and that the definitions of hijra
    and non-binary
  • 18:19 - 18:21
    are completely incompatible.
  • 18:23 - 18:24
    But again this has the potential
  • 18:24 - 18:27
    to cause harm.
  • 18:27 - 18:28
    Because the fact of the matter
  • 18:28 - 18:32
    is that western notions of gender
    are pretty regularly
  • 18:32 - 18:35
    and over a long period of time
    exported to the rest of the world
  • 18:35 - 18:37
    often by violence.
  • 18:37 - 18:40
    We have these information systems.
  • 18:40 - 18:43
    We have classification systems.
  • 18:43 - 18:44
    We have standards.
  • 18:44 - 18:47
    We have, historically and currently, wars,
  • 18:47 - 18:49
    all of which are orientated
    around this idea
  • 18:49 - 18:52
    of the western way of doing things
    is the only good way
  • 18:52 - 18:54
    or is the best way
    and the standard way,
  • 18:54 - 18:57
    and everyone should conform.
  • 18:57 - 19:01
    And so when we have these big projects
    which are trying to fit the world
  • 19:01 - 19:04
    in to a very westernized idea
    of knowledge, because they have to,
  • 19:04 - 19:08
    because that’s how classification systems
    do universally work--
  • 19:08 - 19:11
    everything has to fit
    into one consistent scheme.
  • 19:11 - 19:14
    It is perpetuating that kind of violence.
  • 19:17 - 19:21
    So, you could respond
    to my concerns and examples,
  • 19:21 - 19:23
    and rambles with kind of a lot.
  • 19:23 - 19:25
    One line to take would be,
    "Why does this matter?"
  • 19:25 - 19:28
    Why does Wikidata participating
    and validating
  • 19:28 - 19:33
    or invalidating particular discourses
    have an impact on the world?
  • 19:33 - 19:37
    And the first answer is
    it actually doesn't matter if it matters.
  • 19:37 - 19:39
    It matters that you acknowledge it,
  • 19:39 - 19:42
    So, right now the default framing
    of Wikidata is
  • 19:42 - 19:45
    we're just collecting all of the knowledge
    in a machine-readable form,
  • 19:45 - 19:46
    but you're not.
  • 19:46 - 19:48
    You're also making decisions
  • 19:48 - 19:50
    about what should be included
    and what shouldn't,
  • 19:50 - 19:53
    and how knowledge should be represented.
  • 19:53 - 19:56
    What complexity is worth representing
    and what isn't.
  • 19:57 - 19:59
    And those are ethical
    and political choices,
  • 19:59 - 20:02
    and framing the project
    as simply the result
  • 20:02 - 20:05
    of a million anonymous,
    and interchangeable monkeys
  • 20:05 - 20:07
    with an equivalent number of typewriters
  • 20:07 - 20:09
    makes it impossible for us
    to have conversations about it.
  • 20:10 - 20:13
    Wikidata's organizers and users
    and funders must understand
  • 20:13 - 20:17
    that they're fundamentally
    making charged decisions
  • 20:17 - 20:19
    that are not neutral
    or objective at all,
  • 20:20 - 20:24
    and that is not bad but dangerous.
  • 20:26 - 20:28
    And so, okay, having accepted
  • 20:28 - 20:30
    that these are ethical
    and political decisions,
  • 20:30 - 20:33
    you could say,
    "Well, if people want their takes
  • 20:33 - 20:35
    on things included,
    they should just contribute."
  • 20:35 - 20:39
    And marginalized communities
    do contribute a lot, right?
  • 20:39 - 20:41
    There's a long history
    of queer communities,
  • 20:41 - 20:44
    particularly, being
    very early adopters of technology.
  • 20:44 - 20:48
    And so people could
    just contribute to Wikidata.
  • 20:48 - 20:53
    Like Hijra people could create accounts
    and start arguing
  • 20:53 - 20:56
    that actually the entry
    shouldn't be a subset of non-binary
  • 20:56 - 20:58
    and so, and so forth.
  • 20:59 - 21:02
    The problem is that
    this is unlikely to help
  • 21:02 - 21:04
    because they're the minority,
  • 21:04 - 21:06
    because many of the voices
    and perspectives
  • 21:06 - 21:08
    that are currently silenced,
  • 21:08 - 21:10
    in the political and ethical decisions
    being made,
  • 21:10 - 21:12
    are those of minorities.
  • 21:12 - 21:14
    So, I did some number crunching on this.
  • 21:14 - 21:17
    Wikidata has 20,000 active editors
  • 21:17 - 21:21
    from a human population
    of seven billion give or take,
  • 21:21 - 21:24
    unless you believe that maths is a lie
  • 21:24 - 21:28
    and the world governments,
    controlled by lizards under the Arctic,
  • 21:28 - 21:31
    is making everything up.
  • 21:31 - 21:33
    And there are approximately...
    Um hmm?
  • 21:33 - 21:34
    (person 1) You mean they're not?
  • 21:34 - 21:36
    (laughter)
  • 21:36 - 21:37
    Look, I'll be honest.
  • 21:37 - 21:39
    If living in the U.S.
    for the last five years
  • 21:39 - 21:40
    has taught me anything,
  • 21:40 - 21:45
    it's that any government assemblage
    large enough to try and control
  • 21:45 - 21:46
    a big chunk of the human population
  • 21:46 - 21:51
    would in no way be consistently competent
    enough to actually cover it up.
  • 21:51 - 21:52
    (laughter)
  • 21:52 - 21:53
    Like we would have found out
    in three months--
  • 21:53 - 21:55
    and it wouldn't even have been
  • 21:55 - 21:57
    because of some
    plucky investigative reporter--
  • 21:57 - 21:59
    it would have been
    because one of the lizards
  • 21:59 - 22:00
    forgot to put on
    their human suit one day
  • 22:00 - 22:03
    and accidentally went out
    to the shops for a pint of milk
  • 22:03 - 22:04
    (laughter)
  • 22:04 - 22:08
    and got caught in a TikTok video.
  • 22:08 - 22:10
    (laughter)
  • 22:11 - 22:14
    So Wikidata has 20,000 active editors--
  • 22:14 - 22:17
    of whom we will assume none are lizards
  • 22:17 - 22:18
    in human suits or otherwise--
  • 22:19 - 22:21
    from a human population of seven billion,
  • 22:22 - 22:25
    and there are approximately
    one million Hijra people in the world.
  • 22:25 - 22:27
    So if we assume a rate
    of equal participation--
  • 22:27 - 22:31
    setting aside the extreme poverty
    a lot of Hijra people live in
  • 22:31 - 22:32
    and the corresponding impact
  • 22:32 - 22:35
    on access to things
    like reliable internet coverage--
  • 22:36 - 22:41
    then the combined efforts
    of 20,000 Wikidata editors
  • 22:41 - 22:44
    would have to be overwhelmed
    by 2.85 people.
  • 22:46 - 22:49
    That doesn't seem particularly plausible.
  • 22:52 - 22:53
    Okay, so then you might say,
  • 22:53 - 22:57
    "Well, what if we just have
    other Wikibase instances
  • 22:57 - 23:00
    isn't that the whole thing
    we're building towards?
  • 23:00 - 23:03
    You can set up your own Wikibase
    with your own perspectives
  • 23:03 - 23:06
    and your own decisions
    about how to classify things,
  • 23:06 - 23:08
    and what to prioritize,
    and what not to.
  • 23:08 - 23:11
    Make your own site with your own standard
    for what constitutes knowledge
  • 23:11 - 23:13
    and what information is important."
  • 23:13 - 23:16
    And people could do precisely that.
  • 23:16 - 23:19
    But the problem is
    that Wikidata has a lot of heft behind it
  • 23:19 - 23:23
    which is why the decisions
    that Wikidata makes have so much import.
  • 23:24 - 23:26
    There's the fact that it already exists.
  • 23:26 - 23:29
    It has a first movers advantage.
  • 23:29 - 23:31
    There's the Wikimedia brand.
  • 23:31 - 23:34
    There's the funding
    from places like Google.
  • 23:34 - 23:37
    There's the relationships
    with other institutions.
  • 23:37 - 23:39
    When the strategic plan for Wikidata
  • 23:39 - 23:42
    calls for engagement
    and integration with museums,
  • 23:42 - 23:43
    that doesn't just result
  • 23:43 - 23:45
    in getting more data for Wikidata.
  • 23:45 - 23:49
    That also results in Wikidata
  • 23:49 - 23:52
    and the decisions its users make
    permeating more of reality,
  • 23:52 - 23:58
    becoming more of a standard
    of how data systems work,
  • 23:58 - 24:02
    and more of a place that is drawn from
    to populate other spaces.
  • 24:04 - 24:07
    So I keep using this line,
    "Not bad, but dangerous"
  • 24:07 - 24:10
    to describe classification systems
    or to describe Wikidata,
  • 24:11 - 24:12
    and I want to reinforce
  • 24:12 - 24:15
    that I don't think that Wikidata
    is inherently bad.
  • 24:16 - 24:19
    But I do think that its dangers are vast
  • 24:19 - 24:21
    and are not being properly attended to.
  • 24:21 - 24:23
    Just by looking at gender,
  • 24:23 - 24:27
    we saw three examples,
    which I pulled very, very quickly,
  • 24:27 - 24:32
    of situations where even setting aside
  • 24:32 - 24:35
    the sort of objective "accuracy"
  • 24:35 - 24:39
    of the information that
    a Wikidata entry might contain,
  • 24:39 - 24:44
    the information it chooses to contain
    and chooses to prioritize perpetuates
  • 24:44 - 24:47
    or silences particular discourses,
    and particular ideas
  • 24:47 - 24:52
    that have weight in the rest of the world,
    that do harm in the rest of the world.
  • 24:53 - 24:54
    And I picked those examples
  • 24:54 - 24:58
    not because they're surprising
    in any way,
  • 24:58 - 25:00
    or not because they're unique,
  • 25:00 - 25:04
    but simply to point out that
    if I could find that many problems
  • 25:04 - 25:07
    with resonances in wider violent systems
  • 25:07 - 25:09
    in such a tiny sliver of content,
  • 25:09 - 25:12
    imagine how many others
    are lurking out there.
  • 25:14 - 25:18
    And the goal of Wikidata,
  • 25:18 - 25:19
    the goal of universal classification
  • 25:19 - 25:22
    if these dangers are not attended to
  • 25:22 - 25:24
    could ultimately result,
    or will ultimately result,
  • 25:24 - 25:28
    not in simple like neutral classification,
  • 25:28 - 25:29
    but imposition.
  • 25:29 - 25:32
    In saying this is the way
    the world works
  • 25:32 - 25:33
    and if you don't like it
  • 25:33 - 25:37
    then congrats, you should try
    and fit into it.
  • 25:39 - 25:42
    And I really wish that I had
    a sort of simple answer for this.
  • 25:42 - 25:44
    I don't.
  • 25:44 - 25:45
    It's one of the advantages
  • 25:45 - 25:46
    of switching to academia
  • 25:46 - 25:48
    instead of working
    in an engineering department.
  • 25:48 - 25:49
    You can just show up places
  • 25:49 - 25:52
    and go, "Everything
    is really complicated."
  • 25:52 - 25:54
    Someone should do something about that.
  • 25:55 - 25:57
    Could I have a grant please?
  • 25:57 - 25:58
    (laughter)
  • 25:58 - 26:00
    But all I can really do
  • 26:00 - 26:03
    is point you back to
    Bowker and Star's conclusion,
  • 26:03 - 26:07
    which is that this isn't ultimately
    about Wikidata,
  • 26:07 - 26:08
    this isn't a problem with Wikidata
  • 26:08 - 26:11
    this is that the class of systems
  • 26:11 - 26:14
    that Wikidata is a part of
    has never been done safely
  • 26:14 - 26:17
    and there is no reason
    to think it could be.
  • 26:18 - 26:20
    And so my call is ultimately
  • 26:20 - 26:22
    not for a particular change,
  • 26:22 - 26:24
    or for all of you
    to just go home and give up.
  • 26:25 - 26:27
    It's for the project collectively
  • 26:27 - 26:29
    and for you all individually
  • 26:29 - 26:32
    to determine how comfortable you are
  • 26:32 - 26:36
    with participating and building a system
  • 26:36 - 26:39
    that makes a claim to universalism,
  • 26:39 - 26:42
    that makes a claim to neutrality
    and truth in data,
  • 26:44 - 26:47
    when we know that that's neither possible
  • 26:47 - 26:50
    nor harmless when it fails.
  • 26:50 - 26:53
    and if you are not comfortable
    with that, working to articulate
  • 26:53 - 26:55
    what other ways of doing this
    there might be.
  • 26:56 - 26:59
    And these could look like, for example,
  • 27:00 - 27:04
    giving primacy
    to those local Wikibase installs.
  • 27:04 - 27:06
    Saying that ultimately
  • 27:06 - 27:08
    we need to give individual communities
  • 27:08 - 27:11
    and individual contexts
    and spaces primacy
  • 27:11 - 27:13
    in defining what matters to them,
  • 27:13 - 27:15
    and how they wish to be defined.
  • 27:15 - 27:19
    And the conversation about
    which perspective should be included
  • 27:19 - 27:22
    in some central repository should wait
  • 27:22 - 27:25
    until we have
    the full range of perspectives.
  • 27:27 - 27:29
    So, that's everything from me.
  • 27:29 - 27:31
    Thank you, everyone,
    for sitting through this.
  • 27:32 - 27:35
    I think we have about 20 to 25 minutes--
  • 27:35 - 27:39
    (moderator) 25 minutes for questions,
    so, please, plentiful.
  • 27:40 - 27:41
    Thank you very much.
  • 27:42 - 27:45
    (applause)
  • 27:47 - 27:50
    (person 2) Thank you so much
    for this wonderful presentation
  • 27:50 - 27:52
    about the problems inherent
    in classification systems.
  • 27:52 - 27:55
    One of the examples you had
    is really cool
  • 27:55 - 27:56
    from a mathematical point of view,
  • 27:56 - 27:58
    when you were showing
    that transgender male
  • 27:58 - 28:01
    is the opposite of transgender female--
  • 28:01 - 28:04
    or transgender female
    is the opposite of transgender male
  • 28:04 - 28:07
    and the opposite of cisgendered female.
  • 28:07 - 28:12
    That makes cisgendered female
    be the same as transgender male,
  • 28:12 - 28:13
    because opposite of is the same--
  • 28:13 - 28:17
    if A is opposite of B
    and C is the opposite of B,
  • 28:17 - 28:18
    A and C are the same.
  • 28:18 - 28:21
    So actually that's a place
    where it should be different from
  • 28:21 - 28:23
    and not opposite of,
  • 28:23 - 28:26
    and that involves
    a lot of mathematical issues
  • 28:26 - 28:29
    when we go to actually ask queries
    of the database,
  • 28:29 - 28:32
    so it's really important
    that you've pointed out things like that.
  • 28:32 - 28:34
    Yeah, another example
    of that which I thought was fun
  • 28:34 - 28:39
    was transsexualism was defined
    in part further down--
  • 28:39 - 28:40
    which I wanted to include,
  • 28:40 - 28:42
    but couldn't find a way
    of fitting it into the flow--
  • 28:42 - 28:46
    as the same as sex-reassignment surgery.
  • 28:47 - 28:48
    Which is unintentionally hilarious
  • 28:48 - 28:51
    because a diagnosis of transsexualism
  • 28:51 - 28:55
    was historically a prerequisite
    for sex-reassignment surgery.
  • 28:55 - 28:58
    So it's not so much a chicken
    and an egg problem
  • 28:58 - 29:00
    as the chicken is carrying the egg.
  • 29:00 - 29:01
    (laughter)
  • 29:02 - 29:04
    Yeah. So yeah, these--
  • 29:04 - 29:08
    When we look at Wikidata
    and how much it uses mathematical,
  • 29:08 - 29:12
    or pseudo-mathematical language of, like,
  • 29:12 - 29:16
    opposite of, distinct from,
    in the set of...
  • 29:17 - 29:19
    Yeah, reality is more complex
  • 29:19 - 29:22
    than the mathematics
    we have to represent it.
  • 29:23 - 29:25
    I don't have a smart answer there
    except to say
  • 29:25 - 29:27
    that I used to be
    a quantitative researcher
  • 29:27 - 29:30
    and I left,
    and there is a reason for this.
  • 29:33 - 29:34
    (moderator) Next question.
  • 29:34 - 29:36
    Who raised hands?
  • 29:36 - 29:37
    I see a hand over there?
  • 29:46 - 29:47
    (person 3) Hello.
  • 29:48 - 29:50
    First of all.
    Thank you for this presentation.
  • 29:50 - 29:51
    It was very eye-opening.
  • 29:53 - 29:56
    I want to tell you,
    but first of all--
  • 29:56 - 29:59
    there's a Wikimedia--
    I don't know if you know
  • 29:59 - 30:01
    about the community LGBT+ user group.
  • 30:01 - 30:02
    So it's a user group,
  • 30:02 - 30:04
    and they have this mailing list,
  • 30:04 - 30:08
    and they discussing actually
    the issue of sex and gender in Wikidata,
  • 30:08 - 30:09
    and there is some proposals made
  • 30:09 - 30:12
    by LGBT+ people to improve it.
  • 30:12 - 30:15
    So, but it's not fully done yet.
  • 30:15 - 30:18
    So, there are some plans,
    people working on it.
  • 30:18 - 30:20
    It would be great
    if you want to chime in there
  • 30:20 - 30:22
    and give your opinion
  • 30:22 - 30:24
    because I'm pretty sure
    you're more expert than most of us.
  • 30:25 - 30:28
    But I want to give a critique
    of this thing that you said
  • 30:28 - 30:30
    about hijra people that said
  • 30:30 - 30:34
    out of 20,000 editors of Wikidata,
  • 30:34 - 30:37
    assuming 2.8 of them will be hijra
  • 30:37 - 30:40
    and they need to overcome
    all of these 20,000 people
  • 30:40 - 30:41
    but this is not true.
  • 30:41 - 30:45
    Lots of people, I say assume 20,000 people
  • 30:45 - 30:48
    are just unaware of an issue.
  • 30:48 - 30:50
    They are not bigots
  • 30:50 - 30:52
    or they are not going to actively
  • 30:52 - 30:54
    not let people do this.
  • 30:54 - 30:57
    And lots of them would help
    if you tell them.
  • 30:57 - 31:00
    Like, as you [inaudible]
    that edits Wikidata,
  • 31:00 - 31:02
    I have no idea about this issue
  • 31:02 - 31:04
    and if I knew it
    I would have fixed it.
  • 31:05 - 31:06
    So, yeah.
  • 31:06 - 31:08
    Yeah. I totally get what you mean.
  • 31:09 - 31:11
    And I want to be clear that I'm not saying
  • 31:11 - 31:13
    there are 20,000 people,
  • 31:13 - 31:14
    many of whom are in this room,
  • 31:14 - 31:16
    although only a tiny percentage
  • 31:16 - 31:20
    who are vehement bigots
    and cultural imperialists.
  • 31:20 - 31:22
    Instead what I'm getting at
    is the fact
  • 31:22 - 31:27
    that the consensus model,
    and discussion-based model
  • 31:27 - 31:31
    that the WikiProjects are based on
  • 31:31 - 31:33
    has a couple of flaws,
  • 31:33 - 31:34
    and one of the big flaws
  • 31:34 - 31:39
    is that it assumes that all of the voices
    worth representing are there
  • 31:39 - 31:42
    and are represented
    somewhat proportionately.
  • 31:42 - 31:46
    Consensus started off
    as a model in Quaker communities
  • 31:46 - 31:49
    where literally everyone impacted
    by a decision was in the room,
  • 31:49 - 31:53
    because everyone impacted
    by a decision could fit in the room.
  • 31:54 - 31:59
    And so my point
    with this 2.85 number is not to say
  • 31:59 - 32:01
    you have to argue
    with the entire population of Wikidata
  • 32:01 - 32:03
    every time you want to make any decision,
  • 32:03 - 32:08
    but instead to say
    that the consensus model
  • 32:08 - 32:11
    and the majoritarian model
    of what knowledge should be represented
  • 32:11 - 32:14
    runs fundamentally into a problem
  • 32:14 - 32:21
    when the people
    who are being underrepresented
  • 32:21 - 32:23
    are underrepresented.
  • 32:24 - 32:26
    For another example, and a real one,
  • 32:28 - 32:30
    Myanmar as a country.
  • 32:30 - 32:35
    The English Wikipedia claims
    that it was called Burma
  • 32:35 - 32:37
    until a couple of years ago.
  • 32:39 - 32:41
    And the reasoning for this
    was very simple.
  • 32:43 - 32:45
    The BBC didn't like calling it Myanmar
  • 32:45 - 32:48
    and a load of editors--
  • 32:48 - 32:49
    (person 4) [inaudible] completely wrong.
  • 32:49 - 32:50
    Sorry.
  • 32:50 - 32:52
    (laughter)
  • 32:53 - 32:56
    You run into this issue of like...
  • 32:56 - 32:58
    I know it's not the precise thing,
    but it's just...
  • 32:58 - 33:02
    - (person 4) : [inaudible] it's actually--
    - (moderator) I give you the mic, sir.
  • 33:02 - 33:03
    - Yes?
    - (person 4) I'm sorry,
  • 33:03 - 33:05
    that's just incredibly playing
    being ignorant and that...
  • 33:05 - 33:08
    - Okay. Go for it.
    - (person 4) That's an absolute terrible,
  • 33:08 - 33:10
    terrible mischaracterization
    of the political situation in Myanmar.
  • 33:10 - 33:11
    Okay. Go for it.
  • 33:12 - 33:15
    (person 4) Anyways, so basically
    what it is is that the country--
  • 33:16 - 33:17
    in the Burmese language
  • 33:17 - 33:20
    the country can be referred to as
    Myanma or Bama.
  • 33:20 - 33:21
    Yep.
  • 33:21 - 33:23
    Myanma tends to be a more
    formal register
  • 33:23 - 33:25
    and Bama tends to be
    a little bit more informal register
  • 33:25 - 33:28
    but both are acceptable terms
    for the country.
  • 33:31 - 33:35
    The term Burma came obviously
    from the term Bama,
  • 33:36 - 33:38
    but what happened was
  • 33:38 - 33:40
    there is no official...
  • 33:42 - 33:48
    The country was officially referred
    to, in English, as Burma
  • 33:48 - 33:50
    up until 1988-- 1989, excuse me,
  • 33:51 - 33:53
    when the military government
    of the country
  • 33:54 - 33:56
    basically decided,
    the military junta of the country decided
  • 33:56 - 33:59
    that the country should be
    referred to as Myanma.
  • 33:59 - 34:05
    Ostensibly, this was as an attempt
    to make the country name
  • 34:05 - 34:08
    more acceptable to minorities
    within the country.
  • 34:08 - 34:10
    However, this is a bit
    of historical revisionism
  • 34:10 - 34:13
    because Myanma and Bama
    specifically refer
  • 34:13 - 34:15
    to the majority ethnicity in the country.
  • 34:15 - 34:20
    So, it was basically the government
    of Burma at the time--
  • 34:20 - 34:23
    trying to make the people
    equivalent to the country,
  • 34:23 - 34:24
    therefore implicitly saying--
  • 34:24 - 34:26
    (person 4) Almost the opposite,
  • 34:26 - 34:27
    but in a really weird way.
  • 34:27 - 34:30
    They basically declared that
    Bama was in reference to the ethnicity
  • 34:30 - 34:33
    and Myanma was in reference
    to the country,
  • 34:33 - 34:35
    when historically they both
    represent ethnicity
  • 34:35 - 34:36
    and the country.
  • 34:36 - 34:37
    That makes sense.
  • 34:37 - 34:42
    (person 4) But what happen was
    because Democrat advocates
  • 34:42 - 34:45
    within the country
    believed that the military junta
  • 34:45 - 34:47
    did not have the power
  • 34:47 - 34:49
    to be able to change
    the name of the country
  • 34:49 - 34:50
    in any language,
  • 34:50 - 34:52
    because they were not
    empowered by the people of the country.
  • 34:52 - 34:58
    and were explicitly
    a military junta that they...
  • 34:58 - 34:59
    therefore the country should continue
  • 34:59 - 35:02
    to be referred to Burma in English.
  • 35:02 - 35:06
    Because of the fact that essentially
    to call it Myanmar is essentially to say
  • 35:06 - 35:10
    the government of Burma and Myanmar
    at the time was legitimate.
  • 35:10 - 35:13
    After the fall of the-- well not fall,
  • 35:13 - 35:17
    but after like the semi return
    of civilian government in 2014,
  • 35:18 - 35:20
    this question came up,
  • 35:20 - 35:22
    "Okay, should we call this country
    Burma or Myanmar in English?"
  • 35:22 - 35:25
    and essentially,
    the facto leader of the country,
  • 35:25 - 35:26
    Aung San Suu Kyi,
  • 35:26 - 35:29
    said that there's nothing
    in the Burmese constitution
  • 35:29 - 35:31
    that says you know,
    what you should call it in English
  • 35:31 - 35:33
    so call it whatever you want.
  • 35:33 - 35:34
    I mean the name of the country
  • 35:34 - 35:39
    is officially the Union of Myanma
    in Burmese,
  • 35:39 - 35:41
    but as far as in English
    you can call it whatever you want.
  • 35:41 - 35:45
    But generally before the return
    of the civilian government in Burma,
  • 35:46 - 35:48
    to refer to it is as Myanmar
    was essentially
  • 35:48 - 35:52
    to legitimize the military government.
  • 35:52 - 35:54
    And so therefore,
  • 35:54 - 35:57
    to call it Burma was generally considered
    to be a specific political act
  • 35:57 - 35:59
    to not give that government legitimacy.
  • 35:59 - 36:03
    Yeah. So, I'm not saying that
    that isn't a rationale for it.
  • 36:03 - 36:06
    I'm saying that
    on the English Wikipedia specifically,
  • 36:06 - 36:11
    the page went through seven requested
    move discussions
  • 36:11 - 36:15
    over four years
    and a mediation cabal decision,
  • 36:15 - 36:17
    and an attempted structured mediation,
  • 36:17 - 36:21
    and a review of one the closures
    of the move discussion,
  • 36:21 - 36:24
    and that when you look
    at the discussions,
  • 36:24 - 36:27
    most of the sort of argument
    back and forth
  • 36:27 - 36:29
    is not about
    the nuanced political situation
  • 36:29 - 36:30
    of the country
  • 36:30 - 36:34
    but it's instead about
    what is the common name in media sources
  • 36:34 - 36:36
    and what do
    different institutions call it.
  • 36:36 - 36:38
    And that when you look at the discussion,
  • 36:38 - 36:43
    you can see a clear point
    where pretty much every news organization
  • 36:43 - 36:45
    that isn't the BBC
    in the English Language,
  • 36:45 - 36:48
    that's considered like a major
    western news source
  • 36:48 - 36:50
    has switched their language sources,
  • 36:50 - 36:54
    and the debate
    essentially becomes a debate
  • 36:54 - 36:57
    of whether we should listen
    to the Wall Street Journal
  • 36:57 - 36:58
    or the BBC.
  • 36:59 - 37:03
    So the point I'm making
    is not about the specific politics
  • 37:03 - 37:05
    of the situation, but instead the fact
  • 37:05 - 37:08
    that it's really easy for those decisions
  • 37:08 - 37:13
    to actually become almost a proxy dispute
    of how much do we love the BBC,
  • 37:14 - 37:16
    and that when you look at the discussions
  • 37:16 - 37:18
    you see this really nice case study
  • 37:18 - 37:22
    in the issues of having
    those conversations
  • 37:22 - 37:26
    and having those nuanced,
    and often insider perspectives
  • 37:26 - 37:29
    when most of the discussions
    are centered around
  • 37:29 - 37:30
    how much we love the BBC
  • 37:30 - 37:34
    and are coming from people
    who are outside the context.
  • 37:34 - 37:36
    So, it's not--
  • 37:36 - 37:37
    My point in all of this is basically
  • 37:37 - 37:41
    that even if you're not fighting
    20,000 people,
  • 37:42 - 37:45
    even if you're only arguing
    with 20 people,
  • 37:45 - 37:47
    probabilistically, 19 of them
  • 37:47 - 37:51
    are going to be people
    who have very strong opinions,
  • 37:51 - 37:53
    who don't necessarily bear
    any negative consequences
  • 37:53 - 37:56
    of whichever change happens,
  • 37:56 - 38:00
    but have a particular world view
    and have decided to stick in it,
  • 38:00 - 38:04
    and so the proposals
    by the LGBTQ+ group
  • 38:04 - 38:06
    to change the Wikidata criteria
  • 38:06 - 38:09
    might be amazing, I might love them,
    I might not love them,
  • 38:09 - 38:11
    I haven't read them.
  • 38:12 - 38:15
    But the base premise of this is...
  • 38:15 - 38:18
    We got the people who show up
    on Wikidata right now,
  • 38:18 - 38:22
    and those are the representatives
    of all queer people
  • 38:22 - 38:26
    and this is the universal rule
    of what should be done
  • 38:26 - 38:28
    with the content of all queer people
  • 38:28 - 38:31
    is almost a microcosm
    of the same problem.
  • 38:32 - 38:34
    - (moderator) We have another question.
    - Yep.
  • 38:34 - 38:36
    (person 5) Hi.
  • 38:36 - 38:38
    I think there's another problem
  • 38:38 - 38:43
    with the consensus-based approach we have,
  • 38:43 - 38:46
    is that sometimes we have consensus
  • 38:46 - 38:49
    on really difficult issues
    on how to deal with that
  • 38:49 - 38:53
    and [inaudible] that on Wikidata,
    and nobody is reading the discussion.
  • 38:54 - 38:56
    Typically, the project Names,
  • 38:56 - 39:01
    which is a really, really old
    WikiProject on Wikidata--
  • 39:01 - 39:05
    and names are a really,
    really complicated issue in the world.
  • 39:05 - 39:08
    Not every people of the world
    have a given name,
  • 39:08 - 39:12
    not every people have a family name,
    not, well, you have an idea.
  • 39:12 - 39:15
    And there are so many
    writing systems out there,
  • 39:15 - 39:18
    and we have, actually, a system
  • 39:18 - 39:22
    which was working
    for many cases in the world
  • 39:22 - 39:24
    on how to use properties,
  • 39:24 - 39:26
    what items should look like,
  • 39:26 - 39:28
    how to link these together
    and everything--
  • 39:28 - 39:30
    We have eight pages--
  • 39:30 - 39:34
    nobody is reading that,
    and someone just added
  • 39:34 - 39:39
    Latin script family names
    to a Chinese researcher.
  • 39:40 - 39:44
    So, we don't have the names
    of these researchers
  • 39:44 - 39:49
    but we know for sure
    that the value added was wrong.
  • 39:49 - 39:50
    I don't have the correct value,
  • 39:50 - 39:52
    but I know this one
    is not the correct value.
  • 39:53 - 39:57
    And it's not just discussing the issue
  • 39:57 - 39:59
    because we have big discussions
  • 39:59 - 40:01
    and we have actually modeling
  • 40:01 - 40:08
    which is mostly working on
    and even qualifier on things to deal
  • 40:08 - 40:10
    with more complicated cases
  • 40:10 - 40:14
    but people are just,
    "Oh, given names suggest a property,
  • 40:14 - 40:15
    I will just add that."
  • 40:16 - 40:18
    - No.
    - Yeah.
  • 40:18 - 40:21
    I think it's not just
    how to model thing,
  • 40:21 - 40:25
    it's really how to explain
    to people the model,
  • 40:25 - 40:31
    and that's a technical part--
    we could have tools with suggestions
  • 40:31 - 40:35
    and I think the constraint thing
    which went live last year
  • 40:35 - 40:36
    is a great thing for that.
  • 40:36 - 40:40
    But even when we know to model thing,
  • 40:40 - 40:45
    it's how to make
    this model known to people.
  • 40:45 - 40:49
    That's a bit technical issue
    on how to do that better.
  • 40:54 - 40:55
    (moderator) So, there was just remark.
  • 40:55 - 40:58
    There's no real question for you?
  • 40:58 - 41:00
    Or that's a question to you?
  • 41:00 - 41:02
    - How to do that.
    - (person 5) Yeah, it's a question.
  • 41:03 - 41:06
    (person 5): Sorry,
    even if we have the discussion,
  • 41:06 - 41:07
    (moderator) Yeah, sure.
  • 41:08 - 41:11
    (person 5) My question,
    if I was not clear, is that
  • 41:11 - 41:13
    even when everyone is in agreement
  • 41:13 - 41:15
    on how to model complicated cases,
  • 41:15 - 41:20
    how do we make technically
    the model known for project
  • 41:20 - 41:22
    with the scope of Wikidata,
  • 41:22 - 41:27
    so people are not adding
    the wrong value in good faith?
  • 41:27 - 41:30
    Because our problem is both.
  • 41:30 - 41:34
    We have trouble
    modeling complicated realities,
  • 41:34 - 41:39
    and we have trouble explaining
    to users, how to follow the model
  • 41:39 - 41:41
    we actually have.
  • 41:41 - 41:42
    Yep.
  • 41:43 - 41:46
    I will say that
    if I could solve that problem
  • 41:46 - 41:48
    which is to reframe it,
  • 41:48 - 41:53
    how to reliably and consistently
    enculture new users
  • 41:53 - 41:57
    into having the same view
    and understanding
  • 41:57 - 42:00
    of the project space,
  • 42:00 - 42:02
    then they would let me graduate
  • 42:02 - 42:03
    and also give me a job.
  • 42:03 - 42:09
    It's the second oldest problem
    in internet spaces is how to do that.
  • 42:09 - 42:12
    The oldest problem is writing a system
  • 42:12 - 42:14
    that will automatically detect insults.
  • 42:16 - 42:19
    I will say that...
  • 42:19 - 42:21
    You can look back at Wikipedia,
  • 42:21 - 42:23
    or before that,
    there was the phenomenon
  • 42:23 - 42:27
    of eternal September on Usenet
  • 42:27 - 42:30
    which was, "Oh these people keep--
    AOL disks have gone everywhere
  • 42:30 - 42:31
    and now there's newcomers
  • 42:31 - 42:34
    all the time who don't know
    how things work around here,
  • 42:34 - 42:38
    and everything is drowning
    in people hitting "Reply All."
  • 42:40 - 42:42
    Generally speaking,
    the place that I would look for that
  • 42:42 - 42:48
    is there is a discipline called,
    "Computer-supported collaborative work,"
  • 42:48 - 42:50
    and one of their big questions
  • 42:50 - 42:54
    is this question of onboarding,
    and of like...
  • 42:55 - 42:58
    making the culture known to people.
  • 42:58 - 43:01
    But it may not be something
    that is directly solvable,
  • 43:01 - 43:03
    or that we want to directly solve, right?
  • 43:03 - 43:07
    So, Susan Leigh Star
    who wrote Sorting Things Out,
  • 43:07 - 43:08
    one of her other contributions
  • 43:08 - 43:13
    was generally the study of infrastructures
  • 43:13 - 43:15
    of which I would argue Wikidata
    is definitely one,
  • 43:16 - 43:19
    and of the things that she argued
  • 43:19 - 43:22
    was that infrastructures
    make themselves known
  • 43:22 - 43:23
    through using them.
  • 43:23 - 43:28
    So like, basically the only way
    to work out how a system works
  • 43:28 - 43:32
    is to engage with it,
    and trip over, and fall flat on your face,
  • 43:32 - 43:35
    and learn not to fall over that way again.
  • 43:35 - 43:40
    And I think everyone everywhere,
    including new users,
  • 43:41 - 43:43
    including people
    coming from other projects,
  • 43:43 - 43:49
    wants a way of approaching this
    where they don't have to fall over.
  • 43:49 - 43:51
    But I'm not sure if that exists,
  • 43:51 - 43:56
    and I think that a better place
    we might look is maybe to ask
  • 43:56 - 43:59
    what are the consequences
    of people screwing up
  • 43:59 - 44:03
    and how do we make screwing up
    an understandable
  • 44:03 - 44:07
    and a more expected component
    of the user experience.
  • 44:08 - 44:10
    (moderator) Okay thanks.
    Next question.
  • 44:11 - 44:12
    (person 6) Thank you.
  • 44:13 - 44:15
    So, first, thank you very much
    for your presentation to us.
  • 44:15 - 44:17
    Again, someone said, eye-opening.
  • 44:18 - 44:23
    I was looking at the specific item
    on transsexualism,
  • 44:24 - 44:27
    and it's actually even more interesting
  • 44:27 - 44:29
    because I was looking
    at different Wikipedias,
  • 44:29 - 44:32
    how they dealt with the issue.
  • 44:32 - 44:35
    And I just look at three.
  • 44:35 - 44:38
    So, apparently, what
    we are seeing on Wikidata
  • 44:38 - 44:44
    actually reflects pretty much
    what happened to some extent
  • 44:45 - 44:47
    at some level on English Wikipedia,
  • 44:47 - 44:51
    whereas if you look
    at Portuguese Wikipedia,
  • 44:51 - 44:55
    the actual item connects to transgender,
  • 44:56 - 45:02
    and on French Wikipedia
    it connects to trans identity
  • 45:03 - 45:08
    whereas transsexualism is a redirect
    in both Portuguese and French.
  • 45:09 - 45:15
    And I was looking at the history
    of editing on the Wikidata item,
  • 45:15 - 45:19
    and if you look at--
    there were several sort of wars
  • 45:19 - 45:22
    but the discussion page
    is actually only one line,
  • 45:23 - 45:26
    but there were several conflicts
    between editors,
  • 45:26 - 45:28
    particularly with the French
  • 45:28 - 45:32
    that were opposing
    the use of transsexualism.
  • 45:32 - 45:36
    If you look at the names of the items
    on each language,
  • 45:36 - 45:39
    the only one on which
    you don't have transsexualism
  • 45:39 - 45:41
    is French for trans identity,
  • 45:41 - 45:45
    and then someone came,
    and did what you said about
  • 45:45 - 45:47
    it's the opposite [inaudible],
    trans identity,
  • 45:47 - 45:51
    and then there is a different item that--
  • 45:51 - 45:52
    Oh yeah.
  • 45:52 - 45:56
    (person 6) So, it's a complete
    global fight over...
  • 45:56 - 45:59
    basically it's reverberating conflicts
  • 45:59 - 46:03
    that are apparently also
  • 46:03 - 46:08
    the manifestations of conflicts
    that happen on each Wikipedia.
  • 46:08 - 46:12
    Yes, that also reflect conflicts
    in local cultures,
  • 46:12 - 46:14
    and in different parts
    of the world, yeah...
  • 46:15 - 46:17
    And I'd argue that, I mean,
  • 46:17 - 46:21
    I'm British so I have a tendency
    to say, "Wait, fighting with the French?"
  • 46:21 - 46:22
    "Yes, Please!"
  • 46:22 - 46:23
    (laughter)
  • 46:23 - 46:27
    But I'd say there's almost something
    more fundamental than that,
  • 46:27 - 46:29
    and you can make an argument
    in the other direction.
  • 46:29 - 46:33
    I can, as a trans person, make an argument
    in the other direction and say,
  • 46:33 - 46:36
    "Actually, it's the French
    and Portuguese who have it wrong."
  • 46:36 - 46:38
    Because the actual question is
  • 46:38 - 46:40
    is the entry transsexualism about
  • 46:40 - 46:45
    the medical classification,
    or the state of being,
  • 46:45 - 46:48
    or the historic medical classification,
  • 46:48 - 46:51
    or the historic term
    for the state of being,
  • 46:51 - 46:54
    or are these different entries,
    or the same entries?
  • 46:54 - 46:56
    When are things distinct enough
    to be different objects,
  • 46:56 - 46:59
    and how do we negotiate that fight
  • 46:59 - 47:01
    between people who think
    that the medical status
  • 47:01 - 47:04
    and the identity are the same thing,
    or different things.
  • 47:05 - 47:08
    But yeah, there is no easy answer
  • 47:08 - 47:10
    but yeah, I suspect
    if you look at a lot of these examples,
  • 47:10 - 47:13
    and if you look
    at a lot of controversies,
  • 47:13 - 47:14
    generally on Wikidata
  • 47:14 - 47:18
    what you're going to see is
    these fights over...
  • 47:18 - 47:19
    These almost negotiations
  • 47:19 - 47:21
    are the local community norms,
  • 47:21 - 47:24
    and beyond that are the cultural norms.
  • 47:24 - 47:26
    Which is a problem because again,
  • 47:26 - 47:29
    when we're talking about marginalized
    or minority groups,
  • 47:29 - 47:34
    we would expect them to also
    be marginalized within Wiki communities,
  • 47:34 - 47:37
    and also within Wikidata,
  • 47:37 - 47:38
    and so Wikidata is sort of...
  • 47:40 - 47:43
    building on these
    preexisting prioritizations
  • 47:43 - 47:45
    of whose knowledge matters,
    and under what circumstances
  • 47:45 - 47:47
    and in what form.
  • 47:48 - 47:51
    (person 7): I wanted to touch
    on something you mentioned.
  • 47:52 - 47:58
    Everything is complex
    and I think modeling it right,
  • 47:58 - 48:00
    getting it right on Wikidata
  • 48:00 - 48:03
    is not the sum of the issue.
  • 48:03 - 48:05
    As you said, Wikidata is infrastructure,
  • 48:05 - 48:09
    and as [Hermione] said,
  • 48:09 - 48:13
    we have gotten it right perhaps
    in some things, in some other topics,
  • 48:13 - 48:15
    and still can't
    actually practice it right.
  • 48:15 - 48:16
    Yep.
  • 48:16 - 48:18
    (person 7): So I want to suggest that
  • 48:18 - 48:22
    this is a prevalent condition
    of the human race.
  • 48:23 - 48:28
    And however well we model something,
    even if we model gender
  • 48:29 - 48:33
    ten times more complexly
    than we do today,
  • 48:33 - 48:36
    most SPARQL queries involving gender
    would not bother
  • 48:36 - 48:38
    - with the qualifiers right?
    - Yeah.
  • 48:38 - 48:42
    And would still generate very,
    very flattened, very simplified results.
  • 48:42 - 48:47
    Google's use of our data
    in the infamous Google infoboxes
  • 48:47 - 48:50
    will also flatten the data
    and ignore qualifiers.
  • 48:50 - 48:52
    That is not going to change.
  • 48:52 - 48:55
    Wikidata will continue to be used
    in simplistic ways.
  • 48:56 - 48:57
    Indeed, the majority of use,
  • 48:57 - 48:59
    probably, will be that simplistic thing.
  • 49:00 - 49:04
    My point is, it's probably not fixable
  • 49:04 - 49:05
    and we shouldn't stop trying.
  • 49:07 - 49:09
    I mean we should try to get it right
  • 49:09 - 49:13
    and understand that a lot of the use is,
    despite our best efforts,
  • 49:13 - 49:15
    going to be simplistic and wrong.
  • 49:15 - 49:16
    Yep. I would agree with that.
  • 49:17 - 49:19
    I guess I would say that
  • 49:19 - 49:20
    you know,
    it's not about like,
  • 49:20 - 49:24
    my issue here is not about
    it being you know,
  • 49:24 - 49:26
    there is one true
    incredibly complex answer.
  • 49:28 - 49:30
    At some point I just gave up
  • 49:30 - 49:36
    even in my thesis which is about
    transness and technology
  • 49:36 - 49:38
    of defining transness.
  • 49:38 - 49:39
    I just gave up.
  • 49:40 - 49:45
    And I instead took what is referred to
    as a pragmatist view,
  • 49:45 - 49:47
    which is basically that
    it is whatever the people
  • 49:47 - 49:49
    in the situation that you're studying
    believe it to be,
  • 49:49 - 49:53
    and however they construct
    the world as if it were,
  • 49:55 - 49:57
    and what I'm getting at this
  • 49:57 - 49:59
    is not that there is
    some universal definition
  • 49:59 - 50:02
    of anything which,
    if sufficiently complicated,
  • 50:02 - 50:05
    would be enough,
  • 50:05 - 50:09
    but instead that I think
    that the scale is the problem,
  • 50:09 - 50:11
    and the universalism is the problem.
  • 50:13 - 50:15
    Maybe we should keep trying,
  • 50:15 - 50:16
    or maybe we should stop.
  • 50:16 - 50:19
    Maybe we should instead say
    that, again,
  • 50:19 - 50:23
    there should be a Wikibase install
    in every self-defined community
  • 50:23 - 50:28
    that wants it and they can define things,
    and articulate things
  • 50:28 - 50:30
    to their own satisfaction.
  • 50:31 - 50:33
    But then we end up in more political
  • 50:33 - 50:37
    and fraught debates of a reformist
    versus radical actions,
  • 50:37 - 50:40
    and how you open a box
    with a crowbar that's already inside it,
  • 50:40 - 50:42
    and I end up quoting Foucault for an hour,
  • 50:42 - 50:44
    and everyone gets sad.
  • 50:45 - 50:47
    Including me because I hate Foucault.
  • 50:47 - 50:49
    So this might be a discussion
    for elsewhere.
  • 50:49 - 50:52
    But generally agreed, I just--
  • 50:52 - 50:54
    I would raise questions about
  • 50:54 - 50:56
    whether we should keep trying
  • 50:56 - 50:58
    for a better form of universalism,
  • 50:58 - 51:00
    or whether the problem
    is that universalism.
  • 51:01 - 51:03
    I'm guessing we have
    a time for one more? Yeah.
  • 51:04 - 51:08
    (person 8): This is a short question,
    possibly complex answer.
  • 51:08 - 51:10
    One of the most popular
  • 51:10 - 51:15
    and used properties is sex
    or gender on Wikidata.
  • 51:16 - 51:18
    Could you speak to whether you find
  • 51:18 - 51:25
    that merging useful,
    productive, problematic?
  • 51:26 - 51:28
    Sure, I mean I think it's always
  • 51:28 - 51:30
    going to be reductive
    cause it's a merging.
  • 51:31 - 51:36
    But I also think
    that it is deeply tiresome
  • 51:37 - 51:39
    in a way that's kind of interesting
  • 51:39 - 51:42
    insofar as it reveals
    the limitations of Wikidata,
  • 51:42 - 51:44
    though Wikidata claims to be building
  • 51:44 - 51:47
    towards this like big objective
    set of knowledge,
  • 51:47 - 51:49
    but ultimately kind of
    smushed these things together
  • 51:49 - 51:53
    because I mean they haven't asked
  • 51:53 - 51:56
    most people who have entries
    what their gender is,
  • 51:56 - 51:57
    and/or what their sex is,
  • 51:57 - 51:59
    and so they just merge them
  • 51:59 - 52:02
    so that inference is easier.
  • 52:02 - 52:05
    But generally speaking, yeah,
    I say that the merging
  • 52:05 - 52:09
    of the two together
    is reductive and dangerous
  • 52:09 - 52:10
    but...
  • 52:12 - 52:13
    Again it's not...
  • 52:14 - 52:15
    There is no good way of doing it.
  • 52:15 - 52:18
    I think this is a particularly bad way
  • 52:18 - 52:22
    of treating them
    as interchangeable things,
  • 52:23 - 52:26
    and treating them
    as forever-linked things,
  • 52:28 - 52:32
    but I can't suggest a better way
    that remains--
  • 52:32 - 52:34
    that continues to have Wikidata
  • 52:34 - 52:36
    even tracking this information
    or the information contained
  • 52:36 - 52:38
    in that at all.
  • 52:39 - 52:41
    (moderator): Okay.
    I think we have to conclude here.
  • 52:41 - 52:42
    I still saw some raised hands
  • 52:42 - 52:44
    so hopefully you'll be around.
  • 52:44 - 52:46
    Yeah. I am a grad student.
  • 52:46 - 52:47
    I have functionally no life, so...
  • 52:47 - 52:48
    (laughter)
  • 52:48 - 52:52
    (moderator): Perfect. Okay.
    So please come and talk.
  • 52:52 - 52:54
    Thank you very much.
  • 52:54 - 52:56
    (applause)
Title:
cdn.media.ccc.de/.../wikidatacon2019-15-eng-Keynote_Questioning_Wikidata_hd.mp4
Video Language:
English
Duration:
53:02

English subtitles

Revisions