intro music Herald: Wikidata for (Data) Journalists by Elizabeth Giesemann. Elisabeth Giesemann: So our agenda for today is that we will have a look on key points of data journalism. We will quickly explain what Wikidata is, what tools you can use inside of Wikidata for data visualization, what other third party tools are there for your research? Then we have a look at critical research done with Wikidata. And finally, we have a critical look on the data of Wikidata itself. Key points of data journalism are that you want to interview a dataset, so you want to find connections, correlations and causalities behind the data. Also, you want to visualize the data in a compelling way and you want to write your own story. You want to find a new spin and a new look on- at the facts and all of these things you can do with Wikidata. At Wikimedia Deutschland, we want to support evidence-based reporting that's why we want to support you in using Wikidata. Also data journalism helps you to tailor your story to the users or your readers. Data journalism helps you to create visual storytelling instead of walls of text. And this, again, helps you to convey facts faster and way more easy and that makes your story way more inclusive. So how do you get to a story with Wikidata? You want to find and recognize patterns in a dataset, you can search for geographical data, you can search for similarities and differences in the data, and you can also search for missing data, because that also exists in Wikidata. You can visualize your findings with the tools that you find in the Wikidata Query Service. And what's most important is you can connect to the Wikidata community and find people who are working on a similar subject or have a similar research- research question to the one that you have. So I included this visualization to show you that data is only the beginning of your story and the path that you will take. We want you to use the data in Wikidata for- to create a compelling story and therefore contribute value and your idea about what's in the data. Because data is a lot, but it's not everything, as we've seen in the last month, many people aren't convinced by facts. Also, there is a lack of time and there is a lack of data- data literacy in our society. It's not always easy to understand the complexity of historical events and developments, to understand the complexity of medical data or demographic changes. So it is important to have a storytelling aspect to your data, have good visualizations and an easy to understand approach to convey the significance of your data and your story. And finally, it is important to remain transparent and clear about the use and analysis of the data. So what is Wikidata? Wikidata is a free linked database that can be read and edited by both humans and machines, so it is a database of linked open data. It- that means that the data doesn't just sit there in tables. It can be connected and combined with other data, found on Wikidata. As such, it is a realization of the semantic web as dreamt by Tim Berners-Lee and also Wikidata won a prize for its realization of the semantic web. We just celebrated Wikidata- data's 8th birthday. It currently holds 90 million items and has 44,000 active users and contributors, which makes it the most edited Wikimedia project. It was initially used to or thought of to support the projects of the other projects of the Wikimedia ecosystem and seen as a central storage for the structured data of the sister of projects like Wikivoyage, Wikisource and the most famous Wikimedia project, Wikipedia. But it also has another function, which means- which is to provide free and open data to the Internet, and that became really huge. As already said, we now have more than 80- 90 million data items on Wikidata. A colleague of mine created this map and you can see here the geolocation data that is in Wikidata and we are very proud that it's distributed all over the world but it's also- we also take it with a grain of salt, because as you can see, it's very bright in Europe and on the east and west coasts of the US, but there are very dark spots where we can't record the knowledge in the same way as we do in our Western societies and that brings us to the question of what is knowledge equity and how can we actually best serve everybody in our global society? So how does it work? Wikidata items, which are real things or concepts in the real world, like Berlin, Barack Obama, helium, and these items are identified with an ID, the QID. So Q76 or Q... I don't, I can't read the number now, so these items have labels, descriptions, aliases and sitelinks. Labels, that means it's described in all of the languages that Wikidata holds currently, those are around 300. Descriptions are forms to describe what the item holds and aliases, sometimes one item has several names, etc, etc. An item also has properties, those are used to label to data like a person is born somewhere, its date of birth or death or the location of a specific building. Statements hold informations in properties, so P47 shares the border with another, like, country or the population. Statements also have qualifiers to expand the information and then also they have references which is very important because for scientific research, you want to have those references. So here we see again our item, Berlin, Q64. The property is the population of 3.7 million. So what's new about research with Wikidata is that you can ask your own questions. Before, you would go to a library and some- the librarians - librarians are awesome, but they would give you books with specific facts in them and you would consume them and try to use them for your research. At Wikidata you can ask very specific questions that nobody else came up with before. So for your research, you want to do your own Wikidata queries, that's what we have the Wikidata Query Service for. The good news is that you don't have to learn Python or R or become a data scientist, but you want to learn a bit of SPARQL. We included a few resources here in this presentation and there's also going to be a talk given by my colleague Lucas on the 29th on how to query Wikidata with SPARQL. We also have a guided tour on Wikidata on our website which I can recommend. OK, so, um, as said, once you queried your data, you can visualize your results for more compelling storytelling and there are several ways of doing this and I'm going to show you some of this just to give you an idea. You could, for instance, ask the query service to show you airports that are named after a person and color code them according to their gender. Gender of the person, not the airport, obviously. You can ask the query service, show me everything connected to the item Berlin. You can ask it to show you the population of the countries that are bordering Germany and how it developed. You can also ask the query service to show you the most common cause of death among noble people. Or here it shows you an- an historical overview of space probes. Or all of the children and grandchildren of Genghis Khan. So we had a look on the visualizations inside of Wikidata's Query Service, but there are also tools that use Wikidata's data for their own visualizations. And I'm going to show you some of them now. So here is Histropedia, which makes time beams of historical events using data from Wikidata. This is Inventaire. Basically, it lets you create your own private library and then uses the data from Wikidata to describe the publications. Here is "Ask me anything". That's done by different researchers in Europe, and it lets you pose questions in natural language to Wikidata so you don't have to use the query service. That's a way that to use Wikidata that's also used by a lot of voice assistants like Siri and Alexa. And here you have Scholia, which is basically a platform for scientific publications that are published under open access and collected, and it can answer your questions like who published what paper, with whom, who and when or who wrote the first paper on COVID, when was it published, etc. And here we have "Sum of All Paintings". Basically, it's a database that creates all of the paintings in the world and lists their metadata so you can combine it in your own specific way. So I showed you a couple of examples, what you could do, and I want to hint at other researchers who did great stuff with Wikidata and used it for very cool storytelling. If my slides work, OK, here we go. So, um, "Women's representation and voice in media coverage of the coronavirus crisis", that's the- that's a study done by a researcher called Laura Jones regarding the representation of female experts within the coverage of coronavirus. It uses evaluations of Wikipedia and Wikidata to show- to show how much representation was there, of female experts. And, as we see, it's not a lot. Finally, there is another great example I want to tell you about, it's a project called Enslaved.org. It's a linked open data platform based on Wikibase, which is the software behind Wikidata and it basically shows or it collects and connects data related to the transatlantic slave trade. So, people who suffered under the slave trade and the records that were done by the people active in this slave trade, those data is collected. It has been collected in several databases and Enslaved build one large database to connect them and rebuild the stories, which I think is a really great idea to or really great way to humanize people who have been dehumanized with data. Like you can see here, they collect- they collect data from newspapers and from the slaveholders to recount a story of individuals. So finally, I also want to talk to you about one thing in Wikidata that is always on our minds, which is that Wikidata is not perfect. I highly recommend the talk by Os Keyes "Questioning Wikidata" in which it is explained that all classification systems are inherently dangerous and Wikidata is a large encyclopedic wiki classification system which makes choices, ethical and political choices, about what is notable, about how to categorize information. And these choices, they reduce complexity and reduce also specific forms of- of history, like oral history. This reduction has consequences. As you know, Wikidata is used by many programs, apps, voice assistance and what- what and how we store information in Wikidata really matters. So we ask ourselves, what is encyclopedic knowledge? And how can we organize it in a more inclusive way? Encyclopedic knowledge is a Western concept, and we can and must do better than just use our own Western view to organize the world. But then also the wiki principle applies, we have a huge community behind Wikidata that helps us to make these decisions, and you can also become a part of this by researching Wikidata, using it for your work and also contributing your research. So once again, I want to tell you, you can use Wikidata as a tool for your storytelling. Wikidata can help you find connections between data. Wikidata can help you find- can help you build visualization in its query service. You can ask questions about historical data correlations more critically than you could- than you could before. And- but there are also downsides to- downsides to Wikidata because it is an encyclopedic way of organizing Western knowledge. So this was only a start. I'm looking forward to our Q&A session now and if you have further questions, concerns or have ideas, you can contact me and my colleagues and you can also contact me individually. Thank you. Herald: Hello and welcome to Elizabeth. Thank you very much for your interesting talk. That was a very great introduction. Elisabeth: Hi. Yeah, thanks for having me. I'm happy that I was able to talk a bit about Wikidata and how you could do storytelling with it. I wanted to add that, obviously, you can ask me questions now, but also I want to hint at the great introduction of Wikidata that one of my colleagues gave. Yesterday, two of my colleagues, which is already online, and tomorrow there will be a query service workshops where you can learn a bit more in-depth how to query Wikidata. Herald: Yeah, that's a very good hint. There's actually there's two questions in the chat right now. The first one is, are your slides going to be published because people are interested in your links to the tutorials, obviously. Elisabeth: Yes, that was, uh, I asked before, I think the talk will be published and the slides. Is there a Wikipaka board where I can put it? Otherwise, I can also put a link on our Twitter account, Wikimedia Deutschland. And yeah... Herald: I think Twitter for now would probably be the best idea, I actually have to check on the Wikipaka board, but we will let you know where you can find everything. Elisabeth: I put it on the Wikimedia Deutschland Twitter. It's @wmde I think Herald: we will also retweet it obviously. You will find it, I promise. Elisabeth: OK. Herald: There's another question. What resources would you recommend for self- studying the writing of queries for query.wikidata.org? Elisabeth: Mhm. Um, I put some links in the- in the slides. There is... yeah, we have, like, a few tutorials on Wikidata. There was also a couple of months ago, a very nice and very easy tutorial published by Wikimedia Israel. And I- so we didn't do it, but I can recommend it, it's a very low key introduction to your first queries. Herald: OK. We will also publish that somehow. I have a question for you as well. You mentioned that Wikidata is like a great way for meeting other people that are working on similar topics. So is there some kind of like greater community of journalists using Wikidata? Elisabeth: So far, the community is mostly research based. That's also why we wanted to reach out here. So I would recommend getting in touch with the community on there regarding the research topics that you have. And you can also get in touch with us and we connect you. I have a noise in my ear, but I hope it's only me. Herald: Well, I don't have it, so it might just be you, but I feel like there might be also an echo on the stream, that's what people on the chat are saying. Elisabeth: Oh, OK. Herald: So I don't have any other questions in the chat and since there seems to be an echo on the stream, I don't want to annoy people any further. So I would suggest for everyone who has further questions to you that you can meet in our Big Blue Button meetup room that I will be posting in the chat right now and we will continue our program here at 2:20 with another talk about Flutter by "The one with the braid", so I'm saying bye for now. Elisabeth: Thanks, bye. Herald: Bye. outro music Subtitles created by c3subtitles.de in the year 2021. Join, and help us!