[Script Info] Title: [Events] Format: Layer, Start, End, Style, Name, MarginL, MarginR, MarginV, Effect, Text Dialogue: 0,0:00:06.30,0:00:07.36,Default,,0000,0000,0000,,(Lydia) Thank you so much. Dialogue: 0,0:00:07.36,0:00:11.24,Default,,0000,0000,0000,,So, this conference,\None of the big themes is languages. Dialogue: 0,0:00:14.22,0:00:18.51,Default,,0000,0000,0000,,I want to give you an overview\Nof where we actually are currently Dialogue: 0,0:00:18.51,0:00:19.81,Default,,0000,0000,0000,,when it comes to languages Dialogue: 0,0:00:20.26,0:00:22.17,Default,,0000,0000,0000,,and where we can go from here. Dialogue: 0,0:00:29.04,0:00:32.58,Default,,0000,0000,0000,,Wikidata is all about giving more people\Nmore access to more knowledge, Dialogue: 0,0:00:32.58,0:00:37.17,Default,,0000,0000,0000,,and language is such an important part\Nof making that a reality, Dialogue: 0,0:00:38.20,0:00:43.29,Default,,0000,0000,0000,,especially since more and more\Nof our lives depends on technology. Dialogue: 0,0:00:44.11,0:00:48.87,Default,,0000,0000,0000,,And as our keynote speaker\Nearlier today was talking, Dialogue: 0,0:00:49.72,0:00:51.59,Default,,0000,0000,0000,,some of the technology\Nleaves people behind Dialogue: 0,0:00:51.59,0:00:55.02,Default,,0000,0000,0000,,simply because they can't speak\Na certain language, Dialogue: 0,0:00:55.32,0:00:57.57,Default,,0000,0000,0000,,and that's not okay. Dialogue: 0,0:00:58.63,0:01:02.10,Default,,0000,0000,0000,,So we want to do something about that. Dialogue: 0,0:01:02.93,0:01:05.84,Default,,0000,0000,0000,,And in order to change that,\Nyou need at least two things. Dialogue: 0,0:01:06.41,0:01:11.27,Default,,0000,0000,0000,,One is you need to provide content\Nto the people in their language, Dialogue: 0,0:01:11.27,0:01:12.96,Default,,0000,0000,0000,,and the second thing you need Dialogue: 0,0:01:12.96,0:01:15.91,Default,,0000,0000,0000,,is to provide them\Nwith interaction in their language Dialogue: 0,0:01:15.91,0:01:19.19,Default,,0000,0000,0000,,in those applications\Nor whatever it is you have. Dialogue: 0,0:01:20.37,0:01:25.28,Default,,0000,0000,0000,,And Wikidata helps with both of those. Dialogue: 0,0:01:25.28,0:01:28.41,Default,,0000,0000,0000,,And the first thing,\N{\i1}content in your language{\i0}, Dialogue: 0,0:01:28.41,0:01:30.88,Default,,0000,0000,0000,,that is basically what we have\Nin items and properties, Dialogue: 0,0:01:31.32,0:01:33.08,Default,,0000,0000,0000,,how we describe the world. Dialogue: 0,0:01:33.08,0:01:35.08,Default,,0000,0000,0000,,Now, this is certainly\Nnot everything you need, Dialogue: 0,0:01:35.08,0:01:39.29,Default,,0000,0000,0000,,but it gets you quite far ahead. Dialogue: 0,0:01:39.76,0:01:41.85,Default,,0000,0000,0000,,The other thing\Nis {\i1}interaction in your language{\i0}, Dialogue: 0,0:01:41.85,0:01:46.39,Default,,0000,0000,0000,,and that's where lexemes come into play Dialogue: 0,0:01:46.39,0:01:49.38,Default,,0000,0000,0000,,If you want to talk\Nto your digital personal assistant Dialogue: 0,0:01:49.38,0:01:54.92,Default,,0000,0000,0000,,or if you want to have your device\Ntranslate a text and things like that. Dialogue: 0,0:01:56.40,0:01:59.25,Default,,0000,0000,0000,,Alright, let's look into\N{\i1}content in your language.{\i0} Dialogue: 0,0:01:59.25,0:02:03.40,Default,,0000,0000,0000,,So what we have in {\i1}items{\i0} and {\i1}properties{\i0}. Dialogue: 0,0:02:05.41,0:02:09.70,Default,,0000,0000,0000,,For this, the labels in those items\Nand properties are crucial. Dialogue: 0,0:02:10.24,0:02:14.87,Default,,0000,0000,0000,,We need to know what this entity\Nis called that we're talking about. Dialogue: 0,0:02:15.66,0:02:19.99,Default,,0000,0000,0000,,And instead of talking about Q5, Dialogue: 0,0:02:19.99,0:02:22.18,Default,,0000,0000,0000,,someone who speaks English\Nknows that's a "human," Dialogue: 0,0:02:22.18,0:02:24.71,Default,,0000,0000,0000,,someone who speaks German\Nknows that's a "mensch," Dialogue: 0,0:02:24.71,0:02:26.37,Default,,0000,0000,0000,,and similar things. Dialogue: 0,0:02:26.37,0:02:29.74,Default,,0000,0000,0000,,So those labels on items and properties Dialogue: 0,0:02:29.74,0:02:33.62,Default,,0000,0000,0000,,are bridging the gap\Nbetween humans and machines. Dialogue: 0,0:02:33.62,0:02:35.44,Default,,0000,0000,0000,,And humans and humans Dialogue: 0,0:02:35.44,0:02:40.12,Default,,0000,0000,0000,,making more existing knowledge\Naccessible to them. Dialogue: 0,0:02:43.27,0:02:46.29,Default,,0000,0000,0000,,Now, that's a nice aspiration. Dialogue: 0,0:02:46.29,0:02:48.34,Default,,0000,0000,0000,,What does it actually look like? Dialogue: 0,0:02:48.34,0:02:49.61,Default,,0000,0000,0000,,It looks like this. Dialogue: 0,0:02:50.95,0:02:52.42,Default,,0000,0000,0000,,What you're seeing here Dialogue: 0,0:02:52.42,0:02:58.50,Default,,0000,0000,0000,,is that most of the items\Non Wikidata have two labels, Dialogue: 0,0:02:58.50,0:03:00.77,Default,,0000,0000,0000,,so labels in two languages. Dialogue: 0,0:03:01.70,0:03:03.85,Default,,0000,0000,0000,,And after that, it's one, and then three, Dialogue: 0,0:03:03.85,0:03:06.12,Default,,0000,0000,0000,,and then it becomes very sad. Dialogue: 0,0:03:06.78,0:03:08.58,Default,,0000,0000,0000,,(quiet laughter) Dialogue: 0,0:03:10.05,0:03:12.71,Default,,0000,0000,0000,,I think we need to do better than this. Dialogue: 0,0:03:14.18,0:03:15.32,Default,,0000,0000,0000,,But, on the other hand, Dialogue: 0,0:03:15.32,0:03:17.48,Default,,0000,0000,0000,,I was actually expecting this\Nto be even worse. Dialogue: 0,0:03:17.48,0:03:19.56,Default,,0000,0000,0000,,I was expecting the average to be one. Dialogue: 0,0:03:19.56,0:03:22.50,Default,,0000,0000,0000,,So I was quite happy \Nto see two. (chuckles) Dialogue: 0,0:03:24.92,0:03:26.19,Default,,0000,0000,0000,,Alright. Dialogue: 0,0:03:27.16,0:03:29.53,Default,,0000,0000,0000,,But it's not just interesting to know Dialogue: 0,0:03:29.53,0:03:33.74,Default,,0000,0000,0000,,how many labels our items\Nand properties have. Dialogue: 0,0:03:33.74,0:03:36.56,Default,,0000,0000,0000,,It's also interesting to see\Nin which languages. Dialogue: 0,0:03:38.04,0:03:43.76,Default,,0000,0000,0000,,Here you see a graph of the languages Dialogue: 0,0:03:43.76,0:03:46.84,Default,,0000,0000,0000,,that we have labels for on {\i1}Items{\i0}. Dialogue: 0,0:03:46.84,0:03:50.67,Default,,0000,0000,0000,,So the biggest part there is {\i1}Other{\i0}. Dialogue: 0,0:03:51.23,0:03:53.86,Default,,0000,0000,0000,,So I just took the top 100 languages Dialogue: 0,0:03:54.53,0:03:58.90,Default,,0000,0000,0000,,and everything else is {\i1}Other{\i0}\Nto make this graph readable. Dialogue: 0,0:03:59.54,0:04:02.14,Default,,0000,0000,0000,,And then there's English and Dutch, Dialogue: 0,0:04:03.00,0:04:04.25,Default,,0000,0000,0000,,French, Dialogue: 0,0:04:05.92,0:04:09.13,Default,,0000,0000,0000,,and not to forget, Asturian. Dialogue: 0,0:04:09.66,0:04:11.89,Default,,0000,0000,0000,,- (person 1) Whoo!\N- Whoo-hoo, yes! Dialogue: 0,0:04:13.90,0:04:16.95,Default,,0000,0000,0000,,So what you see here is quite an imbalance Dialogue: 0,0:04:16.95,0:04:20.11,Default,,0000,0000,0000,,and still quite a lot of focus on English. Dialogue: 0,0:04:21.24,0:04:24.37,Default,,0000,0000,0000,,Another thing is if you look\Nat the same thing for {\i1}Properties{\i0}, Dialogue: 0,0:04:24.37,0:04:25.100,Default,,0000,0000,0000,,it's actually looking better. Dialogue: 0,0:04:27.40,0:04:32.75,Default,,0000,0000,0000,,And I think part of that constituted\Njust being way less properties. Dialogue: 0,0:04:32.75,0:04:36.77,Default,,0000,0000,0000,,So even smaller communities\Nhave a chance to keep up with that. Dialogue: 0,0:04:36.77,0:04:39.17,Default,,0000,0000,0000,,But it's also a pretty\Nimportant part of Wikidata Dialogue: 0,0:04:39.17,0:04:41.16,Default,,0000,0000,0000,,to localize into your language. Dialogue: 0,0:04:41.16,0:04:42.38,Default,,0000,0000,0000,,So that's good. Dialogue: 0,0:04:45.75,0:04:47.84,Default,,0000,0000,0000,,What I want to highlight\Nhere with Asturian Dialogue: 0,0:04:47.84,0:04:53.70,Default,,0000,0000,0000,,is that a small community\Ncan really make a huge difference Dialogue: 0,0:04:54.45,0:04:57.08,Default,,0000,0000,0000,,with some dedication and work, Dialogue: 0,0:04:57.08,0:04:58.42,Default,,0000,0000,0000,,and that's really cool. Dialogue: 0,0:05:01.85,0:05:03.53,Default,,0000,0000,0000,,A small quiz for you. Dialogue: 0,0:05:03.53,0:05:05.49,Default,,0000,0000,0000,,If you take all the properties on Wikidata Dialogue: 0,0:05:05.49,0:05:07.69,Default,,0000,0000,0000,,that are not external identifiers, Dialogue: 0,0:05:07.69,0:05:10.36,Default,,0000,0000,0000,,which one has the most labels,\Nlike the most languages? Dialogue: 0,0:05:10.98,0:05:13.85,Default,,0000,0000,0000,,(audience) [inaudible] Dialogue: 0,0:05:13.85,0:05:16.79,Default,,0000,0000,0000,,I hear some agreement on {\i1}instance of{\i0}? Dialogue: 0,0:05:17.51,0:05:19.44,Default,,0000,0000,0000,,You would be wrong. Dialogue: 0,0:05:19.98,0:05:22.21,Default,,0000,0000,0000,,It's {\i1}image{\i0}. (chuckles) Dialogue: 0,0:05:23.23,0:05:26.37,Default,,0000,0000,0000,,So, yeah, that tells you,\Nif you speak one of the languages Dialogue: 0,0:05:26.37,0:05:28.62,Default,,0000,0000,0000,,where {\i1}instance of{\i0}\Ndoesn't yet have a label, Dialogue: 0,0:05:28.62,0:05:30.19,Default,,0000,0000,0000,,you might want to add it. Dialogue: 0,0:05:32.10,0:05:35.68,Default,,0000,0000,0000,,So it has 148 labels currently. Dialogue: 0,0:05:37.69,0:05:41.25,Default,,0000,0000,0000,,But that's just another slide. Dialogue: 0,0:05:42.63,0:05:44.16,Default,,0000,0000,0000,,This graph tells us something Dialogue: 0,0:05:44.16,0:05:49.32,Default,,0000,0000,0000,,about how much content we are making\Navailable in a certain language Dialogue: 0,0:05:49.32,0:05:52.04,Default,,0000,0000,0000,,and how much of that content\Nis actually used. Dialogue: 0,0:05:52.04,0:05:55.45,Default,,0000,0000,0000,,So what you're seeing is basically a curve Dialogue: 0,0:05:55.45,0:06:00.99,Default,,0000,0000,0000,,with most content having English labels,\Nbeing available in English, Dialogue: 0,0:06:01.51,0:06:04.30,Default,,0000,0000,0000,,and being used a lot. Dialogue: 0,0:06:04.30,0:06:06.45,Default,,0000,0000,0000,,And then it kind of goes down. Dialogue: 0,0:06:06.45,0:06:09.44,Default,,0000,0000,0000,,But, again, what you can see are outliers Dialogue: 0,0:06:09.44,0:06:15.33,Default,,0000,0000,0000,,who have a lot more content\Nthan you would necessarily expect, Dialogue: 0,0:06:16.90,0:06:19.54,Default,,0000,0000,0000,,and that is really, really good. Dialogue: 0,0:06:20.84,0:06:24.94,Default,,0000,0000,0000,,The problem still is it's not used a lot. Dialogue: 0,0:06:25.56,0:06:28.74,Default,,0000,0000,0000,,Asturian and Dutch should be higher, Dialogue: 0,0:06:28.74,0:06:31.99,Default,,0000,0000,0000,,and I think helping those communities Dialogue: 0,0:06:33.27,0:06:35.56,Default,,0000,0000,0000,,increase the use\Nof the data they collected Dialogue: 0,0:06:35.56,0:06:37.68,Default,,0000,0000,0000,,is a really useful thing to do. Dialogue: 0,0:06:42.91,0:06:48.11,Default,,0000,0000,0000,,What this analysis and others\Nshowed us is also a good thing though Dialogue: 0,0:06:48.30,0:06:51.38,Default,,0000,0000,0000,,is that we are seeing\Nthat highly used items Dialogue: 0,0:06:51.38,0:06:55.30,Default,,0000,0000,0000,,also tend to have more labels Dialogue: 0,0:06:55.30,0:06:58.19,Default,,0000,0000,0000,,or the other way around--\Nit's not entirely clear. Dialogue: 0,0:07:02.51,0:07:04.38,Default,,0000,0000,0000,,And then the question is, Dialogue: 0,0:07:04.81,0:07:07.01,Default,,0000,0000,0000,,are we serving\Njust the powerful languages? Dialogue: 0,0:07:07.90,0:07:11.15,Default,,0000,0000,0000,,Or are we serving everyone? Dialogue: 0,0:07:12.76,0:07:17.74,Default,,0000,0000,0000,,And what you see here\Nis a grouping of languages. Dialogue: 0,0:07:17.74,0:07:21.83,Default,,0000,0000,0000,,The languages that are grouped together\Ntend to have labels together. Dialogue: 0,0:07:26.04,0:07:28.60,Default,,0000,0000,0000,,And you see it clustering. Dialogue: 0,0:07:28.60,0:07:34.06,Default,,0000,0000,0000,,Now here's a similar clustering, colored, Dialogue: 0,0:07:34.06,0:07:39.48,Default,,0000,0000,0000,,based on how alive, how used, Dialogue: 0,0:07:40.46,0:07:43.16,Default,,0000,0000,0000,,how endangered the language is. Dialogue: 0,0:07:43.16,0:07:44.64,Default,,0000,0000,0000,,And a good thing you're seeing here Dialogue: 0,0:07:44.64,0:07:49.57,Default,,0000,0000,0000,,is that safe languages\Nand endangered languages Dialogue: 0,0:07:49.57,0:07:53.77,Default,,0000,0000,0000,,do not form two different clusters. Dialogue: 0,0:07:53.77,0:07:58.87,Default,,0000,0000,0000,,But they're all mixed together, Dialogue: 0,0:08:00.26,0:08:04.62,Default,,0000,0000,0000,,which is much better than it would be\Nthe other way around Dialogue: 0,0:08:04.62,0:08:09.38,Default,,0000,0000,0000,,where the safe languages,\Nthe powerful languages Dialogue: 0,0:08:10.20,0:08:12.16,Default,,0000,0000,0000,,are just helping each other out. Dialogue: 0,0:08:12.74,0:08:14.36,Default,,0000,0000,0000,,No, that's not the case. Dialogue: 0,0:08:14.36,0:08:17.42,Default,,0000,0000,0000,,And it's a really good thing. Dialogue: 0,0:08:17.42,0:08:20.04,Default,,0000,0000,0000,,When I saw this, \NI thought this was very good. Dialogue: 0,0:08:23.47,0:08:25.17,Default,,0000,0000,0000,,Here's a similar thing Dialogue: 0,0:08:26.24,0:08:28.80,Default,,0000,0000,0000,,where we looked at Dialogue: 0,0:08:30.23,0:08:34.22,Default,,0000,0000,0000,,the languages' status Dialogue: 0,0:08:34.22,0:08:36.22,Default,,0000,0000,0000,,and how many labels it has. Dialogue: 0,0:08:39.37,0:08:42.94,Default,,0000,0000,0000,,What you're seeing\Nis a clear win for safe languages, Dialogue: 0,0:08:42.94,0:08:44.25,Default,,0000,0000,0000,,as is expected. Dialogue: 0,0:08:45.51,0:08:46.69,Default,,0000,0000,0000,,But what you're also seeing Dialogue: 0,0:08:46.69,0:08:54.41,Default,,0000,0000,0000,,is that the languages in category 2\Nand 3 and maybe even 4 Dialogue: 0,0:08:54.41,0:08:59.28,Default,,0000,0000,0000,,are not that bad, actually, Dialogue: 0,0:08:59.28,0:09:02.37,Default,,0000,0000,0000,,in terms of their representation\Nin Wikidata and others. Dialogue: 0,0:09:03.29,0:09:06.41,Default,,0000,0000,0000,,It's a really good thing to find. Dialogue: 0,0:09:07.65,0:09:09.13,Default,,0000,0000,0000,,Now, if you look at the same thing Dialogue: 0,0:09:09.13,0:09:12.42,Default,,0000,0000,0000,,for how much of that content\Nof those labels Dialogue: 0,0:09:12.42,0:09:15.50,Default,,0000,0000,0000,,is actually used\Non Wikipedia, for example, Dialogue: 0,0:09:17.46,0:09:22.56,Default,,0000,0000,0000,,then we see a similar\Npicture emerging again. Dialogue: 0,0:09:23.60,0:09:29.81,Default,,0000,0000,0000,,And it tells us that those communities\Nare actually making good use of their time Dialogue: 0,0:09:29.81,0:09:34.50,Default,,0000,0000,0000,,by filling in labels\Nfor higher used items, for example. Dialogue: 0,0:09:36.41,0:09:40.49,Default,,0000,0000,0000,,There are outliers\Nwhere I think we can help, Dialogue: 0,0:09:41.68,0:09:48.20,Default,,0000,0000,0000,,to help those communities find the places\Nwhere their work would be most valuable. Dialogue: 0,0:09:49.31,0:09:52.66,Default,,0000,0000,0000,,But, overall, I'm happy with this picture. Dialogue: 0,0:09:54.82,0:09:59.84,Default,,0000,0000,0000,,Now, that was the items\Nand properties part of Wikidata. Dialogue: 0,0:10:00.71,0:10:03.03,Default,,0000,0000,0000,,Now, let's look at interaction\Nin your languages. Dialogue: 0,0:10:03.03,0:10:05.20,Default,,0000,0000,0000,,So the lexeme parts of Wikidata Dialogue: 0,0:10:05.20,0:10:09.39,Default,,0000,0000,0000,,where we describe words\Nand their forms and their meanings. Dialogue: 0,0:10:10.17,0:10:13.30,Default,,0000,0000,0000,,We've been doing this now\Nsince May last year, Dialogue: 0,0:10:16.46,0:10:19.13,Default,,0000,0000,0000,,and content has been growing. Dialogue: 0,0:10:20.11,0:10:22.15,Default,,0000,0000,0000,,You can see here in blue the lexemes, Dialogue: 0,0:10:22.15,0:10:25.94,Default,,0000,0000,0000,,and then in red, \Nthe forms on those lexemes Dialogue: 0,0:10:25.94,0:10:29.91,Default,,0000,0000,0000,,and yellow, the senses\Non those lexemes. Dialogue: 0,0:10:30.99,0:10:34.45,Default,,0000,0000,0000,,So some communities--\Nwe'll get to that later-- Dialogue: 0,0:10:34.45,0:10:39.79,Default,,0000,0000,0000,,have spent a lot of time creating forms\Nand senses for their lexemes, Dialogue: 0,0:10:39.79,0:10:42.75,Default,,0000,0000,0000,,which is really useful Dialogue: 0,0:10:42.75,0:10:48.24,Default,,0000,0000,0000,,because that builds\Nthe core of the data set that you need. Dialogue: 0,0:10:50.56,0:10:55.13,Default,,0000,0000,0000,,Now, we looked at all the languages Dialogue: 0,0:10:55.13,0:10:57.91,Default,,0000,0000,0000,,that have lexemes on Wikidata. Dialogue: 0,0:10:57.91,0:11:01.00,Default,,0000,0000,0000,,So words we have, Dialogue: 0,0:11:01.71,0:11:04.40,Default,,0000,0000,0000,,those are right now 310 languages. Dialogue: 0,0:11:04.88,0:11:08.29,Default,,0000,0000,0000,,Now, what do you think is the top language Dialogue: 0,0:11:08.29,0:11:11.95,Default,,0000,0000,0000,,when it comes to the number\Nof lexemes currently in Wikidata? Dialogue: 0,0:11:12.93,0:11:14.70,Default,,0000,0000,0000,,(audience) [inaudible] Dialogue: 0,0:11:19.18,0:11:20.22,Default,,0000,0000,0000,,Huh? Dialogue: 0,0:11:20.22,0:11:21.74,Default,,0000,0000,0000,,(person 2) German. Dialogue: 0,0:11:21.74,0:11:24.25,Default,,0000,0000,0000,,Sorry, I've heard it before. Dialogue: 0,0:11:24.25,0:11:25.65,Default,,0000,0000,0000,,It's Russian. Dialogue: 0,0:11:28.01,0:11:29.75,Default,,0000,0000,0000,,Russian is quite ahead. Dialogue: 0,0:11:31.90,0:11:33.83,Default,,0000,0000,0000,,And just to give you some perspective, Dialogue: 0,0:11:35.65,0:11:36.82,Default,,0000,0000,0000,,there's different opinions Dialogue: 0,0:11:36.82,0:11:42.23,Default,,0000,0000,0000,,but I've read, for example,\Nthat 1,000 to 3,000 words Dialogue: 0,0:11:42.23,0:11:45.45,Default,,0000,0000,0000,,gets you to conversation level,\Nroughly, in another language, Dialogue: 0,0:11:45.45,0:11:49.46,Default,,0000,0000,0000,,and 4,000 to 10,000 words\Nto an advanced level. Dialogue: 0,0:11:51.59,0:11:55.28,Default,,0000,0000,0000,,So, we still have a bit to catch up there. Dialogue: 0,0:11:58.48,0:12:03.28,Default,,0000,0000,0000,,One thing I want you\Nto pay attention to is Basque here Dialogue: 0,0:12:03.28,0:12:07.74,Default,,0000,0000,0000,,with 10,000, roughly, lexemes. Dialogue: 0,0:12:09.24,0:12:13.00,Default,,0000,0000,0000,,Now, if you look at the number\Nof forms for those lexemes, Dialogue: 0,0:12:14.16,0:12:16.50,Default,,0000,0000,0000,,Basque is way up there, Dialogue: 0,0:12:18.26,0:12:20.01,Default,,0000,0000,0000,,which is really cool, Dialogue: 0,0:12:20.01,0:12:24.93,Default,,0000,0000,0000,,and you should go to a talk that explains\Nto you why that is the case. Dialogue: 0,0:12:27.34,0:12:31.18,Default,,0000,0000,0000,,Now, if you look at the number\Nof senses, so what do words mean, Dialogue: 0,0:12:32.02,0:12:35.08,Default,,0000,0000,0000,,Basque even gets to the top of the list. Dialogue: 0,0:12:35.08,0:12:37.10,Default,,0000,0000,0000,,I think that deserves an applause. Dialogue: 0,0:12:37.10,0:12:38.92,Default,,0000,0000,0000,,(applause) Dialogue: 0,0:12:45.68,0:12:47.12,Default,,0000,0000,0000,,Another short quiz. Dialogue: 0,0:12:47.12,0:12:50.18,Default,,0000,0000,0000,,What's the lexeme\Nwith the most translations currently? Dialogue: 0,0:12:50.65,0:12:55.41,Default,,0000,0000,0000,,(audience) Cats, cats, [inaudible], \NDouglas Adams, [inaudible] Dialogue: 0,0:12:56.77,0:13:00.01,Default,,0000,0000,0000,,All good guesses, but no. Dialogue: 0,0:13:01.01,0:13:04.14,Default,,0000,0000,0000,,It's this, the Russian word for "water." Dialogue: 0,0:13:09.57,0:13:12.25,Default,,0000,0000,0000,,Alright, so now we talked a lot Dialogue: 0,0:13:12.25,0:13:16.41,Default,,0000,0000,0000,,about how many lexemes,\Nforms, and senses we have, Dialogue: 0,0:13:16.41,0:13:20.49,Default,,0000,0000,0000,,but that's just one thing you need. Dialogue: 0,0:13:20.49,0:13:21.52,Default,,0000,0000,0000,,The other thing you need Dialogue: 0,0:13:21.52,0:13:25.16,Default,,0000,0000,0000,,is actually describing those lexemes,\Nforms, and senses Dialogue: 0,0:13:25.16,0:13:27.65,Default,,0000,0000,0000,,in a machine-readable way. Dialogue: 0,0:13:27.65,0:13:30.04,Default,,0000,0000,0000,,And for that you have statements,\Nlike on items. Dialogue: 0,0:13:31.48,0:13:36.36,Default,,0000,0000,0000,,And one of the properties\Nyou use is usage example. Dialogue: 0,0:13:36.36,0:13:38.58,Default,,0000,0000,0000,,So whoever is using that data Dialogue: 0,0:13:38.58,0:13:42.09,Default,,0000,0000,0000,,can understand how to use\Nthat word in context, Dialogue: 0,0:13:42.09,0:13:44.16,Default,,0000,0000,0000,,so that could be a quote, for example. Dialogue: 0,0:13:45.40,0:13:47.11,Default,,0000,0000,0000,,And here, Polish rocks. Dialogue: 0,0:13:47.90,0:13:49.76,Default,,0000,0000,0000,,Good job, Polish speakers. Dialogue: 0,0:13:54.22,0:13:57.68,Default,,0000,0000,0000,,Another property\Nthat's really useful is IPA, Dialogue: 0,0:13:57.68,0:14:00.19,Default,,0000,0000,0000,,so how do you pronounce this word. Dialogue: 0,0:14:00.88,0:14:07.50,Default,,0000,0000,0000,,Russian apparently needs\Nlots of IPA statements. Dialogue: 0,0:14:10.42,0:14:13.31,Default,,0000,0000,0000,,But, again, Polish, second. Dialogue: 0,0:14:17.15,0:14:20.75,Default,,0000,0000,0000,,And last but not least\Nwe have pronunciation audio. Dialogue: 0,0:14:20.75,0:14:23.37,Default,,0000,0000,0000,,So that is links to files on Commons Dialogue: 0,0:14:23.37,0:14:25.96,Default,,0000,0000,0000,,where someone speaks the word, Dialogue: 0,0:14:25.96,0:14:29.91,Default,,0000,0000,0000,,so you can hear a native speaker\Npronounce the word Dialogue: 0,0:14:29.91,0:14:32.87,Default,,0000,0000,0000,,in case you can't read IPA, for example. Dialogue: 0,0:14:34.96,0:14:39.20,Default,,0000,0000,0000,,And there's a really nice actually\NWiki-based powered project Dialogue: 0,0:14:39.20,0:14:40.47,Default,,0000,0000,0000,,called Lingua Libre Dialogue: 0,0:14:40.88,0:14:45.17,Default,,0000,0000,0000,,where you can go and help record\Nwords in your language Dialogue: 0,0:14:45.17,0:14:47.84,Default,,0000,0000,0000,,that then can be added\Nto lexemes on Wikidata, Dialogue: 0,0:14:48.45,0:14:52.10,Default,,0000,0000,0000,,so other people can understand\Nhow to pronounce your words. Dialogue: 0,0:14:53.66,0:14:55.69,Default,,0000,0000,0000,,(person 2) [inaudible] Dialogue: 0,0:14:55.69,0:14:57.66,Default,,0000,0000,0000,,If you search for "Lingua Libre," Dialogue: 0,0:14:57.66,0:15:00.98,Default,,0000,0000,0000,,and I'm sure someone can post it\Nin the Telegram channel. Dialogue: 0,0:15:03.14,0:15:04.62,Default,,0000,0000,0000,,Those guys rock. Dialogue: 0,0:15:04.62,0:15:06.73,Default,,0000,0000,0000,,They did really cool stuff with Wikibase. Dialogue: 0,0:15:09.42,0:15:10.62,Default,,0000,0000,0000,,Alright. Dialogue: 0,0:15:12.71,0:15:17.28,Default,,0000,0000,0000,,Then the question is,\Nwhere do we go from here? Dialogue: 0,0:15:19.16,0:15:22.01,Default,,0000,0000,0000,,Based on the numbers I've just shown you, Dialogue: 0,0:15:23.03,0:15:25.17,Default,,0000,0000,0000,,we've come a long way Dialogue: 0,0:15:25.17,0:15:28.43,Default,,0000,0000,0000,,towards giving more people\Nmore access to more knowledge Dialogue: 0,0:15:28.43,0:15:31.24,Default,,0000,0000,0000,,when looking at languages on Wikidata. Dialogue: 0,0:15:32.53,0:15:36.39,Default,,0000,0000,0000,,But there is also still\Na lot of work ahead of us. Dialogue: 0,0:15:38.99,0:15:42.34,Default,,0000,0000,0000,,Some of the things\Nyou can do to help, for example, Dialogue: 0,0:15:42.34,0:15:44.92,Default,,0000,0000,0000,,is run label-a-thons Dialogue: 0,0:15:44.92,0:15:50.12,Default,,0000,0000,0000,,like get people together\Nto label items in Wikidata Dialogue: 0,0:15:50.91,0:15:55.12,Default,,0000,0000,0000,,or do an edit-a-thon\Naround lexemes in your language Dialogue: 0,0:15:55.12,0:15:59.21,Default,,0000,0000,0000,,to get the most used words\Nin your language into Wikidata. Dialogue: 0,0:16:00.77,0:16:03.28,Default,,0000,0000,0000,,Or you can use a tool like Terminator Dialogue: 0,0:16:03.28,0:16:08.49,Default,,0000,0000,0000,,that helps you find the most\Nimportant items in your language Dialogue: 0,0:16:08.49,0:16:11.55,Default,,0000,0000,0000,,that are still missing a label. Dialogue: 0,0:16:13.27,0:16:18.36,Default,,0000,0000,0000,,Most important being measured\Nby how often it is used Dialogue: 0,0:16:18.36,0:16:22.55,Default,,0000,0000,0000,,in other Wikidata items\Nas links in statements. Dialogue: 0,0:16:25.77,0:16:30.02,Default,,0000,0000,0000,,And, of course, for the lexeme part, Dialogue: 0,0:16:31.34,0:16:35.17,Default,,0000,0000,0000,,now that we've got\Na basic coverage of those lexemes, Dialogue: 0,0:16:35.17,0:16:41.16,Default,,0000,0000,0000,,it's also about building them out,\Nadding more statements to them Dialogue: 0,0:16:41.16,0:16:44.40,Default,,0000,0000,0000,,so that they actually can build the base Dialogue: 0,0:16:44.40,0:16:47.42,Default,,0000,0000,0000,,for meaningful applications\Nto build on top of that. Dialogue: 0,0:16:48.14,0:16:50.80,Default,,0000,0000,0000,,Because we're getting closer\Nto that critical mass, Dialogue: 0,0:16:50.80,0:16:53.62,Default,,0000,0000,0000,,but we're still away from that, Dialogue: 0,0:16:53.62,0:16:56.62,Default,,0000,0000,0000,,that you can build\Nserious applications on top of it. Dialogue: 0,0:16:58.28,0:17:01.68,Default,,0000,0000,0000,,And I hope all of you\Nwill join us in doing that. Dialogue: 0,0:17:02.58,0:17:07.10,Default,,0000,0000,0000,,And that already brings me Dialogue: 0,0:17:07.10,0:17:09.84,Default,,0000,0000,0000,,to a little help from our friends, Dialogue: 0,0:17:09.84,0:17:12.81,Default,,0000,0000,0000,,and Bruno, do you want to come over Dialogue: 0,0:17:13.88,0:17:16.85,Default,,0000,0000,0000,,and talk to us about lexical masks. Dialogue: 0,0:17:17.54,0:17:18.57,Default,,0000,0000,0000,,(Bruno) Thank you, Lydia, Dialogue: 0,0:17:18.57,0:17:21.52,Default,,0000,0000,0000,,thank you for giving me\Nthis short period of time Dialogue: 0,0:17:21.52,0:17:24.15,Default,,0000,0000,0000,,to present this work\Nthat we are doing at Google Dialogue: 0,0:17:24.15,0:17:29.64,Default,,0000,0000,0000,,Denny that most of you\Nprobably have heard of or know. Dialogue: 0,0:17:30.13,0:17:32.03,Default,,0000,0000,0000,,Because at Google so I'm a linguist. Dialogue: 0,0:17:32.03,0:17:36.15,Default,,0000,0000,0000,,so I'm very happy to be here\Namongst other language enthusiasts. Dialogue: 0,0:17:36.62,0:17:39.28,Default,,0000,0000,0000,,We are also building some lexicons, Dialogue: 0,0:17:39.28,0:17:41.77,Default,,0000,0000,0000,,and we have built this technology Dialogue: 0,0:17:41.77,0:17:45.59,Default,,0000,0000,0000,,or this approach that we think\Ncan be useful for you. Dialogue: 0,0:17:46.37,0:17:48.46,Default,,0000,0000,0000,,Just to give you\Na little bit of background, Dialogue: 0,0:17:48.46,0:17:52.07,Default,,0000,0000,0000,,this is my lexicographic\Nbackground talking here. Dialogue: 0,0:17:52.79,0:17:54.35,Default,,0000,0000,0000,,When we build a lexicon database, Dialogue: 0,0:17:54.35,0:17:58.62,Default,,0000,0000,0000,,there is a lot of hard time to maintain,\Nto keep them consistent Dialogue: 0,0:17:58.62,0:18:00.12,Default,,0000,0000,0000,,and to exchange data, Dialogue: 0,0:18:00.12,0:18:02.03,Default,,0000,0000,0000,,as you probably know. Dialogue: 0,0:18:02.52,0:18:05.93,Default,,0000,0000,0000,,There are several attempts\Nto unify the feature and the properties Dialogue: 0,0:18:05.93,0:18:09.18,Default,,0000,0000,0000,,that are describing\Nthose lexemes and those forms, Dialogue: 0,0:18:09.18,0:18:10.94,Default,,0000,0000,0000,,and it's not a solved problem, Dialogue: 0,0:18:10.94,0:18:13.96,Default,,0000,0000,0000,,but there are some\Nunification attempts on that side. Dialogue: 0,0:18:13.96,0:18:15.21,Default,,0000,0000,0000,,But what is really missing-- Dialogue: 0,0:18:15.21,0:18:18.73,Default,,0000,0000,0000,,and this is a problem we had\Nat the beginning of our project at Google Dialogue: 0,0:18:18.73,0:18:21.61,Default,,0000,0000,0000,,is to try to have an internal structure Dialogue: 0,0:18:22.20,0:18:25.91,Default,,0000,0000,0000,,that describes how\Na lexical entry should look like, Dialogue: 0,0:18:25.91,0:18:28.58,Default,,0000,0000,0000,,what kind of data\Nor what kind of information we have Dialogue: 0,0:18:28.58,0:18:32.24,Default,,0000,0000,0000,,and the specification that are expected. Dialogue: 0,0:18:32.24,0:18:38.19,Default,,0000,0000,0000,,So, this is what we came up\Nwith this thing called lexicon mask. Dialogue: 0,0:18:38.90,0:18:44.84,Default,,0000,0000,0000,,A lexicon mask is describing\Nwhat is expected for an entry, Dialogue: 0,0:18:44.84,0:18:47.33,Default,,0000,0000,0000,,a lexicographic entry, to be complete, Dialogue: 0,0:18:47.33,0:18:51.44,Default,,0000,0000,0000,,both in terms of the number of forms\Nyou expect for a lexeme, Dialogue: 0,0:18:51.44,0:18:55.61,Default,,0000,0000,0000,,and the number of features\Nyou expect for each of those forms. Dialogue: 0,0:18:56.40,0:18:58.33,Default,,0000,0000,0000,,Here is an example for Italian adjectives. Dialogue: 0,0:18:58.33,0:19:02.00,Default,,0000,0000,0000,,You expect, in Italian, to have\Nfour forms for your adjectives, Dialogue: 0,0:19:02.00,0:19:05.38,Default,,0000,0000,0000,,and each of these forms\Nhave a specific combination Dialogue: 0,0:19:05.38,0:19:07.95,Default,,0000,0000,0000,,of gender and number features. Dialogue: 0,0:19:08.61,0:19:12.67,Default,,0000,0000,0000,,This is what we expect\Nfor the Italian adjectives. Dialogue: 0,0:19:12.67,0:19:16.18,Default,,0000,0000,0000,,Of course, you can have\Nextremely complex masks, Dialogue: 0,0:19:16.18,0:19:20.78,Default,,0000,0000,0000,,like the French verbs conjugation,\Nwhich is quite extensive, Dialogue: 0,0:19:20.78,0:19:23.49,Default,,0000,0000,0000,,and I don't show you\Nany other Russian mask Dialogue: 0,0:19:23.49,0:19:25.38,Default,,0000,0000,0000,,because it doesn't fit the screen. Dialogue: 0,0:19:26.31,0:19:29.53,Default,,0000,0000,0000,,And we also have\Nsome detailed specifications Dialogue: 0,0:19:29.53,0:19:33.42,Default,,0000,0000,0000,,because we distinguish\Nwhat is at the form level. Dialogue: 0,0:19:33.42,0:19:37.54,Default,,0000,0000,0000,,So here you have Russian nouns\Nthat have three numbers Dialogue: 0,0:19:37.54,0:19:40.05,Default,,0000,0000,0000,,and a number of cases\Nwith different forms, Dialogue: 0,0:19:40.05,0:19:43.09,Default,,0000,0000,0000,,but they also have\Nan entry level specification Dialogue: 0,0:19:43.09,0:19:45.59,Default,,0000,0000,0000,,that says a noun particularly has Dialogue: 0,0:19:45.59,0:19:50.13,Default,,0000,0000,0000,,an inherent gender\Nand an inherent animacy feature Dialogue: 0,0:19:50.13,0:19:52.49,Default,,0000,0000,0000,,that is also specified in the mask. Dialogue: 0,0:19:54.52,0:19:58.78,Default,,0000,0000,0000,,We also want to distinguish\Nthat a mask gives a specification Dialogue: 0,0:19:58.78,0:20:01.87,Default,,0000,0000,0000,,for, in general,\Nwhat an entry should look like. Dialogue: 0,0:20:01.87,0:20:07.16,Default,,0000,0000,0000,,But you can have smaller masks\Nfor defective aspects of the form Dialogue: 0,0:20:07.16,0:20:11.28,Default,,0000,0000,0000,,or defective aspects of the lexeme\Nthat happen in language. Dialogue: 0,0:20:11.28,0:20:14.54,Default,,0000,0000,0000,,So here is the simplest version\Nof French verbs Dialogue: 0,0:20:14.54,0:20:19.73,Default,,0000,0000,0000,,that have only the 3rd person singular\Nfor all the weather verbs, Dialogue: 0,0:20:19.73,0:20:23.97,Default,,0000,0000,0000,,like "it rains" or "it snows,"\Nlike in English. Dialogue: 0,0:20:24.54,0:20:26.49,Default,,0000,0000,0000,,So we distinguish these two levels. Dialogue: 0,0:20:26.92,0:20:29.96,Default,,0000,0000,0000,,And how we use this at Google Dialogue: 0,0:20:29.96,0:20:32.64,Default,,0000,0000,0000,,is that when we have a lexicon\Nthat we want to use, Dialogue: 0,0:20:33.06,0:20:38.31,Default,,0000,0000,0000,,we use the mask to really\Nliterally throw the lexicons, Dialogue: 0,0:20:38.31,0:20:40.16,Default,,0000,0000,0000,,all the entries, through the mask Dialogue: 0,0:20:40.16,0:20:44.30,Default,,0000,0000,0000,,and see which entry has a problem\Nin terms of structure. Dialogue: 0,0:20:44.30,0:20:46.52,Default,,0000,0000,0000,,Are we missing a form?\NAre we missing a feature? Dialogue: 0,0:20:46.52,0:20:51.50,Default,,0000,0000,0000,,And when there is a problem,\Nwe do some human validation Dialogue: 0,0:20:51.50,0:20:53.75,Default,,0000,0000,0000,,or just to see if it passes the mask. Dialogue: 0,0:20:53.75,0:20:57.92,Default,,0000,0000,0000,,So it's an extremely powerful tool\Nto check the quality of the structure. Dialogue: 0,0:20:59.43,0:21:01.96,Default,,0000,0000,0000,,So what we are happy to announce today Dialogue: 0,0:21:01.96,0:21:05.41,Default,,0000,0000,0000,,is that we get the green light\Nto open source our mask. Dialogue: 0,0:21:05.95,0:21:07.57,Default,,0000,0000,0000,,So this is a schema. Dialogue: 0,0:21:07.57,0:21:09.48,Default,,0000,0000,0000,,If you want that, we can release Dialogue: 0,0:21:09.48,0:21:13.48,Default,,0000,0000,0000,,and that we will provide\Nto Wikidata as to ShEx files. Dialogue: 0,0:21:13.48,0:21:16.69,Default,,0000,0000,0000,,This is a ShEx file for German nouns, Dialogue: 0,0:21:16.69,0:21:20.43,Default,,0000,0000,0000,,and Denny is working on the conversion\Nfrom our internal specification Dialogue: 0,0:21:20.43,0:21:23.67,Default,,0000,0000,0000,,to a more open-source specification. Dialogue: 0,0:21:23.67,0:21:27.52,Default,,0000,0000,0000,,We currently cover more than 25 languages. Dialogue: 0,0:21:27.52,0:21:29.22,Default,,0000,0000,0000,,So we expect to grow on our side, Dialogue: 0,0:21:29.22,0:21:34.35,Default,,0000,0000,0000,,but we also look for this opportunity\Nto collaborate for other languages. Dialogue: 0,0:21:34.35,0:21:40.73,Default,,0000,0000,0000,,And one of the ongoing collaborations\Nalso that Denny has with Lukas. Dialogue: 0,0:21:40.73,0:21:45.05,Default,,0000,0000,0000,,Lukas has these great tools to have a UI Dialogue: 0,0:21:45.05,0:21:51.06,Default,,0000,0000,0000,,to help the user or the contributor\Nto add more forms. Dialogue: 0,0:21:51.06,0:21:54.15,Default,,0000,0000,0000,,So if you want to add\Nan adjective in French, Dialogue: 0,0:21:54.15,0:21:59.06,Default,,0000,0000,0000,,the UI is telling you\Nhow many forms are expected Dialogue: 0,0:21:59.06,0:22:01.56,Default,,0000,0000,0000,,and what kind of features\Nthis form should have. Dialogue: 0,0:22:01.56,0:22:06.27,Default,,0000,0000,0000,,So our mask will help the tool\Nto be defined and expanded. Dialogue: 0,0:22:07.24,0:22:08.38,Default,,0000,0000,0000,,That's it. Dialogue: 0,0:22:08.79,0:22:10.36,Default,,0000,0000,0000,,(Lydia) Thank you so much. Dialogue: 0,0:22:10.36,0:22:11.99,Default,,0000,0000,0000,,(applause) Dialogue: 0,0:22:14.25,0:22:16.89,Default,,0000,0000,0000,,Alright. Are there questions? Dialogue: 0,0:22:16.89,0:22:19.38,Default,,0000,0000,0000,,Do you want to talk more about lexemes? Dialogue: 0,0:22:19.82,0:22:21.48,Default,,0000,0000,0000,,- (person 3) Yes.\N- Yes. (chuckles) Dialogue: 0,0:22:33.48,0:22:35.38,Default,,0000,0000,0000,,(person 3) My question,\Nbecause you were talking Dialogue: 0,0:22:35.38,0:22:39.11,Default,,0000,0000,0000,,about giving more access\Nto more people in more languages. Dialogue: 0,0:22:39.11,0:22:42.44,Default,,0000,0000,0000,,But there are a lot of languages\Nthat can't be used in Wikidata. Dialogue: 0,0:22:42.44,0:22:44.59,Default,,0000,0000,0000,,So what solution do you have for that? Dialogue: 0,0:22:45.89,0:22:47.69,Default,,0000,0000,0000,,When you say that can't use Wikidata, Dialogue: 0,0:22:47.69,0:22:50.31,Default,,0000,0000,0000,,are you talking about entering labels? Dialogue: 0,0:22:50.31,0:22:52.58,Default,,0000,0000,0000,,- (person 3) Labels, descriptions.\N- Right. Dialogue: 0,0:22:52.58,0:22:55.50,Default,,0000,0000,0000,,So, for lexemes, it's a bit different Dialogue: 0,0:22:55.50,0:22:57.79,Default,,0000,0000,0000,,because there we don't have\Nthat restriction. Dialogue: 0,0:22:58.92,0:23:05.00,Default,,0000,0000,0000,,For labels on items and properties,\Nthere is some restriction Dialogue: 0,0:23:05.43,0:23:12.41,Default,,0000,0000,0000,,because we wanted to make sure\Nthat it's not completely Dialogue: 0,0:23:12.41,0:23:14.23,Default,,0000,0000,0000,,anyone does anything, Dialogue: 0,0:23:14.23,0:23:17.77,Default,,0000,0000,0000,,and it becomes unmanageable. Dialogue: 0,0:23:19.35,0:23:23.33,Default,,0000,0000,0000,,Even a small community who wants\None language and wants to work on that, Dialogue: 0,0:23:23.90,0:23:26.79,Default,,0000,0000,0000,,come talk to us, we will make it happen. Dialogue: 0,0:23:26.79,0:23:29.20,Default,,0000,0000,0000,,(person 3) I mean, we did this\Nat the Prague Hackathon in May, Dialogue: 0,0:23:29.20,0:23:32.46,Default,,0000,0000,0000,,and it took us until almost August\Nin order to be able to use our language. Dialogue: 0,0:23:32.46,0:23:35.14,Default,,0000,0000,0000,,- Yeah.\N- (person 3) So, it's very slow. Dialogue: 0,0:23:35.14,0:23:37.85,Default,,0000,0000,0000,,Yeah, it is, unfortunately, very slow. Dialogue: 0,0:23:37.85,0:23:39.88,Default,,0000,0000,0000,,We're currently working\Nwith the language Committee Dialogue: 0,0:23:39.88,0:23:46.05,Default,,0000,0000,0000,,on solving some fundamental... Dialogue: 0,0:23:49.54,0:23:55.45,Default,,0000,0000,0000,,Like, getting agreement on what kind\Nof languages are actually "allowed," Dialogue: 0,0:23:56.05,0:23:59.40,Default,,0000,0000,0000,,and that has taken too long, Dialogue: 0,0:23:59.99,0:24:04.18,Default,,0000,0000,0000,,which is the reason why your request\Nprobably took longer than it should have. Dialogue: 0,0:24:04.78,0:24:05.96,Default,,0000,0000,0000,,(person 3) Thanks. Dialogue: 0,0:24:06.82,0:24:07.95,Default,,0000,0000,0000,,(person 4) Thank you. Dialogue: 0,0:24:07.95,0:24:10.94,Default,,0000,0000,0000,,Lydia, if you remember\Nthe statistics that you showed, Dialogue: 0,0:24:10.94,0:24:12.89,Default,,0000,0000,0000,,the number of lexemes per language. Dialogue: 0,0:24:12.89,0:24:17.60,Default,,0000,0000,0000,,So, did you count\Nall the forms as a data point Dialogue: 0,0:24:17.60,0:24:20.03,Default,,0000,0000,0000,,or only lexemes? Dialogue: 0,0:24:21.29,0:24:22.94,Default,,0000,0000,0000,,(Lydia) Do you mean this? Dialogue: 0,0:24:22.94,0:24:24.05,Default,,0000,0000,0000,,Which one do you mean? Dialogue: 0,0:24:24.05,0:24:25.53,Default,,0000,0000,0000,,(person 4) Yes, exactly. Dialogue: 0,0:24:25.80,0:24:28.34,Default,,0000,0000,0000,,If you remember,\Ndoes this number [inaudible] Dialogue: 0,0:24:28.34,0:24:31.95,Default,,0000,0000,0000,,all the forms for all the lexemes\Nor just how many lexemes there are? Dialogue: 0,0:24:31.95,0:24:33.58,Default,,0000,0000,0000,,No, this is just a number of lexemes. Dialogue: 0,0:24:33.58,0:24:35.40,Default,,0000,0000,0000,,(person 4) Just a number of lexemes, okay. Dialogue: 0,0:24:35.40,0:24:36.80,Default,,0000,0000,0000,,So then it is a just statistic Dialogue: 0,0:24:36.80,0:24:39.39,Default,,0000,0000,0000,,because if it would then\Ncompose the forms-- Dialogue: 0,0:24:39.39,0:24:40.61,Default,,0000,0000,0000,,that's why I'm asking-- Dialogue: 0,0:24:40.61,0:24:42.82,Default,,0000,0000,0000,,then all the languages\Nwith the inflectional morphology, Dialogue: 0,0:24:42.82,0:24:45.03,Default,,0000,0000,0000,,like Russian, Serbian,\NSlovenian and et cetera, Dialogue: 0,0:24:45.03,0:24:47.62,Default,,0000,0000,0000,,they have a natural advantage\Nbecause they have so many. Dialogue: 0,0:24:47.62,0:24:51.99,Default,,0000,0000,0000,,So, this kind of kicks in here\Non this number of forms. Dialogue: 0,0:24:51.99,0:24:53.85,Default,,0000,0000,0000,,(person 4) Yeah, that was this one. \NThank you. Dialogue: 0,0:24:56.55,0:25:00.22,Default,,0000,0000,0000,,(person 5) So, I had\Na quick question about the... Dialogue: 0,0:25:00.64,0:25:06.82,Default,,0000,0000,0000,,When we're talking about\Nthe actual items and properties. Dialogue: 0,0:25:07.12,0:25:08.90,Default,,0000,0000,0000,,Like as far as I understand, Dialogue: 0,0:25:08.90,0:25:11.96,Default,,0000,0000,0000,,there is currently no way\Nto give an actual source Dialogue: 0,0:25:11.96,0:25:14.73,Default,,0000,0000,0000,,to any of the labels\Nand descriptions that are given. Dialogue: 0,0:25:14.73,0:25:18.05,Default,,0000,0000,0000,,So, for example,\Nbecause when you're talking Dialogue: 0,0:25:18.05,0:25:20.92,Default,,0000,0000,0000,,about an item property, Dialogue: 0,0:25:20.92,0:25:24.51,Default,,0000,0000,0000,,like, for example,\Nyou can get conflicting labels. Dialogue: 0,0:25:24.51,0:25:25.74,Default,,0000,0000,0000,,Yes. Dialogue: 0,0:25:25.74,0:25:27.66,Default,,0000,0000,0000,,(person 5) So this person is like... Dialogue: 0,0:25:28.40,0:25:30.78,Default,,0000,0000,0000,,We were talking about\Nindigenous things before, for example. Dialogue: 0,0:25:30.78,0:25:35.96,Default,,0000,0000,0000,,So this person is a Norwegian artist\Naccording to this source, Dialogue: 0,0:25:35.96,0:25:38.75,Default,,0000,0000,0000,,and a Sami artist,\Naccording to this source. Dialogue: 0,0:25:39.55,0:25:42.88,Default,,0000,0000,0000,,Or, for example, in Estonian,\Nwe had an issue Dialogue: 0,0:25:42.88,0:25:47.73,Default,,0000,0000,0000,,where we had to change terminology\Nto the official use terminology Dialogue: 0,0:25:47.73,0:25:49.48,Default,,0000,0000,0000,,in official lexicons, Dialogue: 0,0:25:49.48,0:25:52.26,Default,,0000,0000,0000,,but we have no way to indicate really why, Dialogue: 0,0:25:52.26,0:25:53.60,Default,,0000,0000,0000,,like what was the source of this Dialogue: 0,0:25:53.60,0:25:55.56,Default,,0000,0000,0000,,and why this was better\Nand what was there before. Dialogue: 0,0:25:55.56,0:25:57.15,Default,,0000,0000,0000,,It was just me as a random person Dialogue: 0,0:25:57.15,0:25:59.62,Default,,0000,0000,0000,,just switching the thing\Nto anyone who sees it. Dialogue: 0,0:25:59.62,0:26:02.52,Default,,0000,0000,0000,,So is there a plan\Nto make this possible in any way Dialogue: 0,0:26:02.52,0:26:06.36,Default,,0000,0000,0000,,so that we can actually have\Nproper sources for the language data? Dialogue: 0,0:26:07.04,0:26:11.57,Default,,0000,0000,0000,,So, it is partially possible. Dialogue: 0,0:26:11.57,0:26:15.96,Default,,0000,0000,0000,,So, for example, when you have\Nan item for a person, Dialogue: 0,0:26:16.97,0:26:22.72,Default,,0000,0000,0000,,you have a statement, first name,\Nlast name, and so on, of that person, Dialogue: 0,0:26:22.72,0:26:26.23,Default,,0000,0000,0000,,and then you can provide\Nthe reference for that there. Dialogue: 0,0:26:28.21,0:26:32.54,Default,,0000,0000,0000,,I'm quite hesitant to add more complexity Dialogue: 0,0:26:32.54,0:26:35.56,Default,,0000,0000,0000,,for references on labels and descriptions, Dialogue: 0,0:26:35.56,0:26:38.62,Default,,0000,0000,0000,,but if people really, really think Dialogue: 0,0:26:38.62,0:26:44.94,Default,,0000,0000,0000,,this is something that isn't covered\Nby any reference on the statement, Dialogue: 0,0:26:44.94,0:26:46.80,Default,,0000,0000,0000,,then let's talk about it. Dialogue: 0,0:26:49.08,0:26:53.30,Default,,0000,0000,0000,,But I fear it will add a lot of complexity Dialogue: 0,0:26:53.30,0:26:56.52,Default,,0000,0000,0000,,for what I hope are few cases, Dialogue: 0,0:26:57.39,0:27:00.19,Default,,0000,0000,0000,,but I'm willing to be convinced otherwise Dialogue: 0,0:27:00.19,0:27:04.09,Default,,0000,0000,0000,,if people really feel\Nvery strongly about this. Dialogue: 0,0:27:04.09,0:27:08.18,Default,,0000,0000,0000,,(person 5) I mean, if it's added\Nit probably shouldn't be the default, Dialogue: 0,0:27:08.18,0:27:12.45,Default,,0000,0000,0000,,show to all the users as a beginner,\Ninterface, in any case. Dialogue: 0,0:27:12.45,0:27:16.19,Default,,0000,0000,0000,,More like, "Click here if you need to say\Na specific thing about this." Dialogue: 0,0:27:17.63,0:27:23.37,Default,,0000,0000,0000,,Do we have a sense of how many times\Nthat would actually matter? Dialogue: 0,0:27:24.52,0:27:26.42,Default,,0000,0000,0000,,(person 5) In Estonian, for example-- Dialogue: 0,0:27:26.42,0:27:28.84,Default,,0000,0000,0000,,I expect this is true\Nof other languages as well-- Dialogue: 0,0:27:29.27,0:27:34.20,Default,,0000,0000,0000,,for example, there is an official name\Nthat is the actual legitimate translation, Dialogue: 0,0:27:34.20,0:27:36.21,Default,,0000,0000,0000,,for example, into English, Dialogue: 0,0:27:36.21,0:27:40.31,Default,,0000,0000,0000,,of, say, a specific kind of municipality. Dialogue: 0,0:27:40.61,0:27:42.18,Default,,0000,0000,0000,,That was my use case, for example, Dialogue: 0,0:27:42.18,0:27:44.41,Default,,0000,0000,0000,,where we were using the word "parish" Dialogue: 0,0:27:45.16,0:27:50.88,Default,,0000,0000,0000,,which the original Estonian word\Nwas meant kind of like church parish, Dialogue: 0,0:27:50.88,0:27:51.90,Default,,0000,0000,0000,,and that was the origin, Dialogue: 0,0:27:51.90,0:27:54.81,Default,,0000,0000,0000,,but that's not the official translation\NEstonia gets right now. Dialogue: 0,0:27:55.19,0:27:58.99,Default,,0000,0000,0000,,In this case, I would just add it\Nas official name statements Dialogue: 0,0:27:58.99,0:28:00.82,Default,,0000,0000,0000,,and add the reference there. Dialogue: 0,0:28:02.03,0:28:03.16,Default,,0000,0000,0000,,(person 5) Okay. Dialogue: 0,0:28:05.19,0:28:06.57,Default,,0000,0000,0000,,More questions, yes? Dialogue: 0,0:28:07.68,0:28:10.04,Default,,0000,0000,0000,,(person 6) I have two quick comments. Dialogue: 0,0:28:10.04,0:28:13.93,Default,,0000,0000,0000,,You specifically called out Asturian\Nas a language that does well, Dialogue: 0,0:28:13.93,0:28:16.46,Default,,0000,0000,0000,,and I think that's a false artifact. Dialogue: 0,0:28:16.46,0:28:17.72,Default,,0000,0000,0000,,Tell me about it. Dialogue: 0,0:28:17.72,0:28:19.75,Default,,0000,0000,0000,,(person 6) I think it's just a bot Dialogue: 0,0:28:19.75,0:28:24.07,Default,,0000,0000,0000,,that pasted person names,\Nlike proper names, Dialogue: 0,0:28:24.07,0:28:27.17,Default,,0000,0000,0000,,and said, "Well, this is exactly\Nlike in French or Spanish," Dialogue: 0,0:28:27.17,0:28:28.56,Default,,0000,0000,0000,,and just massively copied it. Dialogue: 0,0:28:28.56,0:28:33.32,Default,,0000,0000,0000,,One point of evidence is that\Nyou don't see that energy in Asturian Dialogue: 0,0:28:33.32,0:28:37.20,Default,,0000,0000,0000,,in things that actually\Nrequire translation, like property names, Dialogue: 0,0:28:37.20,0:28:39.65,Default,,0000,0000,0000,,or names of items\Nthat are not proper names. Dialogue: 0,0:28:39.65,0:28:41.22,Default,,0000,0000,0000,,Asaf, you break my heart. Dialogue: 0,0:28:41.22,0:28:43.20,Default,,0000,0000,0000,,(person 6) I know,\NI like raining on parades, Dialogue: 0,0:28:43.20,0:28:48.46,Default,,0000,0000,0000,,but I have good news as well,\Nwhich is about the pronunciation numbers. Dialogue: 0,0:28:49.41,0:28:53.52,Default,,0000,0000,0000,,As you probably know,\NCommons is full of pronunciation files, Dialogue: 0,0:28:53.52,0:28:54.67,Default,,0000,0000,0000,,and, for example, Dialogue: 0,0:28:54.67,0:29:01.10,Default,,0000,0000,0000,,Dutch has no less than 300,000\Npronunciation files already on Commons Dialogue: 0,0:29:01.91,0:29:05.05,Default,,0000,0000,0000,,that just need to somehow be ingested. Dialogue: 0,0:29:05.05,0:29:07.70,Default,,0000,0000,0000,,So if anyone's looking for a side project, Dialogue: 0,0:29:07.70,0:29:08.100,Default,,0000,0000,0000,,there's tons and tons Dialogue: 0,0:29:08.100,0:29:13.28,Default,,0000,0000,0000,,of classified, categorized\Npronunciation files on Commons Dialogue: 0,0:29:13.28,0:29:16.89,Default,,0000,0000,0000,,under the category\N"Pronunciation" by language. Dialogue: 0,0:29:16.89,0:29:22.84,Default,,0000,0000,0000,,So that's just waiting to be matched\Nto lexemes and put on Lexeme. Dialogue: 0,0:29:23.18,0:29:25.48,Default,,0000,0000,0000,,And I was wondering\Nif you could say something Dialogue: 0,0:29:25.48,0:29:26.58,Default,,0000,0000,0000,,about the road map, Dialogue: 0,0:29:26.58,0:29:28.76,Default,,0000,0000,0000,,something about how much investment Dialogue: 0,0:29:28.76,0:29:31.100,Default,,0000,0000,0000,,or what can we expect\Nfrom Lexeme in the coming year, Dialogue: 0,0:29:31.100,0:29:34.02,Default,,0000,0000,0000,,because I, for one, can't wait. Dialogue: 0,0:29:34.95,0:29:37.04,Default,,0000,0000,0000,,You can't wait? (chuckles) Dialogue: 0,0:29:37.04,0:29:39.12,Default,,0000,0000,0000,,- (person 6) For more.\N- Yes. (chuckles) Dialogue: 0,0:29:44.54,0:29:49.52,Default,,0000,0000,0000,,Right now, we're concentrating\Nmore on Wikibase and data quality Dialogue: 0,0:29:51.49,0:29:55.09,Default,,0000,0000,0000,,to see how much traction this gets Dialogue: 0,0:29:55.09,0:30:01.68,Default,,0000,0000,0000,,and then getting more for feeding off\Nwhere the pain points are next, Dialogue: 0,0:30:01.68,0:30:06.00,Default,,0000,0000,0000,,and then going back to improving\Nlexicographical data further. Dialogue: 0,0:30:06.90,0:30:09.79,Default,,0000,0000,0000,,And one of the things\NI'd love to hear from you Dialogue: 0,0:30:09.79,0:30:14.14,Default,,0000,0000,0000,,is where exactly do you see\Nthe next steps, Dialogue: 0,0:30:14.14,0:30:15.97,Default,,0000,0000,0000,,where do you want to see improvements Dialogue: 0,0:30:15.97,0:30:20.34,Default,,0000,0000,0000,,so that we can then figure out\Nhow to make that happen. Dialogue: 0,0:30:21.12,0:30:22.81,Default,,0000,0000,0000,,But, of course, you're right, Dialogue: 0,0:30:22.81,0:30:25.71,Default,,0000,0000,0000,,there's still so much to do\Nalso on the technical side. Dialogue: 0,0:30:30.57,0:30:35.85,Default,,0000,0000,0000,,(person 7) Okay, as we were uploading\Nthe Basque words with forms, Dialogue: 0,0:30:35.85,0:30:37.77,Default,,0000,0000,0000,,and you'll see some\Nof these kinds of things, Dialogue: 0,0:30:37.77,0:30:41.33,Default,,0000,0000,0000,,we were both like, last week we said,\N"Oh, we are the first one in something." Dialogue: 0,0:30:42.92,0:30:44.93,Default,,0000,0000,0000,,It's It appears in press, and it's like, Dialogue: 0,0:30:44.93,0:30:49.49,Default,,0000,0000,0000,,"Oh, Basque are the first time in some--\Nthey are the first in something, okay." Dialogue: 0,0:30:49.49,0:30:50.61,Default,,0000,0000,0000,,(laughs) Dialogue: 0,0:30:50.61,0:30:53.32,Default,,0000,0000,0000,,And then people ask,\N"Okay, but what is this for?" Dialogue: 0,0:30:54.68,0:30:56.85,Default,,0000,0000,0000,,We don't have a real good answer. Dialogue: 0,0:30:56.85,0:30:57.89,Default,,0000,0000,0000,,I mean it's like, okay, Dialogue: 0,0:30:57.89,0:31:01.84,Default,,0000,0000,0000,,this will help computers\Nto understand more our language, yes, Dialogue: 0,0:31:01.84,0:31:05.28,Default,,0000,0000,0000,,but what kind of tools\Ncan we make in the future? Dialogue: 0,0:31:05.28,0:31:07.47,Default,,0000,0000,0000,,And we don't have a good answer for this. Dialogue: 0,0:31:07.47,0:31:10.62,Default,,0000,0000,0000,,So I don't know\Nif you have a good answer for this. Dialogue: 0,0:31:10.62,0:31:12.74,Default,,0000,0000,0000,,(chuckles) I don't know\Nif I have a good answer, Dialogue: 0,0:31:12.74,0:31:14.75,Default,,0000,0000,0000,,but I have an answer. Dialogue: 0,0:31:15.48,0:31:20.42,Default,,0000,0000,0000,,So I think right now \Nas I was telling [inaudible], Dialogue: 0,0:31:20.42,0:31:21.92,Default,,0000,0000,0000,,we haven't reached that critical mass Dialogue: 0,0:31:21.92,0:31:25.53,Default,,0000,0000,0000,,where you can build a lot\Nof the really interesting tools. Dialogue: 0,0:31:25.53,0:31:27.71,Default,,0000,0000,0000,,But there are already some tools. Dialogue: 0,0:31:28.27,0:31:31.91,Default,,0000,0000,0000,,Just the other day,\NEsther [Pandelia], for example, Dialogue: 0,0:31:31.91,0:31:33.82,Default,,0000,0000,0000,,released a tool where you can see, Dialogue: 0,0:31:35.84,0:31:38.89,Default,,0000,0000,0000,,I think it was the words on a globe Dialogue: 0,0:31:38.89,0:31:41.90,Default,,0000,0000,0000,,where they're spoken,\Nwhere they're coming from. Dialogue: 0,0:31:42.63,0:31:44.09,Default,,0000,0000,0000,,I'm probably wrong about this, Dialogue: 0,0:31:44.09,0:31:46.35,Default,,0000,0000,0000,,but she had answered\Non the Project chat on Wikidata-- Dialogue: 0,0:31:46.35,0:31:48.98,Default,,0000,0000,0000,,you can look it up there. Dialogue: 0,0:31:49.57,0:31:51.80,Default,,0000,0000,0000,,So we have seen these first tools, Dialogue: 0,0:31:51.80,0:31:55.70,Default,,0000,0000,0000,,just like we've seen\Nback when Wikidata started. Dialogue: 0,0:31:56.85,0:31:59.60,Default,,0000,0000,0000,,First some--like just a network, Dialogue: 0,0:31:59.60,0:32:03.42,Default,,0000,0000,0000,,and like, "Hey, look, there's this thing\Nthat connects to this other thing." Dialogue: 0,0:32:04.82,0:32:07.06,Default,,0000,0000,0000,,And as we have more data, Dialogue: 0,0:32:07.06,0:32:10.35,Default,,0000,0000,0000,,and as we've reached some critical mass, Dialogue: 0,0:32:11.85,0:32:14.75,Default,,0000,0000,0000,,more powerful applications\Nbecome possible, Dialogue: 0,0:32:15.68,0:32:17.52,Default,,0000,0000,0000,,things like Histropedia, Dialogue: 0,0:32:19.13,0:32:21.99,Default,,0000,0000,0000,,things like question and answering Dialogue: 0,0:32:21.99,0:32:26.66,Default,,0000,0000,0000,,in your digital personal assistant,\NPlatypus, and so on. Dialogue: 0,0:32:26.66,0:32:29.67,Default,,0000,0000,0000,,And we're seeing\Na similar thing with lexemes. Dialogue: 0,0:32:31.20,0:32:34.65,Default,,0000,0000,0000,,We're at the stage\Nwhere you can build like these little, Dialogue: 0,0:32:34.65,0:32:37.46,Default,,0000,0000,0000,,hey, look, there's a connection\Nbetween the two things, Dialogue: 0,0:32:37.86,0:32:42.74,Default,,0000,0000,0000,,and there's a translation\Nof this word into that language stage, Dialogue: 0,0:32:42.74,0:32:47.75,Default,,0000,0000,0000,,and as we build it out\Nand as we describe more words, Dialogue: 0,0:32:47.75,0:32:49.53,Default,,0000,0000,0000,,more becomes possible. Dialogue: 0,0:32:49.53,0:32:51.80,Default,,0000,0000,0000,,Now, what becomes possible? Dialogue: 0,0:32:53.48,0:32:59.48,Default,,0000,0000,0000,,As Ben, our keynote speaker earlier\Nwas talking about translations, Dialogue: 0,0:33:00.10,0:33:03.46,Default,,0000,0000,0000,,being able to translate\Nfrom one language to another. Dialogue: 0,0:33:03.46,0:33:07.93,Default,,0000,0000,0000,,And Jens, my colleague,\Nhe's always talking about Dialogue: 0,0:33:07.93,0:33:11.45,Default,,0000,0000,0000,,the European Union\Nlooking for a translator Dialogue: 0,0:33:11.45,0:33:17.44,Default,,0000,0000,0000,,who can translate from\NI think it was Maltese to Swedish-- Dialogue: 0,0:33:17.44,0:33:19.44,Default,,0000,0000,0000,,- (person 8) Estonian.\N- Estonian. Dialogue: 0,0:33:22.02,0:33:26.21,Default,,0000,0000,0000,,And that is not a usual combination. Dialogue: 0,0:33:27.21,0:33:31.74,Default,,0000,0000,0000,,But once you have all these languages\Nin one machine-readable place, Dialogue: 0,0:33:31.74,0:33:33.14,Default,,0000,0000,0000,,you can do that, Dialogue: 0,0:33:33.14,0:33:36.86,Default,,0000,0000,0000,,you can get a dictionary Dialogue: 0,0:33:36.86,0:33:41.74,Default,,0000,0000,0000,,from Estonian to Maltese and back. Dialogue: 0,0:33:42.94,0:33:45.61,Default,,0000,0000,0000,,So covering language\Ncombinations in dictionaries Dialogue: 0,0:33:45.61,0:33:47.91,Default,,0000,0000,0000,,that just haven't been covered before Dialogue: 0,0:33:47.91,0:33:51.05,Default,,0000,0000,0000,,because there wasn't\Nenough demand for it, for example, Dialogue: 0,0:33:51.05,0:33:55.54,Default,,0000,0000,0000,,to make it financially viable\Nand to justify the work. Dialogue: 0,0:33:55.54,0:33:57.15,Default,,0000,0000,0000,,Now we can do that. Dialogue: 0,0:33:59.80,0:34:02.32,Default,,0000,0000,0000,,Then text generation. Dialogue: 0,0:34:02.32,0:34:03.65,Default,,0000,0000,0000,,Lucie was earlier talking Dialogue: 0,0:34:03.65,0:34:10.14,Default,,0000,0000,0000,,about how she's working\Nwith Hattie on generating text Dialogue: 0,0:34:10.14,0:34:14.67,Default,,0000,0000,0000,,to get Wikipedia articles\Nin minority languages started, Dialogue: 0,0:34:15.42,0:34:19.51,Default,,0000,0000,0000,,and that needs data about words, Dialogue: 0,0:34:19.51,0:34:22.59,Default,,0000,0000,0000,,and you need to understand\Nthe language to do that. Dialogue: 0,0:34:23.77,0:34:28.13,Default,,0000,0000,0000,,Yeah, and those are just some\Nthat come to my mind right now. Dialogue: 0,0:34:28.69,0:34:30.49,Default,,0000,0000,0000,,Maybe our audience has more ideas Dialogue: 0,0:34:30.49,0:34:34.35,Default,,0000,0000,0000,,what they want to do\Nwhen we have all the glorious data. Dialogue: 0,0:34:37.69,0:34:40.89,Default,,0000,0000,0000,,(person 9) Okay, I will deviate\Nfrom the lexemes topic. Dialogue: 0,0:34:40.89,0:34:42.67,Default,,0000,0000,0000,,I will ask the question, Dialogue: 0,0:34:42.67,0:34:45.63,Default,,0000,0000,0000,,how can I as a member of community Dialogue: 0,0:34:45.63,0:34:50.14,Default,,0000,0000,0000,,influence that priority is put on task, Dialogue: 0,0:34:50.14,0:34:56.64,Default,,0000,0000,0000,,that a new user comes, and he can indicate\Nwhat languages he wants to see and edit Dialogue: 0,0:34:56.64,0:35:01.14,Default,,0000,0000,0000,,without some secret verbal\Ntemplate knowledge. Dialogue: 0,0:35:02.14,0:35:05.05,Default,,0000,0000,0000,,Maybe there will be this year\Nthis technical wish list Dialogue: 0,0:35:05.05,0:35:07.04,Default,,0000,0000,0000,,without Wikipedia topics. Dialogue: 0,0:35:07.04,0:35:10.12,Default,,0000,0000,0000,,Maybe there's a hope\Nwe can all vote about Dialogue: 0,0:35:10.12,0:35:14.22,Default,,0000,0000,0000,,this thing we didn't fix for seven years. Dialogue: 0,0:35:14.22,0:35:17.61,Default,,0000,0000,0000,,So do you have any ideas\Nand comments about this? Dialogue: 0,0:35:18.22,0:35:20.33,Default,,0000,0000,0000,,So you're talking about the fact Dialogue: 0,0:35:20.33,0:35:23.52,Default,,0000,0000,0000,,that someone who is\Nnot logged into Wikidata Dialogue: 0,0:35:23.52,0:35:25.97,Default,,0000,0000,0000,,can't change their language easily? Dialogue: 0,0:35:25.97,0:35:27.84,Default,,0000,0000,0000,,(person 9) No, for [inaudible] users. Dialogue: 0,0:35:28.31,0:35:30.69,Default,,0000,0000,0000,,So, if they are logged in, Dialogue: 0,0:35:30.69,0:35:34.87,Default,,0000,0000,0000,,they can just change their language\Nat the top of the page, Dialogue: 0,0:35:35.89,0:35:38.10,Default,,0000,0000,0000,,and then it will appear Dialogue: 0,0:35:39.77,0:35:42.01,Default,,0000,0000,0000,,where the labels' description\N[inaudible] are, Dialogue: 0,0:35:42.01,0:35:43.48,Default,,0000,0000,0000,,and they can edit it. Dialogue: 0,0:35:45.66,0:35:49.01,Default,,0000,0000,0000,,(person 9) Well, actually, usually\Nmany times the workflow Dialogue: 0,0:35:49.01,0:35:52.45,Default,,0000,0000,0000,,is that if you want to have\Nmultiple languages, they are available, Dialogue: 0,0:35:52.45,0:35:55.42,Default,,0000,0000,0000,,and it's not always the case. Dialogue: 0,0:35:55.42,0:35:58.58,Default,,0000,0000,0000,,Okay, maybe we should sit down\Nafter this talk and you show me. Dialogue: 0,0:36:01.56,0:36:04.09,Default,,0000,0000,0000,,Cool. More questions? Dialogue: 0,0:36:05.53,0:36:06.54,Default,,0000,0000,0000,,Yes. Dialogue: 0,0:36:11.60,0:36:13.20,Default,,0000,0000,0000,,(person 10) Thanks for the presentation. Dialogue: 0,0:36:14.11,0:36:15.13,Default,,0000,0000,0000,,Can you comment Dialogue: 0,0:36:15.13,0:36:19.31,Default,,0000,0000,0000,,on the state of the correlation\Nwith the Wiktionary community. Dialogue: 0,0:36:19.31,0:36:22.30,Default,,0000,0000,0000,,As far as I've seen,\Nthere were some discussions Dialogue: 0,0:36:22.30,0:36:26.05,Default,,0000,0000,0000,,about importing some elements of the work, Dialogue: 0,0:36:26.05,0:36:30.84,Default,,0000,0000,0000,,but there seems to be licensing issues\Nand some disagreements, et cetera. Dialogue: 0,0:36:30.84,0:36:31.85,Default,,0000,0000,0000,,Right. Dialogue: 0,0:36:31.85,0:36:36.33,Default,,0000,0000,0000,,So, Wiktionary communities\Nhave spent a lot of time Dialogue: 0,0:36:37.32,0:36:39.47,Default,,0000,0000,0000,,building Wiktionary. Dialogue: 0,0:36:39.47,0:36:42.64,Default,,0000,0000,0000,,They have built Dialogue: 0,0:36:43.19,0:36:47.55,Default,,0000,0000,0000,,amazingly complicated\Nand complex templates Dialogue: 0,0:36:47.55,0:36:53.61,Default,,0000,0000,0000,,to build pretty tables\Nthat automatically generate forms for you Dialogue: 0,0:36:53.61,0:36:56.39,Default,,0000,0000,0000,,and all kinds of really impressive, Dialogue: 0,0:36:56.39,0:37:00.68,Default,,0000,0000,0000,,and kind of crazy stuff,\Nif you think about it. Dialogue: 0,0:37:02.31,0:37:07.99,Default,,0000,0000,0000,,And, of course, they have invested\Na lot of time and effort into that. Dialogue: 0,0:37:09.36,0:37:11.80,Default,,0000,0000,0000,,And understandably, Dialogue: 0,0:37:11.80,0:37:17.12,Default,,0000,0000,0000,,they don't just want that to be grabbed, Dialogue: 0,0:37:18.05,0:37:19.10,Default,,0000,0000,0000,,just like that. Dialogue: 0,0:37:19.10,0:37:21.79,Default,,0000,0000,0000,,So there's some of that coming from there. Dialogue: 0,0:37:22.76,0:37:25.14,Default,,0000,0000,0000,,And that's fine, that's okay. Dialogue: 0,0:37:25.74,0:37:32.09,Default,,0000,0000,0000,,Now, the first Wiktionary communities\Nare talking about turning out Dialogue: 0,0:37:32.09,0:37:34.33,Default,,0000,0000,0000,,and importing some\Nof their data into Wikidata. Dialogue: 0,0:37:34.33,0:37:39.10,Default,,0000,0000,0000,,Russian, you have seen,\Nfor example, is one of those cases Dialogue: 0,0:37:40.38,0:37:42.36,Default,,0000,0000,0000,,And I expect more of that to happen. Dialogue: 0,0:37:43.64,0:37:46.80,Default,,0000,0000,0000,,But it will be a slow process, Dialogue: 0,0:37:46.80,0:37:49.38,Default,,0000,0000,0000,,just like adoption\Nof Wikidata's data on Wikipedia Dialogue: 0,0:37:49.38,0:37:51.91,Default,,0000,0000,0000,,has been a rather slow process. Dialogue: 0,0:37:52.85,0:37:56.18,Default,,0000,0000,0000,,On the other side\Nof making it actually easier Dialogue: 0,0:37:56.18,0:37:59.13,Default,,0000,0000,0000,,to use the data that is in lexemes, Dialogue: 0,0:37:59.13,0:38:02.21,Default,,0000,0000,0000,,on Wiktionary, so that\Nthey can make use of that Dialogue: 0,0:38:02.21,0:38:05.53,Default,,0000,0000,0000,,and share data between\Nthe language Wiktionaries Dialogue: 0,0:38:05.53,0:38:08.85,Default,,0000,0000,0000,,which is super hard\Nto impossible right now, Dialogue: 0,0:38:08.85,0:38:11.56,Default,,0000,0000,0000,,which is crazy,\Njust like it was on Wikipedia. Dialogue: 0,0:38:13.86,0:38:16.32,Default,,0000,0000,0000,,Wait for the birthday present. (chuckles) Dialogue: 0,0:38:20.04,0:38:21.18,Default,,0000,0000,0000,,Yes. Dialogue: 0,0:38:22.60,0:38:24.83,Default,,0000,0000,0000,,(person 11) When I was thinking\Nthe other way around it, Dialogue: 0,0:38:24.83,0:38:28.17,Default,,0000,0000,0000,,I actually didn't want to say it\Nbecause I think this will be super silly, Dialogue: 0,0:38:28.17,0:38:32.00,Default,,0000,0000,0000,,but I think that Wiktionary\Nalready has some content, Dialogue: 0,0:38:32.00,0:38:34.98,Default,,0000,0000,0000,,and I know that\Nwe can't transfer it to Wikidata Dialogue: 0,0:38:34.98,0:38:37.05,Default,,0000,0000,0000,,because there's a difference in licenses. Dialogue: 0,0:38:37.05,0:38:39.63,Default,,0000,0000,0000,,But I was thinking maybe\Nwe can do something about that. Dialogue: 0,0:38:40.32,0:38:45.91,Default,,0000,0000,0000,,Maybe, I don't know, we can obtain\Nthe communities' permission Dialogue: 0,0:38:45.91,0:38:51.20,Default,,0000,0000,0000,,after like, I don't know,\Nhaving like a public voting Dialogue: 0,0:38:52.08,0:38:55.64,Default,,0000,0000,0000,,and for the community,\Nthe active members of the community Dialogue: 0,0:38:55.64,0:39:02.52,Default,,0000,0000,0000,,to vote and say if they would like \Nor accept or to transfer the content Dialogue: 0,0:39:02.52,0:39:05.53,Default,,0000,0000,0000,,for which they may do\Nthe Wikidata lexemes. Dialogue: 0,0:39:06.24,0:39:08.54,Default,,0000,0000,0000,,Because I just think it is such a waste. Dialogue: 0,0:39:09.57,0:39:14.44,Default,,0000,0000,0000,,So, that's definitely\Na conversation those people Dialogue: 0,0:39:14.44,0:39:18.25,Default,,0000,0000,0000,,who are in Wiktionary communities\Nare very welcome to bring up there. Dialogue: 0,0:39:18.25,0:39:24.65,Default,,0000,0000,0000,,I think it would be a bit presumptuous\Nfor us to go and force that. Dialogue: 0,0:39:25.92,0:39:31.14,Default,,0000,0000,0000,,But, yeah, I think it's definitely worth\Nhaving a conversation. Dialogue: 0,0:39:31.14,0:39:33.90,Default,,0000,0000,0000,,But I think it's also important\Nto understand Dialogue: 0,0:39:33.90,0:39:39.08,Default,,0000,0000,0000,,that there's a distinction between\Nwhat is actually legally allowed Dialogue: 0,0:39:39.08,0:39:43.15,Default,,0000,0000,0000,,and what we should be doing Dialogue: 0,0:39:43.15,0:39:45.43,Default,,0000,0000,0000,,and what those people want or do not want. Dialogue: 0,0:39:45.74,0:39:47.33,Default,,0000,0000,0000,,So even if it's legally allowed, Dialogue: 0,0:39:47.33,0:39:50.64,Default,,0000,0000,0000,,if some other Wiktionary communities\Ndo not want that, Dialogue: 0,0:39:50.64,0:39:53.54,Default,,0000,0000,0000,,I would be careful, at least. Dialogue: 0,0:39:58.89,0:40:02.49,Default,,0000,0000,0000,,I think you need the mic\Nfor the stream. Dialogue: 0,0:40:04.54,0:40:07.30,Default,,0000,0000,0000,,(person 12) So, obviously,\Nit's all very exciting, Dialogue: 0,0:40:07.98,0:40:12.32,Default,,0000,0000,0000,,and I immediately think\Nhow can I take that to my students Dialogue: 0,0:40:12.32,0:40:15.56,Default,,0000,0000,0000,,and how can I incorporate it\Nwith the courses, Dialogue: 0,0:40:15.56,0:40:18.53,Default,,0000,0000,0000,,the work that we're doing,\Neducational settings. Dialogue: 0,0:40:18.53,0:40:22.27,Default,,0000,0000,0000,,And I don't have, at the moment, Dialogue: 0,0:40:22.87,0:40:24.12,Default,,0000,0000,0000,,first of all, enough knowledge, Dialogue: 0,0:40:24.12,0:40:27.28,Default,,0000,0000,0000,,but I think the documentation\Nthat we do have Dialogue: 0,0:40:27.81,0:40:30.08,Default,,0000,0000,0000,,could be maybe improved. Dialogue: 0,0:40:30.08,0:40:33.44,Default,,0000,0000,0000,,So that's a kind of request\Nto make cool videos Dialogue: 0,0:40:33.44,0:40:35.90,Default,,0000,0000,0000,,that explain how it works Dialogue: 0,0:40:35.90,0:40:39.95,Default,,0000,0000,0000,,because if we have it, we can then use it, Dialogue: 0,0:40:39.95,0:40:41.98,Default,,0000,0000,0000,,and we can have students on board, Dialogue: 0,0:40:41.98,0:40:47.07,Default,,0000,0000,0000,,and we can make people understand\Nhow awesome it all is. Dialogue: 0,0:40:47.07,0:40:52.00,Default,,0000,0000,0000,,And yeah, just think about documentation\Nand think about education, please. Dialogue: 0,0:40:52.00,0:40:54.48,Default,,0000,0000,0000,,Because I think a lot could be done. Dialogue: 0,0:40:54.48,0:40:58.58,Default,,0000,0000,0000,,These are like many tasks\Nthat could be done even with... Dialogue: 0,0:41:00.12,0:41:02.03,Default,,0000,0000,0000,,well, I wouldn't say primary schools, Dialogue: 0,0:41:02.03,0:41:05.50,Default,,0000,0000,0000,,but certainly, even younger students. Dialogue: 0,0:41:05.92,0:41:10.87,Default,,0000,0000,0000,,And so I would really like to see\Nthat potential being tapped into, Dialogue: 0,0:41:10.87,0:41:15.27,Default,,0000,0000,0000,,and, as of now, I personally\Ndon't understand enough Dialogue: 0,0:41:15.27,0:41:19.50,Default,,0000,0000,0000,,to be able to create tasks\Nor to create like... Dialogue: 0,0:41:20.43,0:41:22.16,Default,,0000,0000,0000,,to do something practical with it. Dialogue: 0,0:41:22.16,0:41:25.77,Default,,0000,0000,0000,,So any help, any thoughts\Nanyone here has about that, Dialogue: 0,0:41:25.77,0:41:29.65,Default,,0000,0000,0000,,I would be very happy to hear\Nyour thoughts, and yours as well. Dialogue: 0,0:41:30.51,0:41:32.13,Default,,0000,0000,0000,,Yeah, let's talk about that. Dialogue: 0,0:41:35.47,0:41:37.14,Default,,0000,0000,0000,,More questions? Dialogue: 0,0:41:37.81,0:41:39.20,Default,,0000,0000,0000,,Someone else raised a hand. Dialogue: 0,0:41:39.20,0:41:40.50,Default,,0000,0000,0000,,I forgot where it was. Dialogue: 0,0:41:45.74,0:41:49.100,Default,,0000,0000,0000,,(person 13) So, if we can't import\Nfrom Wiktionary, Dialogue: 0,0:41:49.100,0:41:55.77,Default,,0000,0000,0000,,is there some concerted effort\Nto find other public domain sources, Dialogue: 0,0:41:55.77,0:41:57.46,Default,,0000,0000,0000,,maybe all the data, Dialogue: 0,0:41:58.77,0:42:03.17,Default,,0000,0000,0000,,and kind of prefilter it, organize it Dialogue: 0,0:42:03.17,0:42:08.47,Default,,0000,0000,0000,,so that it's easy to be checked\Nby people for import? Dialogue: 0,0:42:09.09,0:42:11.18,Default,,0000,0000,0000,,So there are first efforts. Dialogue: 0,0:42:11.18,0:42:14.77,Default,,0000,0000,0000,,My understanding is that Basque\Nis one of those efforts. Dialogue: 0,0:42:14.77,0:42:17.47,Default,,0000,0000,0000,,Maybe you want to say\Na bit more about it? Dialogue: 0,0:42:18.43,0:42:20.13,Default,,0000,0000,0000,,(person 14) [inaudible] Dialogue: 0,0:42:23.17,0:42:27.15,Default,,0000,0000,0000,,Okay, the actual answer \Nis paying for that... Dialogue: 0,0:42:28.37,0:42:33.38,Default,,0000,0000,0000,,I mean, we have an agreement\Nwith a contractor we usually work with. Dialogue: 0,0:42:34.80,0:42:38.72,Default,,0000,0000,0000,,They do dictionaries-- Dialogue: 0,0:42:40.32,0:42:42.46,Default,,0000,0000,0000,,lots of stuff, but they do dictionaries. Dialogue: 0,0:42:42.46,0:42:47.47,Default,,0000,0000,0000,,So we agreed with them\Nto make free the students' dictionary, Dialogue: 0,0:42:47.47,0:42:52.78,Default,,0000,0000,0000,,we would [cast] the most common words\Nand start uploading it Dialogue: 0,0:42:52.78,0:42:55.59,Default,,0000,0000,0000,,with an external identifier\Nand the scheme of things. Dialogue: 0,0:42:56.42,0:43:02.90,Default,,0000,0000,0000,,But there was some discussion\Nabout leaving it on CC0 Dialogue: 0,0:43:03.21,0:43:05.32,Default,,0000,0000,0000,,because they have\Nthe dictionary with CC by it, Dialogue: 0,0:43:06.54,0:43:10.33,Default,,0000,0000,0000,,and they understood\Nwhat the difference was. Dialogue: 0,0:43:10.33,0:43:13.87,Default,,0000,0000,0000,,So there was some discussion. Dialogue: 0,0:43:13.87,0:43:19.71,Default,,0000,0000,0000,,But I think that we can provide some tools\Nor some examples in the future, Dialogue: 0,0:43:19.71,0:43:21.76,Default,,0000,0000,0000,,and I think that there will be\Nother dictionaries Dialogue: 0,0:43:21.76,0:43:24.02,Default,,0000,0000,0000,,that we can handle, Dialogue: 0,0:43:24.02,0:43:29.27,Default,,0000,0000,0000,,and also I think Wiktionary\Nshould start moving in that direction, Dialogue: 0,0:43:29.27,0:43:32.26,Default,,0000,0000,0000,,but that's another great discussion. Dialogue: 0,0:43:33.28,0:43:34.49,Default,,0000,0000,0000,,And on top of that, Dialogue: 0,0:43:34.49,0:43:38.84,Default,,0000,0000,0000,,Lea is also in contact\Nwith people from Occitan Dialogue: 0,0:43:38.84,0:43:41.83,Default,,0000,0000,0000,,who work on Occitan dictionaries, Dialogue: 0,0:43:41.83,0:43:45.14,Default,,0000,0000,0000,,and they're currently working\Non a Sumerian collaboration. Dialogue: 0,0:43:51.64,0:43:53.36,Default,,0000,0000,0000,,More questions? Dialogue: 0,0:44:01.49,0:44:05.35,Default,,0000,0000,0000,,(person 15) Hi! We are the people\Nwho want to import Occitan data. Dialogue: 0,0:44:05.35,0:44:06.58,Default,,0000,0000,0000,,Aha! Perfect! Dialogue: 0,0:44:06.58,0:44:08.37,Default,,0000,0000,0000,,(person 15) And we have a small problem. Dialogue: 0,0:44:09.19,0:44:14.22,Default,,0000,0000,0000,,We don't know how to represent\Nthe variety of all lexemes. Dialogue: 0,0:44:14.22,0:44:17.89,Default,,0000,0000,0000,,We have six dialects, Dialogue: 0,0:44:17.89,0:44:24.01,Default,,0000,0000,0000,,and we want to indicate for Lexeme\Nin which dialect it's used, Dialogue: 0,0:44:24.01,0:44:27.28,Default,,0000,0000,0000,,and we don't have a proper\NC0 statement to do that. Dialogue: 0,0:44:27.28,0:44:31.10,Default,,0000,0000,0000,,So as long as the segment doesn't exist, Dialogue: 0,0:44:31.64,0:44:34.46,Default,,0000,0000,0000,,it prevents us from [inaudible] Dialogue: 0,0:44:34.46,0:44:37.60,Default,,0000,0000,0000,,because we will need to do it again Dialogue: 0,0:44:37.60,0:44:42.08,Default,,0000,0000,0000,,when we will be able\Nto [export] the statement. Dialogue: 0,0:44:42.08,0:44:44.55,Default,,0000,0000,0000,,And it's complicated\Nbecause it's a statement Dialogue: 0,0:44:44.55,0:44:47.80,Default,,0000,0000,0000,,which won't be asked by many people Dialogue: 0,0:44:47.80,0:44:53.44,Default,,0000,0000,0000,,because it's a statement\Nwhich concerns mostly minority languages. Dialogue: 0,0:44:53.44,0:44:56.93,Default,,0000,0000,0000,,So you will have one person to ask this. Dialogue: 0,0:44:56.93,0:45:00.02,Default,,0000,0000,0000,,But as our colleagues Basque, Dialogue: 0,0:45:00.02,0:45:06.08,Default,,0000,0000,0000,,it can be one person\Nwho will power thousands of others, Dialogue: 0,0:45:06.08,0:45:10.88,Default,,0000,0000,0000,,so it might not be asking a lot, Dialogue: 0,0:45:10.88,0:45:14.14,Default,,0000,0000,0000,,but it will be very important for us. Dialogue: 0,0:45:14.87,0:45:17.60,Default,,0000,0000,0000,,Do you already have\Na new property proposal up, Dialogue: 0,0:45:17.60,0:45:19.47,Default,,0000,0000,0000,,or do you need help creating it? Dialogue: 0,0:45:21.52,0:45:24.30,Default,,0000,0000,0000,,(person 15) We asked four months ago. Dialogue: 0,0:45:24.72,0:45:28.76,Default,,0000,0000,0000,,Alright, then let's get some people\Nto help out with this property proposal. Dialogue: 0,0:45:30.16,0:45:33.09,Default,,0000,0000,0000,,I'm sure there are enough people\Nin this room to make this happen. Dialogue: 0,0:45:33.36,0:45:35.45,Default,,0000,0000,0000,,(person 15) Property proposal\N[speaking in French]. Dialogue: 0,0:45:35.45,0:45:36.96,Default,,0000,0000,0000,,(person 16) We didn't have an answer. Dialogue: 0,0:45:36.96,0:45:39.77,Default,,0000,0000,0000,,(person 15) We didn't have any answer,\Nand we don't know how to do this Dialogue: 0,0:45:39.77,0:45:42.95,Default,,0000,0000,0000,,because we aren't \Nin the Wikidata community. Dialogue: 0,0:45:44.69,0:45:48.82,Default,,0000,0000,0000,,Yup, so there are people here\Nwho can help you. Dialogue: 0,0:45:48.82,0:45:52.13,Default,,0000,0000,0000,,Maybe someone raises their hand to take-- Dialogue: 0,0:45:52.57,0:45:53.64,Default,,0000,0000,0000,,(person 14) I'm for that. Dialogue: 0,0:45:53.64,0:45:55.51,Default,,0000,0000,0000,,But I think this is quite interesting Dialogue: 0,0:45:55.51,0:45:59.06,Default,,0000,0000,0000,,that only the variant of form Dialogue: 0,0:45:59.06,0:46:02.61,Default,,0000,0000,0000,,also can handle it geographically, Dialogue: 0,0:46:02.61,0:46:04.100,Default,,0000,0000,0000,,with coordinates or some kind of mapping. Dialogue: 0,0:46:05.60,0:46:07.82,Default,,0000,0000,0000,,Also having different pronunciations, Dialogue: 0,0:46:07.82,0:46:11.84,Default,,0000,0000,0000,,and I think this is something\Nthat happens in lots of languages. Dialogue: 0,0:46:12.61,0:46:16.26,Default,,0000,0000,0000,,We should start making\Nit happen [inaudible], Dialogue: 0,0:46:16.26,0:46:18.86,Default,,0000,0000,0000,,and I'm going to search for the property. Dialogue: 0,0:46:19.78,0:46:20.93,Default,,0000,0000,0000,,Cool. Dialogue: 0,0:46:20.93,0:46:24.45,Default,,0000,0000,0000,,So you will get backing\Nfor your property proposal. Dialogue: 0,0:46:26.14,0:46:27.30,Default,,0000,0000,0000,,Thank you. Dialogue: 0,0:46:28.15,0:46:30.26,Default,,0000,0000,0000,,Alright, more questions? Dialogue: 0,0:46:32.41,0:46:33.47,Default,,0000,0000,0000,,Finn. Dialogue: 0,0:46:33.97,0:46:35.06,Default,,0000,0000,0000,,Finn is one of those people Dialogue: 0,0:46:35.06,0:46:38.03,Default,,0000,0000,0000,,who builds stuff\Non top of lexicographical data. Dialogue: 0,0:46:38.03,0:46:40.08,Default,,0000,0000,0000,,(Finn) It's just a small question, Dialogue: 0,0:46:40.40,0:46:44.23,Default,,0000,0000,0000,,and that's about spelling variations. Dialogue: 0,0:46:44.90,0:46:48.00,Default,,0000,0000,0000,,It seems to be difficult to put them in... Dialogue: 0,0:46:48.53,0:46:53.37,Default,,0000,0000,0000,,You could, of course,\Nhave multiple forms for the same word. Dialogue: 0,0:46:56.33,0:46:58.45,Default,,0000,0000,0000,,I don't know, it seems to be... Dialogue: 0,0:46:59.56,0:47:03.54,Default,,0000,0000,0000,,If you don't do it that way,\Nit seems to be difficult to specify... Dialogue: 0,0:47:04.77,0:47:05.89,Default,,0000,0000,0000,,or I don't know whether Dialogue: 0,0:47:05.89,0:47:09.73,Default,,0000,0000,0000,,this is just a minor technical issue\Nor whether... Dialogue: 0,0:47:09.73,0:47:11.25,Default,,0000,0000,0000,,Let's look at it together. Dialogue: 0,0:47:11.64,0:47:15.23,Default,,0000,0000,0000,,I would love to see an example. Dialogue: 0,0:47:17.48,0:47:18.48,Default,,0000,0000,0000,,Asaf. Dialogue: 0,0:47:26.89,0:47:28.40,Default,,0000,0000,0000,,(Asaf) Thank you. Dialogue: 0,0:47:29.39,0:47:33.68,Default,,0000,0000,0000,,I can give a very concrete example\Nfrom my mother tongue, Hebrew. Dialogue: 0,0:47:34.20,0:47:38.84,Default,,0000,0000,0000,,Hebrew has two main variants Dialogue: 0,0:47:38.84,0:47:42.79,Default,,0000,0000,0000,,for expressing almost every word Dialogue: 0,0:47:42.79,0:47:47.64,Default,,0000,0000,0000,,because the traditional spelling Dialogue: 0,0:47:47.64,0:47:50.04,Default,,0000,0000,0000,,leaves out many of the vowels. Dialogue: 0,0:47:50.93,0:47:55.21,Default,,0000,0000,0000,,And, therefore, in modern editions\Nof the Bible and of poetry, Dialogue: 0,0:47:55.21,0:47:57.46,Default,,0000,0000,0000,,diacritics are used. Dialogue: 0,0:47:57.46,0:48:02.67,Default,,0000,0000,0000,,However, those diacritics\Nare never used for modern prose Dialogue: 0,0:48:02.67,0:48:05.97,Default,,0000,0000,0000,,or newspaper writing or street signs. Dialogue: 0,0:48:05.97,0:48:11.21,Default,,0000,0000,0000,,So the average daily casual use\Nputs in extra vowels Dialogue: 0,0:48:12.17,0:48:13.52,Default,,0000,0000,0000,,and doesn't use the diacritics Dialogue: 0,0:48:13.52,0:48:15.61,Default,,0000,0000,0000,,because they are,\Nof course, more cumbersome Dialogue: 0,0:48:15.61,0:48:17.89,Default,,0000,0000,0000,,and have all kinds of rules\Nand nobody knows the rules. Dialogue: 0,0:48:18.63,0:48:20.53,Default,,0000,0000,0000,,So there are basically two variants. Dialogue: 0,0:48:20.53,0:48:25.32,Default,,0000,0000,0000,,There's the everyday casual prose variant, Dialogue: 0,0:48:25.32,0:48:27.83,Default,,0000,0000,0000,,and there's the Bible or poetry, Dialogue: 0,0:48:27.83,0:48:32.20,Default,,0000,0000,0000,,which always come\Nin this traditional diacriticized text. Dialogue: 0,0:48:32.20,0:48:33.30,Default,,0000,0000,0000,,To be useful, Dialogue: 0,0:48:33.30,0:48:37.43,Default,,0000,0000,0000,,Lexeme would have to recognize\Nboth varieties of every single word Dialogue: 0,0:48:37.43,0:48:39.75,Default,,0000,0000,0000,,and every single form\Nof every single word. Dialogue: 0,0:48:40.68,0:48:43.39,Default,,0000,0000,0000,,So that's a very comprehensive use case Dialogue: 0,0:48:43.39,0:48:46.34,Default,,0000,0000,0000,,for official stable variants. Dialogue: 0,0:48:46.34,0:48:48.94,Default,,0000,0000,0000,,It's not dialect, it's not regions, Dialogue: 0,0:48:49.33,0:48:53.63,Default,,0000,0000,0000,,it's basically two coexisting\Nmorphological systems. Dialogue: 0,0:48:54.54,0:48:58.93,Default,,0000,0000,0000,,And I too don't know exactly\Nhow to express that in Lexeme today, Dialogue: 0,0:48:58.93,0:49:02.80,Default,,0000,0000,0000,,which is one thing that is keeping me\Nin partial answer to Magnus' question Dialogue: 0,0:49:02.80,0:49:05.24,Default,,0000,0000,0000,,from uploading the parts that are ready Dialogue: 0,0:49:05.24,0:49:09.39,Default,,0000,0000,0000,,from the biggest Hebrew dictionary,\Nwhich is public domain Dialogue: 0,0:49:09.39,0:49:13.14,Default,,0000,0000,0000,,and which I have been digitizing\Nfor several years now. Dialogue: 0,0:49:13.14,0:49:14.80,Default,,0000,0000,0000,,A good portion of it is ready, Dialogue: 0,0:49:14.80,0:49:16.55,Default,,0000,0000,0000,,but I'm not putting it on Lexeme right now Dialogue: 0,0:49:16.55,0:49:20.24,Default,,0000,0000,0000,,because I don't know exactly\Nhow to solve this problem. Dialogue: 0,0:49:20.24,0:49:23.39,Default,,0000,0000,0000,,Alright, let's solve\Nthis problem here. (chuckles) Dialogue: 0,0:49:24.50,0:49:26.02,Default,,0000,0000,0000,,That has to be possible. Dialogue: 0,0:49:30.04,0:49:32.05,Default,,0000,0000,0000,,Alright, more questions? Dialogue: 0,0:49:37.17,0:49:39.74,Default,,0000,0000,0000,,If not, then thank you so much. Dialogue: 0,0:49:40.60,0:49:42.68,Default,,0000,0000,0000,,(applause)