< Return to Video

36C3 Wikipaka WG: Wikimedia Cloud Services introduction

  • 0:00 - 0:19
    Music
  • 0:19 - 0:24
    Herald: Hello everybody, we are ready to
    get started we have Lucas and Amir here
  • 0:24 - 0:29
    and they want to give us a quick
    introduction of a project from the
  • 0:29 - 0:34
    wikimedia foundation called "cloud
    services" and how it might be may be
  • 0:34 - 0:39
    useful to all of us. So let's give a round
    of welcoming applause to Lucas and Amir.
  • 0:39 - 0:43
    Applause
  • 0:43 - 0:49
    Lucas: Thanks! yea, hello. So "wikimedia
    cloud services" is basically this big
  • 0:49 - 0:55
    collection of all kinds of different
    things which are useful if you want to do
  • 0:55 - 0:59
    taking your things in the wikimedia
    universe like with wikipedia or other
  • 0:59 - 1:06
    projects and you get them free of charge
    or you can just use them and the only
  • 1:06 - 1:10
    requirement is that you use them for
    something that's a kind of relevant to the
  • 1:10 - 1:14
    mission of wikimedia of promoting free
    knowledge and that kind of stuff and it's
  • 1:14 - 1:18
    kind of split into the things that you can
    do with your regular wikimedia account
  • 1:18 - 1:23
    which any registered user can do and then
    there's also things you need a special
  • 1:23 - 1:26
    account for on a different system called
    wiki tech and Amir is going to talk more
  • 1:26 - 1:30
    about those later but first let's just
    look into some of the things you can do
  • 1:30 - 1:34
    with your regular wikimedia account. And
    if you want to follow any of these links
  • 1:34 - 1:39
    there's a shortcut here. I was about to
    switch the next tab, so let's just stay
  • 1:39 - 1:45
    here for a few seconds yeah. So the first
    thing is the API sandbox which is if you
  • 1:45 - 1:52
    want to use the MediaWiki API to figure
    out what you have on a page or to make
  • 1:52 - 1:56
    edits or any kind of stuff. The API
    sandbox is a special page that's really
  • 1:56 - 2:01
    useful to find out how to use the API for
    example here's all the different actions I
  • 2:01 - 2:08
    can use that say query is the kind of
    general catch-all action that's here and
  • 2:08 - 2:13
    then I get down here a list of all the
    parameters I can use with queries such as:
  • 2:13 - 2:20
    I want to have all the user info and what
    kind of user info do I want? I want
  • 2:20 - 2:24
    options, blablabla. I would like to have
    some different format versions. So it
  • 2:24 - 2:29
    gives you all these nice inputs for
    figuring out exactly how to use the API
  • 2:29 - 2:33
    what's valid what's not valid and then you
    can make the API request and there you get
  • 2:33 - 2:38
    a response and we can't read anything
    because it's zoomed in way too much. But
  • 2:38 - 2:43
    it's very helpful when trying to use the
    API and then in the end here you can see
  • 2:43 - 2:49
    what you need to do in your own code to
    make the same API request. And for
  • 2:49 - 2:55
    anything that you can't do with the normal
    API - so if you want to do some kind of
  • 2:55 - 3:00
    more expensive analysis - you can often do
    that with Quarry, which is a tool that
  • 3:00 - 3:05
    lets you write SQL queries against
    databases that are almost like the ones in
  • 3:05 - 3:09
    production like you don't have user
    passwords and stuff but you'll have all
  • 3:09 - 3:16
    the database tables with page metadata and
    connections between them and the logs and
  • 3:16 - 3:20
    all kinds of stuff and you can just write
    your SQL here send it and you get the
  • 3:20 - 3:25
    results for example here's the number of
    lexemes published a days so it's some kind
  • 3:25 - 3:31
    of selecting from the page where the
    namespace is the lexeme namespace and
  • 3:31 - 3:41
    grouping that by the date and then we get
    something like all the way down to
  • 3:41 - 3:46
    September which is apparently when I ran
    this query there were here there were 116
  • 3:46 - 3:53
    lexemes created in this day. Or here
    someone had a list of edits to JavaScript
  • 3:53 - 3:58
    and CSS pages on Hungarian Wikipedia so
    you can run these queries against any Wiki
  • 3:58 - 4:07
    you like, like this here in wikipedia one.
    And if you can't get by with just SQL what
  • 4:07 - 4:13
    you also have is this thing called Paws,
    which gives you a Jupiter(?) instance if
  • 4:13 - 4:19
    you've heard of that you can basically
    write your own Python code here and do it
  • 4:19 - 4:24
    in a very convenient way because there's
    all kinds of auto-completion and helpful
  • 4:24 - 4:37
    things. So i can just try to copy this and
    run the code (then I needed a new cell
  • 4:37 - 4:43
    below it… there we go, Thanks!) and if I
    type item I should get helpful hints what
  • 4:43 - 4:50
    I can do with the item (if it's not
    hanging or something or the tab control
  • 4:50 - 4:59
    space no oh there we go yeah) and it's
    also a very useful way to work with py-
  • 4:59 - 5:07
    wiki-bot or you can also directly get
    normal shell here. And one thing (oops did
  • 5:07 - 5:12
    I click and wrong thing? I would like to
    have oh no I don't want a bash notebook I
  • 5:12 - 5:21
    want a new terminal that's what I want).
    And here you have for example database
  • 5:21 - 5:33
    dumps in (where was it?) public/dumps/
    something public again… So if you want to
  • 5:33 - 5:41
    do some kind of analysis here on the data
    dumps you can get them here and then have
  • 5:41 - 5:48
    all the computing that you want I guess to
    analyze the wiki more thoroughly and all
  • 5:48 - 5:52
    of this is hosted in the Wikimedia Cloud
    for you and you don't need your own server
  • 5:52 - 5:57
    or anything. Oh yeah I had two more
    examples of that, for example here: I use
  • 5:57 - 6:02
    that too so there were a lot of items on
    Wikipedia where there was some encoding
  • 6:02 - 6:06
    error, this should be an apostrophe like
    down here and instead it was this kind of
  • 6:06 - 6:13
    I with an accent and I hacked together
    some ugly Java/Python code to make all of
  • 6:13 - 6:17
    these edits and it was already logged in
    as well I didn't need to worry about
  • 6:17 - 6:21
    logging in or having a password or
    anything. So it's a very convenient way to
  • 6:21 - 6:29
    make edits as well. Or you can build
    something nicer here you can insert like
  • 6:29 - 6:37
    markdown cells to explain what you're
    doing and how the code works and build
  • 6:37 - 6:42
    nice notebooks like that, which are almost
    self-explanatory. And those are some of
  • 6:42 - 6:45
    the things you can do just with your
    Wikimedia account and now Amir is going to
  • 6:45 - 6:49
    talk about some other things.
    Amir: Thanks Lucas! So the thing that we
  • 6:49 - 6:55
    can do is that maybe some of you like me
    think that doing thing in browser is for
  • 6:55 - 7:00
    kids I need to do things in terminal I
    need to do connected system and then you
  • 7:00 - 7:05
    can access for a wiki tech account which
    you can just make a wiki tech account in
  • 7:05 - 7:12
    this place called wiki tech. (where is the
    li… no no but I do'… the main thing, the
  • 7:12 - 7:21
    main list. yeah okay) And so in here so
    and then you make a wiki tech account and
  • 7:21 - 7:25
    it gets approved quickly and then you get
    the shell and then you can just quickly go
  • 7:25 - 7:30
    there (where is yer…) and you can go to
    this shell and just log in and then you
  • 7:30 - 7:35
    have access to day a big set of nodes in
    the cloud and you can just do whatever you
  • 7:35 - 7:41
    want. Also you have access to the core
    dumps and you have access to the replica
  • 7:41 - 7:59
    database. Let me show it to you.
    [mumbling] So for example you can go to LS
  • 7:59 - 8:14
    /public/dumps/public/wikidatawiki/ and
    then you get - oh there's like all sorts
  • 8:14 - 8:19
    of time and everything that you want to,
    but if you also… you can do something else
  • 8:19 - 8:33
    is that you can just do SQL wikidatawiki
    and then you go inside the wikidata's
  • 8:33 - 8:36
    database, I mean it does you don't have
    the rights you can you cannot write to
  • 8:36 - 8:40
    their replica because it's a replica and
    also it's sanitized so it doesn't have
  • 8:40 - 8:50
    their like hash of user password and stuff
    like that but still you can do just select
  • 8:50 - 9:09
    varies from recent changes limits five and
    yeah and then you get all of the things
  • 9:09 - 9:15
    that you want you cannot even describe
    anything you want to directly into their
  • 9:15 - 9:20
    system and then there is also we have
    something called the job grid so you can
  • 9:20 - 9:25
    just put a crown and anything that you
    want to or just it's something run
  • 9:25 - 9:31
    something directly and you goes to the a
    big note of cloud kubernetes and then just
  • 9:31 - 9:36
    runs everything that you want to in its
    here there's a more information about it
  • 9:36 - 9:43
    in here there's a like a long help that it
    says like oh I used to run this job and
  • 9:43 - 9:48
    then job of what it does and you can get
    this so you just need to it's a bash
  • 9:48 - 9:55
    command you can run any bash command and
    send it okay return me this output to this
  • 9:55 - 10:00
    place and the other places one thing that
    you can do is also there's a web server
  • 10:00 - 10:05
    that you can access everything directly so
    you can just put a PHP file there and into
  • 10:05 - 10:13
    the Apache and then yet for example this
    is this is an example that we built
  • 10:13 - 10:19
    together I think two two Christmases ago
    but this was like you can just see this is
  • 10:19 - 10:24
    a piece before the source code is
    available and you just copy pasted that
  • 10:24 - 10:28
    source code into like a directory and it
    was there and every time we click on it
  • 10:28 - 10:32
    and you get most of the edits that happen
    on description wiki data that might be
  • 10:32 - 10:38
    vandalism and we can fix it also a this is
    not just the only thing that you can do
  • 10:38 - 10:45
    with this is that you can also put a
    Python flask application is this the file
  • 10:45 - 10:51
    implants and then this can be just a
    Python application and you can just have
  • 10:51 - 10:58
    the file there and also know JSON Java
    there's so many of them also you can have
  • 10:58 - 11:01
    own database like I have something that
    has its own database for example quick
  • 11:01 - 11:10
    categories in here has jobs that are here
    this is this tool for its own built-in
  • 11:10 - 11:15
    database inside our select cloud services
    and its uses it just fine you can do that
  • 11:15 - 11:22
    as well and also there's a cloud VPS that
    it doesn't do any kubernetes it just you
  • 11:22 - 11:27
    can make a VPS of your own and then do
    whatever you want with it so for example
  • 11:27 - 11:32
    and you get a project and you get the
    quota it's a slightly more limited but
  • 11:32 - 11:36
    also you have access to the whole VPS you
    have sudo rights on it you can do whatever
  • 11:36 - 11:40
    you feel like about it so we have like for
    example this project in here and it's
  • 11:40 - 11:47
    called tools and then there's proxies and
    you can for example go into that instance
  • 11:47 - 11:52
    and reboot it and do whatever you want and
    you can make new instance and look at your
  • 11:52 - 11:59
    culture and look at everything else there
    and also you can also make it even a wiki
  • 11:59 - 12:06
    on one of those cloud VPS systems which is
    for example we did in here in here if you
  • 12:06 - 12:11
    look at it it's just a wiki and the
    difference is that for other ones for
  • 12:11 - 12:15
    example for the vandalism dashboard you
    have tools that wmf labs org and then
  • 12:15 - 12:22
    slash WD w VD which is the tool itself but
    in here we get our own subdomain so which
  • 12:22 - 12:29
    will be wiki data - like seam that flew
    out the wmf labs org and you can even put
  • 12:29 - 12:36
    all sorts of add subdomains for the wmf
    labs or as long it's not taken so you can
  • 12:36 - 12:42
    build a media week instance instance or
    you can just complete a new software
  • 12:42 - 12:48
    anything you can put a word processor who
    cares and then you can use it it's very
  • 12:48 - 12:59
    simple your own thing and you can help
    lots of experience. Anything else?
  • 12:59 - 13:00
    Lucas: I don't think so. Most
  • 13:00 - 13:06
    important I would say is tool Forge to run
    your websites or if that's not enough for
  • 13:06 - 13:11
    you cloud VPS and then you get your own
    VMware you can do absolutely anything you
  • 13:11 - 13:20
    want as long as it matches those rules and
    stuff and I think that's it are there any questions?
  • 13:20 - 13:25
    Herald: Hello thank you very much for
    the talk that was very quick so maybe
  • 13:25 - 13:35
    anybody has a question here I'll give you
    my microphone to ask it. I don't see any
  • 13:35 - 13:42
    hands nope okay I don't think we have
    questions but if you're just too shy to
  • 13:42 - 13:47
    ask I think these guys always hanging
    around here around the wikipaka wiki so
  • 13:47 - 13:53
    if you have anything you want to talk
    about you'll find them later okay then
  • 13:53 - 13:56
    give a round of applause again
    for Lucas and Amir.
  • 13:56 - 13:59
    Applause
  • 13:59 - 14:26
    Music
Title:
36C3 Wikipaka WG: Wikimedia Cloud Services introduction
Description:

more » « less
Video Language:
English
Duration:
14:26

English subtitles

Revisions