Return to Video

https:/.../2020-06-29_gov355m_14.g_conclusion.mp4

  • 0:06 - 0:11
    Alright, this is the last segment
    for the module but also the
  • 0:11 - 0:13
    last segment for the course.
  • 0:13 - 0:19
    We have gone through a tremendous
    amount in what feels like a very short
  • 0:19 - 0:20
    period of time.
  • 0:20 - 0:25
    This is all recorded in a summer session
    so you know that everyone has been
  • 0:25 - 0:32
    working overtime to go to learn a lot
    of this and for those without some
  • 0:32 - 0:34
    data or statistical background.
  • 0:34 - 0:40
    There's a lot of material here
    so if you're here and you're
  • 0:40 - 0:45
    watching this still, great job
    with everything and recognize
  • 0:45 - 0:47
    the incredible progress.
  • 0:48 - 0:51
    If you're at the point where
    you can even do some of this
  • 0:51 - 0:59
    basic webscraping, text analysis
    and mapping, you are in a much
  • 0:59 - 1:03
    better position than so many
    other students especially that
  • 1:03 - 1:07
    come out of the liberal arts in terms
    of [inaudible] and hard skills.
  • 1:07 - 1:11
    Now, the hard skills, I just wanted
    to comment a few things but you
  • 1:11 - 1:14
    know, the hard skills like this
    are incredibly important but
  • 1:14 - 1:19
    it's not to say that the broader
    liberal arts education is not --
  • 1:19 - 1:26
    as someone who teaches courses
    that are very much about coming
  • 1:26 - 1:32
    at things with this broad kind of
    liberal arts mindset, very analytical
  • 1:32 - 1:36
    and critical synthetic, that type
    of thing, I'm a huge believer in
  • 1:36 - 1:41
    all of that but I also think that
    getting these hard skills is
  • 1:41 - 1:46
    incredibly important and it's
    not really kind of a 'you do this
  • 1:46 - 1:51
    or you do that' approach,
    it's really one in which you should
  • 1:51 - 1:59
    develop that broad, strong liberal
    arts mindset by getting some of
  • 1:59 - 2:03
    these hard skills will hopefully
    help you to apply that and
  • 2:03 - 2:08
    apply it in a more sophisticated way,
    maybe not more sophisticated, in
  • 2:08 - 2:13
    an asophisticated way or one
    sophisticated way, there's
  • 2:13 - 2:15
    lots of sophisticated ways to
    get at things.
  • 2:15 - 2:19
    This is one sophisticated way
    to help enhance your liberal
  • 2:19 - 2:24
    arts education and be able to do
    a lot of things hopefully that you
  • 2:24 - 2:27
    couldn't have done otherwise
    and that will hopefully set you
  • 2:27 - 2:32
    apart from someone else that
    comes out with a strong liberal
  • 2:32 - 2:35
    arts education but you know
    just doesn't have the ability to
  • 2:35 - 2:41
    take on a lot of these difficult
    political, social, and economic
  • 2:41 - 2:49
    challenges of the world armed
    with data and some ability to
  • 2:49 - 2:57
    program and manipulate data,
    clean data, get it in some format
  • 2:57 - 3:03
    that you can then analyze it
    either through basic statistics and/or
  • 3:03 - 3:10
    visualization, mapping, text analysis
    and so forth and so our hope here
  • 3:10 - 3:15
    and what we set out to do originally
    was where we started on the opening
  • 3:15 - 3:20
    day was to say this course is not
    about making you a computer
  • 3:20 - 3:21
    science expert.
  • 3:21 - 3:26
    We can't replace computer science
    department, not going to try, the
  • 3:26 - 3:33
    goal here was to hopefully
    introduce a bunch of substantive
  • 3:33 - 3:39
    challenges in the world, again
    political, social, economic challenges
  • 3:39 - 3:46
    or problems at least in brief
    and then use those as examples
  • 3:46 - 3:51
    for a lot of the r and excel skills
    that you learned in the course.
  • 3:51 - 3:56
    So you came for data science,
    and a lot of it has to focus on
  • 3:56 - 4:01
    the code and the data and so
    we hope that the political, social
  • 4:01 - 4:03
    economic side didn't get lost in
    all of that.
  • 4:03 - 4:08
    We certainly tried to keep it there
    but again, we kind of had to delve
  • 4:08 - 4:15
    into data early and often and
    deeply in order to develop enough
  • 4:15 - 4:19
    skills so we could competently
    engage with these substantive
  • 4:19 - 4:24
    issues and so that was at least
    the goal and the hope with trying
  • 4:24 - 4:28
    to in each session work with
    real data, work with real problems
  • 4:28 - 4:33
    but then in the process try to
    teach you enough code that
  • 4:33 - 4:35
    you can get around various things.
  • 4:35 - 4:43
    So, to get really not just proficient
    but to excel in data science, it's
  • 4:43 - 4:47
    going to require a lot more work
    as with most courses you take,
  • 4:47 - 4:51
    probably all courses you take,
    you've got to continue to apply
  • 4:51 - 4:54
    things in the future and so I'm not
    going to give you a big pep talk here
  • 4:54 - 5:00
    but what I will say is just to the extent
    that you think that a lot of this can
  • 5:00 - 5:03
    be useful in the future, I just
    strongly encourage you to
  • 5:03 - 5:10
    keep applying this stuff early
    enough and to give you some
  • 5:10 - 5:14
    context on this, when I go, I've
    been doing this stuff for quite
  • 5:14 - 5:20
    a long time now, I think I revealed
    my age on webscrape when I entered
  • 5:20 - 5:22
    my birthday to see what the billboard
    hot 100 said.
  • 5:22 - 5:29
    At any rate, I've been in this for
    15-20 years now and when I
  • 5:29 - 5:34
    do a statistical or when I do any
    sort of like data work, I'm always
  • 5:34 - 5:37
    looking up stuff still and told you
    here that when I was trying to
  • 5:37 - 5:40
    parse that time stamp on the twitter
    data, it took me a little bit of time
  • 5:40 - 5:45
    and so someday you might get
    to the point where you never have to
  • 5:45 - 5:49
    look up anything but I've met very
    few people in that situation.
  • 5:49 - 5:50
    I've got to look it up a lot.
  • 5:50 - 5:55
    I know Mike Denly does, others
    do, the key though is to get enough
  • 5:55 - 6:01
    of the understanding of the
    infrastructure, the infrastructure
  • 6:01 - 6:07
    in terms of like r and our studio,
    the way it approaches, conceptualizes
  • 6:07 - 6:13
    how it deals with data, which is
    thinking about things like different
  • 6:13 - 6:19
    types of data, their classes, their types,
    all of that, how many [inaudible] stuff
  • 6:19 - 6:25
    kind of like etched in is really important
    like it, that you deal with the character
  • 6:25 - 6:29
    variable different that you deal
    with the numeric, I mean those
  • 6:29 - 6:33
    things, you'll get to the point
    where that stuff is super basic
  • 6:33 - 6:37
    in time and you just always
    kind of know when you can
  • 6:37 - 6:41
    work through that stuff very
    quickly, and then undersanding
  • 6:41 - 6:44
    enough the basics on each topic
    that you can jump in and do a lot
  • 6:44 - 6:48
    of it on your own but then you
    know, you know when you need
  • 6:48 - 6:51
    to look when you need a little bit
    more help with something and
  • 6:51 - 6:53
    that's where most people are at
    that point and so should you go
  • 6:53 - 6:57
    on and apply this, guaranteed
    you're going to spend time googling,
  • 6:57 - 7:03
    you're going to spend time on
    various different web clearing
  • 7:03 - 7:08
    houses and otherwise, but that's
    part of the game to the extent
  • 7:08 - 7:11
    that some of this could be useful
    in the future, just encourage you
  • 7:11 - 7:13
    to keep going on some of this stuff.
  • 7:13 - 7:17
    To the extent that you don't end up
    doing a lot of this and you just sort
  • 7:17 - 7:21
    of say that it's useful to have
    some basic literacy in terms
  • 7:21 - 7:25
    of like especially if you're in
    an organization where maybe
  • 7:25 - 7:29
    you're not the data scientist
    in your organization but
  • 7:29 - 7:33
    someone else is, if you have
    enough literacy of this that
  • 7:33 - 7:36
    you can engage competently
    with that person, goes so far,
  • 7:36 - 7:40
    it absolutely goes so far
    in terms of helping you
  • 7:40 - 7:46
    be able to be the type of person
    on the team that can both engage
  • 7:46 - 7:51
    with, whatever the data scientists,
    or the non-data scientists or whatever
  • 7:51 - 7:56
    like being that person that can engage
    fully in all of that can be super useful
  • 7:56 - 8:00
    as well so just having some of that basic
    literacy and trying to keep up some of
  • 8:00 - 8:05
    that you know can be important
    in various ways so at the end of
  • 8:05 - 8:10
    the day though, the goal was to
    have some of these substantive
  • 8:10 - 8:14
    problems that shape things teach
    you enough to be able to learn
  • 8:14 - 8:17
    some new things about those
    substantive problems and
  • 8:17 - 8:21
    at the end of the day, hopefully
    have a few tips and tricks up
  • 8:21 - 8:24
    your sleeve like being able to do
    some maps which is like very
  • 8:24 - 8:32
    cool, maybe basic webscraping and text
    analysis, which can often be a really nice
  • 8:32 - 8:39
    way to go above and beyond what someone
    could just standardly do with the
  • 8:39 - 8:41
    [inaudible] file in Excel or something.
  • 8:41 - 8:44
    And so hopefully, you know have some
    higher order skills to go with this as
  • 8:44 - 8:53
    well, so finishing up then, and moving
    to the final exam, just a note on this
  • 8:53 - 8:59
    which is just plan for, understand
    and you should understand this
  • 8:59 - 9:03
    by now of course but the material
    course is fairly cumulative, right,
  • 9:03 - 9:09
    it's tough to do a text analysis
    without doing material from
  • 9:09 - 9:10
    module 7, 8, and 9.
  • 9:10 - 9:16
    We use [inaudible] here in
    text analysis that was brought
  • 9:16 - 9:21
    up in module 7 and like the spread
    and gather which come up earlier
  • 9:21 - 9:26
    in modules 8 and 9 or filtering
    and mutating, stuff like that
  • 9:26 - 9:30
    so when it comes to the final
    exam, just a final note that
  • 9:30 - 9:37
    it's somewhat cumulative
    but we'll also have a weighted
  • 9:37 - 9:45
    towards like the third, the last
    third of this course, so you're
  • 9:45 - 9:48
    going to want to give special
    emphasis to the last third
  • 9:48 - 9:52
    but remember, again, a lot that
    earlier material is important to
  • 9:52 - 9:57
    pay attention to and be prepared to
    work on and illustrate for the final exam
  • 9:57 - 10:03
    so with that we'll go ahead and close
    and just say, just let us know if any
  • 10:03 - 10:08
    questions, concerns, challenges
    and then just a note to please
  • 10:08 - 10:13
    keep in touch, if not in office hours
    in the next little bit in coming years
  • 10:13 - 10:17
    especially if some of the data
    science ends up coming in handy
  • 10:17 - 10:19
    for future courses or jobs or
    internships.
  • 10:19 - 10:23
    I would absolutely love to hear about
    it, if it turns out that it's absolutely
  • 10:23 - 10:26
    not useful in any way, shape, or form,
    we'd love to hear about that too,
  • 10:26 - 10:30
    honestly truly, it would be really
    nice to just get feedback down the
  • 10:30 - 10:36
    road of what you guys find useful
    and not useful, and so that we can
  • 10:36 - 10:41
    make sure that this course stays
    optimally beneficial for students
  • 10:41 - 10:42
    in the future.
  • 10:42 - 10:45
    So with that, thanks everyone
    for being in the class and we
  • 10:45 - 10:46
    will be in touch.
Title:
https:/.../2020-06-29_gov355m_14.g_conclusion.mp4
Video Language:
English
Duration:
11:02

English subtitles

Revisions