Return to Video

Data Sources - Intro to Hadoop and MapReduce

  • 0:01 - 0:04
    Organisations have been generating
    data since way back
  • 0:05 - 0:06
    But as time goes on, more & more
    data is being generated.
  • 0:07 - 0:12
    IBM estimates that 90% of world's data
    was created in the last two years alone.
  • 0:12 - 0:15
    This is a simple example.
    Think of your cellphone.
  • 0:15 - 0:18
    Whenever your cellphone is turned on,
    it's connected to the cell towers.
  • 0:19 - 0:22
    As you move around,
    it'll connect to different towers
  • 0:22 - 0:24
    in a different signal streaks.
  • 0:25 - 0:26
    All of that connection data is collected
    by the phone company & it's being logged.
  • 0:27 - 0:29
    They use that information to find
    dead spots in the coverage
  • 0:29 - 0:32
    & know which towers are busiest &
    need increased capacity.
  • 0:32 - 0:37
    They can even trace you if you make an
    emergency phone call but don't get your
  • 0:37 - 0:39
    exact phone location.
  • 0:39 - 0:41
    This is an enormous amount of data we have.
  • 0:41 - 0:42
    Another example as you when visit a
    website like Amazon or Netflix,
  • 0:42 - 0:49
    everything, you do there is logged: which
    pages you view, how long you spend there,
  • 0:49 - 0:54
    where you coming from.
  • 0:54 - 0:59
    They can even capture things like what
    browser you are using.
  • 1:01 - 1:02
    Again this is a huge amount of data.
  • 1:03 - 1:03
    Phone data & website logs are just examples.
  • 1:04 - 1:05
    In addition, things like X-rays are
    creating huge amounts of data.
  • 1:06 - 1:09
    & people doing research to detect
    similarities in tumors.
  • 1:10 - 1:11
    The increase in amount of data we're
    generating opens up huge possibilities.
  • 1:12 - 1:13
    But it comes with problems too.
  • 1:14 - 1:15
    Where do we've to store all this data?
    & process it too?
Tytuł:
Data Sources - Intro to Hadoop and MapReduce
Video Language:
English
Team:
Udacity
Projekt:
ud617 - Intro to Hadoop and Mapreduce
Duration:
01:16

English subtitles

Revisions Compare revisions