Return to Video

Finishing the Web Crawler Solution - Intro to Computer Science

  • 0:00 - 0:04
    So the answer is we should use the "addpageto_index" procedure we just defined,
  • 0:04 - 0:06
    and we should pass in the index.
  • 0:06 - 0:10
    We should pass in the page, that's the URL that identifies the location,
  • 0:10 - 0:12
    and we should pass in the content.
  • 0:12 - 0:14
    And that's all we need.
  • 0:14 - 0:16
    So we're done with our web crawler.
  • 0:16 - 0:18
    From a seed, we can find a set of pages.
  • 0:18 - 0:23
    Following that seed, following all the links that we find on the pages that we find starting from that seed,
  • 0:23 - 0:27
    for each page, we're going to add the content that we find on that page to an index,
  • 0:27 - 0:29
    and we're going to return that index.
  • 0:29 - 0:33
    And we've already written a code that given the index, can do a lookup.
  • 0:33 -
    So for any word we want to look up, we'll find the list of URLs for the pages that contain that word.
Title:
Finishing the Web Crawler Solution - Intro to Computer Science
Description:

dummy description

more » « less
Video Language:
English
Team:
Udacity
Project:
CS101 - Intro to Computer Science
Duration:
0:40

English subtitles

Revisions Compare revisions