0:00:01.000,0:00:04.000 This Web Archiving Service video tutorial 0:00:04.000,0:00:06.000 will show you how utilize the various 0:00:06.000,0:00:11.000 report options available to you. 0:00:11.000,0:00:12.000 The reports you can run are listed 0:00:12.000,0:00:14.000 under the "Reports" tab 0:00:14.000,0:00:35.000 when viewing the overview of an individual capture. 0:00:35.000,0:00:37.000 Each report has a brief description 0:00:37.000,0:00:40.000 explaining its function. 0:00:40.000,0:00:41.000 The most important report is 0:00:41.000,0:00:45.000 the "Crawl Log." 0:00:45.000,0:00:48.000 This report gives you detailed information 0:00:48.000,0:00:50.000 about your capture and can help you determine 0:00:50.000,0:00:52.000 any errors that were encountered 0:00:52.000,0:00:54.000 during the crawl. 0:00:54.000,0:00:55.000 You can search for a particular item 0:00:55.000,0:00:58.000 to see whether or not it was captured. 0:00:58.000,0:01:00.000 Use Ctrl-f on a PC or Command-f on a Mac 0:01:00.000,0:01:04.000 to search for the filename. 0:01:04.000,0:01:06.000 Then find the corresponding Heritrix 0:01:06.000,0:01:08.000 or HTTP status code 0:01:08.000,0:01:11.000 that is in the second column. 0:01:11.000,0:01:13.000 The main Heritrix codes that you will encounter 0:01:13.000,0:01:18.000 are "1," which means successful DNS lookup performed; 0:01:18.000,0:01:23.000 "200," which indicates that the item was successfully captured; 0:01:23.000,0:01:25.000 "403," which tells you that the item requires 0:01:25.000,0:01:28.000 authorization to be viewed, and therefore 0:01:28.000,0:01:29.000 was not captured; 0:01:29.000,0:01:32.000 "404," which means that the item could not be found, 0:01:32.000,0:01:34.000 and therefore was not captured; 0:01:34.000,0:01:39.000 "-9998," which means that there was a robots.txt exclusion 0:01:39.000,0:01:42.000 for this item, and it was not captured. 0:01:42.000,0:01:43.000 These tools are great for browsing 0:01:43.000,0:01:46.000 and troubleshooting on your own, 0:01:46.000,0:01:48.000 but know that we're happy to work with you 0:01:48.000,0:01:50.000 to research any errors or problems 0:01:50.000,0:01:52.000 that you come across. 0:01:52.000,0:01:54.000 In addition to the "Reports" tab, 0:01:54.000,0:01:57.000 we also a useful page of quality assurance tools. 0:01:57.000,0:02:00.000 Using the dropdown menu beneath the "Captures" tab 0:02:00.000,0:02:02.000 to choose "QA Tools" will take you to the list 0:02:02.000,0:02:05.000 of quality assurance tools. 0:02:05.000,0:02:07.000 These tools will help pinpoint areas 0:02:07.000,0:02:10.000 that are causing problems within your captures. 0:02:10.000,0:02:13.000 Each tool has a brief explanation. 0:02:13.000,0:02:16.000 For example, checking the list of redirected seed URLs 0:02:16.000,0:02:19.000 will clue you in as to which sites 0:02:19.000,0:02:20.000 may need updated URLs in order to continue 0:02:20.000,0:02:24.000 capturing correctly in the future. 0:02:24.000,0:02:26.000 In order to update a URL, 0:02:26.000,0:02:28.000 simply click on the "Edit Site" link, 0:02:28.000,0:02:31.000 and then change the seed URL information 0:02:31.000,0:02:33.000 on the "Edit Site" screen. 0:02:33.000,0:02:36.000 This has been a tutorial on analyzing reports. 0:02:36.000,0:02:38.000 Check out our additional tutorials 0:02:38.000,0:02:40.000 on analyzing capture results, 0:02:40.000,0:02:41.000 and comparing captures, 0:02:41.000,0:02:44.000 to better understand your capture results. 0:02:44.000,0:02:46.000 As always, if you have questions, 0:02:46.000,0:02:50.000 feel free to contact us at washelp@ucop.edu