WEBVTT 00:00:01.000 --> 00:00:04.000 This Web Archiving Service video tutorial 00:00:04.000 --> 00:00:06.000 will show you how utilize the various 00:00:06.000 --> 00:00:11.000 report options available to you. 00:00:11.000 --> 00:00:12.000 The reports you can run are listed 00:00:12.000 --> 00:00:14.000 under the "Reports" tab 00:00:14.000 --> 00:00:35.000 when viewing the overview of an individual capture. 00:00:35.000 --> 00:00:37.000 Each report has a brief description 00:00:37.000 --> 00:00:40.000 explaining its function. 00:00:40.000 --> 00:00:41.000 The most important report is 00:00:41.000 --> 00:00:45.000 the "Crawl Log." 00:00:45.000 --> 00:00:48.000 This report gives you detailed information 00:00:48.000 --> 00:00:50.000 about your capture and can help you determine 00:00:50.000 --> 00:00:52.000 any errors that were encountered 00:00:52.000 --> 00:00:54.000 during the crawl. 00:00:54.000 --> 00:00:55.000 You can search for a particular item 00:00:55.000 --> 00:00:58.000 to see whether or not it was captured. 00:00:58.000 --> 00:01:00.000 Use Ctrl-f on a PC or Command-f on a Mac 00:01:00.000 --> 00:01:04.000 to search for the filename. 00:01:04.000 --> 00:01:06.000 Then find the corresponding Heritrix 00:01:06.000 --> 00:01:08.000 or HTTP status code 00:01:08.000 --> 00:01:11.000 that is in the second column. 00:01:11.000 --> 00:01:13.000 The main Heritrix codes that you will encounter 00:01:13.000 --> 00:01:18.000 are "1," which means successful DNS lookup performed; 00:01:18.000 --> 00:01:23.000 "200," which indicates that the item was successfully captured; 00:01:23.000 --> 00:01:25.000 "403," which tells you that the item requires 00:01:25.000 --> 00:01:28.000 authorization to be viewed, and therefore 00:01:28.000 --> 00:01:29.000 was not captured; 00:01:29.000 --> 00:01:32.000 "404," which means that the item could not be found, 00:01:32.000 --> 00:01:34.000 and therefore was not captured; 00:01:34.000 --> 00:01:39.000 "-9998," which means that there was a robots.txt exclusion 00:01:39.000 --> 00:01:42.000 for this item, and it was not captured. 00:01:42.000 --> 00:01:43.000 These tools are great for browsing 00:01:43.000 --> 00:01:46.000 and troubleshooting on your own, 00:01:46.000 --> 00:01:48.000 but know that we're happy to work with you 00:01:48.000 --> 00:01:50.000 to research any errors or problems 00:01:50.000 --> 00:01:52.000 that you come across. 00:01:52.000 --> 00:01:54.000 In addition to the "Reports" tab, 00:01:54.000 --> 00:01:57.000 we also a useful page of quality assurance tools. 00:01:57.000 --> 00:02:00.000 Using the dropdown menu beneath the "Captures" tab 00:02:00.000 --> 00:02:02.000 to choose "QA Tools" will take you to the list 00:02:02.000 --> 00:02:05.000 of quality assurance tools. 00:02:05.000 --> 00:02:07.000 These tools will help pinpoint areas 00:02:07.000 --> 00:02:10.000 that are causing problems within your captures. 00:02:10.000 --> 00:02:13.000 Each tool has a brief explanation. 00:02:13.000 --> 00:02:16.000 For example, checking the list of redirected seed URLs 00:02:16.000 --> 00:02:19.000 will clue you in as to which sites 00:02:19.000 --> 00:02:20.000 may need updated URLs in order to continue 00:02:20.000 --> 00:02:24.000 capturing correctly in the future. 00:02:24.000 --> 00:02:26.000 In order to update a URL, 00:02:26.000 --> 00:02:28.000 simply click on the "Edit Site" link, 00:02:28.000 --> 00:02:31.000 and then change the seed URL information 00:02:31.000 --> 00:02:33.000 on the "Edit Site" screen. 00:02:33.000 --> 00:02:36.000 This has been a tutorial on analyzing reports. 00:02:36.000 --> 00:02:38.000 Check out our additional tutorials 00:02:38.000 --> 00:02:40.000 on analyzing capture results, 00:02:40.000 --> 00:02:41.000 and comparing captures, 00:02:41.000 --> 00:02:44.000 to better understand your capture results. 00:02:44.000 --> 00:02:46.000 As always, if you have questions, 00:02:46.000 --> 00:02:50.000 feel free to contact us at washelp@ucop.edu