WEBVTT 00:00:06.080 --> 00:00:08.760 Interpreter: If you create video and audio content 00:00:08.760 --> 00:00:10.120 or host events, 00:00:10.120 --> 00:00:12.520 there are two reasons to care about providing 00:00:12.520 --> 00:00:16.320 high quality, same language captioning and transcription. 00:00:16.320 --> 00:00:18.260 The first is obvious, 00:00:18.260 --> 00:00:20.700 that there are millions of people like myself, 00:00:20.700 --> 00:00:22.940 who get accessibility through captions. 00:00:22.940 --> 00:00:24.130 But another reason 00:00:24.130 --> 00:00:27.210 is that you will reach a much larger audience 00:00:27.210 --> 00:00:29.130 and increase your return on investment. 00:00:29.130 --> 00:00:33.280 80% of people who use captions are not deaf. 00:00:33.280 --> 00:00:36.100 Captions help foreign language learners, 00:00:36.100 --> 00:00:40.430 people in gyms, or bars, or sound sensitive environments, 00:00:40.430 --> 00:00:43.750 and assist when the speaker has a strong accent, 00:00:43.750 --> 00:00:48.140 or when content is complicated and difficult to understand. 00:00:48.140 --> 00:00:51.690 Captions increase the viewership by 40%. 00:00:51.690 --> 00:00:55.210 90% of videos with captions are watched to completion. 00:00:55.210 --> 00:00:56.390 Digiday states that 00:00:56.390 --> 00:00:59.980 85% of Facebook videos are viewed in mute mode, 00:00:59.980 --> 00:01:02.700 so captioning has become a necessity. 00:01:02.700 --> 00:01:05.040 Growing up, I didn't have captioning access 00:01:05.040 --> 00:01:07.350 until I was in eighth grade. 00:01:07.350 --> 00:01:09.370 That's when my dad brought home a box 00:01:09.370 --> 00:01:11.400 called a captioning decoder. 00:01:11.400 --> 00:01:14.070 We were amazed by this magic box 00:01:14.070 --> 00:01:17.260 that made captions show up on almost all the TV channels. 00:01:17.260 --> 00:01:21.010 Now captioning support is included in all TV models 00:01:21.030 --> 00:01:22.600 and video players, 00:01:22.600 --> 00:01:25.420 and there's no need for external decoders anymore. 00:01:25.420 --> 00:01:29.250 Captions are now not limited even to TV and videos, 00:01:29.250 --> 00:01:31.770 they can also be provided in real-time 00:01:31.770 --> 00:01:34.780 for breaking news, webinars, events, 00:01:34.780 --> 00:01:37.670 classes, work meetings, and so on. 00:01:37.670 --> 00:01:40.650 Once I was given access to captioned material, 00:01:40.650 --> 00:01:42.380 my life completely changed 00:01:42.380 --> 00:01:45.000 and new worlds opened up for me. 00:01:45.000 --> 00:01:46.170 But I'm not alone. 00:01:46.170 --> 00:01:49.470 There are over 48 million deaf and hard of hearing people 00:01:49.470 --> 00:01:50.720 in United States. 00:01:50.720 --> 00:01:54.720 And over 466 million worldwide. 00:01:54.720 --> 00:01:57.150 The number is increasing rapidly as well, 00:01:57.150 --> 00:02:00.170 especially because the number one cause of hearing loss now 00:02:00.170 --> 00:02:03.323 is constant exposure to loud noises. 00:02:05.160 --> 00:02:09.470 Captions also helped me master English as my third language. 00:02:09.470 --> 00:02:12.330 Even my parents, who have perfect hearing, 00:02:12.330 --> 00:02:15.793 love captions because English is not their native language. 00:02:18.180 --> 00:02:20.425 Quality is critical 00:02:20.425 --> 00:02:23.030 when you're considering captions. 00:02:23.030 --> 00:02:25.980 While there are now many speech technologies 00:02:25.980 --> 00:02:27.950 that can translate automatically 00:02:27.950 --> 00:02:29.450 spoken words into captions, 00:02:29.450 --> 00:02:31.530 they're often not accurate. 00:02:31.530 --> 00:02:34.503 Many deaf people call them "craptions". 00:02:34.503 --> 00:02:35.336 (Laughter) 00:02:35.336 --> 00:02:36.870 Under ideal conditions, 00:02:36.870 --> 00:02:40.837 the accuracy rate is usually somewhere between 80 and 90%. 00:02:40.837 --> 00:02:43.890 And it gets even worse with complex content, 00:02:43.890 --> 00:02:46.070 or accents, background noise, 00:02:46.070 --> 00:02:48.120 when speakers are speaking too fast, 00:02:48.120 --> 00:02:49.970 or overlap each other. 00:02:49.970 --> 00:02:52.460 And bad captions are not better than nothing. 00:02:52.460 --> 00:02:55.240 Garbled captions are hard to read. 00:02:55.240 --> 00:02:58.490 And it's just like listening to garbled audio. 00:02:58.490 --> 00:03:00.790 Caption errors make it hard to understand 00:03:00.790 --> 00:03:03.250 or retain any information. 00:03:03.250 --> 00:03:07.570 Poor captions affect your SEO, search engine optimization, 00:03:07.570 --> 00:03:10.750 and decrease your CPM, your clicks per mille, 00:03:10.750 --> 00:03:14.763 for videos and your ROI, return on investment. 00:03:15.830 --> 00:03:18.490 Google is deaf like me 00:03:18.490 --> 00:03:21.100 and doesn't like videos with auto-captioning, 00:03:21.100 --> 00:03:24.140 so it just does not index those videos. 00:03:24.140 --> 00:03:26.200 So Google penalizes producers 00:03:26.200 --> 00:03:27.970 for turning on the auto-captions 00:03:27.970 --> 00:03:30.770 instead of actually adding proper captioning themselves. 00:03:32.320 --> 00:03:34.610 Even if a machine could translate 00:03:34.610 --> 00:03:37.070 every spoken word to captions accurately, 00:03:37.070 --> 00:03:40.310 it still cannot follow certain quality guidelines. 00:03:40.310 --> 00:03:43.380 For example, you may see text exactly as it's heard, 00:03:43.380 --> 00:03:44.873 for example this: 00:03:46.090 --> 00:03:47.673 Lets eat grandma. 00:03:48.593 --> 00:03:49.670 (Laughter) 00:03:49.670 --> 00:03:52.280 Now, does that mean that we should go ahead 00:03:52.280 --> 00:03:53.930 and consume grandma, 00:03:53.930 --> 00:03:55.520 or that we should dine with her? 00:03:55.520 --> 00:03:57.140 It's not clear, right? 00:03:57.140 --> 00:03:58.706 Now that's much better. 00:03:58.706 --> 00:04:00.360 (Laughter) 00:04:00.360 --> 00:04:03.320 So proper grammar and punctuation are critical 00:04:03.320 --> 00:04:05.700 for optimal experience in reading captions. 00:04:05.700 --> 00:04:08.450 Just like speech intonations are. 00:04:08.450 --> 00:04:10.760 Quality guidelines are long and complex, 00:04:10.760 --> 00:04:14.020 so I'm just sharing a few basic tips with you today. 00:04:14.020 --> 00:04:17.019 Transcripts and captions need to be verbatim 00:04:17.019 --> 00:04:19.240 They need to follow proper grammatical rules, 00:04:19.240 --> 00:04:21.410 include non-speech elements, 00:04:21.410 --> 00:04:23.660 use upper and lower case letters, 00:04:23.660 --> 00:04:26.580 have good font size and color contrast, 00:04:26.580 --> 00:04:28.520 and translate profanity -- 00:04:29.480 --> 00:04:30.330 word for word, 00:04:30.330 --> 00:04:32.480 assuming it's not bleeped out in the audio. 00:04:34.920 --> 00:04:38.690 Captions and transcripts need to be completed 00:04:38.690 --> 00:04:41.710 before you publish or share videos and podcasts. 00:04:44.010 --> 00:04:46.810 So everyone can enjoy them at the same time. 00:04:46.810 --> 00:04:48.430 And if you advertise in advance 00:04:48.430 --> 00:04:50.640 that events will have real-time captions, 00:04:50.640 --> 00:04:52.363 you will get a larger audience. 00:04:54.380 --> 00:04:56.180 With advances in technologies, 00:04:56.180 --> 00:04:58.430 it is much easier, faster,and cheaper 00:04:58.430 --> 00:05:01.690 to create captions now more than ever before. 00:05:01.690 --> 00:05:04.820 For example, YouTube offers a great free tool 00:05:04.820 --> 00:05:06.980 to create and edit captions. 00:05:06.980 --> 00:05:11.080 You can save and download your captions as an SRT caption file 00:05:11.120 --> 00:05:14.530 and import it onto Vimeo, Facebook, 00:05:14.530 --> 00:05:16.220 or other video platforms 00:05:16.220 --> 00:05:18.630 that offer captioning support. 00:05:18.630 --> 00:05:21.240 Here's a short example. 00:05:21.240 --> 00:05:23.313 So you can see how easy it is. 00:05:25.400 --> 00:05:27.063 Yes, it's really easy. 00:05:28.340 --> 00:05:30.740 And you can clean up the auto captions if you like, 00:05:30.740 --> 00:05:32.500 or you can create them from scratch. 00:05:32.500 --> 00:05:34.740 Just please do not turn on auto captions 00:05:34.740 --> 00:05:36.600 and just leave them as is. 00:05:36.600 --> 00:05:38.610 And please add video captions 00:05:38.610 --> 00:05:40.680 on all platforms that you use, 00:05:40.680 --> 00:05:44.720 and couple those captions with transcripts for videos, 00:05:44.720 --> 00:05:48.940 because those are useful for people who cannot see the video 00:05:48.940 --> 00:05:50.853 or that want to skim the information. 00:05:51.820 --> 00:05:53.610 If your organization can do so, 00:05:53.610 --> 00:05:56.080 consider hiring an experienced specialist 00:05:56.080 --> 00:05:59.650 who is familiar with quality standards and guidelines. 00:05:59.650 --> 00:06:02.510 They can produce your captions and transcripts faster 00:06:02.510 --> 00:06:05.200 and make them look more professional and easy to read. 00:06:05.200 --> 00:06:08.520 The costs for transcripts and captions 00:06:08.520 --> 00:06:12.200 vary from a dollar all the way to $10 or $15 per minute. 00:06:12.200 --> 00:06:14.790 And the rates really depend 00:06:14.790 --> 00:06:17.823 on experience and the skills of the vendor, 00:06:19.160 --> 00:06:21.290 the audio quality, 00:06:21.290 --> 00:06:25.600 the content complexity, among other factors. 00:06:25.600 --> 00:06:29.380 Low cost may result in bad quality. 00:06:29.380 --> 00:06:32.450 Some vendors may use auto-captioning themselves 00:06:32.450 --> 00:06:34.660 or unskilled labor 00:06:34.660 --> 00:06:36.860 that will result in low caption quality 00:06:36.860 --> 00:06:38.810 and end up wasting your money. 00:06:38.810 --> 00:06:42.090 Always review the selected vendor's prior work 00:06:42.090 --> 00:06:43.533 to assess the quality. 00:06:44.660 --> 00:06:47.660 If you have budget limitations, 00:06:47.660 --> 00:06:49.650 there are some creative ways to find money 00:06:49.650 --> 00:06:52.363 like asking sponsors to cover expenses. 00:06:53.930 --> 00:06:57.430 I work with media producers, corporations, 00:06:57.430 --> 00:07:01.780 educational institutions, business owners, event organizers, 00:07:01.780 --> 00:07:05.930 and I help make their audio, video, and events accessible 00:07:05.930 --> 00:07:08.200 at a reasonable cost. 00:07:08.200 --> 00:07:11.440 There's some guidance and information along with my personal story 00:07:11.440 --> 00:07:13.660 that can be found in my book. 00:07:13.660 --> 00:07:16.480 If nobody asks for speech to text access, 00:07:16.480 --> 00:07:18.580 that does not mean that there's no demand. 00:07:19.420 --> 00:07:23.220 It needs to be provided for all types of aural information 00:07:23.220 --> 00:07:25.800 even without a prior request. 00:07:25.800 --> 00:07:28.800 High quality, same language captions and transcripts 00:07:28.800 --> 00:07:29.900 are good for business 00:07:29.900 --> 00:07:32.700 and a necessity for millions of people. 00:07:32.700 --> 00:07:34.914 Let's think outside the ears! 00:07:34.914 --> 00:07:35.747 (Laughter) 00:07:35.747 --> 00:07:37.610 Let's caption everything! 00:07:37.610 --> 00:07:39.840 Get your message to more people. 00:07:39.840 --> 00:07:40.822 Thank you. 00:07:40.822 --> 00:07:42.456 (Applause)