How computers translate human language - Ioannis Papachimonas
-
0:07 - 0:11How is it that so many
intergalactic species in movies and TV -
0:11 - 0:14just happen to speak perfect English?
-
0:14 - 0:18The short answer is that no one
wants to watch a starship crew -
0:18 - 0:22spend years compiling an alien dictionary.
-
0:22 - 0:23But to keep things consistent,
-
0:23 - 0:27the creators of Star Trek
and other science-fiction worlds -
0:27 - 0:31have introduced the concept
of a universal translator, -
0:31 - 0:35a portable device that can instantly
translate between any languages. -
0:35 - 0:39So is a universal translator
possible in real life? -
0:39 - 0:42We already have many programs
that claim to do just that, -
0:42 - 0:46taking a word, sentence,
or entire book in one language -
0:46 - 0:49and translating it into almost any other,
-
0:49 - 0:52whether it's modern English
or Ancient Sanskrit. -
0:52 - 0:56And if translation were just a matter
of looking up words in a dictionary, -
0:56 - 1:00these programs would run circles
around humans. -
1:00 - 1:03The reality, however,
is a bit more complicated. -
1:03 - 1:07A rule-based translation program
uses a lexical database, -
1:07 - 1:10which includes all the words
you'd find in a dictionary -
1:10 - 1:13and all grammatical forms they can take,
-
1:13 - 1:19and set of rules to recognize the basic
linguistic elements in the input language. -
1:19 - 1:22For a seemingly simple sentence like,
"The children eat the muffins," -
1:22 - 1:27the program first parses its syntax,
or grammatical structure, -
1:27 - 1:30by identifying the children
as the subject, -
1:30 - 1:32and the rest of the sentence
as the predicate -
1:32 - 1:34consisting of a verb "eat,"
-
1:34 - 1:37and a direct object "the muffins."
-
1:37 - 1:40It then needs to recognize
English morphology, -
1:40 - 1:45or how the language can be broken down
into its smallest meaningful units, -
1:45 - 1:46such as the word muffin
-
1:46 - 1:50and the suffix "s,"
used to indicate plural. -
1:50 - 1:52Finally, it needs to understand
the semantics, -
1:52 - 1:56what the different parts of the sentence
actually mean. -
1:56 - 1:58To translate this sentence properly,
-
1:58 - 2:02the program would refer to a different set
of vocabulary and rules -
2:02 - 2:05for each element of the target language.
-
2:05 - 2:07But this is where it gets tricky.
-
2:07 - 2:12The syntax of some languages
allows words to be arranged in any order, -
2:12 - 2:17while in others, doing so could make
the muffin eat the child. -
2:17 - 2:20Morphology can also pose a problem.
-
2:20 - 2:23Slovene distinguishes between
two children and three or more -
2:23 - 2:27using a dual suffix absent
in many other languages, -
2:27 - 2:31while Russian's lack of definite articles
might leave you wondering -
2:31 - 2:34whether the children are eating
some particular muffins, -
2:34 - 2:37or just eat muffins in general.
-
2:37 - 2:40Finally, even when the semantics
are technically correct, -
2:40 - 2:43the program might miss their finer points,
-
2:43 - 2:46such as whether the children
"mangiano" the muffins, -
2:46 - 2:48or "divorano" them.
-
2:48 - 2:52Another method is
statistical machine translation, -
2:52 - 2:56which analyzes a database
of books, articles, and documents -
2:56 - 2:59that have already
been translated by humans. -
2:59 - 3:03By finding matches between source
and translated text -
3:03 - 3:05that are unlikely to occur by chance,
-
3:05 - 3:09the program can identify corresponding
phrases and patterns, -
3:09 - 3:12and use them for future translations.
-
3:12 - 3:15However, the quality
of this type of translation -
3:15 - 3:18depends on the size
of the initial database -
3:18 - 3:21and the availability of samples
for certain languages -
3:21 - 3:23or styles of writing.
-
3:23 - 3:27The difficulty that computers have
with the exceptions, irregularities -
3:27 - 3:31and shades of meaning
that seem to come instinctively to humans -
3:31 - 3:35has led some researchers to believe
that our understanding of language -
3:35 - 3:39is a unique product
of our biological brain structure. -
3:39 - 3:43In fact, one of the most famous
fictional universal translators, -
3:43 - 3:46the Babel fish from
"The Hitchhiker's Guide to the Galaxy", -
3:46 - 3:50is not a machine at all
but a small creature -
3:50 - 3:54that translates the brain waves
and nerve signals of sentient species -
3:54 - 3:57through a form of telepathy.
-
3:57 - 4:00For now, learning a language
the old fashioned way -
4:00 - 4:05will still give you better results than
any currently available computer program. -
4:05 - 4:07But this is no easy task,
-
4:07 - 4:09and the sheer number
of languages in the world, -
4:09 - 4:13as well as the increasing interaction
between the people who speak them, -
4:13 - 4:18will only continue to spur greater
advances in automatic translation. -
4:18 - 4:21Perhaps by the time we encounter
intergalactic life forms, -
4:21 - 4:25we'll be able to communicate with them
through a tiny gizmo, -
4:25 - 4:29or we might have to start compiling
that dictionary, after all.
- Title:
- How computers translate human language - Ioannis Papachimonas
- Speaker:
- Ioannis Papachimonas
- Description:
-
View full lesson: http://ed.ted.com/lessons/how-computers-translate-human-language-ioannis-papachimonas
Is a universal translator possible in real life? We already have many programs that claim to be able to take a word, sentence, or entire book in one language and translate it into almost any other. The reality, however, is a bit more complicated. Ioannis Papachimonas shows how these machine translators work, and explains why they often get a bit mixed up.
Lesson by Ioannis Papachimonas, animation by NOWAY Video Club.
- Video Language:
- English
- Team:
closed TED
- Project:
- TED-Ed
- Duration:
- 04:45
![]() |
Jessica Ruby approved English subtitles for How computers translate human language | |
![]() |
Jessica Ruby edited English subtitles for How computers translate human language | |
![]() |
Jessica Ruby accepted English subtitles for How computers translate human language | |
![]() |
Jessica Ruby edited English subtitles for How computers translate human language | |
![]() |
Jessica Ruby edited English subtitles for How computers translate human language | |
![]() |
Jennifer Cody edited English subtitles for How computers translate human language | |
![]() |
Jennifer Cody edited English subtitles for How computers translate human language |