Machine translation taking a quantum leap forward

By dmunger on December 5, 2006.

Steven Pinker points out in The Language Instinct that the potential ambiguities in any sentence makes programming computers to understand language quite difficult: humans can quickly determine the appropriate interpretation through context; computers are unable to understand context, and therefore they flounder, and so have difficulty translating texts. The sentence "Time flies like an arrow," for example, can be interpreted in five different ways. Here are just a couple of ways:

When timing houseflies, time them in the same manner in which you time arrows
A type of fly, a "time fly," enjoys the company of a particular arrow

While it's striking to realize the potential for ambiguity in such a simple sentence, the problem is compounded in longer sentences: in an analysis of a set of 891 sentences ranging in length from 1 to 25 words, a team led by Kathryn Baker found an average of 27 possible ways to parse each sentence. When attempting to translate between two languages, software such as the Google Language Tools faces similar difficulties in both the original and target language.

So how is the problem being addressed? Wired has an excellent article discussing one new technology:

In [previous attempts to handle the problem], called statistical-based MT, algorithms analyze large collections of previous translations, or what are technically called parallel corpora - sessions of the European Union, say, or newswire copy - to divine the statistical probabilities of words and phrases in one language ending up as particular words or phrases in another.

In the new system described in the article, instead of using parallel texts, one dictionary is used to generate all possible translations of a small chunk of the text in the target language -- say, English. Then these are compared to a 150 GB database of English phrases, identifying likely real-language equivalents.

Next, the software slides its window one word to the right, repeating the flooding process with another five- to eight-word chunk: "nuestra responsabilidad de lo que ha ocurrido en." Using what Meaningful Machines calls the decoder, it then rescores the candidate translations according to the amount of overlap between each chunk's translation options and the ones before and after it. If "We declare our responsibility for what has happened" overlaps with "declare our responsibility for what has happened in" which overlaps with "our responsibility for what has happened in Madrid," the translation is judged accurate.

The result is a system that's more accurate, and requires less data and processing time to work than previous efforts. The whole article is highly readable, and highly recommended. Mind Hacks also has a great summary of the article.

In other news:

More like this

Uncertainty Reduction: Ambiguity Resolution Mechanisms in Language

Ambiguity is a constant problem for any embodied cognitive agent with limited resources. Decisions need to be made, and their consequences understood, despite the probabilistic veil of uncertainty enveloping everything from sensory input to action execution. Clearly, there must be mechanisms for…

The Conservative Rewrite of the Bible

This is really off-topic for GM/BM, but I just can't resist mocking the astonishing stupidity of the Conservapedia folks. I'm sure you've heard by now that Andy Schafly and his pals are working on a "new translation" of the bible. They say that they need to do this in order to remove liberal bias…

When Pigs Fly But Hell Hasn't Frozen Over: Semantic Anomalies, Context, and the Inferior Frontal Gyri

When reading the title of this post, your knowledge of the world was sufficient to let you interpret the phrase "when pigs fly," but also alerted you to the fact that it is inconsistent with much of that world knowledge: clearly, pigs don't fly. A new study by Menenti, Petersson, Scheeringa…

Language and time: More on whether the future is literally in front of us

Last week's article on the Aymara language and metaphorical depictions of time generated a lot of discussion. I think part of the confusion there had to do less with the specific example and more with basic questions about metaphorical representations of time, so today I'm going to cover some of…

"You can't put too much water in a nuclear reactor."

Sounds like an advance.

I use a free text to speech system called festival. While it takes some getting used to, it seems to know parts of speech, like nouns and verbs, and generally gets them right. However, it totally punted on this:

I live for live music.

which it pronounced:

I liive for liv music.

Now, it knows that 'live' can be a verb or an adjective. And, it knows that they are not pronounced the same. It just made a mistake.

So, this approach might work for text to speech too. You shouldn't have to compare to 150 GB of text. That stuff should be reduced to a word linkage databaase.

Computationally, 150 GB sounds like alot. And it is. But, this calendar year, i picked up a 160 GB hard disk for $60, new.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Cognitive Daily Closes Shop after a Fantastic Five-Year Run

January 20, 2010

Five years ago today, we made the first post that would eventually make its way onto a blog called Cognitive Daily. We thought we were keeping notes for a book, but in reality we were helping build a network that represented a new way of sharing psychology with the world. Cognitive Daily wasn't the…

Both musicians and non-musicians can perceive bitonality

January 20, 2010

Take a listen to this brief audio clip of "Unforgettable." Aside from the fact that it's a computer-generated MIDI performance, do you hear anything unusual? If you're a non-musician like me, you might not have noticed anything. It sounds basically like the familiar song, even though the…

Synesthesia and the McGurk effect

January 14, 2010

We've discussed synesthesia many times before on Cognitive Daily -- it's the seemingly bizarre phenomenon when one stimulus (e.g. a sight or a sound) is experienced in multiple modalities (e.g. taste, vision, or colors). For example, a person might experience a particular smell whenever a given…

Does watching TV really kill you?

January 12, 2010

Today I had to put off my normal morning run in order to make time to be interviewed on a radio show at 7:30 a.m. As I waited on hold for the interview to start, I could hear the hosts joking back-and-forth about what the "latest TV controversy" is. "Is it the Jay Leno / Conan O'Brien news on NBC…

The outfielder problem: The psychology behind catching fly balls

January 7, 2010

It's football season in America: The NFL playoffs are about to start, and tonight, the elected / computer-ranked top college team will be determined. What better time than now to think about ... baseball! Baseball players, unlike most football players, must solve one of the most complicated…