Saturday, September 03, 2005

Learning Language Automatically

In the future, perhaps we can write a code by using our own language, thanks to Cornell University and Tel Aviv University researchers who have developed a method for enabling a computer program to scan text in any of a number of languages, including English and Chinese, and autonomously and without previous information infer the underlying rules of grammar. The rules can then be used to generate new and meaningful sentences. The method also works for such data as sheet music or protein sequences.

This new algorithm called Automatic Distillation of Structure (ADIOS) and it was successfully identifies complex patterns in raw texts. The algorithm discovers the patterns by repeatedly aligning sentences and looking for overlapping parts.

