About NLTT

NLTT is one of the oldest research groups in PARC, and it has been the source of innovations in computational linguistics and linguistic theory that have been broadly influential. Past achievements include: Our current activities center on well-engineered, comprehensive systems that provide efficient implementations for linguistic processing. The speed and space reductions that we achieved in the finite-state arena are what enabled the large array of commercial products. Our ambiguity management techniques are embedded and exploited in the XLE system; whereas alternative systems typically exhibit exponential behavior and can only operate on short sentences or with specialized grammars, XLE performs well on long, real-world sentences and on full, broad-coverage grammars with semantic specifications.

The XLE system makes it (relatively) easy to write computational grammars for different languages. At Xerox we have produced large-scale LFG grammars of Chinese, English and French, and our collaborators in the Parallel Grammar Project have produced substantial grammars for Arabic, German, Japanese, and Norwegian. These and other grammars, including Turkish and Urdu, are continually under development.

These grammars and the XLE implementation are resources that can be combined with other modules to make up various applications. To take an example, we have constructed a semantic-based question answering system that makes use of the ambiguity-management features of our algorithms to map text into semantic (and then Knowledge Representation) structures using the XLE parser and ordered rewrite system. These structures are input to our scalable, high-performance search engine. When a user enters a query into the system, the linguistic structure of the query is compared to those in the index, and the system finds the most relevant matches based on the meaning of the text. These candidate passages are then aligned using a unification-based algorithm that selects those that are responsive to the query and these are returned to the user. (See also our work on Machine Translation.)

To summarize, we have developed a suite of concepts, theories, algorithms, and implementations that make it easy to construct descriptions of new, even "exotic" languages and to perform efficient, deep analysis and generation with those descriptions in order to solve practical language processing problems. More information about these issues and our approach to them can be found in the selected bibliography of NLTT publications.



Natural Language Theory and Technology
Palo Alto Research Center
3333 Coyote Hill Rd.
Palo Alto, CA 94304 USA

fax: (650) 812-4374

Last updated: Wednesday, 11-Oct-2006 08:57:58 PDT.