Grammar Engineering
Ling 239E Topics in Computational Linguistics
Logistics
| Instructors: | Ron Kaplan | Tracy Holloway King |
| kaplan "at" parc.com | thking "at" parc.com |
| 650 812-4348 | 650 812-4808 |
| Class time: | Thursday 2:15-5:05 (winter 2004) |
| Class location: |
Meyer 143 (Mac lab) |
| Office hours: | Monday 4.00-4.30 (room 460-110
(linguistics dept.)) and
by appointment. |
| We can stay later as necessary to answer
questions; send email if you need to arrive before or
after this time. |
| Also, you can ask questions after class on
Thursday or by email or phone. |
Course Description
Topics in Computational Linguistics: Grammar Engineering -- Hands-on
introduction to basic techniques for implementing large-scale
linguistic grammars drawing on a combination of sound grammatical
theory and engineering principles. Morphological and syntactic
specifications within a description-based lexicalist framework.
Integration of shallow and deep parsing techniques. Engineering
issues in multilingual parallel grammar development. Students will
incrementally extend a small grammar for English.
Prerequisite: basic
knowledge of syntactic theory or Ling120.
No prior programming skills
required.
Provides an appropriate background for Ling239A.
Weekly topics and assignments
- Week of Jan 5 :
Introduction to grammar engineering
Introduction to LFG and XLE
- Week of Jan 12
Engineering and linguistic generalizations
Formal devices:
equations,
lexicons,
templates,
lexical rules
- Slides, notation slides
- Assignment
- Readings:
- XLE documentation: Grammar notations sections:
1. Regular predicates for c-structure rules
4.3 Boolean combinations
5. Functional templates and lexical rules
[You may wish to skim through the other sections in Grammar
notations.]
- Week of Jan 19
Configurations and declarations
Metarulemacro
- Week of Jan 26
Ambiguity
Coordination
- Slides
- Assignment
- Readings:
- Ron Kaplan and John T. Maxwell III. 1995.
Constituent Coordination in Lexical-Functional Grammar. In M. Dalrymple,
R. M. Kaplan, J. T. Maxwell, and A. Zaenen (eds.), Formal Issues in
Lexical-Functional Grammar, Stanford, CA: CSLI Publications. (Originally
appeared in Proceedings of COLING-88, vol. 1 (Budapest, 1998),
303--305.) We will provide hard copies.
- Week of Feb 2
Context-free parsing
Functional Uncertainty
Complex categories/rule parameterization (time permitting)
- Slides: Functional Uncertainty, CFP and XLE computation
- Assignment
- Readings
- Ronald M. Kaplan and Annie Zaenen. 1989. Long-distance dependencies,
constituent structure, and functional uncertainty. (ps) In Mark Baltin and
Anthony Kroch (editors), Alternative Conceptions of Phrase Structure,
pp. 17-42. Chicago University Press. Reprinted in Dalrymple et
al. (editors), Formal Issues in Lexical-Functional Grammar. CSLI, 1995.
- Week of Feb 9
Integration of FST technologies:
tokenizers,
morphological analyzers
- Week of Feb 16
Ambiguity and robustness I:
OT marks
- Week of Feb 23
Ambiguity and robustness II:
fragments,
xle performance settings, shallow markup
- Week of Mar 1
Generation
Testing and evaluation
Stochastic disambiguation
- Slides: Generation,
Testing/Evaluation, Stochastic disambiguation
- Assignment
- Readings:
- Statistical disambiguation: Stefan Riezler, Tracy H. King, Ronald
M. Kaplan, Richard Crouch, John T. Maxwell III, and Mark
Johnson. 2002. Parsing the Wall Street Journal using a
Lexical-Functional Grammar and discriminative estimation
techniques. In Proceedings of the 40th Annual Meeting of the
Association for Computational Linguistics (ACL'02), Philadephia, PA.
- Generation: Ronald M. Kaplan and Juergen Wedekind. 2000. LFG Generation Produces Context-free
Languages. COLING.
- Extra reading for those interested in documentation: Stefanie Dipper. 2003. Implenenting and Documenting Large-Scale Grammars ---
German LFG Chapter 4. Ph.D. thesis IMS, University of
Stuttgart. Arbeitspapiere des Instituts für Maschinelle
Sprachverarbeitung (AIMS), Vol. 9(1). [If you want to read the entire
dissertation, it is available on her web page.]
- Week of Mar 8
Applications and transfer
Multilingual grammar development: the ParGram project
- Slides
- Assignment: no assignment this week
- Readings:
And one of:
- Week of Mar 15
No class/No final
Grading
Grades will be determined based on the nine weekly assignments (each worth
10% of the grade) and class participation (worth 10% of the grade). There
is no final (contrary to what the time table says).
Class materials
There will be assigned readings. These will be available directly from this page.
Two books that are recommended as being of interest are:
They are
available from the Stanford Bookstore.
There are two copies of Butt et al. and one of Farghaly on reserve in
Green Library.
Created: October 2003
Modified: March 2004