Technical reports:
Lexicalized Well-Founded Grammars: Learnability and Merging
Smaranda Muresan; Tudor Muresan; Judith L. Klavans
Downloads:
- Title:
- Lexicalized Well-Founded Grammars: Learnability and Merging
- Author(s):
-
Muresan, Smaranda
Muresan, Tudor
Klavans, Judith L. - Date:
- 2005
- Type:
- Technical reports
- Department:
- Computer Science
- Permanent URL:
- http://hdl.handle.net/10022/AC:P:29389
- Series:
- Columbia University Computer Science Technical Reports
- Part Number:
- CUCS-027-05
- Publisher:
- Department of Computer Science, Columbia University
- Publisher Location:
- New York
- Abstract:
- This paper presents the theoretical foundation of a new type of constraint-based grammars, Lexicalized Well-Founded Grammars, which are adequate for modeling human language and are learnable. These features make the grammars suitable for developing robust and scalable natural language understanding systems. Our grammars capture both syntax and semantics and have two types of constraints at the rule level: one for semantic composition and one for ontology-based semantic interpretation. We prove that these grammars can always be learned from a small set of semantically annotated, ordered representative examples, using a relational learning algorithm. We introduce a new semantic representation for natural language, which is suitable for an ontology-based interpretation and allows us to learn the compositional constraints together with the grammar rules. Besides the learnability results, we give a principle for grammar merging. The experiments presented in this paper show promising results for the adequacy of these grammars in learning natural language. Relatively simple linguistic knowledge is needed to build the small set of semantically annotated examples required for the grammar induction.
- Subject(s):
- Computer science
- Item views:
- 98