Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



COMLEX English Syntax Lexicon

Item Name: COMLEX English Syntax Lexicon
Authors: Catherine Macleod, Adam Meyers and Ralph Grishman
LDC Catalog No.: LDC98L21
ISBN: 1-58563-147-7
Data Type: lexicon
Data Source(s): newswire, varied
Application(s): natural language processing
Language(s): English
Language ID(s): ENG
Distribution: Web Download
Member fee: $0 for 1994, 1995, 1996, 1998 members
Non-member Fee: US $2250.00
Reduced-License Fee: US $1125.00
Extra-Copy Fee: N/A
Non-member License: yes
Member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Catherine Macleod, Adam Meyers and Ralph Grishman
1998
COMLEX English Syntax Lexicon
Linguistic Data Consortium, Philadelphia

Introduction

This is a moderately broad coverage English lexicon (with about 38,000 lemmas) developed at New York University under LDC sponsorship. It contains detailed information about the syntactic characteristics of each lexical item and is particularly detailed in its treatment of subcategorization (complement structures).

Data

In the current dictionary, nouns have nine possible features and nine possible complements; adjectives have seven features and 14 complements; verbs have five features and 92 complements. The entries for 750 frequent verbs contain 100 tags each, where a tag includes: a pointer to an instance of that verb in a corpus and the subcategorization appropriate for that instance.

Some references for the syntax and semantics work:

Ralph Grishman, Catherine Macleod and Adam Meyers. Comlex syntax: Building a computational lexicon. Proc. 15th Int'l Conf. Computational Linguistics (COLING 94), Kyoto, Japan, August 1994.

Macleod, Catherine, Adam Meyers and Ralph Grishman. The Influence of Tagging on the Classification of Lexical Complements. Proc. 16th Int'l Conf. Computational Linguistics (COLING 96), Copenhagen, Denmark, August 1996.

Here is a sample page from the lexicon.

Updates

There are no updates at this time.


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact: ldc@ldc.upenn.edu

(c) 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.