|

|
|

License Agreements
The license agreements page contains a list of corpora that require corpus-specific user licenses along with links to these license agreements. The page is organized by year to mirror the organization of our Corpus Catalog
. For an alphabetical list of license agreements, please click here.
For Members, use of most data is governed by the membership agreement. Corpora with additional usage restrictions are listed below; click on the Member link for the corpus-specific user license agreement for these corpora.
For Non-members, use of most data is governed by the LDC User Agreement for Non-members.
This user license agreement permits non-commercial linguistic education and research use of data. Only those corpora with additional usage restrictions are listed below; click on the Non-member link for the corpus-specific user license agreement.
Those corpora without corresponding Non-member license agreements are for Members-Only.
All completed user licenses can be faxed to +1 215 573-2175 or scanned and emailed to LDC.
|2011|2010|2009|2008|2007|2006|2005|2004|2002|2001|2000|1999|1998
|1997|1996|1995|1994|1993|
2011 [ top ]
LDC2011T04 Indian Language Part-of-Speech Tagset: Sanskrit
Member, Non-member
2010 [ top ]
LDC2010T06 Chinese Web 5-gram Version 1
Member, Non-member
LDC2010T16 Indian Language Part-of-Speech Tagset: Bengali
LDC2010T24 Indian Language Part-of-Speech Tagset: Hindi
Member, Non-member
LDC2010L01 LDC Standard Arabic Morphological Analyzer (SAMA) Version 3.1
Member
2009 [ top ]
LDC2009V01 Audiovisual Database of Spoken American English
Member, Non-member
LDC2009T04 BioProp Version 1.0
Member, Non-member
LDC2009S01
CSLU: Numbers Version 1.3
LDC2009S03
CSLU: S4X Release 1.2
Member, Non-member
LDC2009T08 Japanese Web N-gram Version 1
Member, Non-member
LDC2009T25 Web 1T 5-gram, 10 European Languages Version 1
Member, Non-member
2008 [ top ]
LDC2008T13 BLLIP North American News Text, Complete
Member
LDC2008T14
BLLIP North American News Text, General Release
Member, Non-member
LDC2008S06 CSLU: Alphadigit Version 1.3
LDC2008S07 CSLU: ISOLET Spoken Letter Database Version 1.3
LDC2008S02 CSLU: National Cellular Telephone Speech Release 2.3
LDC2008S01 CSLU: Portland Cellular Telephone Speech Version 1.3
Member, Non-member
LDC2008T22 Czech Academic Corpus 2.0
Member, Non-member
LDC2008L02 Hindi WordNet
Member, Non-member
LDC2008T01 Hungarian-English Parallel Text, Version 1.0
Member, Non-member
LDC2008T15 North American News Text, Complete
Member
LDC2008T16
North American News Text, General Release
Non-member
LDC2008T19 The New York Times Annotated Corpus
Member, Non-member
2007 [ top ]
LDC2007T22 2001 Topic Annotated Enron Email Data Set
Member, Non-member
LDC2007S08
CSLU: Foreign Accented English Release 1.2
LDC2007S18
CSLU: Kids` Speech Version 1.1
LDC2007S13
CSLU: Apple Words and Phrases
LDC2007S05
CSLU: Yes/No Version 1.2
Member, Non-member
LDC2007S09 Mandarin Affective Speech
Member, Non-member
LDC2007T19 MITRE 1997 Mandarin Broadcast News Speech Translations(Hub-4NE)
Member
LDC2007S15 Nationwide Speech Project
Member, Non-member
2006
[
top ]
LDC2006S15
CSLU: Spelled and Spoken Words
LDC2006S14
CSLU: Stories v 1.2
LDC2006S35
CSLU: Multilanguage Telephone Speech Version 1.2
LDC2006S39
CSLU: Names Release 1.3
LDC2006S26
CSLU: Speaker Recognition Version 1.1
LDC2006S16
CSLU: Spoltech Brazilian Portuguese Version 1.0
LDC2006S01
CSLU: Voices
Member, Non-member
LDC2006T03 Korean Propbank
Member, Non-member
LDC2006T09 Korean Treebank Annotations
Version 2.0
Member, Non-member
LDC2006S13 N4 NATO Native and Non-Native Speech
Member, Non-member
LDC2006T01 Prague Dependency Treebank 2.0
Member, Non-member
LDC2006T13 Web 1T 5-gram Version 1
Member, Non-member
2005
[
top ]
LDC2005T35 American National Corpus (ANC)
Second Release
Member, Non-member
2004
[ top ]
LDC2004T23 Prague Arabic Dependency Treebank 1.0
Member, Non-member
LDC2004T25 Prague Czech-English Dependency Treebank 1.0
Member, Non-member
2002
[ top ]
LDC2002S11 1997 HUB4 English Evaluation Speech and Transcripts
Member, Non-member
LDC2002T26 Korean English Treebank Annotations
Member, Non-member
2001
[ top ]
LDC2001T62 CETEMpublico
Member, Non-member
2000
[ top ]
LDC2000S86 1998 HUB4 Broadcast News Evaluation English Test
Material
Member
LDC2000T43 BLLIP 1987-89 WSJ Corpus Release 1
Member, Non-member
LDC2000T52 TREC Mandarin
Member, Non-member
LDC2000T51 TREC Spanish
Member, Non-Member
1999
[ top ]
LDC99T34 Japanese Business News Text Supplement
Member
LDC99S82 USC Marketplace Broadcast News Speech
LDC99T36 USC Marketplace Broadcast News Transcripts
Member, Non-member
1998
[
top ]
LDC98T31 1996 CSR HUB4 Language Model
Member
LDC98S73 1997 Mandarin Broadcast News Speech (HUB4-NE)
LDC98T241997 Mandarin Broadcast News Transcripts (HUB4-NE)
Member
LDC98L21 COMLEX English Syntax Lexicon
Member, Non-member
LDC98T30 North American News Text Supplement
Member
LDC98T25 TDT Pilot Study Corpus
Member, Non-member
1997 [ top ]
LDC97S66 1996 English Broadcast News Dev and
Eval (HUB4)
LDC97S44 1996 English Broadcast News Speech (HUB4)
LDC97T22 1996 English Broadcast News Transcripts (HUB4)
Member
LDC97L20
CALLHOME American English Lexicon (PRONLEX)
LDC97L18CALLHOME German Lexicon
Member, Non-member
LDC97S63 The CMU Kids Corpus
Member, Non-member
1996 [ top ]
LDC96L17 CALLHOME Japanese Lexicon
LDC96L15 CALLHOME Mandarin Chinese Lexicon
LDC96L16 CALLHOME Spanish Lexicon
Member, Non-member
LDC96L14 CELEX2
Member, Non-member
LDC96S33 CSR-IV HUB3
Member
LDC96S31 CSR-IV HUB4
Member, Non-member
LDC96T10 Message Understanding Conference
(MUC) 6 Additional News Text
Member, Non-member
1995
[ top ]
LDC95T6 CSR-III Text
Member
LDC95T11 European Language Newspaper Text
Member
LDC95T8 Japanese Business News Text
Member
LDC95S28 LATINO-40 Spanish Read News
Member, Non-member
LDC95T13 Mandarin Chinese News Text
Member, Non-member
LDC95T21 North American News Text Corpus
Member
LDC95T9 Spanish News Text
Member
1994
[ top ]
LDC94T5 ECI Multilingual Text
Member, Non-member
LDC94T4A UN Parallel Text (Complete)
LDC94T4B-1 UN Parallel Text (English)
LDC94T4B-2 UN Parallel Text (French)
LDC94T4B-3 UN Parallel Text (Spanish)
Member, Non-member
1993
[ top ]
LDC93T1 ACL/DCI
Member,
Non-member
LDC93T3A TIPSTER Complete
LDC93T3B TIPSTER Volume 1
LDC93T3C TIPSTER Volume 2
LDC93T3D TIPSTER Volume 3
Member,
Non-member
License agreements in alphabetical order. Note that these link to the Member license for each database.
|
|