![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
||||
|
|
License AgreementsThe license agreements page contains a list of corpora that require corpus-specific user licenses along with links to these license agreements. The page is organized by year to mirror the organization of our Corpus Catalog. For Members, use of most data is governed by the membership agreement. Corpora with additional usage restrictions are listed below; click on the Member link for the corpus-specific user license agreement for these corpora. For Non-members, use of most data is governed by the LDC User Agreement for Non-members. This user license agreement permits non-commercial linguistic education and research use of data. Only those corpora with additional usage restrictions are listed below; click on the Non-member link for the corpus-specific user license agreement. Those corpora without corresponding Non-member license agreements are for Members-Only. All completed user licenses can be faxed to +1 215 573-2175 or scanned and emailed to LDC. [ top ] Member, Non-member [ top ] Member, Non-member LDC2012T11 American English Nickname Collection Member, Non-member LDC2012S03 Digital Archive of Southern Speech Commercial Member LDC2012S05 USC-SFI MALACH Interviews and Transcripts English Member, Non-member [ top ] Member, Non-member [ top ] Member, Non-member LDC2010T16 Indian Language Part-of-Speech Tagset: Bengali LDC2010T24 Indian Language Part-of-Speech Tagset: Hindi Member, Non-member LDC2010L01 LDC Standard Arabic Morphological Analyzer (SAMA) Version 3.1 Member [ top ] Member, Non-member LDC2009T04 BioProp Version 1.0 Member, Non-member LDC2009S01 CSLU: Numbers Version 1.3 LDC2009S03 CSLU: S4X Release 1.2 Member, Non-member LDC2009T08 Japanese Web N-gram Version 1 Member, Non-member LDC2009T25 Web 1T 5-gram, 10 European Languages Version 1 Member, Non-member [ top ] Member LDC2008T14 BLLIP North American News Text, General Release Member, Non-member LDC2008S06 CSLU: Alphadigit Version 1.3 LDC2008S07 CSLU: ISOLET Spoken Letter Database Version 1.3 LDC2008S02 CSLU: National Cellular Telephone Speech Release 2.3 LDC2008S01 CSLU: Portland Cellular Telephone Speech Version 1.3 Member, Non-member LDC2008T22 Czech Academic Corpus 2.0 Member, Non-member LDC2008L02 Hindi WordNet Member, Non-member LDC2008T01 Hungarian-English Parallel Text, Version 1.0 Member, Non-member LDC2008T15 North American News Text, Complete Member LDC2008T16 North American News Text, General Release Non-member LDC2008T19 The New York Times Annotated Corpus Member, Non-member [ top ] Member, Non-member LDC2007S08 CSLU: Foreign Accented English Release 1.2 LDC2007S18 CSLU: Kids` Speech Version 1.1 LDC2007S13 CSLU: Apple Words and Phrases LDC2007S05 CSLU: Yes/No Version 1.2 Member, Non-member LDC2007S09 Mandarin Affective Speech Member, Non-member LDC2007T19 MITRE 1997 Mandarin Broadcast News Speech Translations(Hub-4NE) Member LDC2007S15 Nationwide Speech Project Member, Non-member [ top ] LDC2006S15 CSLU: Spelled and Spoken Words LDC2006S14 CSLU: Stories v 1.2 LDC2006S35 CSLU: Multilanguage Telephone Speech Version 1.2 LDC2006S39 CSLU: Names Release 1.3 LDC2006S26 CSLU: Speaker Recognition Version 1.1 LDC2006S16 CSLU: Spoltech Brazilian Portuguese Version 1.0 LDC2006S01 CSLU: Voices Member, Non-member LDC2006T03 Korean Propbank Member, Non-member LDC2006T09 Korean Treebank Annotations Version 2.0 Member, Non-member LDC2006S13 N4 NATO Native and Non-Native Speech Member, Non-member LDC2006T01 Prague Dependency Treebank 2.0 [ top ] Member, Non-member [ top ] LDC2004T23 Prague Arabic Dependency Treebank 1.0 Member, Non-member LDC2004T25 Prague Czech-English Dependency Treebank 1.0 Member, Non-member [ top ] LDC2002S11 1997 HUB4 English Evaluation Speech and Transcripts Member, Non-member LDC2002T26 Korean English Treebank Annotations Member, Non-member [ top ] Member, Non-member [ top ] Member LDC2000T43 BLLIP 1987-89 WSJ Corpus Release 1 Member, Non-member LDC2000T52 TREC Mandarin Member, Non-member LDC2000T51 TREC Spanish Member, Non-Member [ top ] Member LDC99S82 USC Marketplace Broadcast News Speech LDC99T36 USC Marketplace Broadcast News Transcripts Member, Non-member [ top ] Member LDC98S73 1997 Mandarin Broadcast News Speech (HUB4-NE) LDC98T241997 Mandarin Broadcast News Transcripts (HUB4-NE) Member LDC98L21 COMLEX English Syntax Lexicon Member, Non-member LDC98T30 North American News Text Supplement Member LDC98T25 TDT Pilot Study Corpus Member, Non-member [ top ] LDC97S44 1996 English Broadcast News Speech (HUB4) LDC97T22 1996 English Broadcast News Transcripts (HUB4) Member LDC97L20 CALLHOME American English Lexicon (PRONLEX) LDC97L18CALLHOME German Lexicon Member, Non-member LDC97S63 The CMU Kids Corpus Member, Non-member [ top ] LDC96L15 CALLHOME Mandarin Chinese Lexicon LDC96L16 CALLHOME Spanish Lexicon Member, Non-member LDC96L14 CELEX2 Member, Non-member LDC96S33 CSR-IV HUB3 Member LDC96S31 CSR-IV HUB4 Member, Non-member LDC96T10 Message Understanding Conference (MUC) 6 Additional News Text Member, Non-member [ top ] Member LDC95T11 European Language Newspaper Text Member LDC95T8 Japanese Business News Text Member LDC95S28 LATINO-40 Spanish Read News Member, Non-member LDC95T13 Mandarin Chinese News Text Member, Non-member LDC95T21 North American News Text Corpus Member LDC95T9 Spanish News Text Member [ top ] Member, Non-member LDC94T4A UN Parallel Text (Complete) LDC94T4B-1 UN Parallel Text (English) LDC94T4B-2 UN Parallel Text (French) LDC94T4B-3 UN Parallel Text (Spanish) Member, Non-member [ top ] Member, Non-member LDC93T3A TIPSTER Complete LDC93T3B TIPSTER Volume 1 LDC93T3C TIPSTER Volume 2 LDC93T3D TIPSTER Volume 3 Member, Non-member |
|||
|
About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data
Contact ldc@ldc.upenn.edu |
||||