IARPA Babel Lao Language Pack IARPA-babel203b-v3.1a
Multi-Language Conversational Telephone Speech 2011 -- Turkish
Phrase Detectives Corpus
The EventStatus Corpus
2010 NIST Speaker Recognition Evaluation Test Set
BOLT Egyptian Arabic SMS/Chat and Transliteration
CHiME2 Grid
LDC Members must sign the appropriate agreement below:
BOLT Chinese Discussion Forum Parallel Training Data
IARPA Babel Swahili Language Pack IARPA-babel202b-v1.0d
Noisy TIMIT Speech
GALE English-Chinese Parallel Aligned Treebank -- Training
First-Year Law Students' Court Memoranda
IARPA Babel Haitian Creole Language Pack IARPA-babel201b-v0.2b
GALE Phase 3 Arabic Broadcast News Speech Part 2
GALE Phase 3 Arabic Broadcast News Transcripts Part 2
Arabic Speech Recognition Pronunciation Dictionary
IARPA Babel Vietnamese Language Pack IARPA-babel107b-v0.7
MWE-Aware English Dependency Corpus
GALE Phase 3 and 4 Chinese Web Parallel Text
Bamanankan Lexicon
IARPA Babel Tagalog Language Pack IARPA-babel106-v0.2g
TAC KBP Spanish Cross-lingual Entity Linking - Comprehensive Training and Evaluation Data 2012-2014
GALE Phase 4 Arabic Newswire Parallel Sentences
JANA: A Human-Human Dialogues Corpus for Egyptian Dialect
Multi-Language Conversational Telephone Speech 2011 – Slavic Group
IARPA Babel Georgian Language Pack IARPA-babel404b-v1.0a
GALE Phase 3 and 4 Chinese Newswire Parallel Text