LDC Catalog
|
By Type and Source
|
By Year
|
Top Ten
|
Projects
|
Catalog Search
Search the LDC Catalog
The corpus has not been released yet.
Publication Name:
Author:
Catalog Number:
Find keywords in corpus description:
Language(s):
Albanian
Arabic
Bengali
Berber
Bulgarian
Canadian French
Cantonese
Chinese
Croatian
Czech
Danish
Dari
Dutch
Egyptian Arabic
English
Estonian
Farsi
French
Gaelic
German
Gulf Arabic
Gullah
Hindi
Hungarian
Indian English
Indonesian
Iraqi Arabic
Italian
Japanese
Korean
Kurdish
Latin
Levantine Arabic
Lithuanian
Lucumi
Mahou
Mandarin Chinese
Mesopotamian Arabic
Min Nan Chinese
Modern Greek
Modern Standard Arabic
Ngomba
North Levantine Arabic
North Mesopotamian Arabic
Northern Uzbek
Norwegian
Norwegian Bokmaal
Norwegian Nynorsk
Pashto
Polish
Portuguese
Putonghua
Romanian
Russian
Serbian
Slovenian
South Levantine Arabic
Spanish
Standard Malay
Swahili
Swedish
Tagalog
Taiwan Mandarin
Tamil
Thai
Trinidadian
Turkish
Urdu
Uzbek
Vervet Monkey Calls
Vietnamese
Western Farsi
Wu Chinese
Yemba
Yoruba
Yue Chinese
Member year(s):
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
Corpus type(s):
lexicon
lexicon, speech, text
speech
speech and text
speech and transcripts
text
transcripts
video
video frames
video frames, transcripts
Data source(s):
broadcast conversation
broadcast news
dictionaries
email
field recordings
government documents
journal articles
meeting speech
microphone conversation
microphone speech
news magazine
newsgroups
newswire
telephone conversations
telephone speech
text chat conversations
transcribed speech
varied
video
web collection
weblogs
Research project(s):
ACE
AQUAINT
ATIS
Communicator
DARPA-CSR
DASL
EARS
GALE
GENOA
Hub4
Hub5-LVCSR
JANUS
LID
MT08
MUC
NIST Automatic Meeting Recognition
NIST LRE
NIST MT
NIST SRE
REFLEX-MTE
ROAR
SID
SPINE
Talkbank
TDT
TIDES
Tipster
TREC
Recommended application(s):
anaphora resolution
automatic content extraction
bibliometrics
content-based retrieval from digital video
cross-lingual information retrieval
discourse analysis
discourse parsing
distillation
finite state technology
gesture recognition
gesture synthesis
information detection
information extraction
information extraction from video
information retrieval
instruction
language generation
language identification
language modeling
language teaching
linguistic analysis
machine learning
machine translation
meeting summarization
message understanding
metadata extraction
morphology
morphology learning
named entity recognition
natural language processing
nominal expression generation
parsing
part of speech tagging
phonetics
phonology
pragmatics
pronunciation modeling
prosody
psycholinguistics
question-answering
sociolinguistics
speaker identification
speaker segmentation and tracking
speaker verification
speech recognition
speech synthesis
spoken dialogue modeling
spoken dialogue systems
standards
subjectivity analysis
summarization
syntactic parsing
tagging
temporal parsing
temporal reasoning
topic detection and tracking
video event detection
Search Options:
Within Fields
or
and
Between Fields
and
or
The above ten (10) criteria are available for searching the Catalog Database. Any criteria left blank are ignored. For the text fields, you may enter full or partial names, and use the underscore ('_') character for any character you are unsure of. More information about LDC Catalog Numbers is available on the
Search By Year
page.
For criteria in the selection boxes above, you have the option to select more than one value
within the field
, and whether or not the search should return publications with at least one of the values (eg. Membership Year = 1996
OR
1998), which is the default behavior, or only those publications with all of the chosen values (eg. Membership Year = 1996
AND
1998). NOTE: A Publication can have only one data source, so "
OR
" will be assumed if you select more than one value.
If you wish to search by more than one criteria (
between fields
), you can have the search return only those publications that match criteria set for all of the fields (eg. Membership Year = 1996
AND
Language = English), which is the default bahavior, or publications that match criteria for at least one of the fields (eg. Membership Year = 1996
OR
Language = English)