Introduction
Korean Telephone Conversations Lexicon was produced by Linguistic Data
Consortium (LDC) catalog number LDC2003L02 and ISBN 1-58563-265-1.
Korean Telephone Conversations Lexicon consists of 25,251 words, and contains
separate fields with phonological, morphological, and frequency
information for each word.
The lexicon covers the tokens occurring in 100 telephone
conversations transcribed and published as Korean Telephone Conversations Transcripts. The token
coverage is 100%. The corresponding speech is published as Korean Telephone Conversations Speech.
Data
The lexicon contains five tab-separated information fields:
- orthographic form in Hangul (head-word), encoded in the KSC-5601
(Wansung) system
- orthographic form in Yale romanization
- pronunciation
- frequency of the word in Korean Telephone Conversations Transcripts
- morphological analysis of the word
Please follow this link for a sample page from the lexicon: txt | gif.
Updates
There are no updates available at this time.
Content Copyright
Portions © 2003 Trustees of the University of Pennsylvania. |