Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



Korean Broadcast News Speech

Item Name: Korean Broadcast News Speech
Authors: Stephanie Strassel, Nii Martey, and David Graff
LDC Catalog No.: LDC2006S42
ISBN: 1-58563-398-4
Release Date: Aug 24, 2006
Data Type: speech
Sample Rate: 16000 Hz
Sampling Format: pcm
Data Source(s): broadcast news
Application(s): machine translation, natural language processing
Language(s): Korean
Language ID(s): KOR
Distribution: 1 DVD
Member fee: $0 for 2006 members
Non-member Fee: US $700.00
Reduced-License Fee: US $350.00
Extra-Copy Fee: US $200.00
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Stephanie Strassel, Nii Martey, and David Graff
2006
Korean Broadcast News Speech
Linguistic Data Consortium, Philadelphia

Introduction

This data set consists of 18 audio files recorded by LDC in January 2000 and February 2000 from Voice of America (VOA) satellite radio news broadcasts in Korean.

Data

The recordings, captured from a dedicated satellite receiver, are stored as 16-bit PCM, 16-kHz, single-channel, in NIST SPHERE format. The duration of each recording is either 30 minutes or 60 minutes, depending on the VOA broadcast schedule. The date (YYYYMMDD), start-time and end-time (HHMM, Eastern Standard Time) for each recording are indicated in its file name. The sample data is not compressed.

Transcripts for these recordings are available as a separate corpus from the LDC: Korean Broadcast News Transcripts, LDC2006T14.

Samples

For an example of the data contained in this corpus, please listen to this audio sample (wav format).

Content Copyright

Portions © 2000, 2006 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact: ldc@ldc.upenn.edu

(c) 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.