Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



Boston University Radio Speech Corpus

Item Name: Boston University Radio Speech Corpus
Authors: Mari Ostendorf, Patti Price, and Stefanie Shattuck-Hufnagel
LDC Catalog No.: LDC96S36
ISBN: 1-58563-060-8
Data Type: speech
Sample Rate: 16000 Hz
Sampling Format: 1-channel pcm
Data Source(s): microphone speech
Application(s): prosody, speech recognition, speech synthesis
Language(s): English
Language ID(s): eng
Distribution: 1 DVD
Member fee: $0 for 1996, 1997 members
Non-member Fee: US $1200.00
Reduced-License Fee: US $600.00
Extra-Copy Fee: US $200.00
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Mari Ostendorf, Patti Price, and Stefanie Shattuck-Hufnagel
1996
Boston University Radio Speech Corpus
Linguistic Data Consortium, Philadelphia

The Boston University Radio Speech Corpus was collected primarily to support research in text-to-speech synthesis, particularly generation of prosodic patterns. The corpus consists of professionally read radio news data, including speech and accompanying annotations, suitable for speech and language research.

The corpus includes speech from seven (four male, three female) FM radio news announcers associated with WBUR, a public radio station. The main radio news portion of the corpus consists of over seven hours of news stories recorded in the WBUR radio studio during broadcasts over a two year period. In addition, the announcers were also recorded in a laboratory at Boston University. In this, the lab news portion, the announcers read a total of 24 stories from the radio news portion. The announcers were first asked to read the stories in their non-radio style and then, 30 minutes later, to read the same stories in their radio style.

Each story read by an announcer was digitized in paragraph size units, which typically include several sentences. The files were digitized at a 16k Hz sample rate using a 16-bit A/D. The paragraphs were annotated with the orthographic transcription, phonetic alignments, part-of-speech tags and prosodic markers. The orthographic transcripts were generated by hand and include indication of where the speaker took a breath. The phonetic alignments and part-of-speech tags were generated automatically and hand corrected. The prosodic labels were marked by hand and are available only for a subset of the corpus.

A zipped compressed file example.zip is available. Please be aware that this file is slightly larger than 1 Mb (1,278,998 bytes).

An additional sample file, LDC1996.tgz is also available.

Content Copyright


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact: ldc@ldc.upenn.edu

(c) 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.