Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



Fisher English Training Part 2, Transcripts

Item Name: Fisher English Training Part 2, Transcripts
Authors: Christopher Cieri, David Graff, Owen Kimball, Dave Miller, and Kevin Walker
LDC Catalog No.: LDC2005T19
ISBN: 1-58563-336-4
Release Date: Apr 15, 2005
Data Type: text
Data Source(s): telephone conversations
Project(s): EARS, GALE
Application(s): speech recognition
Language(s): English
Language ID(s): ENG
Distribution: 1 CD
Member fee: $0 for 2005 members
Non-member Fee: US$1500.00
Reduced-License Fee: US$750.00
Extra-Copy Fee: US$150.00
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Christopher Cieri, et al.
2005
Fisher English Training Part 2, Transcripts
Linguistic Data Consortium, Philadelphia

Introduction

This corpus represents the secpnd half of a collection of conversational telephone speech (CTS) that was created at the LDC during 2003. It contains 5,849 audio files, each one containing a full conversation of up to ten minutes. Additional information regarding the speakers involved and types of telephones used can be found in the companion text corpus of transcripts (Fisher English Training Text Data, Part 2 -- LDC2005T19).

Data

The first half of the collection (Fisher English Training Speech Data, Part 1) was released by the LDC in 2004 (LDC2004S13 for speech data, LDC2004T19 for transcripts). Taken as a whole, the two parts comprise 11,699 recorded telephone conversations.

The individual audio files are presented in NIST SPHERE format, and contain two-channel mu-law sample data; "shorten" compression has been applied to all files.

Data collection and transcription were sponsored by DARPA and the U.S. Department of Defense, as part of the EARS project for research and development in automatic speech recognition.

Samples

To see an example of this corpus, please examine this sample.

Copyright

Portions © 2005 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Da ta

Contact: ldc@ldc.upenn.edu

(c) 1992-2008 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.