Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



TRAINS Spoken Dialog Corpus

Item Name: TRAINS Spoken Dialog Corpus
Authors: James Allen and Peter A. Heeman
LDC Catalog No.: LDC95S25
ISBN: 1-58563-057-8
Data Type: speech
Sample Rate: 16000 Hz
Sampling Format: 1-channel pcm compressed
Data Source(s): microphone conversation
Application(s): discourse analysis, speech recognition, spoken dialogue systems
Language(s): English
Language ID(s): eng
Distribution: 1 CD
Member fee: $0 for 1995 members
Non-member Fee: US $500.00
Reduced-License Fee: US $250.00
Extra-Copy Fee: US $150.00
Non-member License: yes
Readme File: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: James Allen and Peter A. Heeman
1995
TRAINS Spoken Dialog Corpus
Linguistic Data Consortium, Philadelphia

This CD-ROM contains a corpus of task-oriented spoken dialogs. These dialogs were collected in 1993 at the University of Rochester Department of Computer Science as part of the TRAINS project, a project to develop a conversationally proficient planning assistant, which helps a user construct a plan to achieve some task involving the manufacturing and shipment of goods in a railroad freight system. The collection procedure was designed to make the setting as close to human-computer interaction as possible, but was not a "wizard" scenario, where one person pretends to be a computer. Thus these dialogs provide a snapshot into an ideal human-computer interface that would be able to engage in fluent conversations.

Altogether, this corpus includes 98 dialogs, collected using 20 different tasks and 34 different speakers. This amounts to six and a half hours of speech, about 5,900 speaker turns and 55,000 transcribed words.

Content Copyright

Portions © 1995 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact: ldc@ldc.upenn.edu

(c) 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.