This CD-ROM contains a corpus of task-oriented spoken dialogs. These dialogs
were collected in 1993 at the University of Rochester Department of Computer
Science as part of the TRAINS project, a project to develop a conversationally
proficient planning assistant, which helps a user construct a plan to achieve
some task involving the manufacturing and shipment of goods in a railroad
freight system. The collection procedure was designed to make the setting as
close to human-computer interaction as possible, but was not a "wizard"
scenario, where one person pretends to be a computer. Thus these dialogs
provide a snapshot into an ideal human-computer interface that would be able to
engage in fluent conversations.
Altogether, this corpus includes 98 dialogs, collected using 20 different tasks
and 34 different speakers. This amounts to six and a half hours of speech,
about 5,900 speaker turns and 55,000 transcribed words.
Portions © 1995 Trustees of the University of Pennsylvania