The 1998 HUB5 English Transcripts corpus was produced by the
Linguistic Data Consortium (LDC), catalog number LDC2003T02
and ISBN 1-58563-253-8.
This publication contains transcripts for 20 CALLHOME English and
20 Switchboard telephone conversations. These 40 conversations were used in NIST's
1998 HUB5 English evaluation, and are published as 1998 HUB5 English
There are 40 data files in .txt format.
Please follow this link for a sample transcript: example.
There are no updates at this time.
Portions © 2003 Trustees of the University of Pennsylvania.