Introduction
The 1998 HUB5 English Evaluation, Linguistic Data Consortium (LDC)
catalog number LDC2002S10 and ISBN 1-58563-226-0, is part of an ongoing
series of periodic evaluations conducted by NIST. These evaluations
provide an important contribution to the direction of research efforts
and the calibration of technical capabilities. They are intended to be
of interest to all researchers working on the general problem of
conversational speech recognition. To this end the evaluation was
designed to be simple, to focus on core speech technology issues, to be
fully supported, and to be accessible.
Additional documentation is available at the 2000 NIST Evaluation Plan
for Recognition of Conversational Speech Over the Telephone website.
Data
This publication contains 40 sphere files encoded in two channel
interleaved mulaw for a total of 635,416,388 bytes (605 Mbytes) of
sphere data. The sphere headers have been modified from the original
Evaluation data by the addition of sample checksums to the 20 CALLHOME
data files.
An included documentation table contains information on the speech
segments to be processed as follows:
...
Updates
There are no updates at this time.
Content Copyright
Portions © 1997 Trustees of the University of Pennsylvania |