Meeting Recording Transcription
As part of NIST's Rich Transcription 2007 Meeting Recognition Evaluation, LDC is currently producing 18 hours of quick transcripts for training data, to support automatic speech-to-text transcription and speaker segmentation in the meeting domain.
The links below
provide additional information about LDC's transcription efforts for Meeting Recognition.
This method produces accurate time-aligned
transcripts within a
limited timeframe. The QTR method excludes manual segmentation,
special markup and multiple quality checks in favor of a single focused
transcription pass. LDC followed the QTR method for the transcription
of more than ten hours of training data for the RT-07 project.
The
careful transcription effort involves multiple passes over the
data. Annotators first manually segment speaker turns and (for
broadcast data) story boundaries. Annotators then create a verbatim
transcript by listening to each segment in turn. A second pass
verifies the accuracy of the segment boundaries and transcript itself,
revisits difficult sections, and adds information like speaker
identity, background noise conditions, plus special markup for
mispronounced words, proper names, acronyms, partial words and other
phenomena. Further scans over the data identify common errors, conduct
spelling and syntax checks, and standardize the spelling of personal,
organization and other names across the transcripts. This
technique was used for transcription of the evaluation corpora for the Rich Transcription 2005, 2006, and 2007 Spring Meeting Recognition Evaluations.
mlglenn@ldc.upenn.edu
Last modified: May 7 2007