Speaker Recognition Corpus Release 1.1 Center for Spoken Language Understanding UPDATED: 29 March 2002 This document describes the file naming conventions used for this distribution and gives a brief description of the various file formats used. File Naming Convention ---------------------- Each filename in /speech and /trans directories encodes information about the call number, utterance type, and file type. A typical filename will look like: 00381aa1.wav The first four digits of the file name identify the caller. The fifth digit represents the session number during which the utterance was recorded. The two letters are a symbol indicating the prompt given the caller. The final number is the occurrence of the utterance for the session (some utterances were generated more than once per session) The final three-letter extension indicates the file type. The following types are in this distribution: wav The speech data txt The text-based transcription of the speech data File Formats ------------ The "wav" files contain speech data and use the RIFF standard file format. This file format is 16-bit linearly encoded. The "trans" file in the /docs directory is a list of all of the non-time-aligned word-level transcriptions (which comply with the conventions in the CSLU Labeling Guide). Each file transcription is on a separate line. Each line Is divided into several fields: * an individual identification number for the caller * the session number during which the utterance was recorded * a symbol representing the prompt given the caller * the occurrence of the utterance for the session (some utterances were generated more than once per session) * the gender of the caller * the approximate age range of the caller * an internal code * the actual transcription of the utterance FOR THIS VERSION OF THE CORPUS, THE CONTENTS OF THE TRANS.TXT FILE HAVE BEEN EXTRACTED INDIVIDUALLY INTO THE /TRANS DIRECTORY. The /trans directory file structure exactly parallels the structure of the /speech directory. Each file in the /trans directory is in .txt format and contains a line, as described in the previous paragraph, that uniquely defines each corresponding sound file.