Speaker Recognition Corpus Release 1.1 Center for Spoken Language Understanding UPDATED: 29 March 2002 Directory Structure ------------------- This document describes the directory structure of this release. Following is a written description of the directory structure in this release: readme.txt General information regarding the corpus. docs/ The documentation directory. This directory contains further documentation for the Speaker Recognition corpus. labels/ Phonetic labeling directory. This directory would contain phonetic labeling information if it existed. For this corpus, it is empty. misc/ Miscellaneous directory, possibly containing software tools and scripts. speech/ The speech directory contains the actual .wav files. There are many numbered subdirectories within the speech directory. trans/ The transcriptions directory. This directory contains non-time-aligned word-level transcription of each of the speech files. This corpus requires approximately 4.8GB of disk space. Visually, the directory structure looks something like this: spkrec | -------------------------------------------------- | | | | | | readme.txt /docs /labels /misc /speech /trans This release contains a /speech directory that contains all of the recorded utterances. In the /speech directory are subdirectories for each speaker. Within those subdirectories are *another* layer of subdirectories for each recording session. The files themselves are encoded with the id and session number as well. See the formats.txt file in the /docs directory for more information. Along with the speech files, Release 1.1 contains transcriptions of nearly all of the utterances and gender and age information for each speaker. All of this information is found in the trans.txt file in the docs folder. In addition, the individual transcriptions have been placed in the /trans directory, which uses a structure that exactly parallels the structure of the /speech directory.