July 1997 This release of CallHome Enlgish data consists of 80 training, 20 devtest, and 20 evaltest transcripts. Two subdirectories can be found here: doc/ transcrp/ Each of these contains the following: doc/ contains eight files that describe the transcript corpus: ch_eng.doc - describes English speech and transcripts callinfo.doc - describes callinfo.tbl callinfo.tbl - provides audit information for each channel devtest.ids - provides a list of callids for the devtest evaltest.ids - provides a list of callids for the evaltest spkrinfo.doc - describes spkrinfo.tbl spkrinfo.tbl - provides demographic information on the telephone call originators train.ids - provides a list of callids for training set transcrp/ contains three subdirectories: train/ contains 80 transcript files devtest/ contains 20 transcript files evaltest/ contains 20 transcript files