|

|
|
ATIS3 Training Data
| |
| Item Name: | ATIS3 Training Data |
| Authors: | Deborah A. Dahl, Madeleine Bates, Michael Brown, William Fisher, Kate Hunicke-Smith, David Pallett, Christine Pao, Alexander Rudnicky, Elizabeth Shriberg, John Garofolo, Jonathan Fiscus, Denise Danielson, Enrico Bocchieri, Bruce Buntschuh, Beverly Schwartz, Sandra Peters, Robert Ingria, Robert Weide, Yuzong Chang, Eric Thayer, Lynette Hirschman, Joe Polifroni, Bruce Lund, Goh Kawai, Tom Kuhn, and Lew Norton |
| LDC Catalog No.: | LDC94S19 |
| NIST Catalog No.: | 17-1.1 through 17-3.1 |
| ISBN: | 1-58563-028-4 |
| Data Type: | speech |
| Sample Rate: | 16000 Hz |
| Sampling Format: | 1-channel pcm compressed |
| Data Source(s): | microphone speech |
| Project(s): | ATIS |
| Application(s): | speech recognition, spoken dialogue systems |
| Language(s): | English |
| Language ID(s): | eng |
| Distribution: | 1 DVD |
| Member fee: | $0 for 1994 members |
| Non-member Fee: | US$750.00 |
| Reduced-License Fee: | US$375.00 |
| Extra-Copy Fee: | US$200.00 |
| Non-member License: | yes |
| Readme File: | yes |
| Online documentation: | yes |
| Licensing Instructions: | Subscription Members, Standard Members, Non-Members |
| Citation: | Deborah A. Dahl, et al. 1994 ATIS3 Training Data Linguistic Data Consortium, Philadelphia |
|
| The ATIS3 corpus, on three CD-ROMs, includes over 774 scenarios
completed by 137 subjects, yielding a total of over 7,300 utterances.
All utterances are transcribed and 2,900 of them have been categorized
and annotated with canonical reference answers.
The relational database for this dataset included flight information
for 46 cities and 52 airports. Data was collected at BBN, CMU, MIT
and SRI, using their own ATIS systems and at NIST using systems
provided by BBN and SRI.
Two 1,000-utterance test sets were set aside from the data pooled by
the collection sites. The first set was used in a December 1993 ARPA
test and is included in ATIS3. The second has been reserved for
future testing.
Samples
|
|
|