Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



ATIS3 Training Data

Item Name: ATIS3 Training Data
Authors: Deborah A. Dahl, Madeleine Bates, Michael Brown, William Fisher, Kate Hunicke-Smith, David Pallett, Christine Pao, Alexander Rudnicky, Elizabeth Shriberg, John Garofolo, Jonathan Fiscus, Denise Danielson, Enrico Bocchieri, Bruce Buntschuh, Beverly Schwartz, Sandra Peters, Robert Ingria, Robert Weide, Yuzong Chang, Eric Thayer, Lynette Hirschman, Joe Polifroni, Bruce Lund, Goh Kawai, Tom Kuhn, and Lew Norton
LDC Catalog No.: LDC94S19
NIST Catalog No.: 17-1.1 through 17-3.1
ISBN: 1-58563-028-4
Data Type: speech
Sample Rate: 16000 Hz
Sampling Format: 1-channel pcm compressed
Data Source(s): microphone speech
Project(s): ATIS
Application(s): speech recognition, spoken dialogue systems
Language(s): English
Language ID(s): eng
Distribution: 1 DVD
Member fee: $0 for 1994 members
Non-member Fee: US$750.00
Reduced-License Fee: US$375.00
Extra-Copy Fee: US$200.00
Non-member License: yes
Readme File: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Deborah A. Dahl, et al.
1994
ATIS3 Training Data
Linguistic Data Consortium, Philadelphia

The ATIS3 corpus, on three CD-ROMs, includes over 774 scenarios completed by 137 subjects, yielding a total of over 7,300 utterances. All utterances are transcribed and 2,900 of them have been categorized and annotated with canonical reference answers.

The relational database for this dataset included flight information for 46 cities and 52 airports. Data was collected at BBN, CMU, MIT and SRI, using their own ATIS systems and at NIST using systems provided by BBN and SRI.

Two 1,000-utterance test sets were set aside from the data pooled by the collection sites. The first set was used in a December 1993 ARPA test and is included in ATIS3. The second has been reserved for future testing.

Samples


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Da ta

Contact: ldc@ldc.upenn.edu

(c) 1992-2008 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.