Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



Articulation Index

Item Name: Articulation Index
Authors: Jonathan Wright
LDC Catalog No.: LDC2005S22
ISBN: 1-58563-346-1
Release Date: Sep 15, 2005
Data Type: speech
Sample Rate: 16000 Hz
Sampling Format: pcm
Data Source(s): microphone speech
Application(s): language identification, language modeling, natural language processing, parsing, pronunciation modeling, pronunciation modeling
Language(s): English
Language ID(s): eng
Distribution: 1 CD, 1 DVD
Member fee: $0 for 2005 members
Non-member Fee: US $1500.00
Reduced-License Fee: US $750.00
Extra-Copy Fee: US $400.00
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Jonathan Wright
2005
Articulation Index
Linguistic Data Consortium, Philadelphia

Introduction

This file contains documentation on the Articulation Index, Linguistic Data Consortium (LDC) catalog number LCD2005S22 and ISBN 1-58563-346-1.

The Articulation Index Corpus was partly inspired by the work of Harvey Fletcher, who performed a number of perceptual experiments involving English syllables during the first half of the 20th century. His term "articulation index" meant something like "perceptual index of syllables," where those syllables were not necessarily words, and reflected how well speakers could correctly identify syllables in the presence of noise. This corpus was created to facilitate similar experiments, as well as to potentially facilitate new methods in speech recognition research.

The basic concept behind the corpus is to record speakers pronouncing syllables of English, some of which might be real words, but most of which are nonsense syllables. The goal was to have each speaker say a set of 2,000 syllables common to all speakers, as well as a set of 20 syllables unique to that speaker. This goal was nearly met, but not precisely; see below for a description of the syllable inventory.

Samples

For an example of this corpus, please review this audio sample.

Content Copyright

© 2005 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact: ldc@ldc.upenn.edu

(c) 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.