Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



VAHA (POLYPHONE II)

Item Name: VAHA (POLYPHONE II)
Authors: Yeshwant K. Muthusamy
LDC Catalog No.: LDC96S41
ISBN: 1-58563-107-8
Data Type: speech
Sample Rate: 8000 Hz
Sampling Format: 1-channel ulaw
Data Source(s): telephone speech
Application(s): speech recognition
Language(s): Spanish
Language ID(s): spa
Distribution: 1 DVD
Member fee: $0 for 1996 members
Non-member Fee: US $4200.00
Reduced-License Fee: US $2100.00
Extra-Copy Fee: US $200.00
Non-member License: yes
Readme File: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Yeshwant K. Muthusamy
1996
VAHA (POLYPHONE II)
Linguistic Data Consortium, Philadelphia

Introduction

Voice Across Hispanic America (VAHA) is a corpus of Spanish telephone speech, recorded digitally from 915 native speakers of Spanish in various parts of the United States. With nearly 39,000 recorded and transcribed utterances, VAHA will be useful for a variety of research studies, but it is intended primarily for speech technology research and development in telecommunications applications. It is patterned after Macrophone (1), an American English corpus (LDC94S21) which is widely used for this purpose.

Data

This corpus was collected by Texas Instruments in Dallas, TX for the Linguistic Data Consortium at the University of Pennsylvania.

Updates

There are no updates at this time.

Copyright


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact: ldc@ldc.upenn.edu

(c) 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.