Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



Levantine Arabic QT Training Data Set 4 (Speech + Transcripts)

Item Name: Levantine Arabic QT Training Data Set 4 (Speech + Transcripts)
Authors: Mohamed Maamouri (project head), Tim Buckwalter, and Hubert Jin
LDC Catalog No.: LDC2005S14
ISBN: 1-58563-342-9
Release Date: Jun 15, 2005
Data Type: speech
Sample Rate: 8000 Hz
Data Source(s): telephone conversations
Project(s): EARS, GALE
Language(s): Levantine Arabic, North Levantine Arabic, South Levantine Arabic
Language ID(s): AJP, APC
Distribution: 2 DVD
Member fee: $0 for 2005 members
Non-member Fee: US $3000.00
Reduced-License Fee: US $1500.00
Extra-Copy Fee: US $400.00
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Mohamed Maamouri (project head), Tim Buckwalter, and Hubert Jin
2005
Levantine Arabic QT Training Data Set 4 (Speech + Transcripts)
Linguistic Data Consortium, Philadelphia


Introduction

This file contains documentation on the Levantine Arabic QT Training Data Set 4 (Speech + Transcripts), Linguistic Data Consortium (LDC) catalog number LDC2005S14 and ISBN 1-58563-342-9.

This release contains 901 calls and the total speech is 133.6 hours of telephone conversation in Levantine Arabic. Both audio and transcription files are included in this package.

The majority of speakers in this corpus are Lebanese. The data is similar to the training data in Set 3 [LDC2005S07, speech and LDC2005T03, transcripts]. The dialects are distributed as follows:

  • 171 JOR
  • 1373 LEB
  • 229 PAL
  • 29 SYR

Samples

For an example of this corpus, please review this audio sample.

Content Copyright

Portions © 2005 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact: ldc@ldc.upenn.edu

(c) 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.