Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



Russian through Switched Telephone Network (RuSTeN)

Item Name: Russian through Switched Telephone Network (RuSTeN)
Authors: Anrey Raev, Serguei Koval, Natalia Smirnova, Daria Khitrova, and Vitaly Stepanov
LDC Catalog No.: LDC2006S34
ISBN: 1-58563-388-7
Release Date: Jul 21, 2006
Data Type: speech
Sample Rate: 11025 Hz
Sampling Format: 1-channel pcm
Data Source(s): telephone conversations
Application(s): speaker identification, speaker verification, speech recognition
Language(s): Russian
Language ID(s): rus
Distribution: 1 DVD
Member fee: $0 for 2006 members
Non-member Fee: US $1000.00
Reduced-License Fee: US $500.00
Extra-Copy Fee: US $200.00
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Anrey Raev, et al.
2006
Russian through Switched Telephone Network (RuSTeN)
Linguistic Data Consortium, Philadelphia

Introduction

This file contains documentation on the Russian through Switched Telephone Network (RuSTeN), Linguistic Data Consortium (LDC) catalog number LDC2006S34 and ISBN 1-58563-388-7.

This corpus was developed as part of ?Trawl? (Automatic Voice Identification System in Telephone Channel). The purpose of the project was to develop software for automatic identification of speakers based on voice samples acquired through telephone channels. The training of the system was performed with the telephone speech corpus RuSTeN.

Data

The RuSTeN (Russian through Switched Telephone Network) database was recorded between March 2001 and February 2003 by Speech Technology Center using the "forget-me-not" professional telephone recording and archiving software package developed by STC. The files were recorded with sample frequency 11025 Hz, one-channel, 16-bit linear.

Each of the speakers made at least five calls from different locations and/or telephone sets. Most of the calls were made from home or an office environment with uncontrolled noise level. Additionally, one call per speaker was made from a public telephone (with either street or metro station noise in the background). The recordings are spontaneous (sometimes guided by the near-end speaker) conversations between the caller and the speech database collector on various subjects (the weather, the caller's biography, hobbies, etc.) and include approximately 150 seconds of the far-end and at least five seconds of the near-end speaker. Besides, each time the caller was asked to utter the usual digits set (0-9) and the words "yes" and "no."

The time interval between two successive sessions is at least two days. The database contains 125 speakers (far-end), 58 male and 67 female. Each far-end speaker is represented by at least five speech files. The sound files are in the wav-format. The speech filenames contain the following information: FFF (far-end speaker number) and SS (session number).

Samples

For an example of the data in this corpus, please review this audio sample.

Content Copyright

Portions © 2001 Speech Technology Center Limited, © 2006 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact: ldc@ldc.upenn.edu

(c) 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.