Introduction
The CALLFRIEND project
supports the development of language identification technology.
Data
The corpus consists of 60 unscripted telephone conversations,
lasting between 5-30 minutes. The corpus also includes
documentation describing speaker information (sex, age, education,
callee telephone number) and call information (channel quality, number
of speakers).
For each conversation, both the caller and callee are native speakers
of Mandarin Chinese from Mainland China. All calls are domestic and
were placed inside the continental United States and Canada.
Callers in the "Mainland" and "Taiwan" collections of CALLFRIEND
Mandarin were identified primarily on the basis of specific attributes
in their speech characteristic of geographic origin.
Updates
There are no updates at this time.
Content Copyright |