Chinese News Translation Text Part 1 was produced by Linguistic Data Consortium (LDC) catalog number
LDC2005T06 and ISBN 1-58563-329-1.
To support the development of automatic machine translation systems, the LDC was sponsored to solicit
English translations for a single set of Chinese source materials.
The source Chinese text and its English translations were selected and
translated in different LDC projects during the time period of February 2003 to
January 2005. A total of about 474K Chinese characters were selected from two
sources, namely Xinhua and AFP, and translation services were
provided by seven translation agencies. Each Chinese news story was translated
All stories and its translations were created for
TIDES Machine Translation as training data, following roughly
the same guidelines and procedures.
To see an example of this corpus, please examine this
Portions © 2002-2004 Xinhua News Agency, 2002-2004 Agence France-Presse, ©
2005 Trustees of the University of Pennsylvania