Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



Chinese News Translation Text Part 1

Item Name: Chinese News Translation Text Part 1
Authors: Xiaoyi Ma
LDC Catalog No.: LDC2005T06
ISBN: 1-58563-329-1
Release Date: Mar 15, 2005
Data Type: text
Data Source(s): newswire
Project(s): GALE, TIDES
Application(s): cross-lingual information retrieval, language teaching, machine translation
Language(s): English, Mandarin Chinese
Language ID(s): cmn, eng
Distribution: Web Download
Member fee: $0 for 2005 members
Non-member Fee: US$2000.00
Reduced-License Fee: US$1000.00
Extra-Copy Fee: N/A
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Xiaoyi Ma
2005
Chinese News Translation Text Part 1
Linguistic Data Consortium, Philadelphia

Introduction

Chinese News Translation Text Part 1 was produced by Linguistic Data Consortium (LDC) catalog number LDC2005T06 and ISBN 1-58563-329-1.

To support the development of automatic machine translation systems, the LDC was sponsored to solicit English translations for a single set of Chinese source materials.

The source Chinese text and its English translations were selected and translated in different LDC projects during the time period of February 2003 to January 2005. A total of about 474K Chinese characters were selected from two sources, namely Xinhua and AFP, and translation services were provided by seven translation agencies. Each Chinese news story was translated once.

All stories and its translations were created for TIDES Machine Translation as training data, following roughly the same guidelines and procedures.

Samples

To see an example of this corpus, please examine this translation file.

Copyright

Portions © 2002-2004 Xinhua News Agency, 2002-2004 Agence France-Presse, © 2005 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Da ta

Contact: ldc@ldc.upenn.edu

(c) 1992-2008 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.