Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



English Chinese Translation Treebank v 1.0

Item Name: English Chinese Translation Treebank v 1.0
Authors: Ann Bies, Martha Palmer, Justin Mott, Colin Warner
LDC Catalog No.: LDC2007T02
ISBN: 1-58563-408-5
Release Date: Jan 22, 2007
Data Type: text
Data Source(s): newswire
Project(s): TIDES
Application(s): machine translation, natural language processing, parsing, tagging
Language(s): English
Language ID(s): eng
Distribution: Web Download
Member fee: $0 for 2007 members
Non-member Fee: US $500.00
Reduced-License Fee: US $250.00
Extra-Copy Fee: N/A
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Ann Bies, et al.
2007
English Chinese Translation Treebank v 1.0
Linguistic Data Consortium, Philadelphia

Description

This release of English Chinese Translation Treebank v. 1.0 consists of 146,300 words in 325 files of individual news stories from Xinhua News Agency (corresponding to the Xinhua data in Chinese Treebank 5.0, LDC Catalog No. LDC2005T01) that are translated into English, part-of-speech tagged and treebanked. The files were compressed using gzip.

The source files for the treebank annotation contain the final updated translation of these files. Translation errors that prevented complete treebank annotation have been corrected. This translation and annotation were completed in October 2004 and supersede any earlier translation.

This publication was compiled under Natinal Science Foundation Grant #IIS-0325646.

Samples

For an example of the data in this publication, please view this sample.

Copyright

Portions © 1994-1998 Xinhua News Agency, © 2004, 2007 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact: ldc@ldc.upenn.edu

(c) 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.