|

|
|
Prague Dependency Treebank 2.0
| |
| Item Name: | Prague Dependency Treebank 2.0 |
| Authors: | Jan Hajič, Jarmila Panevová, Eva Hajičová, Jarmila Panevová, Petr Sgall, Petr Pajas, Jan ?těpánek, Jiří Havelka, and Marie Mikulová |
| LDC Catalog No.: | LDC2006T01 |
| ISBN: | 1-58563-370-4 |
| Release Date: | Jul 21, 2006 |
| Data Type: | text |
| Data Source(s): | journal articles, news magazine, newswire |
| Application(s): | information extraction, information retrieval, language modeling, language teaching, parsing, tagging |
| Language(s): | Czech |
| Distribution: | 1 CD |
| Member fee: | $0 for 2006 members |
| Non-member Fee: | US $100.00 |
| Reduced-License Fee: | US $100.00 |
| Extra-Copy Fee: | US $100.00 |
| Non-member License: | yes |
| Member License: | yes |
| Online documentation: | yes |
| Licensing Instructions: | Subscription Members, Standard Members, Non-Members |
| Citation: | Jan Hajič, et al. 2006 Prague Dependency Treebank 2.0 Linguistic Data Consortium, Philadelphia |
|
Introduction
The Prague Dependency Treebank 2.0 (PDT 2.0) contains a
large amount of Czech texts with complex and interlinked
morphological (two million words), syntactic (1.5 MW) and
complex semantic annotation (0.8 MW); in addition, certain
properties of sentence information structure and coreference
relations are annotated at the semantic level.
PDT 2.0 is based on the long-standing Praguian linguistic
tradition, adapted for the current Computational Linguistics
research needs. The corpus itself uses the latest
annotation technology. Software tools for corpus search,
annotation and language analysis are included. Extensive
documentation (in English) is provided as well.
Samples
For an example of the data in this publication, please examine these samples.
Content Copyright
Portions © 1991, 1994,1995 Lidové noviny
daily newspapers, © 1992 Mladá fronta Dnes daily newspapers, © 1994
Ceskomoravský Profit business weekly, © 1992-1993 Vesmír scientific
magazine, Academia Publishers, © 1996-2005 Institute of Formal and
Applied Linguistics and Center for Computational Linguistics, Faculty of
Mathematics and Physics, Charles University, © 2006 Trustees of the
University of Pennsylvania |
|
|