|

|
|
Discourse Graphbank
| |
| Item Name: | Discourse Graphbank |
| Authors: | Florian Wolf, Edward Gibson, Amy Fisher, and Meredith Knight |
| LDC Catalog No.: | LDC2005T08 |
| ISBN: | 1-58563-320-8 |
| Release Date: | Mar 15, 2005 |
| Data Type: | text |
| Project(s): | EARS, GALE |
| Application(s): | discourse analysis, information retrieval, summarization |
| Language(s): | English |
| Language ID(s): | eng |
| Distribution: | Web Download |
| Member fee: | $0 for 2005 members |
| Non-member Fee: | US $500.00 |
| Reduced-License Fee: | US $250.00 |
| Extra-Copy Fee: | N/A |
| Non-member License: | yes |
| Licensing Instructions: | Subscription Members, Standard Members, Non-Members |
| Citation: | Florian Wolf, et al. 2005 Discourse Graphbank Linguistic Data Consortium, Philadelphia |
|
Introduction
As Florian Wolf's Ph.D thesis, the Discourse Treebank aimed to define a descriptively adequate data structure for representing discourse coherence structures. This project also investigated the impact of discourse coherence structures on other linguistic processes and natural language applications (e.g. anaphor resolution,summarization, information retrieval), and developed and tested discourse parsing algorithms.
Data
The data consists of 135 texts from AP Newswire and Wall Street Journal, annotated with coherence relations. The source was UPenn TIPSTER.
Samples
A screenshot of the output of the annotator tool has been provided as an example of this corpus.
Copyright
Portions © 1988-1990 Associated Press, © 1986-1989 Dow Jones & Company, Inc., © 2005 Trustees of the University of
Pennsylvania |
|
|