Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



Discourse Graphbank

Item Name: Discourse Graphbank
Authors: Florian Wolf, Edward Gibson, Amy Fisher, and Meredith Knight
LDC Catalog No.: LDC2005T08
ISBN: 1-58563-32C-8
Release Date: Mar 15, 2005
Data Type: text
Project(s): EARS, GALE
Application(s): discourse analysis, information retrieval, summarization
Language(s): English
Language ID(s): eng
Distribution: Web Download
Member fee: $0 for 2005 members
Non-member Fee: US$500.00
Reduced-License Fee: US$250.00
Extra-Copy Fee: N/A
Non-member License: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Florian Wolf, et al.
2005
Discourse Graphbank
Linguistic Data Consortium, Philadelphia

Introduction

As Florian Wolf's Ph.D thesis, the Discourse Treebank aimed to define a descriptively adequate data structure for representing discourse coherence structures. This project also investigated the impact of discourse coherence structures on other linguistic processes and natural language applications (e.g. anaphor resolution,summarization, information retrieval), and developed and tested discourse parsing algorithms.

Data

The data consists of 135 texts from AP Newswire and Wall Street Journal, annotated with coherence relations. The source was UPenn TIPSTER.

Samples

A screenshot of the output of the annotator tool has been provided as an example of this corpus.

Copyright

Portions © 1988-1990 Associated Press, © 1986-1989 Dow Jones & Company, Inc., © 2005 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Da ta

Contact: ldc@ldc.upenn.edu

(c) 1992-2008 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.