Curation and Distribution Services

LDC offers a range of services that meet NSF’s requirements for data management plans and can be customized for a project’s particular needs.

Data Curation

The curation process is composed of four basic steps.

Implementing DMPs for Language Resources

For research communities working with language resources, it is clear that the NSF DMP covers all raw data and annotations, where raw data refers to any observation of linguistic behavior whether recorded as text or audiovisual media, and annotation includes transcription, translation and any tagging or coding of language form and meaning.

Details of NSF DMP Requirements

All applicants for National Science Foundation funding must include a Data Management Plan (DMP) that describes how a proposal will implement sharing and dissemination of research data and results.

The NSF Data Sharing Policy reads:

Advantages of Data Center Distribution

The sharing requirement of data management plans may be satisfied in a number of ways. Individual investigators and researchers can choose their personal websites as a distribution medium, the investigator’s institution may provide some sharing infrastructure in the form of a web site or archive and researchers can deposit funded data with established repositories such as data centers, to name a few.

Data Management Plans

Long before researchers were required to develop data management plans for their National Science Foundation (NSF) proposals, they deposited their research data at LDC in accordance with NSF’s long-standing desire that data generated with program funds should be readily accessible at a reasonable cost. Well known data sets in the series include The Santa Barbara Corpus of Spoken American English (multiple parts), Propbank and Grassfields Bantu Fieldwork.


Subscribe to Linguistic Data Consortium RSS