LDC Supports Data Management Plans

colorful LDC USBs

US funding agencies such as the National Science Foundation (NSF) increasingly require researchers to deposit data in an accessible, trustworthy archive. The Consortium’s expertise in data curation, distribution and management and its commitment to the broad accessibility of linguistic data make it the repository of choice for NSF-funded data.

LDC administers data management plans by providing archiving services and making data publicly available at a reasonable cost while protecting intellectual property rights and accommodating privacy concerns. In addition, LDC has in place infrastructures and processes for reviewing, storing and distributing resources over the long-term, a key element for data management plans in general.

Data sets developed and/or distributed with NSF funding include Arabic Broadcast News Speech and Transcripts, Grassfields Bantu Fieldwork, Penn Discourse Treebank, Propbank, SLX Corpus of Classic Sociolinguistic Interviews, Subglottal Resonances Database, The Santa Barbara Corpus of Spoken American English (multiple parts), Translanguage English Database and Speech in Noisy Environments (SPINE) (multiple releases).

Learn more about how LDC can assist researchers in developing and implementing data management plans from our website, our data sheet, or contact LDC Data Management Plans.

LDC User Accounts

LDC login picture

 The key to accessing LDC data is your user account. There are three types of accounts within the LDC system: guest, organization user and administrator.

Guest status is the initial step to register a new organization or to join an organization’s existing LDC account. Guests cannot download or license data. When a guest account is created, notification for approval is sent to the account administrator(s) if the guest has indicated an affiliation with an existing organization account. Once approved, guest status changes to organization user. Individuals who join as guests will eventually be affiliated with a new or existing organization account if data is requested.

Organization users can download and license data and view the licensing history of their organization.

Administrators have the additional authority to approve, create and remove user and guest accounts; they can also view and print invoices and quotes.

To learn more about user account roles, visit our website or register your new account today.

Confidence in LDC Language Resources

Core Trust Seal logo

The LDC Catalog is recognized as a trustworthy data repository under the CoreTrustSeal certification established by the ISCU World Data System and the Data Seal of Approval. This means that the Catalog meets a series of high standards regarding data access, rights management, curation and archival storage. As researchers and funding authorities increasingly require that data remain accessible, useful and meaningful into the future, data repositories are more important than ever. The Catalog has served as a permanent archive for language resources since its inception. LDC joins the other certified organizations in the commitment to promote and maintain sustainable and trustworthy data infrastructures. 

Penn GEF Americas Workshop

International Workshop on Data Intensive Research on Languages of the Americas

Organized by LDC with the support of the Penn Global Engagement Fund, this two-day workshop in Mexico City on May 24-25, 2018 brought together linguists and computer scientists from Mexico, Brazil, Chile, Argentina, and the United States to discuss the opportunities and challenges of constructing and sharing language resources in the languages of the Americas, e.g., Spanish, Portuguese, Caribbean dialects, and indigenous languages.