What's New:

LDC conducted its fourth user survey in December 2015. This survey built on the previous surveys conducted in 2006, 2007 and 2012 to assess user sentiment and also asked for the evaluation of key LDC-related topics including:
  • Opinions on the new website and usability of the Catalog
  • Use and satisfaction with the enhanced user services and e-commerce system
  • LDC’s Data Management Plan capabilities
  • Suggestions for future publications and preferred data delivery methods
  • Use of web services for data access and processing
Overall, survey respondents were satisfied with LDC’s data, membership options, website, Catalog and enhanced user services. Participants cited the top five most useful corpora received between 2012 and 2015 as OntoNotes Release 5.0TIMITTAC KBP Reference Knowledge BasePenn Discourse Treebank V 2.0, and Multi-Channel WSJ Audio. Three fourths of respondents prefer digital delivery of data and the top three languages for current research demands were identified as English, Chinese and Spanish.
We thank everyone who participated in this survey. Responses will benefit the future of the Consortium and will help LDC to better meet the needs of our members and data licensees.

Applications are now being accepted through Thursday, September 15, 2016 for the Fall 2016 LDC Data Scholarship program. The LDC Data Scholarship program provides university students with access to LDC data at no-cost.

This program is open to students pursuing both undergraduate and graduate studies in an accredited college or university. LDC Data Scholarships are not restricted to any particular field of study; however, students must demonstrate a well-developed research agenda and a bona fide inability to pay. The selection process is highly competitive.

The application consists of two parts:
(1) Data Use Proposal. Applicants must submit a two-page proposal describing their intended use of the data. The proposal should state which data the student plans to use, how the data will benefit their research project, the proposed methodology or algorithm which will be used and how success will be measured.

Applicants should consult the Catalog for a complete list of data distributed by LDC. Due to certain restrictions, a handful of LDC corpora are restricted to members of the Consortium. Applicants are advised to select a maximum of one to two databases.

(2) Letter of Support. Applicants must submit one letter of support from their thesis adviser or department chair. The letter must be signed and printed on letterhead, describe the student and the research, evaluate the probability of success and confirm that the department or university lacks the funding to pay the full non-member fee for the data. 

For further information on application materials and program rules, please visit the LDC Data Scholarship page.

Web pages about data management plans (DMPs) describe the Consortium’s capabilities to develop and implement project specific proposals. To satisfy requirements from funders like the National Science Foundation (NSF) that researchers deposit data in an accessible, trustworthy repository, LDC provides archiving services and makes data publicly available at a reasonable cost while protecting intellectual property rights and privacy concerns.

Browse the pages to learn more about the advantages of data center distribution, the details of NSF DMP requirements and the infrastructures and processes LDC has in place for storing and distributing resources over the long-term. 


We've revamped our user services to make it easier than ever to access LDC data. Now you can become an LDC member, request corpora, sign agreements and submit payment online directly from your LDC user account.

You’ll receive email notifications of key points in the transaction, when for instance, an order is created, agreements are signed, payment is received and data is shipped. You can also track the status of a transaction from your user account. 

Visit the new Managing Your LDC Account page to learn more about user accounts and their privileges and the steps for online transactions.

As always, thanks to our members, sponsors, collaborators and licensees for your continued support.

Podcasts from the complete set of staff interviews conducted as part of LDC's 20th Anniversary can be accessed from the LDC Blog. Hear what long-time staffers had to say about their experiences at LDC.

Christopher Cieri, Executive Director -- Chris reflects on the road that brought him to LDC, some of his early responsibilities and Consortium activities. 

Mohamed Maamouri, Senior Researcher -- Mohamed recounts his personal and professional experiences and comments on Arabic resource development at LDC.

David Graff, Lead Programmer -- Dave was one of LDC's first staff members and offers some insights on LDC's early days.

Yiwola Awoyale, Moussa Bamba, Researchers -- Yiwola and Moussa discuss how they came to LDC, their work on West African langauges and how it benefits multiple communities.

Natalia Bragilveskaya, Business Manager; Ilya Ahtaridis, Membership Coordinator; Marian Reed, Marketing Coordinator -- Natalia, Ilya and Marian recall the early days of LDC and the development of its interactions with the University of Pennsylvania, sponsors, members, licensees and collaborators.