What's New:

There is still time to save on 2017 membership fees. Through March 1, all organizations receive a discount on the 2017 membership fee (up to 10%) when they choose to join or renew.   

Plans for MY2017 publications are in progress. Among the expected releases are:

  • 2010 NIST SRE Speaker Recognition Evaluation Set
  • UCLA High Speed Laryngeal Database: audio recordings and high speed video-endoscopic images of the vocal folds while sustaining vowels
  • Noisy TIMIT: TIMIT with added artificial noise
  • CHiME Shared Task Data: noisy read WSJ Speech
  • BOLT: source, parallel and word-aligned data in all languages
  • Multilanguage conversational telephone speech in related languages/dialects: e.g., South Asian, Central European, English, Spanish
  • RATS Keyword Spotting data set 
  • First Year Law Students’ Memoranda: memos to a hypothetical court with annotation
  • IARPA Babel Language Packs: languages include Vietnamese, Haitian Creole, Zulu, Kazakh, Lithuanian

For more information on membership benefits, visit Join LDC

LDC Director Mark Liberman is the 2017 recipient of the IEEE James L. Flanagan Speech and Audio Processing Award. Established in 2002, this annual award recognizes an individual for his or her outstanding contribution to the advancement of speech and/or audio processing. Liberman’s pioneering contributions and continued leadership in robust, replicable, and data-driven speech and language science and engineering have fueled the development and advancement of human language technologies including speech and speaker recognition, machine translation, and semantic analysis. As LDC’s founder, Mark has shepherded the Consortium from a small organization to the largest developer of shared language resources, distributing more than 120,000 copies of over 2,000 databases covering 91 different languages to more than 3,600 organizations in over 70 countries. 

Liberman will receive the award at ICASSP 2017 in New Orleans (March 5-9). LDC will be an exhibitor at Booth 43. Please stop by and say hello. We hope to see you there.    

Congratulations to the recipients of LDC's Spring 2017 data scholarship:

Umad Ul Hassan and Muhammad Awais Zulfiqar: National University of Sciences and Technology (Pakistan); BS Computer Science. Hassan and Zulfiqar are awarded copies of CSLU: Kids’ Speech Version 1.1 and The CMU Kids Corpus for their research in speech recognition for children with learning difficulties. 

For information about the program, visit the Data Scholarship page.

Web pages about data management plans (DMPs) describe the Consortium’s capabilities to develop and implement project specific proposals. To satisfy requirements from funders like the National Science Foundation (NSF) that researchers deposit data in an accessible, trustworthy repository, LDC provides archiving services and makes data publicly available at a reasonable cost while protecting intellectual property rights and privacy concerns.

Browse the pages to learn more about the advantages of data center distribution, the details of NSF DMP requirements and the infrastructures and processes LDC has in place for storing and distributing resources over the long-term. 


We've revamped our user services to make it easier than ever to access LDC data. Now you can become an LDC member, request corpora, sign agreements and submit payment online directly from your LDC user account.

You’ll receive email notifications of key points in the transaction, when for instance, an order is created, agreements are signed, payment is received and data is shipped. You can also track the status of a transaction from your user account. 

Visit the new Managing Your LDC Account page to learn more about user accounts and their privileges and the steps for online transactions.

As always, thanks to our members, sponsors, collaborators and licensees for your continued support.

Podcasts from the complete set of staff interviews conducted as part of LDC's 20th Anniversary can be accessed from the LDC Blog. Hear what long-time staffers had to say about their experiences at LDC.

Christopher Cieri, Executive Director -- Chris reflects on the road that brought him to LDC, some of his early responsibilities and Consortium activities. 

Mohamed Maamouri, Senior Researcher -- Mohamed recounts his personal and professional experiences and comments on Arabic resource development at LDC.

David Graff, Lead Programmer -- Dave was one of LDC's first staff members and offers some insights on LDC's early days.

Yiwola Awoyale, Moussa Bamba, Researchers -- Yiwola and Moussa discuss how they came to LDC, their work on West African langauges and how it benefits multiple communities.

Natalia Bragilveskaya, Business Manager; Ilya Ahtaridis, Membership Coordinator; Marian Reed, Marketing Coordinator -- Natalia, Ilya and Marian recall the early days of LDC and the development of its interactions with the University of Pennsylvania, sponsors, members, licensees and collaborators.