What's New:

LDC took linguistics to the street on May 2 offering fun language games to the community as participants in the Philadelphia Science Festival. 50,000 people attended the all-day carnival featuring over 170 hands-on science activities from more than 90 organizations. At LDC’s booth, staff led hundreds of participants through three educational activities: a game to guess the language being spoken, a game to identify words spoken in different American dialects and an exercise that generated name tags for visitors in any of six different alphabets. It was a fun day for all.

The Philadelphia Science Festival is a nine-day, community-wide celebration of science that takes place annually in April, featuring lectures, debates, hands-on activities, special exhibitions and a variety of other informal science education experiences for Philadelphians of all ages. 

Not-for-profit and government organizations can now create a custom data collection from among LDC’s 2013 releases. The 2013 Data Pack allows users to license eight corpora published in 2013 for a flat rate of US$3500. Organizations acquire perpetual rights to the corpora licensed through the pack. The Data Pack is not a membership, and organizations must request all eight data sets at the time of purchase. The 2013 Data Pack is available for a limited time only, through September 15.

Options for the Data Pack include the following publications:

  • Greybeard
  • NIST 2012 Open Machine Translation (OpenMT) evaluation and progress sets
  • Chinese Treebank 8.0
  • GALE Arabic and Chinese speech and text releases
  • 1993-2007 United Nations Parallel Text
  • MADCAT training data
  • CSC Deceptive Speech 

To license the Data Pack and select eight corpora, login or register for an LDC user account and add the 2013 Data Pack and each of the eight data sets to your bin. Follow the check-out procedure, sign all applicable user agreements and select payment via wire transfer, purchase order or check. LDC will adjust the invoice total to reflect the data pack fee.

To pay via credit card, add the 2013 Data Pack to your bin and check out using the system prompts. At the completion of the transaction, send an email to ldc@ldc.upenn.edu indicating the eight data sets to include in your order.


We've revamped our user services to make it easier than ever to access LDC data. Now you can become an LDC member, request corpora, sign agreements and submit payment online directly from your LDC user account.

You’ll receive email notifications of key points in the transaction, when for instance, an order is created, agreements are signed, payment is received and data is shipped. You can also track the status of a transaction from your user account. 

Visit the new Managing Your LDC Account page to learn more about user accounts and their privileges and the steps for online transactions.

As always, thanks to our members, sponsors, collaborators and licensees for your continued support.

Podcasts from the complete set of staff interviews conducted as part of LDC's 20th Anniversary can be accessed from the LDC Blog. Hear what long-time staffers had to say about their experiences at LDC.

Christopher Cieri, Executive Director -- Chris reflects on the road that brought him to LDC, some of his early responsibilities and Consortium activities. 

Mohamed Maamouri, Senior Researcher -- Mohamed recounts his personal and professional experiences and comments on Arabic resource development at LDC.

David Graff, Lead Programmer -- Dave was one of LDC's first staff members and offers some insights on LDC's early days.

Yiwola Awoyale, Moussa Bamba, Researchers -- Yiwola and Moussa discuss how they came to LDC, their work on West African langauges and how it benefits multiple communities.

Natalia Bragilveskaya, Business Manager; Ilya Ahtaridis, Membership Coordinator; Marian Reed, Marketing Coordinator -- Natalia, Ilya and Marian recall the early days of LDC and the development of its interactions with the University of Pennsylvania, sponsors, members, licensees and collaborators.