Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

Linguistic Resources  
Milestones and Celebrations

Information on our landmark corpora distributions and celebrations for our 10th and 15th anniversary years.

65,000th LDC Corpus Distributed! - February 22, 2010

LDC has recently reached another milestone.  Two years after having distributed our 50,000th corpus, we have just distributed our 65,000th!  To help us celebrate, we took the names of all the organizations that had licensed data on the day we distributed our 65,000th corpus and tossed them into a Phillies baseball cap. 

We then randomly drew a name, and the winner is ...Swarthmore College and Universidad Carlos III de Madrid!  That's not a typo, we have two lucky winners!  We are celebrating our 65,000th distribution by awarding a benefit of US$2000 each to both Swarthmore College and Universidad Carlos III de Madrid. The benefit can be used towards membership or data licensing fees at any time this year.

Swarthmore College and Universidad Carlos III de Madrid join our other recipients of landmark corpora distributions:

  •     Helsinki University of Technology, Adaptive Informatics Research Centre (AIRC) - licensed our 50,000th distribution in January 2008.
  •     Instituto de Engenharia de Sistemas e Computadores (INESC) - licensed our 40,000th distribution in November 2006.
  •     University of Hawai'i, Manoa, Language Analysis and Experimentation Laboratories - licensed our 15,000th distribution in April 2002.

We would like to thank both members and non-members for helping the LDC reach this landmark distribution. The unceasing demand for LDC data from over 2800 organizations supports our mission to develop and share resources for research in human language technologies. 


About our winners:

Swarthmore College ~ The Department of Computer Science offers courses that emphasize the fundamental concepts of computer science, treating today's languages and systems as current examples of the underlying concepts. By educating students to think conceptually, we are preparing them to adapt to developments in this dynamic field.

Universidad Carlos III de Madrid ~ The Multimedia Processing Group aims to make a significant research contribution to the field of multimedia processing, especially focusing on combining signal analysis tools with emerging machine learning methods. Projects include automatic multimedia indexing, automatic speech recognition, and last-generation video coding.


[ top ]

50,000th LDC Corpus Distributed! - January 22, 2008

Last year marked the LDC's 15th Anniversary Year and it proved to be an exciting one for the LDC. We commemorated this anniversary with a Fidelity Celebration which rewarded our loyal members who continually support the consortium through membership. Additionally, we provided our list serve readers with a glimpse into the research activities at the LDC through each of our monthly Spotlights.

At the very end of our anniversary year, the LDC observed another significant milestone: the distribution of our 50,000th publication! This corpus was licensed by Helsinki University of Technology, Adaptive Informatics Research Centre (AIRC). AIRC's research includes basic algorithmic analysis, multimodal interfaces (speech, vision and language), bioinformatics, neuroinformatics and computational cognitive systems. In appreciation, the LDC is offering Helsinki University of Technology a US$2000 benefit to be used towards membership or data licensing fees.

We would like to thank both members and nonmembers for helping the LDC reach this landmark distribution. Your persistent demand for LDC data supports our mission to develop and share resources for research in human language technologies.

[ top ]

LDC Fidelity Celebration Winners! - July 17, 2007

On July 2, 2007, we conducted a blind selection process to determine the winners of the 15th Anniversary Fidelity Celebration. We are pleased to announce the following three winners:

* 3-4 year member – University of Ljubljana, Slovenia
* 5-9 year member – University of Arizona, Library Systems
* 10–15 year member – Institute for Language, Speech and Hearing, University of Sheffield, UK

Congratulations to these three universities and we would like to especially thank all long and short-term members of the LDC who continue to foster a strong linguistic community.

About our winners -

University of Ljubljana – The Laboratory of Artificial Perception, Systems and Cybernetics (LUKS) pursues the goal of promoting research and education in the areas of signal processing, pattern classification, analysis and understanding, speech technologies, biometrics, intelligent systems and information theory and coding.

University of Arizona, Library Systems - University of Arizona Library provides Internet access to university students via its online gateway to library materials and to resources on the World Wide Web. In addition, the Library furnishes public access to these and other information resources from personal computers located throughout the Main and branch libraries.

Institute for Language, Speech and Hearing, ILASH – ILASH aims to provide a focus and common resource for interdisciplinary research in language, speech and hearing by fostering collaboration in these and related areas. Projects include information retrieval, natural language processing, auditory modeling, artificial intelligence, linguistics, speech science, speech technology, computational psychology and neural networks.


[ top ]

LDC Fidelity Celebration - June 20, 2007

In honor of our 15th anniversary, the LDC is holding a Fidelity Celebration
  • to thank faithful members for their support. Organizations who have been LDC members for three consecutive years or more are eligible for benefits that can be used in 2007 for corpora purchases, including reduced-license and extra copy fees, or for membership discounts.

    If your organization has been a consecutive member for:

    • 3-4 years, 2007 inclusive, you are eligible to receive a $250 benefit

    • 5-9 years, 2007 inclusive, Non-Profit members are eligible for a $500 benefit and For-Profit members are eligible for a $1500 benefit

    • 10-15 years, 2007 inclusive, Non-Profit members are eligible for a $3500 benefit and For-Profit members are eligible to receive a $7500 benefit


    Eligible organizations will be notified by June 20, 2007. We will hold blind drawings on July 2, 2007 and winners will be notified within 24 hours.

    Thank you to all who have supported the LDC throughout the past 15 years!


    [ top ]

    LDC Celebrates its Fifteenth Anniversary! - April 18, 2007



    April 15, 2007 marked the start of the LDC's 15th Anniversary year!  We have many milestones to celebrate this year including the growth of our staff to include over 40 full-time employees and a online catalog that includes over 350 linguistic databases.  Since 1992, no less than 2,300 organizations from over 80 different nations have licensed LDC data.  This data has been made available through donations, funded projects at LDC or elsewhere, community initiatives, and, increasingly,  LDC initiatives.  Over the past fifteen years, the LDC has grown from an organization that shares existing language technology resources to one that also is at the forefront of the creating new data resources, software tools, and standards.

    In each of our monthly newsletters for the remainder of the year, we will highlight one aspect of the LDC - from our work in human subject collections, to our progress in Arabic treebanking, to the technical challenges of collecting and storing high volumes of broadcast news. 

    As we celebrate throughout the year, look for new membership offerings and announcements.  And be sure to join us as we count down to the much anticipated distribution of our 50,000th publication.

    [ top ]

    40,000th LDC Corpus Distributed! - November 17, 2006


    In 2003, the LDC celebrated its tenth anniversary and the distribution of our 15,000th corpus.  At that time, the LDC recognized the continued support of its constituent members by offering a free membership to the university which had licensed the 15,000th corpus. Three short years and many requests for data later, we are excited to have recently distributed our 40,000th corpus!   This corpus was licensed by Instituto de Engenharia de Sistemas e Computadores (INESC), a research institute in Lisbon, Portugal.  In appreciation, the LDC is offering INESC a free 2007 Standard Not-for-Profit Membership. 

    We would like to thank all members and nonmembers for helping the LDC reach this landmark distribution.  The growing demand for LDC data from over 2000 organizations supports our mission to develop and share resources for research in linguistic technologies.  At the increased rate that we are distributing corpora, we anticipate the swift observance of our 50,000th distribution.  Stay tuned...

    [ top ]

    15,000th LDC Corpus Distributed! - April 16, 2002


    As mentioned previously, 2002 marks the tenth year of LDC's operations! For the record, our official 'birthday' was April 15th and we're trying to work in some numerology here. So, yesterday was April 15, the day after US TAX day (the FIFTEENTH), and we've sent out our FIFTEEN thousandth corpus, and we're announcing a HUB FIVE publication on our TENTH anniversary ... which adds up to FIFTEEN. All of which means something we're sure ... we're just not sure what.

    In any event, to help celebrate this anniversary, we will be making a series of  announcements all year long to keep our membership informed of LDC news and activities.

    LDC is very pleased to announce that we have recently distributed our 15,000th corpus.  The recipient of this corpus is the University of Hawai'i, Manoa.  The University of Hawai'i, Manoa is a new member of the LDC, having joined for the first time for Membership Year 2002.

    Follow this link to view the web site of the member unit, the Language Analysis and Experimentation Laboratories.

    In appreciation of our members whose support ensures the LDC's mission is realized and in light of our 10th anniversary, the LDC has extended a free membership for Membership Year 2003 to the University of Hawai'i. We would like to thank the University of Hawai'i for excepting this offer and our entire membership for your continued assistance.

    [ top ]

    LDC Celebrates its Tenth Anniversary - February 25, 2002


    Since the Linguistic Data Consortium was founded at the University of Pennsylvania in 1992, we have grown from one employee to 27, and from a desk in the corner of the Phonetics Lab to a floor of a building in the University City Science Center. We have published more than 210 linguistic databases, and distributed more than 15,000 copies to research institutions around the world. With support from DARPA, NSF and hundreds of LDC members, customers and data providers, our mission has grown to include linguistic data collection and annotation for an increasing number of areas of language research and engineering, as well as the development of language-related standards and tools. As researchers extend the frontiers of language-related science, technology and education, we renew our commitment to providing the l inguistic resources they need.

    [ top ]


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact ldc@ldc.upenn.edu
© 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.