Current Projects

LDC is involved in a number of projects that support language-related education, research and technology development.

AIDA (Active Interpretation of Disparate Alternatives) (DARPA)

AIDA’s goal is to develop a multi-hypothesis semantic engine that generates explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supports AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages.

Computational Cultural Understanding (DARPA)

The goal of the Computational Cultural Understanding program (CCU) is to create a cross-cultural understanding service to improve situational awareness and the ability to effectively interact with diverse international audiences. It seeks to develop natural language processing technologies that recognize, adapt to, and recommend how to operate within the emotional, social, and cultural norms that differ across societies, languages, and communities. LDC supports CCU by collecting, creating and annotating multimodal linguistic resources.

Disaster and Refugee Relief Research

LDC is releasing Ukrainian data it developed in the DARPA AIDA program, the NIST Language Recognition Evaluation (LRE) series and the DARPA LORELEI program under a special no-cost, limited license for disaster and refugee relief research. 

KAIROS (Knowledge-directed Artificial Intelligence Reasoning Over Schemas) (DARPA)

KAIROS seeks to develop a schema-based AI system that can identify complex events and bring them to the attention of users. It aims to understand complex events described in multimedia inputs by developing a semi-automated system that identifies, links, and temporally sequences their subsidiary elements, the participants involved, as well as the complex event type. LDC supports KAIROS by collecting, creating and annotating linguistic resources in multiple languages.

LRE (Language Recognition Evaluation) (NIST)

LDC develops linguistic resources to support the NIST LRE series.

NIEUW (Novel Incentives and Workflows in Linguistic Data Collection and Annotation) (NSF)

NIEUW is an LDC project supported by an NSF CISE Research Infrastructure planning grant. The goal is to build a framework to develop multilingual language resources employing crowdsourcing techniques proven to work in multiple scientific disciplines. 

SRE (Speaker Recognition Evaluation) (NIST)

LDC develops linguistic resources to support the NIST Speaker Recognition Evaluation (SRE) series.