Current Projects

LDC is involved in a number of projects that support language-related education, research and technology development.

AIDA (Active Interpretation of Disparate Alternatives) (DARPA)

AIDA’s goal is to develop a multi-hypothesis semantic engine that generates explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supports AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages.

CCU (Computational Cultural Understanding) (DARPA)

The goal of the CCU program is to create a cross-cultural understanding service to improve situational awareness and the ability to effectively interact with diverse international audiences. It seeks to develop natural language processing technologies that recognize, adapt to, and recommend how to operate within the emotional, social, and cultural norms that differ across societies, languages, and communities. LDC supports CCU by collecting, creating and annotating multimodal linguistic resources.

KAIROS (Knowledge-directed Artificial Intelligence Reasoning Over Schemas) (DARPA)

KAIROS seeks to develop a schema-based AI system that can identify complex events and bring them to the attention of users. It aims to understand complex events described in multimedia inputs by developing a semi-automated system that identifies, links, and temporally sequences their subsidiary elements, the participants involved, as well as the complex event type. LDC supports KAIROS by collecting, creating and annotating linguistic resources in multiple languages.

KMASS (Knowledge Management at Scale and Speed) (DARPA)

The KMASS program aims to research, develop, integrate, evaluate, and demonstrate underlying technology that will enable effective use of documented knowledge, acquisition of new knowledge as part of regular workflows, and application of useful knowledge when and where it is required and with necessary granularity. LDC supports KMASS by collecting, creating and annotating multimodal linguistic resources focusing on the medical emergency and contingency operations domains.

LRE (Language Recognition Evaluation) (NIST)

LDC develops linguistic resources to support the NIST LRE series.

NIEUW (Novel Incentives and Workflows in Linguistic Data Collection and Annotation) (NSF)

NIEUW is an LDC project supported by an NSF CISE Research Infrastructure planning grant. The goal is to build a framework to develop multilingual language resources employing crowdsourcing techniques proven to work in multiple scientific disciplines. 

SRE (Speaker Recognition Evaluation) (NIST)

LDC develops linguistic resources to support the NIST Speaker Recognition Evaluation (SRE) series.