Current Projects | Linguistic Data Consortium

Current Projects

LDC is involved in a number of projects that support language-related education, research and technology development.

CCU (Computational Cultural Understanding) (DARPA)

The goal of the CCU program is to create a cross-cultural understanding service to improve situational awareness and the ability to effectively interact with diverse international audiences. It seeks to develop natural language processing technologies that recognize, adapt to, and recommend how to operate within the emotional, social, and cultural norms that differ across societies, languages, and communities. LDC supports CCU by collecting, creating and annotating multimodal linguistic resources.

LRE (Language Recognition Evaluation) (NIST)

LDC develops linguistic resources to support the NIST LRE series.

SRE (Speaker Recognition Evaluation) (NIST)

LDC develops linguistic resources to support the NIST Speaker Recognition Evaluation (SRE) series.

U-GAIN Reading (Using Generative Artificial Intelligence for Reading R&D Center)

Sponsored by the US Department of Education and led by the nonprofit, Digital Promise, U-GAIN Reading investigates how generative AI can improve elementary school reading instruction for English learners. U-GAIN builds on an existing research-based tutoring platform, Amira Learning, that is used by more than 1 million students each year. LDC, along with Penn’s Graduate School of Education and Department of Computer and Information Science, contribute expertise in computational linguistics, computer science, and learning analytics. Data developed in the project will be shared with the community through the LDC Catalog.