NIST/USF Evaluation Resources for the VACE Program - Meeting Data Test Set
Part 2, Linguistic Data Consortium (LDC) catalog number LDC2011V04 and isbn
1-58563-585-5, was developed by researchers at the Department
of Computer Science and Engineering, University of South Florida (USF),
Tampa, Florida and the Multimodal Information
Group at the National Institute of Standards and Technology (NIST). It contains
approximately thirteen hours of meeting room video data collected in 2001 and
2002 at NISTs Meeting Data Collection Laboratory and used in the VACE (Video
Analysis and Content Extraction) 2005 evaluation.
The VACE program was established to develop novel algorithms for automatic
video content extraction, multi-modal fusion, and event understanding. During
VACE Phases I and II, the program made significant progress in the automated
detection and tracking of moving objects including faces, hands, people, vehicles
and text in four primary video domains: broadcast news, meetings, street surveillance,
and unmanned aerial vehicle motion imagery. Initial results were also obtained
on automatic analysis of human activities and understanding of video sequences.
Three performance evaluations were conducted under the auspices of the VACE
program between 2004 and 2007. The 2005 evaluation was administered by
USF in collaboration with NIST and guided by an advisory forum including the
LDC has previously released NIST/USF Evaluation Resources for the VACE Program -- Meeting Data Training Set Part 1 LDC2011V01, NIST/USF Evaluation Resources for the VACE Program -- Meeting Data Training Set Part 2 LDC2011V02 and NIST/USF Evaluation Resources for the VACE Program -- Meeting Data Test Set Part 1 LDC2011V03.
NISTs Meeting Data Collection Laboratory is designed to collect corpora to support research, development and evaluation in meeting recognition technologies. It is equipped to look and sound like a conventional meeting space. The data collection facility includes five Sony EV1-D30 video cameras, four of which have stationary views of a center conference table (one view from each surrounding wall) with a fixed focus and viewing angle, and an additional floating camera which is used to focus on particular participants, whiteboard or conference table depending on the meeting forum. The data is captured in a NIST-internal file format. The video data was extracted from the NIST format and encoded using the MPEG-2 standard in NTSC format. Further information concerning the video data parameters can found in the documentation included with this corpus. Note: due to a last moment update, the file lists on the published media are inaccurate. For up to date lists, please see the online documentation for this corpus.
For an example of the data in this corpus, please view this short video sample (mpeg format).
Portions © 2011 Trustees of the University of Pennsylvania