Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome

LDC Catalog | By Type and Source | By Year | Top Ten | Projects | Catalog Search



TRECVID 2005 Keyframes & Transcripts

Item Name: TRECVID 2005 Keyframes & Transcripts
Authors: Peter Wilkins, Christian Petersohn, Kevin Walker
LDC Catalog No.: LDC2007V01
ISBN: 1-58563-437-9
Release Date: Mar 16, 2007
Data Type: video, text
Data Source(s): broadcast news
Project(s): TDT, TREC
Language(s): English, Mandarin Chinese, Modern Standard Arabic
Language ID(s): arb, eng
Distribution: 1 DVD
Member fee: $0 for 2007 members
Non-member Fee: US$500.00
Reduced-License Fee: US$250.00
Extra-Copy Fee: US$200.00
Non-member License: yes
Online documentation: yes
Licensing Instructions: Subscription Members, Standard Members, Non-Members
Citation: Peter Wilkins, Christian Petersohn, Kevin Walker
2007
TRECVID 2005 Keyframes & Transcripts
Linguistic Data Consortium, Philadelphia

Introduction

This file contains documentation for TRECVID 2005 Keyframes & Transcripts, Linguistic Data Consortium (LDC) catalog number LDC2007V01 and isbn 1-58563-437-9.

TREC Video Retrieval Evaluation (TRECVID) is sponsored by the National Institute of Standards and Technology (NIST) to promote progress in content-based retrieval from digital video via open, metrics-based evaluation. The keyframes in this release were extracted for use in the NIST TRECVID 2005 Evaluation.

TRECVID is a laboratory-style evaluation that attempts to model real world situations or significant component tasks involved in such situations. In 2005 there were four main tasks with associated tests:

  • shot boundary determination
  • low-level feature extraction
  • high-level feature extraction
  • search (interactive, manual, and automatic)

For a detailed description of the TRECVID Evaluation Tasks, please refer to the NIST TRECVID 2005 Evaluation Description.

Data

The source data is Arabic, Chinese and English language broadcast programming collected in November 2004 from the following sources: Lebanese Broadcasting Corp. (Arabic); China Central TV and New Tang Dynasty TV (Chinese); and CNN and MSNBC/NBC (English).

Shots are fundamental units of video, useful for higher-level processing. To create the master list of shots, the video was segmented. The results of this pass are called subshots. Because the master shot reference is designed for use in manual assessment, a second pass over the segmentation was made to create the master shots of at least 2 seconds in length. These master shots are the ones used in submitting results for the feature and search tasks in the evaluation. In the second pass, starting at the beginning of each file, the subshots were aggregated, if necessary, until the currrent shot was at least 2 seconds in duration, at which point the aggregation began anew with the next subshot.

The keyframes were selected by going to the middle frame of the shot boundary, then parsing left and right of that frame to locate the nearest I-Frame. This then became the keyframe and was extracted. Keyframes have been provided at both the subshot (NRKF) and master shot (RKF) levels.

In a small number of cases (all of them subshots) there was no I-Frame within the subshot boundaries. When this occured, the middle frame was selected. There is one anomaly: at the end of the first video in the test collection, a subshot occurs outside a master shot.)

The emphasis in the common shot boundary reference is on the shots, not the transitions. The shots are contiguous. There are no gaps between them. They do not overlap. The media time format is based on the Gregorian day time (ISO 8601) norm. Fractions are defined by counting pre-specified fractions of a second.

Samples

The Keyframe below is a sample of the data contained in this corpus.

For information about this frame, please examine this annotation file.

Content Copyright

Portions © 2004 Cable News Network, LP, LLLP, © 2004 China Central TV, © 2004 National Broadcasting Company, Inc., © 2004 New Tang Dynasty TV, © 2004 PAC, Ltd., © 2004, 2005, 2007 Trustees of the University of Pennsylvania


About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Da ta

Contact: ldc@ldc.upenn.edu

(c) 1992-2008 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.