TRECVID 2006 Keyframes, Linguistic Data Consortium
(LDC) catalog id LDC2010V02 and isbn 1-58563-554-5, was developed as
a collaborative effort between researchers at LDC,
Dublin City University.
TREC Video Retrieval Evaluation (TRECVID) is sponsored by the
National Institute of Standards and Technology (NIST) to promote
progress in content-based retrieval from digital video via open,
metrics-based evaluation. The keyframes in this release were
extracted for use in the NIST TRECVID 2006 Evaluation.
TRECVID is a laboratory-style evaluation that attempts to model real
world situations or significant component tasks involved in such
situations. In 2006 TRECVID completed a 2-year cycle on English,
Arabic, and Chinese news video. The evalaution consisted of three
system tasks and associated tests:
shot boundary determination
high-level feature extraction
search (interactive, manually-assisted, and/or fully automatic)
The 2006 evaluation also included a rushes exploitation exploratory
task, but the material associated with that task is not included in
For a detailed description of the TRECVID Evaluation Tasks, please
refer to the
TRECVID 2006 Evaluation Description.
The video stills that compose this corpus are drawn from
approximately 158.6 hours of English, Arabic, and Chinese language
data collected by LDC from NBC ("NBC
Nightly News"), CNN ("Live From..",
"Anderson Cooper 360"), MSNBC
("MSNBC News live"), New Tang Dynsaty TV ("Economic
Frontier", "Focus Interactive"), Phoenix TV
("Good Morning China"),
Lebanese Broadcasting Corp. ("Naharkum Saiid", "News
on LBC"), Alhurra TV ("Alhurra News") and China
Central TV ("CCTV_News").
Shots are fundamental units of video, useful for higher-level
processing. To create the master list of shots, the video was
segmented. The results of this pass are called subshots. Because the
master shot reference is designed for use in manual assessment, a
second pass over the segmentation was made to create the master shots
of at least 2 seconds in length. These master shots are the ones to
be used in submitting results for the feature and search tasks. In
the second pass, starting at the beginning of each file, the subshots
were aggregated, if necessary, until the currrent shot was at least 2
seconds in duration, at which point the aggregation began anew with
the next subshot.
The keyframes were selected by going to the middle frame of the shot
boundary, then parsing left and right of that frame to locate the
nearest I-Frame. This then became the keyframe and was extracted.
Keyframes have been provided at both the subshot (NRKF) and master
shot (RKF) levels.
In a small number of cases (all of them subshots) there was no I-Frame
within the subshot boundaries. When this occurred, the middle frame
The emphasis in the common shot boundary reference is
on the shots, not the transitions. The shots are contiguous. There
are no gaps between them.
They do not overlap. The media time format is based on the Gregorian day
time (ISO 8601) norm. Fractions are defined by counting pre-specified
fractions of a second. In our case, the frame rate will likely be
29.97. One fraction of a second is thus specified as
The video id has the format of "XXX" and shot id
"shotXXX_YYY". The "XXX" is the sequence number
of video onto which the video file name is mapped this will be listed
in the "collection.xml" file. The "YYY" is the
sequence number of the shot. Keyframes are identified as by a suffix
"_RKF" for the main keyframe (one per shot) or
"_NKRF" for additional keyframes derived from subshots that
were merged so that shots have a minimum duration of 2 seconds.
Samples of data available in this corpus:
Keyframe (video still)
Shots metadata (mp7 markup)
No updates are available at this time.
Cable News Network, LP, LLLP, © 2005 China Central TV, ©
2005 National Broadcasting Company, Inc., © 2005 New Tang
Dynasty TV, © 2005 PAC, Ltd., © 2005 Phoenix
TV, © 2005, 2006, 2010 Trustees of the University of