TY - GEN
T1 - Robust video fingerprinting via structural graphical models
AU - Li, Mu
AU - Monga, Vishal
PY - 2012
Y1 - 2012
N2 - Applications of video fingerprinting range from traditional video retrieval and authentication to the more recent problem of anti-piracy search brought about by the emergence of video websites such as Youtube. Video fingerprints offer the potential of identifying in a robust and scalable manner - illegal or undesirable uploads of copyrighted video content. The principal challenge in video fingerprinting is to extract reduced dimensionality descriptors that can withstand incidental spatial and temporal distortions to the video while still allowing the discrimination of distinct videos. To address this fundamental problem, we propose to first represent a video as a graphical structure which can encode temporal relationships between video shots that are crucial to uniquely identifying the video. Next, we leverage ideas from graph theory, namely the normalized cuts graph partitioning method to divide the video representation into sub-graphs. Robust dimensionality reduction applied to these sub-graphs yields the final video hash/fingerprint. Experimental results in the form of receiver operating characteristic (ROC) curves on video databases acquired from YouTube reveal that the proposed video fingerprinting can enable a much more favorable robustness vs. discriminability trade-off over state-of-the art algorithms in video hashing.
AB - Applications of video fingerprinting range from traditional video retrieval and authentication to the more recent problem of anti-piracy search brought about by the emergence of video websites such as Youtube. Video fingerprints offer the potential of identifying in a robust and scalable manner - illegal or undesirable uploads of copyrighted video content. The principal challenge in video fingerprinting is to extract reduced dimensionality descriptors that can withstand incidental spatial and temporal distortions to the video while still allowing the discrimination of distinct videos. To address this fundamental problem, we propose to first represent a video as a graphical structure which can encode temporal relationships between video shots that are crucial to uniquely identifying the video. Next, we leverage ideas from graph theory, namely the normalized cuts graph partitioning method to divide the video representation into sub-graphs. Robust dimensionality reduction applied to these sub-graphs yields the final video hash/fingerprint. Experimental results in the form of receiver operating characteristic (ROC) curves on video databases acquired from YouTube reveal that the proposed video fingerprinting can enable a much more favorable robustness vs. discriminability trade-off over state-of-the art algorithms in video hashing.
UR - http://www.scopus.com/inward/record.url?scp=84875871449&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84875871449&partnerID=8YFLogxK
U2 - 10.1109/ICIP.2012.6466842
DO - 10.1109/ICIP.2012.6466842
M3 - Conference contribution
AN - SCOPUS:84875871449
SN - 9781467325332
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 249
EP - 252
BT - 2012 IEEE International Conference on Image Processing, ICIP 2012 - Proceedings
T2 - 2012 19th IEEE International Conference on Image Processing, ICIP 2012
Y2 - 30 September 2012 through 3 October 2012
ER -