članak: 1 od 1  
Computer Science and Information Systems / ComSIS
2010, vol. 7, br. 1, str. 85-98
jezik rada: engleski
članak
doi:10.2298/CSIS1001085S

Multi-video summarization using complex graph clustering and mining
(naslov ne postoji na srpskom)
aCollege of Computer Science, Zhejiang University, China
bZhejiang Radio & TV Group, China

e-adresa: jshao@cs.zju.edu.cn, dmjiang1985@163.com, wmr628@m

Sažetak

(ne postoji na srpskom)
Multi-video summarization is a great theoretical and technical challenge due to the wider diversity of topics in multi-video than singlevideo as well as the multi-modality nature of multi-video over multidocument. In this paper, we propose an approach to analyze both visual and textual features across a set of videos and to create a so-called circular storyboard composed of topic-representative keyframes and keywords. We formulate the generation of circular storyboard as a problem of complex graph clustering and mining, in which each separated shot from visual data and each extracted keyword from speech transcripts are first structured into a complex graph and grouped into clusters; hidden topics in the representative keyframes and keywords are then mined from clustered complex graph while at the same time maximizing the coverage of the summary over the original video set. We also design experiments to evaluate the effectiveness of our approach and the proposed approach shows a better performance than two other storyboard baselines.

Ključne reči

multi-video summarization; complex graph clustering and mining; circular storyboard

Reference

Chen, B.W., Wang, J.C., Wang, J.F. (2009) A novel video summarization based on mining the story-structure and semantic relations among concept entities. IEEE Tran. Multimedia, vol. 11, br. 2
Christel, M.G., Hauptmann, A.G., Lin, W.H., Chen, M.Y., Yang, J., Maher, B., Baron, R.V. (2008) Exploring the utility of fast-forward surrogates for BBC rushes. u: TRECVID Summarization Workshop, proceeding, Vancouver, British Columbia, Canada
Christel, M.G. (2006) Evaluation and user studies with respect to video summarization and browsing. u: IS&T/SPIE Symposium on Electronics Imaging, San Jose, CA
Deodhar, M., Ghosh, J., Gupta, G., Cho, H., Dhillon, I. (2009) A scalable framework for discovering coherent co-clusters in noisy data. u: ICML '09., 17
Ding, W. (1999) Multimodal surrogates for video browsing. u: Proc. ACM Digital Lib, Berkeley, CA, 85-93
Liu, Y.N., Wu, F., Zhuang, Y.T., Xiao, J. (2008) Active post-refined multi-modality video semantic concept detection with tensor representation. u: ACM Multimedia 2008, 91-100
Long, B., Zhang, Z.F., Yu, P.S. (2008) Clustering on complex graph. u: AAAI Conference on artificial intelligence, (23rd), proceeding
Odobez, J.M., Gatica-Perez, D. (2003) Video shot clustering using spectral methods. u: Int. workshop on Content-based multimedia indexing, (3rd), Rennes, France, 94-102
Wan, X.J., Yang, J.W. (2006) Improved affinity graph based multi-document summarization. u: Human language technology Conf. of the NAACL, proceeding, 181- 184
Wang, F., Merialdo, B. (2009) Multi-document video summarization. u: ICME 09, proceeding
Xia, D.Y., Wu, F., Zhang, X.Q., Zhuang, Y.T. (2008) Local and global approaches of affinity propagation clustering for large scale data. Journal of Zhejiang University Science, vol. 9 br. 10 str. 1373~1381
Ye, Z.Y., Wu, F. (2003) A robust fusion algorithm for shot boundary detection. Journal of Computer Aided Design and Computer Graphics (In Chinese with English Abstract), vol. 15, br. 11, 950-955. 23
Yeung, M.M., Yeo, B.L. (1997) Video visualization for compact presentation and fast browsing of pictorial content. IEEE Transactions on Circuits and Systems for Video Technology, 7(5): 771
Young, S., Everman, G., i dr. (2007) The HTK book (for HTK version 3. 4. http://htk.eng.cam.ac.uk