Movie scenes detection with MIGSOM based on shots semi-supervised clustering

被引:6
作者
Ayadi, Thouraya [1 ]
Ellouze, Mehdi [1 ]
Hamdani, Tarek M. [1 ]
Alimi, Adel M. [1 ]
机构
[1] Univ Sfax, Natl Engn Sch Sfax ENIS, Res Grp Intelligent Machines, REGIM, Sfax 3038, Tunisia
关键词
Unsupervised learning; Multilevel interior growing self-organizing map; Movie scenes detection; Video browsing; GROWING CELL STRUCTURES; SEGMENTATION; VIDEO; NETWORK;
D O I
10.1007/s00521-012-0930-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The segmentation into scenes helps users to browse movie archives and to select the interesting ones. In a given movie, we have two kinds of scenes: action scenes and non-action scenes. To detect action scenes, we rely on tempo features as motion and audio energy. However, to detect non-action scenes, we have to use the content information. In this paper, we present a new approach to detect non-action movie scenes. The main idea is the use of a new dynamic variant of the self-organizing maps called MIGSOM (Multilevel Interior Growing self-organizing maps) to detect agglomerations of shots in movie scenes. The originality of MIGSOM model lies in its architecture for evolving the structure of the network. The MIGSOM algorithm is generated by a growth process by adding nodes where it is necessary, whether from the boundaries or the interior of the map. In addition, the advantage of the proposed MIGSOM algorithm is their ability to find the best structure of the output space through the training process and to represent better the semantics of the data. Our system is tested on a varied database and compared to the classical SOM and others works. The obtained results show the merit of our approach in term of recall and precision rates and that our assumptions are well founded.
引用
收藏
页码:1387 / 1396
页数:10
相关论文
共 40 条
[31]   Data mining using rule extraction from Kohonen self-organising maps [J].
Malone, J ;
McGarry, K ;
Wermter, S ;
Bowerman, C .
NEURAL COMPUTING & APPLICATIONS, 2006, 15 (01) :9-17
[32]  
Oh JH, 2000, PROC SPIE, V3969, P254
[33]   Detection and representation of scenes in videos [J].
Rasheed, Z ;
Shah, M .
IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (06) :1097-1105
[34]  
Sundaram H, 2000, 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, P1145, DOI 10.1109/ICME.2000.871563
[35]   Shot clustering techniques for story browsing [J].
Tavanapong, W ;
Zhou, JY .
IEEE TRANSACTIONS ON MULTIMEDIA, 2004, 6 (04) :517-527
[36]   Scene extraction in motion pictures [J].
Truong, BT ;
Venkatesh, S ;
Dorai, C .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (01) :5-15
[37]  
Wali A, 2010, LECT NOTES COMPUT SC, V6475, P110, DOI 10.1007/978-3-642-17691-3_11
[38]   Segmentation of video by clustering and graph analysis [J].
Yeung, M ;
Yeo, BL ;
Liu, B .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 71 (01) :94-109
[39]  
Yu Y, 2006, INT C COMP INT MOD C
[40]  
Zhao L., 2001, IEEE INT C MULT EXP, P1171