Movie scenes detection with MIGSOM based on shots semi-supervised clustering

被引:6
作者
Ayadi, Thouraya [1 ]
Ellouze, Mehdi [1 ]
Hamdani, Tarek M. [1 ]
Alimi, Adel M. [1 ]
机构
[1] Univ Sfax, Natl Engn Sch Sfax ENIS, Res Grp Intelligent Machines, REGIM, Sfax 3038, Tunisia
关键词
Unsupervised learning; Multilevel interior growing self-organizing map; Movie scenes detection; Video browsing; GROWING CELL STRUCTURES; SEGMENTATION; VIDEO; NETWORK;
D O I
10.1007/s00521-012-0930-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The segmentation into scenes helps users to browse movie archives and to select the interesting ones. In a given movie, we have two kinds of scenes: action scenes and non-action scenes. To detect action scenes, we rely on tempo features as motion and audio energy. However, to detect non-action scenes, we have to use the content information. In this paper, we present a new approach to detect non-action movie scenes. The main idea is the use of a new dynamic variant of the self-organizing maps called MIGSOM (Multilevel Interior Growing self-organizing maps) to detect agglomerations of shots in movie scenes. The originality of MIGSOM model lies in its architecture for evolving the structure of the network. The MIGSOM algorithm is generated by a growth process by adding nodes where it is necessary, whether from the boundaries or the interior of the map. In addition, the advantage of the proposed MIGSOM algorithm is their ability to find the best structure of the output space through the training process and to represent better the semantics of the data. Our system is tested on a varied database and compared to the classical SOM and others works. The obtained results show the merit of our approach in term of recall and precision rates and that our assumptions are well founded.
引用
收藏
页码:1387 / 1396
页数:10
相关论文
共 40 条
[1]   Dynamic self-organizing maps with controlled growth for knowledge discovery [J].
Alahakoon, D ;
Halgamuge, SK ;
Srinivasan, B .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03) :601-614
[2]  
Amarasiri R., 2004, Fourth International Conference on Hybrid Intelligent Systems, P216, DOI 10.1109/ICHIS.2004.52
[3]  
[Anonymous], P 2 ANN IEEE INT C N
[4]  
[Anonymous], Pattern Recognition with Fuzzy Objective Function Algorithms
[5]  
Ayadi T., 2011, 2011 5th International Symposium on Computational Intelligence and Intelligent Informatics (ISCIII), P121, DOI 10.1109/ISCIII.2011.6069754
[6]  
Ayadi T, 2007, IEEE 6 INT C MACH LE, P397
[7]  
Ayadi T, 2010, IEEE SYS MAN CYBERN
[8]  
BLACKMORE J, 1993, 1993 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, P450, DOI 10.1109/ICNN.1993.298599
[9]   A survey on the automatic indexing of video data [J].
Brunelli, R ;
Mich, O ;
Modena, CM .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1999, 10 (02) :78-112
[10]   Movie scene segmentation using background information [J].
Chen, Liang-Hua ;
Lai, Yu-Chun ;
Liao, Hong-Yuan Mark .
PATTERN RECOGNITION, 2008, 41 (03) :1056-1065