Very Deep Recurrent Convolutional Neural Network for Object Recognition

被引:7
作者
Brahimi, Sourour [1 ]
Ben Aoun, Najib [1 ,2 ]
Ben Amar, Chokri [1 ]
机构
[1] Univ Sfax, Natl Sch Engineers Sfax, REGIM Lab Res Grp Intelligent Machines, Sfax, Tunisia
[2] Al BAHA Univ, Coll Comp Sci & Informat Technol, Dept Comp Sci, Al Bahah, Saudi Arabia
来源
NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016) | 2017年 / 10341卷
关键词
Object recognition; very deep convolutional neural network; recurrent convolutional neural network; recurrent connection layers; Soft-max;
D O I
10.1117/12.2268672
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Computer vision has become a very active field. This field includes methods for processing, analyzing, and understanding images. The most challenging problems in computer vision are image classification and object recognition. This paper presents a new approach for object recognition task. This approach exploits the success of the Very Deep Convolutional Neural Network for object recognition. In fact, it improves the convolutional layers by adding recurrent connections. This proposed approach was evaluated on two object recognition benchmarks: Pascal VOC 2007 and CIFAR-10. The experimental results prove the efficiency of our method in comparison with the state of the art methods.
引用
收藏
页数:5
相关论文
共 27 条
[1]  
[Anonymous], BMVC
[2]  
[Anonymous], 2014, ICLR
[3]  
[Anonymous], ICASSP
[4]  
[Anonymous], 2011, 22 INT JT C ART INT, DOI 10.5555/2283516.2283603
[5]  
[Anonymous], 2007, Tech. Rep
[6]  
Ben Aoun Najib, 2011, 2011 International Conference on Innovations in Information Technology (IIT), P114, DOI 10.1109/INNOVATIONS.2011.5893799
[7]   Graph-based approach for human action recognition using spatio-temporal features [J].
Ben Aoun, Najib ;
Mejdoub, Mahmoud ;
Ben Amar, Chokri .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (02) :329-338
[8]  
Ben Aoun N, 2011, LECT NOTES COMPUT SC, V6855, P324, DOI 10.1007/978-3-642-23678-5_38
[9]   A dynamic video watermarking algorithm in fast motion areas in the wavelet domain [J].
El'Arbi, Maher ;
Koubaa, Mohamed ;
Charfeddine, Maha ;
Ben Amar, Chokri .
MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 55 (03) :579-600
[10]  
Goodfellow I, 2013, JMLR W CP, P1319