Enhancing the identification accuracy of deep learning object detection using natural language processing

被引:10
作者
Tsai, Ming-Fong [1 ]
Tseng, Hung-Ju [1 ]
机构
[1] Natl United Univ, Dept Elect Engn, Miaoli, Taiwan
关键词
Natural language processing; Deep learning and object detection;
D O I
10.1007/s11227-020-03525-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, object detection technology with artificial intelligence has been applied in many fields. This study uses a deep learning method to train an identification model to classify and browse pictures of the 600 different kinds of birds in Taiwan. To enhance the accuracy of identification and classification of these birds, we propose an automatic extraction system that can obtain training data by visiting public social media pages. We also develop mobile apps that allow users to take pictures of birds and upload them to an identification server to enable real-time identification and provide training data. These mobile apps are sent candidate bird pictures by the identification server to allow users to confirm and give feedback when the confidence level of identification is within a critical range. The bird pictures are then used as training data, and the identification model is periodically retrained to optimise the model. We also use natural language processing technology to enhance the level of confidence in image identification. The features of the birds' appearance are described in words and candidate birds are obtained through image identification and used to readjust the adopted weight values. The proposed identification system gives a relatively high identification accuracy due to the use of deep learning object detection.
引用
收藏
页码:6676 / 6691
页数:16
相关论文
共 17 条
[1]   An end-to-end deep learning model for human activity recognition from highly sparse body sensor data in Internet of Medical Things environment [J].
Hassan, Mohammad Mehedi ;
Ullah, Sana ;
Hossain, M. Shamim ;
Alelaiwi, Abdulhameed .
JOURNAL OF SUPERCOMPUTING, 2021, 77 (03) :2237-2250
[2]   A global bifurcation theorem for a multiparameter positone problem and its application to the one-dimensional perturbed Gelfand problem [J].
Huang, Shao-Yuan ;
Hung, Kuo-Chih ;
Wang, Shin-Hwa .
ELECTRONIC JOURNAL OF QUALITATIVE THEORY OF DIFFERENTIAL EQUATIONS, 2019, (99) :1-25
[3]  
Joseph RK, 2016, CRIT POL ECON S ASIA, P1
[4]  
Kim Y, 2014, IEEE ASME INT C ADV, P1747, DOI 10.1109/AIM.2014.6878336
[5]  
Li GQ, 2018, PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), P1031, DOI 10.1109/IAEAC.2018.8577214
[6]   Person Search with Natural Language Description [J].
Li, Shuang ;
Xiao, Tong ;
Li, Hongsheng ;
Zhou, Bolei ;
Yue, Dayu ;
Wang, Xiaogang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5187-5196
[7]   Chinese Outbound Investment in Australia: From State Control to Entrepreneurship [J].
Li, Wei ;
Hendrischke, Hans .
CHINA QUARTERLY, 2020, 243 :701-736
[8]   Being excellent teams: managing innovative climate, politics, and team performance [J].
Lin, Chieh-Peng ;
Liu, Chu-Mei ;
Liu, Na-Ting ;
Huang, Hsu-Ting .
TOTAL QUALITY MANAGEMENT & BUSINESS EXCELLENCE, 2020, 31 (3-4) :353-372
[9]   A cloud-based face video retrieval system with deep learning [J].
Lin, Feng-Cheng ;
Ngo, Huu-Huy ;
Dow, Chyi-Ren .
JOURNAL OF SUPERCOMPUTING, 2020, 76 (11) :8473-8493
[10]   Thermal Analysis of Electron Gun for Terahertz Traveling Wave Tubes Based on L-BFGS Algorithm [J].
Ou, Yue ;
Liu, Wenxin ;
Yang, Long Long ;
Zhao, Zhengyuan ;
Wei, Yanyu ;
Yang, Ziqiang .
2020 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO 2020), 2020,