Enhancing the identification accuracy of deep learning object detection using natural language processing

被引:1
作者
Ming-Fong Tsai
Hung-Ju Tseng
机构
[1] National United University,Department of Electronic Engineering
来源
The Journal of Supercomputing | 2021年 / 77卷
关键词
Natural language processing; Deep learning and object detection;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, object detection technology with artificial intelligence has been applied in many fields. This study uses a deep learning method to train an identification model to classify and browse pictures of the 600 different kinds of birds in Taiwan. To enhance the accuracy of identification and classification of these birds, we propose an automatic extraction system that can obtain training data by visiting public social media pages. We also develop mobile apps that allow users to take pictures of birds and upload them to an identification server to enable real-time identification and provide training data. These mobile apps are sent candidate bird pictures by the identification server to allow users to confirm and give feedback when the confidence level of identification is within a critical range. The bird pictures are then used as training data, and the identification model is periodically retrained to optimise the model. We also use natural language processing technology to enhance the level of confidence in image identification. The features of the birds’ appearance are described in words and candidate birds are obtained through image identification and used to readjust the adopted weight values. The proposed identification system gives a relatively high identification accuracy due to the use of deep learning object detection.
引用
收藏
页码:6676 / 6691
页数:15
相关论文
共 13 条
[1]  
Xiao D(2019)A target detection model based on improved tiny-Yolov3 under the environment of mining truck IEEE Access 7 123757-123764
[2]  
Shan F(2020)A cloud-based face video retrieval system with deep learning J Supercomput undefined undefined-undefined
[3]  
Li Z(2020)An end-to-end deep learning model for human activity recognition from highly sparse body sensor data in internet of medical things environment J Supercomput undefined undefined-undefined
[4]  
Le B(undefined)undefined undefined undefined undefined-undefined
[5]  
Liu X(undefined)undefined undefined undefined undefined-undefined
[6]  
Li X(undefined)undefined undefined undefined undefined-undefined
[7]  
Lin F(undefined)undefined undefined undefined undefined-undefined
[8]  
Ngo H(undefined)undefined undefined undefined undefined-undefined
[9]  
Dow C(undefined)undefined undefined undefined undefined-undefined
[10]  
Hassan M(undefined)undefined undefined undefined undefined-undefined