Fruit freshness detection based on multi-task convolutional neural network

被引:3
作者
Zhang, Yinsheng [1 ]
Yang, Xudong [2 ]
Cheng, Yongbo [3 ]
Wu, Xiaojun [4 ]
Sun, Xiulan [5 ]
Hou, Ruiqi [1 ]
Wang, Haiyan [1 ]
机构
[1] Zhejiang Gongshang Univ, Zhejiang Food & Drug Qual & Safety Engn Res Inst, Hangzhou 310018, Peoples R China
[2] Zhejiang Gongshang Univ, Sch Management & E business, Hangzhou 310018, Peoples R China
[3] Nanjing Univ Finance & Econ, Sch Management Sci & Engn, Nanjing 210023, Peoples R China
[4] Jiangnan Univ, Inst Sci & Technol, Wuxi 214122, Peoples R China
[5] Jiangnan Univ, Sch Food Sci & Technol, Wuxi 214122, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -task learning; Depthwise separable convolution; Fruit freshness; Convolutional neural network; CLASSIFICATION;
D O I
10.1016/j.crfs.2024.100733
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
Background: Fruit freshness detection by computer vision is essential for many agricultural applications, e.g., automatic harvesting and supply chain monitoring. This paper proposes to use the multi-task learning (MTL) paradigm to build a deep convolutional neural work for fruit freshness detection. Results: We design an MTL model that optimizes the freshness detection (T1) and fruit type classification (T2) tasks in parallel. The model uses a shared CNN (convolutional neural network) subnet and two FC (fully connected) task heads. The shared CNN acts as a feature extraction module and feeds the two task heads with common semantic features. Based on an open fruit image dataset, we conducted a comparative study of MTL and single-task learning (STL) paradigms. The STL models use the same CNN subnet with only one specific task head. In the MTL scenario, the T1 and T2 mean accuracies on the test set are 93.24% and 88.66%, respectively. Meanwhile, for STL, the two accuracies are 92.50% and 87.22%. Statistical tests report significant differences between MTL and STL on T1 and T2 test accuracies. We further investigated the extracted feature vectors (semantic embeddings) from the two STL models. The vectors have an averaged 0.7 cosine similarity on the entire dataset, with most values lying in the 0.6-0.8 range. This indicates a between-task correlation and justifies the effectiveness of the proposed MTL approach. Conclusion: This study proves that MTL exploits the mutual correlation between two or more relevant tasks and can maximally share their underlying feature extraction process. we envision this approach to be extended to other domains that involve multiple interconnected tasks.
引用
收藏
页数:9
相关论文
共 29 条
[1]   FruitQ: a new dataset of multiple fruit images for freshness evaluation [J].
Abayomi-Alli, Olusola O. O. ;
Damasevicius, Robertas ;
Misra, Sanjay ;
Abayomi-Alli, Adebayo .
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) :11433-11460
[2]   Multi-task learning model for agricultural pest detection from crop-plant imagery: A Bayesian approach [J].
Amrani, Abderraouf ;
Diepeveen, Dean ;
Murray, David ;
Jones, Michael G. K. ;
Sohel, Ferdous .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 218
[3]   DCDS-Net: Deep transfer network based on depth-wise separable convolution with residual connection for diagnosing gastrointestinal diseases [J].
Asif, Sohaib ;
Zhao, Ming ;
Tang, Fengxiao ;
Zhu, Yusen .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 90
[4]  
Ba J, 2014, ACS SYM SER
[5]   Supervised classification of bruised apples with respect to the time after bruising on the basis of hyperspectral imaging data [J].
Baranowski, Piotr ;
Mazurek, Wojciech ;
Pastuszka-Wozniak, Joanna .
POSTHARVEST BIOLOGY AND TECHNOLOGY, 2013, 86 :249-258
[6]   Complexity of training ReLU neural network [J].
Boob, Digvijay ;
Dey, Santanu S. ;
Lan, Guanghui .
DISCRETE OPTIMIZATION, 2022, 44
[7]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[8]   Cosine similarity as a sample size-free measure to quantify phase clustering within a single neurophysiological signal [J].
Chou, Elizabeth P. ;
Hsu, Shen-Mou .
JOURNAL OF NEUROSCIENCE METHODS, 2018, 295 :111-120
[9]  
Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, 10.48550/arXiv.1704.04861]
[10]  
Glorot X., 2010, P 13 INT C ART INT S, P249, DOI DOI 10.1109/LGRS.2016.2565705