ViST: A Ubiquitous Model with Multimodal Fusion for Crop Growth Prediction

被引:2
作者
Li, Junsheng [1 ]
Wang, Ling [1 ]
Liu, Jie [1 ]
Tang, Jinshan [2 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci & Technol, 92 Xidazhi St, Harbin 150000, Heilongjiang, Peoples R China
[2] George Mason Univ, Coll Publ Hlth, Dept Hlth Adm & Policy, Hlth Informat, Fairfax, VA 22033 USA
关键词
Crop growth prediction; ubiquitous model; multimodal learning; transformer module; cross-attention mechanism; NEURAL-NETWORK; VEGETATION INDEXES; GRAIN-YIELD; TRANSFORMER; SYSTEM;
D O I
10.1145/3627707
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Crop growth prediction can help agricultural workers to make accurate and reasonable decisions on farming activities. Existing crop growth prediction models focus on one crop and train a single model for each crop. In this article, we develop a ubiquitous growth prediction model for multiple crops, aiming at training a single model for multiple crops. A ubiquitous vision and sensor transformer (ViST) model for crop growth prediction with image and sensor data is developed to achieve the goals. In the proposed model, a cross-attention mechanism is proposed to facilitate the fusion of multimodal feature maps to reduce computational costs and balance the interactive effects among features. To train the model, we combine the data from multiple crops to create a single (ViST) model. A sensor network system is established for data collection on the farm where rice, soybean, and maize are cultivated. Experimental results show that the proposed ViST model has an excellent ubiquitous ability for crop growth prediction with multiple crops.
引用
收藏
页数:23
相关论文
共 62 条
[1]   Application of Artificial Neural Network for Predicting Maize Production in South Africa [J].
Adisa, Omolola M. ;
Botai, Joel O. ;
Adeola, Abiodun M. ;
Hassen, Abubeker ;
Botai, Christina M. ;
Darkey, Daniel ;
Tesfamariam, Eyob .
SUSTAINABILITY, 2019, 11 (04)
[2]  
Baltrusaitis T., 2018, The Handbook of Multimodal-Multisensor Interfaces: Signal Processing, Architectures, and Detection of Emotion and Cognition, V2, P17, DOI [10.1145/3107990.3107993, DOI 10.1145/3107990.3107993]
[3]   Multimodal Machine Learning: A Survey and Taxonomy [J].
Baltrusaitis, Tadas ;
Ahuja, Chaitanya ;
Morency, Louis-Philippe .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) :423-443
[4]   An overview of the crop model STICS [J].
Brisson, N ;
Gary, C ;
Justes, E ;
Roche, R ;
Mary, B ;
Ripoche, D ;
Zimmer, D ;
Sierra, J ;
Bertuzzi, P ;
Burger, P ;
Bussière, F ;
Cabidoche, YM ;
Cellier, P ;
Debaeke, P ;
Gaudillère, JP ;
Hénault, C ;
Maraux, F ;
Seguin, B ;
Sinoquet, H .
EUROPEAN JOURNAL OF AGRONOMY, 2003, 18 (3-4) :309-332
[5]   EFFECT OF CROP-ROTATION AND FERTILIZATION ON THE QUANTITATIVE RELATIONSHIP BETWEEN SPRING WHEAT YIELD AND MOISTURE USE IN SOUTHWESTERN SASKATCHEWAN [J].
CAMPBELL, CA ;
ZENTNER, RP ;
JOHNSON, PJ .
CANADIAN JOURNAL OF SOIL SCIENCE, 1988, 68 (01) :1-16
[6]   On the relation between NDVI, fractional vegetation cover, and leaf area index [J].
Carlson, TN ;
Ripley, DA .
REMOTE SENSING OF ENVIRONMENT, 1997, 62 (03) :241-252
[7]   HEU Emotion: a large-scale database for multimodal emotion recognition in the wild [J].
Chen, Jing ;
Wang, Chenhui ;
Wang, Kejun ;
Yin, Chaoqun ;
Zhao, Cong ;
Xu, Tao ;
Zhang, Xinyi ;
Huang, Ziqiang ;
Liu, Meichen ;
Yang, Tao .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (14) :8669-8685
[8]  
[陈仲新 Chen Zhongxin], 2016, [遥感学报, Journal of Remote Sensing], V20, P748
[9]   An end-to-end model for rice yield prediction using deep learning fusion [J].
Chu, Zheng ;
Yu, Jiong .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 174
[10]   Remote estimation of crop and grass chlorophyll and nitrogen content using red-edge bands on Sentinel-2 and-3 [J].
Clevers, J. G. P. W. ;
Gitelson, A. A. .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2013, 23 :344-351