Augmented Behavioral Annotation Tools, with Application to Multimodal Datasets and Models: A Systematic Review

被引:8
作者
Watson, Eleanor [1 ]
Viana, Thiago [1 ]
Zhang, Shujun [1 ]
机构
[1] Univ Gloucestershire, Sch Comp & Engn, The Pk, Cheltenham GL50 2RH, England
关键词
machine learning; annotation; behavior; foundation models; NAMED ENTITY RECOGNITION; SOFTWARE;
D O I
10.3390/ai4010007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Annotation tools are an essential component in the creation of datasets for machine learning purposes. Annotation tools have evolved greatly since the turn of the century, and now commonly include collaborative features to divide labor efficiently, as well as automation employed to amplify human efforts. Recent developments in machine learning models, such as Transformers, allow for training upon very large and sophisticated multimodal datasets and enable generalization across domains of knowledge. These models also herald an increasing emphasis on prompt engineering to provide qualitative fine-tuning upon the model itself, adding a novel emerging layer of direct machine learning annotation. These capabilities enable machine intelligence to recognize, predict, and emulate human behavior with much greater accuracy and nuance, a noted shortfall of which have contributed to algorithmic injustice in previous techniques. However, the scale and complexity of training data required for multimodal models presents engineering challenges. Best practices for conducting annotation for large multimodal models in the most safe and ethical, yet efficient, manner have not been established. This paper presents a systematic literature review of crowd and machine learning augmented behavioral annotation methods to distill practices that may have value in multimodal implementations, cross-correlated across disciplines. Research questions were defined to provide an overview of the evolution of augmented behavioral annotation tools in the past, in relation to the present state of the art. (Contains five figures and four tables).
引用
收藏
页码:128 / 171
页数:44
相关论文
共 432 条
[31]  
[Anonymous], TRANSF SOFTW ENG
[32]  
[Anonymous], SUP LEAD VERS 2 0
[33]  
[Anonymous], HUGG FAC TRANSF
[34]  
[Anonymous], GEN MOD EST GRAD DAT
[35]  
[Anonymous], SOM MARC AI SCAL
[36]  
[Anonymous], FLEX DIFF MOD LONG V
[37]  
[Anonymous], THIS MACH KILLS TROL
[38]  
[Anonymous], MET DAT WEAKL GEN AI
[39]  
[Anonymous], LEX STABL DIFF SEARC
[40]  
[Anonymous], Prompt engineering