FedAWR: An Interactive Federated Active Learning Framework for Air Writing Recognition

被引:9
作者
Kong, Xiangjie [1 ]
Zhang, Wenyi [1 ]
Qu, Youyang [2 ]
Yao, Xinwei [1 ]
Shen, Guojiang [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
[2] Australia Commonwealth Sci & Ind Res Org CSIRO, Data61, Clayton, Vic 3169, Australia
基金
中国国家自然科学基金;
关键词
Handwriting recognition; Atmospheric modeling; Data models; Federated learning; Computational modeling; Training; Performance evaluation; Active learning; air-writing; federated learning; text recognition; INTERNET;
D O I
10.1109/TMC.2023.3320147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid development of technology such as virtual reality and augmented reality, coupled with the reduced direct contact due to the COVID-19 pandemic, has led to the emergence of a more advanced mode of interaction: air handwriting. This new form of human-computer interaction allows users to input text by writing in the air freely. However, deploying and applying existing air handwriting recognition systems in real-world scenarios still presents challenges, particularly in real-time performance, privacy protection, and label scarcity. To address these challenges, we propose a federated active learning framework called FedAWR for air handwriting recognition tasks. FedAWR utilizes distributed learning to train a shared global model in the cloud from multiple user devices at the network's edge, while keeping the user's handwritten data local to ensure privacy. In addition, FedAWR employs an interactive active learning strategy to collect user-provided annotations for iterative training during the online federated learning process, bootstrapping personalized models for each client. To further enhance interactivity and real-time performance, we designed a lightweight recognition model, which is integrated into FedAWR. Finally, extensive experiments were conducted on real-world air handwritten datasets to validate the superiority of FedAWR.
引用
收藏
页码:6423 / 6436
页数:14
相关论文
共 45 条
[1]   What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis [J].
Baek, Jeonghun ;
Kim, Geewook ;
Lee, Junyeop ;
Park, Sungrae ;
Han, Dongyoon ;
Yun, Sangdoo ;
Oh, Seong Joon ;
Lee, Hwalsuk .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4714-4722
[2]  
Bard GregoryV., 2007, Proceedings of the fifth Australasian symposium on ACSW frontiers, V68, P117
[3]  
Beutel DJ, 2022, Arxiv, DOI arXiv:2007.14390
[4]   Rosetta: Large Scale System for Text Detection and Recognition in Images [J].
Borisyuk, Fedor ;
Gordo, Albert ;
Sivakumar, Viswanath .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :71-79
[5]  
Cao YT, 2023, Arxiv, DOI arXiv:2211.13579
[6]   Air-Writing Recognition-Part I: Modeling and Recognition of Characters, Words, and Connecting Motions [J].
Chen, Mingyu ;
AlRegib, Ghassan ;
Juang, Biing-Hwang .
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2016, 46 (03) :403-413
[7]  
Gan J, 2018, IEEE INT CON MULTI
[8]   In-air handwritten Chinese text recognition with temporal convolutional recurrent network [J].
Gan, Ji ;
Wang, Weiqiang ;
Lu, Ke .
PATTERN RECOGNITION, 2020, 97
[9]   In-air handwritten English word recognition using attention recurrent translator [J].
Gan, Ji ;
Wang, Weiqiang .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (07) :3155-3172
[10]   AirText: One-Handed Text Entry in the Air for COTS Smartwatches [J].
Gao, Yi ;
Zeng, Siyu ;
Zhao, Ji ;
Liu, Wenxin ;
Dong, Wei .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (05) :2506-2519