TWLip: Exploring Through-Wall Word-Level Lip Reading Based on Coherent SISO Radar

被引：0

作者：

Zhu, Dongsheng ^{[1
]}

Han, Chong ^{[1
,2
]}

Guo, Jian ^{[1
,2
]}

Sun, Lijuan ^{[1
,2
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Comp, Nanjing 210003, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Jiangsu High Technol Res Key Lab Wireless Sensor N, Nanjing 210023, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 19期

基金：

中国国家自然科学基金;

关键词：

Lips; Radar; Feature extraction; Radar signal processing; Target recognition; Radar detection; Speech recognition; Coherent radar; lip-reading recognition; neural networks; through-wall sensing;

D O I：

10.1109/JIOT.2024.3427329

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently emerged radio frequency-based lip-reading recognition technologies leverage their independence from lighting and penetration capabilities to expand the applications of lip-reading. Unlike visual-based lip-reading, this technology penetrates barriers, such as masks, glass, and wood to detect lip movements. However, the previous studies utilizing devices like millimeter-wave radar face limitations due to frequency and power consumption, which restrict the types of penetrable materials and application scenarios. Although low-frequency through-wall radar offers significant penetration capabilities, it has reduced sensitivity to small movements, posing a challenge for lip-reading recognition. Moreover, the high cost of radar equipment and the scarcity of commercially available devices hinder the technology's development. To address these challenges, we propose TWLip, a word-level lip-reading recognition system utilizing coherent single-input and single-output through-wall radar to detect tiny lip movements behind walls. We utilize I/Q 3-D curves derived from the radar signals corresponding to lip movements as the network input. These curves reflect the amplitude, frequency, and rotational characteristics of lip movements in the complex plane. Furthermore, we designed the IQResNet, built with 1-D preactivation residual bottleneck units, to extract and classify lip-movement features from I/Q 3-D curves. We propose a data-augmentation method for radar lip-reading to enhance model efficacy and generalizability. We created a through-wall radar lip-reading data set containing 20 words from eight volunteers, totaling 9583 samples. The TWLip demonstrated the ability to recognize these words through a 24 cm brick wall from two meters away with 88.51% accuracy, validating the algorithm's superiority through detailed comparative studies.

引用

页码：32310 / 32323

页数：14

共 8 条

[1] Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading
Chen, Hang
Wang, Qing
Du, Jun
Wan, Gen-Shun
Xiong, Shi-Fu
Yin, Bao-Ci
Pan, Jia
Lee, Chin-Hui
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9358 - 9371
[2] Real-Time Through-Wall Multihuman Localization and Behavior Recognition Based on MIMO Radar
Wang, Changlong
Zhu, Dongsheng
Sun, Lijuan
Han, Chong
Guo, Jian
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[3] Indoor Target Localization Using Through-Wall Radar Based on the Time-Frequency Enhancement Algorithm
Peng, Yiqun
Ding, Minhao
Cao, Jiaxuan
Dongye, Guangxin
Ding, Yipeng
IEEE SENSORS JOURNAL, 2024, 24 (19) : 31357 - 31366
[4] Real-Time Through-Wall Multiperson 3-D Pose Estimation Based on MIMO Radar
Wang, Changlong
Zhu, Dongsheng
Sun, Lijuan
Han, Chong
Guo, Jian
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 11
[5] Through-Wall Movement Detection based on S-Band FMCW radar: An Experimental Assessment
Yarleque Medina, Manuel A.
Martinez Odiaga, Hansel J.
Alvarez Navarro, Sthefany
2018 IEEE MTT-S LATIN AMERICA MICROWAVE CONFERENCE (LAMC 2018), 2018,
[6] Automatic Detection of Word-Level Reading Errors in Non-native English Speech Based on ASR Output
Qin, Ying
Qian, Yao
Loukina, Anastassia
Lange, Patrick
Misra, Abhinav
Evanini, Keelan
Lee, Tan
2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
[7] Advanced System Level Simulation of UWB Three-Dimensional Through-Wall Imaging Radar for Performance Limitation Prediction
Wang, Yazhou
Kuhn, Michael J.
Fathy, Aly E.
2010 IEEE MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM DIGEST (MTT), 2010, : 165 - 168
[8] Anti-jamming property of Colpitts-based direct chaotic through-wall imaging radar
Liu, Li
Ma, Ruixia
Li, Jingxia
Zhang, Jianguo
Wang, Bingjie
JOURNAL OF ELECTROMAGNETIC WAVES AND APPLICATIONS, 2016, 30 (17) : 2268 - 2279

← 1 →