TWLip: Exploring Through-Wall Word-Level Lip Reading Based on Coherent SISO Radar

被引:0
作者
Zhu, Dongsheng [1 ]
Han, Chong [1 ,2 ]
Guo, Jian [1 ,2 ]
Sun, Lijuan [1 ,2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Comp, Nanjing 210003, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Jiangsu High Technol Res Key Lab Wireless Sensor N, Nanjing 210023, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 19期
基金
中国国家自然科学基金;
关键词
Lips; Radar; Feature extraction; Radar signal processing; Target recognition; Radar detection; Speech recognition; Coherent radar; lip-reading recognition; neural networks; through-wall sensing;
D O I
10.1109/JIOT.2024.3427329
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently emerged radio frequency-based lip-reading recognition technologies leverage their independence from lighting and penetration capabilities to expand the applications of lip-reading. Unlike visual-based lip-reading, this technology penetrates barriers, such as masks, glass, and wood to detect lip movements. However, the previous studies utilizing devices like millimeter-wave radar face limitations due to frequency and power consumption, which restrict the types of penetrable materials and application scenarios. Although low-frequency through-wall radar offers significant penetration capabilities, it has reduced sensitivity to small movements, posing a challenge for lip-reading recognition. Moreover, the high cost of radar equipment and the scarcity of commercially available devices hinder the technology's development. To address these challenges, we propose TWLip, a word-level lip-reading recognition system utilizing coherent single-input and single-output through-wall radar to detect tiny lip movements behind walls. We utilize I/Q 3-D curves derived from the radar signals corresponding to lip movements as the network input. These curves reflect the amplitude, frequency, and rotational characteristics of lip movements in the complex plane. Furthermore, we designed the IQResNet, built with 1-D preactivation residual bottleneck units, to extract and classify lip-movement features from I/Q 3-D curves. We propose a data-augmentation method for radar lip-reading to enhance model efficacy and generalizability. We created a through-wall radar lip-reading data set containing 20 words from eight volunteers, totaling 9583 samples. The TWLip demonstrated the ability to recognize these words through a 24 cm brick wall from two meters away with 88.51% accuracy, validating the algorithm's superiority through detailed comparative studies.
引用
收藏
页码:32310 / 32323
页数:14
相关论文
共 8 条
  • [1] Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading
    Chen, Hang
    Wang, Qing
    Du, Jun
    Wan, Gen-Shun
    Xiong, Shi-Fu
    Yin, Bao-Ci
    Pan, Jia
    Lee, Chin-Hui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9358 - 9371
  • [2] Real-Time Through-Wall Multihuman Localization and Behavior Recognition Based on MIMO Radar
    Wang, Changlong
    Zhu, Dongsheng
    Sun, Lijuan
    Han, Chong
    Guo, Jian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [3] Indoor Target Localization Using Through-Wall Radar Based on the Time-Frequency Enhancement Algorithm
    Peng, Yiqun
    Ding, Minhao
    Cao, Jiaxuan
    Dongye, Guangxin
    Ding, Yipeng
    IEEE SENSORS JOURNAL, 2024, 24 (19) : 31357 - 31366
  • [4] Real-Time Through-Wall Multiperson 3-D Pose Estimation Based on MIMO Radar
    Wang, Changlong
    Zhu, Dongsheng
    Sun, Lijuan
    Han, Chong
    Guo, Jian
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 11
  • [5] Through-Wall Movement Detection based on S-Band FMCW radar: An Experimental Assessment
    Yarleque Medina, Manuel A.
    Martinez Odiaga, Hansel J.
    Alvarez Navarro, Sthefany
    2018 IEEE MTT-S LATIN AMERICA MICROWAVE CONFERENCE (LAMC 2018), 2018,
  • [6] Automatic Detection of Word-Level Reading Errors in Non-native English Speech Based on ASR Output
    Qin, Ying
    Qian, Yao
    Loukina, Anastassia
    Lange, Patrick
    Misra, Abhinav
    Evanini, Keelan
    Lee, Tan
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [7] Advanced System Level Simulation of UWB Three-Dimensional Through-Wall Imaging Radar for Performance Limitation Prediction
    Wang, Yazhou
    Kuhn, Michael J.
    Fathy, Aly E.
    2010 IEEE MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM DIGEST (MTT), 2010, : 165 - 168
  • [8] Anti-jamming property of Colpitts-based direct chaotic through-wall imaging radar
    Liu, Li
    Ma, Ruixia
    Li, Jingxia
    Zhang, Jianguo
    Wang, Bingjie
    JOURNAL OF ELECTROMAGNETIC WAVES AND APPLICATIONS, 2016, 30 (17) : 2268 - 2279