PaLmTac: A Vision-Based Tactile Sensor Leveraging Distributed-Modality Design and Modal-Matching Recognition for Soft Hand Perception

被引：6

作者：

Zhang, Shixin ^{[1
,2
]}

Yang, Yiyong ^{[1
]}

Shan, Jianhua ^{[3
]}

Sun, Fuchun ^{[4
]}

Xue, Hongxiang ^{[5
]}

Fang, Bin ^{[2
]}

机构：

[1] China Univ Geosci, Sch Engn & Technol, Beijing 100083, Peoples R China

[2] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

[3] Anhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243002, Peoples R China

[4] Tsinghua Univ, Inst Artificial Intelligence, Beijing Natl Res Ctr Informat Sci & Technol, Dept Comp Sci & Technol,State Key Lab Intelligent, Beijing 100084, Peoples R China

[5] Fudan Univ, Inst Engn & Appl Technol, Shanghai 200433, Peoples R China

来源：

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING | 2024年 / 18卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Level recognition mechanism; regional recognition mechanism; soft hand; vision-based tactile sensor; OBJECT DETECTION; PRESSURE;

D O I：

10.1109/JSTSP.2024.3386070

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper proposes a vision-based tactile sensor (VBTS) embedded into the soft hand palm, named PaLmTac. We adopt a distributed modality design instead of overlaying function layers. On the one hand, the problem of unrelated modality integration (texture and temperature) is solved. On the other hand, combining regional recognition can avoid mixed unrelated information. Herein, a Level-Regional Feature Extraction Network (LRFE-Net) is presented to match the modality design. We leverage feature mapping, regional convolution, and regional vectorization to construct the regional recognition mechanism, which can extract features in parallel and control fusion degrees. The level recognition mechanism balances the learning difficulty of each modality. Compared with the existing VBTSs, the PaLmTac optimizes unrelated modality integration and reduces fusion interference. This paper provides a novel idea of multimodal VBTS design and sensing mechanism, which is expected to be applied to human-computer interaction scenarios based on multimodal fusion.

引用

页码：288 / 298

页数：11