Saliency-guided stairs detection on wearable RGB-D devices for visually with Swin-Transformer

被引:1
作者
Zheng, Zhuowen [1 ]
He, Jiahui [1 ]
Gu, Jia [2 ]
Chen, Zhen [3 ]
Qin, Wenjian [1 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Adv Technol Inst Suzhou, Suzhou 215123, Peoples R China
[3] Nanchang Hangkong Univ, Nanchang 33063, Peoples R China
关键词
Stairs detection; Visually impaired persons; Wearable RGB-D devices; OBJECT DETECTION; STEREO;
D O I
10.1016/j.patrec.2023.11.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accuracy stairs detection is crucial for people with visual impairment, as it can reduce the potential unforeseen risks of falling on stairs. Wearable RGB-D technology can assist blind and visually impaired individuals. However, existing stair detection algorithms on RGB-D images face difficulties in the stair material, texture, lighting, and direction. In this study, we proposed a saliency-guided stairs detection method based on SwinTransformer to address the challenges mentioned above. First, saliency detection based on RGB-D images is used to learn spatial information for fast stair localization. Furthermore, we use the Swin-Transformer that incorporates key depth features of the stairs to solve orientation detection deficiencies. To evaluate the performance of our proposed method, we collected 3,290 RGB-D images, including the indoor and outdoor staircases. Experiments on our dataset show that our method can achieve high performance in terms of detection accuracy.
引用
收藏
页码:47 / 53
页数:7
相关论文
共 29 条
  • [1] Enhancing perception for the visually impaired with deep learning techniques and low-cost wearable sensors
    Bauer, Zuria
    Dominguez, Alejandro
    Cruz, Edmanuel
    Gomez-Donoso, Francisco
    Orts-Escolano, Sergio
    Cazorla, Miguel
    [J]. PATTERN RECOGNITION LETTERS, 2020, 137 : 27 - 36
  • [2] Salient Object Detection: A Benchmark
    Borji, Ali
    Cheng, Ming-Ming
    Jiang, Huaizu
    Li, Jia
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 5706 - 5722
  • [3] The Lancet Global Health Commission on Global Eye Health: vision beyond 2020
    Burton, Matthew J.
    Ramke, Jacqueline
    Marques, Ana Patricia
    Bourne, Rupert R. A.
    Congdon, Nathan
    Jones, Iain
    Tong, Brandon A. M. Ah
    Arunga, Simon
    Bachani, Damodar
    Bascaran, Covadonga
    Bastawrous, Andrew
    Blanchet, Karl
    Braithwaite, Tasanee
    Buchan, John C.
    Cairns, John
    Cama, Anasaini
    Chagunda, Margarida
    Chuluunkhuu, Chimgee
    Cooper, Andrew
    Crofts-Lawrence, Jessica
    Dean, William H.
    Denniston, Alastair K.
    Ehrlich, Joshua R.
    Emerson, Paul M.
    Evans, Jennifer R.
    Frick, Kevin D.
    Friedman, David S.
    Furtado, Joao M.
    Gichangi, Michael M.
    Gichuhi, Stephen
    Gilbert, Suzanne S.
    Gurung, Reeta
    Habtamu, Esmael
    Holland, Peter
    Jonas, Jost B.
    Keane, Pearse A.
    Keay, Lisa
    Khanna, Rohit C.
    Khaw, Peng Tee
    Kuper, Hannah
    Kyari, Fatima
    Lansingh, Van C.
    Mactaggart, Islay
    Mafwiri, Milka M.
    Mathenge, Wanjiku
    McCormick, Ian
    Morjaria, Priya
    Mowatt, Lizette
    Muirhead, Debbie
    Murthy, Gudlavalleti V. S.
    [J]. LANCET GLOBAL HEALTH, 2021, 9 (04): : E489 - E551
  • [4] Cong Yang, 2010, Acta Automatica Sinica, V36, P667, DOI 10.3724/SP.J.1004.2010.00667
  • [5] Staircase Detection Using a Lightweight Look-Behind Fully Convolutional Neural Network
    Diamantis, Dimitrios E.
    Koutsiou, Dimitra-Christina C.
    Iakovidis, Dimitris K.
    [J]. ENGINEERING APPLICATIONS OF NEURAL NETWORKSX, 2019, 1000 : 522 - 532
  • [6] Objective Quality Assessment for Image Retargeting Based on Structural Similarity
    Fang, Yuming
    Zeng, Kai
    Wang, Zhou
    Lin, Weisi
    Fang, Zhijun
    Lin, Chia-Wen
    [J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2014, 4 (01) : 95 - 105
  • [7] Saliency Detection for Stereoscopic Images
    Fang, Yuming
    Wang, Junle
    Narwaria, Manish
    Le Callet, Patrick
    Lin, Weisi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (06) : 2625 - 2636
  • [8] Object-Based Multiple Foreground Segmentation in RGBD Video
    Fu, Huazhu
    Xu, Dong
    Lin, Stephen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (03) : 1418 - 1427
  • [9] Staircase detection to guide visually impaired people: A hybrid approach
    Habib A.
    Islam Md.M.
    Kabir M.N.
    Mredul M.B.
    Hasan M.
    [J]. Revue d'Intelligence Artificielle, 2019, 33 (05) : 327 - 334
  • [10] A projective chirp based stair representation and detection from monocular images and its application for the visually impaired
    Hai Vu
    Van-Nam Hoang
    Thi-Lan Le
    Thanh-Hai Tran
    Thi Thuy Nguyen
    [J]. PATTERN RECOGNITION LETTERS, 2020, 137 (137) : 17 - 26