LIST: low illumination scene text detector with automatic feature enhancement

被引：5

作者：

Liu, Hang ^{[1
,2
]}

Yuan, Mengke ^{[3
,4
]}

Wang, Tong ^{[1
,2
]}

Ren, Peiran ^{[5
]}

Yan, Dong-Ming ^{[3
,4
]}

机构：

[1] Minist Educ, Engn Res Ctr Digitized Text & Apparel Technol, Shanghai 201620, Peoples R China

[2] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China

[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China

[4] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China

[5] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China

来源：

VISUAL COMPUTER | 2022年 / 38卷 / 9-10期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Low illumination; Scene image text detection; Data synthesis; Feature enhancement;

D O I：

10.1007/s00371-022-02570-7

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Low illumination, under which discriminative clues are buried in the captured images, is an under-investigated but noteworthy issue in wild scene text detection. Existing deep learning approaches suffer from the scarcity of training data and illumination-sensitive feature representation. To address these issues, we propose a Low Illumination Scene Text (LIST) Detector training with authentic synthetic data and integrating dedicated feature enhancement modules. Specifically, we adopt a lightweight and non-reference low-light scene text image synthesis network to acquire adequate training data through pixel-wisely adjusting the dynamic range curve of normal-light images. Moreover, illumination invariant feature representation is learned through dual path feature extraction stem with intensity adjusted inputs, and feature fusion branch with automatically designed fusion cell. In the end, the enhanced feature is fed into a segmentation-based layer to localize arbitrary shape text instances. We construct a labeled real-world scene text image dataset called "DarkText" and conduct extensive experiments to validate the advantages of our proposed framework over state-of-the-art competitors.

引用

页码：3231 / 3242

页数：12

共 37 条

[1]

[Anonymous], 2017, ARXIV171202170

[2] Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition [J].

Ch'ng, Chee Kheng ;

Chan, Chee Seng .

2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :935-942

[3]

Deng D, 2018, AAAI CONF ARTIF INTE, P6773

[4]

Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041

[5] Zero-Reference Deep Curve Estimation for Low-Light Image Enhancement [J].

Guo, Chunle ;

Li, Chongyi ;

Guo, Jichang ;

Loy, Chen Change ;

Hou, Junhui ;

Kwong, Sam ;

Cong, Runmin .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1777-1786

[6] Efficient Contrast Enhancement Using Adaptive Gamma Correction With Weighting Distribution [J].

Huang, Shih-Chia ;

Cheng, Fan-Chieh ;

Chiu, Yi-Sheng .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (03) :1032-1041

[7] EnlightenGAN: Deep Light Enhancement Without Paired Supervision [J].

Jiang, Yifan ;

Gong, Xinyu ;

Liu, Ding ;

Cheng, Yu ;

Fang, Chen ;

Shen, Xiaohui ;

Yang, Jianchao ;

Zhou, Pan ;

Wang, Zhangyang .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2340-2349

[8]

Karatzas D, 2015, PROC INT CONF DOC, P1156, DOI 10.1109/ICDAR.2015.7333942

[9] RETINEX THEORY OF COLOR-VISION [J].

LAND, EH .

SCIENTIFIC AMERICAN, 1977, 237 (06) :108-&

[10]

Liao MH, 2020, AAAI CONF ARTIF INTE, V34, P11474

← 1 2 3 4 →