ICDAR2017 Robust Reading Challenge on Multi-lingual Scene Text Detection and Script Identification - RRC-MLT

被引:302
作者
Nayef, Nibal [1 ]
Yin, Fei [2 ]
Bizid, Imen [2 ]
Choi, Hyunsoo [2 ,5 ]
Feng, Yuan [2 ]
Karatzas, Dimosthenis [2 ]
Luo, Zhenbo [2 ]
Pal, Umapada [2 ,6 ]
Rigaud, Christophe [2 ,5 ]
Chazalon, Joseph [3 ]
Khlif, Wafa [3 ]
Luqman, Muhammad Muzzamil [3 ]
Burie, Jean-Christophe [4 ]
Liu, Cheng-Lin [4 ]
Ogier, Jean-Marc [4 ]
机构
[1] Univ La Rochelle, L3i Lab, La Rochelle, France
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[3] Univ Autonoma Barcelona, Comp Vis Ctr, Barcelona, Spain
[4] Samsung R&D Inst China, Beijing, Peoples R China
[5] CD Digital Media & Commun R&D Ctr Samsung Elect, Seoul, South Korea
[6] Indian Stat Inst, CVPR Unit, Bengaluru, India
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
基金
中国国家自然科学基金;
关键词
Scene Text Detection; Multi-lingual Text; Script Identification;
D O I
10.1109/ICDAR.2017.237
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection and recognition in a natural environment are key components of many applications, ranging from business card digitization to shop indexation in a street. This competition aims at assessing the ability of state-of-the-art methods to detect Multi-Lingual Text (MLT) in scene images, such as in contents gathered from the Internet media and in modern cities where multiple cultures live and communicate together. This competition is an extension of the Robust Reading Competition (RRC) which has been held since 2003 both in ICDAR and in an online context. The proposed competition is presented as a new challenge of the RRC. The dataset built for this challenge largely extends the previous RRC editions in many aspects: the multi-lingual text, the size of the dataset, the multi-oriented text, the wide variety of scenes. The dataset is comprised of 18,000 images which contain text belonging to 9 languages. The challenge is comprised of three tasks related to text detection and script classification. We have received a total of 16 participations from the research and industrial communities. This paper presents the dataset, the tasks and the findings of this RRC-MLT challenge.
引用
收藏
页码:1454 / 1459
页数:6
相关论文
共 10 条
[1]  
[Anonymous], 2017, ARXIV170308289
[2]  
[Anonymous], CORR
[3]   Improving patch-based scene text script identification with ensembles of conjoined networks [J].
Gomez, Lluis ;
Nicolaou, Anguelos ;
Karatzas, Dimosthenis .
PATTERN RECOGNITION, 2017, 67 :85-96
[4]   A fine-grained approach to scene text script identification [J].
Gomez, Lluis ;
Karatzas, Dimosthenis .
PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, :192-197
[5]  
Karatzas D, 2015, PROC INT CONF DOC, P1156, DOI 10.1109/ICDAR.2015.7333942
[6]   ICDAR 2013 Robust Reading Competition [J].
Karatzas, Dimosthenis ;
Shafait, Faisal ;
Uchida, Seiichi ;
Iwamura, Masakazu ;
Gomez i Bigorda, Lluis ;
Robles Mestre, Sergi ;
Mas, Joan ;
Fernandez Mota, David ;
Almazan Almazan, Jon ;
Pere de las Heras, Lluis .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1484-1493
[7]  
Kumar D., 2013, MOCR 13
[8]  
Sharma N, 2015, PROC INT CONF DOC, P1196, DOI 10.1109/ICDAR.2015.7333950
[9]   Script identification in the wild via discriminative convolutional neural network [J].
Shi, Baoguang ;
Bai, Xiang ;
Yao, Cong .
PATTERN RECOGNITION, 2016, 52 :448-458
[10]   A Simple and Effective Solution for Script Identification in the Wild [J].
Singh, Ajeet Kumar ;
Mishra, Anand ;
Dabral, Pranav ;
Jawahar, C. V. .
PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, :428-433