A language model using variable length tokens for open-vocabulary Hangul text recognition
被引:1
作者:
Ryu, SH
论文数: 0引用数: 0
h-index: 0
机构:
Korea Adv Inst Sci & Technol, Div Comp Sci 373 1, Taejon 305701, South KoreaKorea Adv Inst Sci & Technol, Div Comp Sci 373 1, Taejon 305701, South Korea
Ryu, SH
[1
]
Kim, JH
论文数: 0引用数: 0
h-index: 0
机构:
Korea Adv Inst Sci & Technol, Div Comp Sci 373 1, Taejon 305701, South KoreaKorea Adv Inst Sci & Technol, Div Comp Sci 373 1, Taejon 305701, South Korea
Kim, JH
[1
]
机构:
[1] Korea Adv Inst Sci & Technol, Div Comp Sci 373 1, Taejon 305701, South Korea
language model;
character recognition;
hangul recognition;
open-vocabulary;
word recognition;
D O I:
10.1016/j.patcog.2003.12.004
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
We propose a novel language model for Hangul text recognition. Without relying on prior linguistic knowledge in training, the proposed model learns variable length Hangul character sequences, which comprise the elementary tokens of Korean language, and their probabilities from statistics of a raw text corpus. Experiments in handwritten Hangul recognition shows that the proposed language model is effective in postprocessing of recognition results. (C) 2003 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
机构:
Ecole Natl Super Telecommun Bretagne, Dept Signal, CNRS, URA 820, F-75634 Paris 13, FranceEcole Natl Super Telecommun Bretagne, Dept Signal, CNRS, URA 820, F-75634 Paris 13, France
Deligne, S
;
Bimbot, F
论文数: 0引用数: 0
h-index: 0
机构:
Ecole Natl Super Telecommun Bretagne, Dept Signal, CNRS, URA 820, F-75634 Paris 13, FranceEcole Natl Super Telecommun Bretagne, Dept Signal, CNRS, URA 820, F-75634 Paris 13, France
机构:
Ecole Natl Super Telecommun Bretagne, Dept Signal, CNRS, URA 820, F-75634 Paris 13, FranceEcole Natl Super Telecommun Bretagne, Dept Signal, CNRS, URA 820, F-75634 Paris 13, France
Deligne, S
;
Bimbot, F
论文数: 0引用数: 0
h-index: 0
机构:
Ecole Natl Super Telecommun Bretagne, Dept Signal, CNRS, URA 820, F-75634 Paris 13, FranceEcole Natl Super Telecommun Bretagne, Dept Signal, CNRS, URA 820, F-75634 Paris 13, France