[1] Hewlett Packard Labs, Bangalore 560030, Karnataka, India
来源:
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2
|
2004年
关键词:
D O I:
10.1109/ICPR.2004.1334196
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
In this paper Principal Component Analysis (PCA) is applied to the problem of Online Handwritten Character Recognition in the Tamil script. The input is a temporally ordered sequence of (x,y) pen coordinates corresponding to an isolated character obtained from a digitizer The input is converted into a feature vector of constant dimensions following smoothing and normalization. PCA is used to find the basis vectors of each class subspace and the orthogonal distance to the subspaces used for classification. Pre-clustering of the training data and modification of distance measure are explored to overcome some common problems in the traditional subspace method. In empirical evaluation, these PCA-based classification schemes are found to compare favorably with nearest neighbour classification.