期刊文献+

手写维文字符分割中的多信息融合路径寻优方法 被引量:2

A Path Optimization Method Based on Multiple Information Fusion for Handwritten Uyghur Character Segmentation
下载PDF
导出
摘要 针对维吾尔词书写粘连和手写笔画漂移等问题,提出一种基于多信息融合路径寻优的字符分割算法。利用笔画提取、切分和聚类,过分割单词图像得到主体和附加字段,通过字段模糊匹配获得鲁棒的字根序列描述,以抑制笔画漂移造成的干扰;由建立的匹配位置高斯模型来估算字段匹配信息,经对单字分类器输出进行置信度转换,从而得到字符识别信息,再运用数据统计获取单词语义信息;由构建的字符序列二阶Markov语言模型,基于Bayes准则,提出了单词后验概率的多信息加权融合计算方法,通过字段匹配及字根合并的路径寻优,可得到最佳字符分割结果。在手写维文样本库上的实验表明,所提算法能有效提升字符分割的准确率和稳定性。 Character segmentation is a key technique for Uyghur handwriting recognition, but cursive characters and the phenomenon of stroke drift make the segmentation difficult. A new character segmentation algorithm based on multiple information fusion is proposed to solve the problem. Strokes of a word are extracted, segmented and clustered to get two types of sections: main and affix. The robust oversegmentation primitive sequences are obtained using fuzzy section matching to reduce the interference from stroke drift. Then, the matching information is estimated by constructing a matching position Gaussian model. The recognition confidence is converted from character classifier outputs by confidence transformation, and the semantic information is obtained by word data statistics. A character sequences Markov model is presented and the formula to calculate the posterior probability of a word is derived based on the Bayes criterion. The optimal path and the optimal segmentation result are achieved by weighted multiple information fusion. Experiments show that the proposed algorithm can effectively improve the accuracy and stability of character segmentation.
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2013年第8期68-73,86,共7页 Journal of Xi'an Jiaotong University
基金 国家自然科学基金资助项目(60872141) 中央高校基本科研业务费专项资金资助项目(K50510010007) 华为科技基金资助项目(HITC2011023)
关键词 信息处理技术 手写文字识别 字符分割 维吾尔语 多信息融合 information processing technology handwriting recognition character segmentation Uyghur language multiple information fusion
  • 相关文献

参考文献3

二级参考文献20

  • 1Adnan Amin, and Jean F. Mari. Machine recognition and correction of printed Arabic text [J]. IEEE Transactions on Systems, Man and Cybernetics, 1989, 19(5):1300- 1306. 被引量:1
  • 2Katerin Romeo-Pakker, H. Miled, and Yves Lecourtier. A new approach for Latin / Arabic character segmentation [A]. Proceedings of the 3rd International Conference on Document Analysis and Recognition [C]. Montréal, Cana da, 1995, 874- 877. 被引量:1
  • 3H. Al-Muallim, and S. Yamaguchi. A method of recognition of Arabic cursive handwriting [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1987, 9(5):715- 722. 被引量:1
  • 4Anthony Cheung, Mohammed Bennamoun, and Neil W. Bergmann. An Arabic optical character recognition system using recognition-based segmentation [J]. Pattern Recognition, 2001, 34(2):215- 233. 被引量:1
  • 5Issam Bazzi, Richard Schwartz, and John Makhoul. An omnifont open-vocabulary OCR system for English and Arabic [J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 1999, 21(6):495- 504. 被引量:1
  • 6Anniwear Ymin, and Yoshinao Aoki. On the segmentation of multi-font printed Uygur scripts [A]. Proceedings of the 13th International Conference on Pattern Recognition [C]. Vienna, Austria, 1996, 215- 219. 被引量:1
  • 7A. Zahour, B. Taconet, P. Mercy, and S. Ramdane. Arabic hand-written text-line extraction [A]. Proceedings of the 6th International Conference on Document Analysis and Recognition [C]. Seattle, USA, 2001, 281 - 285. 被引量:1
  • 8Gasser A. Auda, and Hazem Raafat. An automatic text reader using neural networks [A]. Proceedings of the Canadian Conference on Electrical and Computer Engineering [C]. Vancouver, BC Canada, 1993, 92- 95. 被引量:1
  • 9Ibrahim S. I. Abuhaiba, M. J. J. Holt, and S. Datta. Recognition of off-line cursive handwriting [J]. Computer Vision and Image Understanding, 1998, 71(1) :19- 38. 被引量:1
  • 10M.F. Bushofa, and M. Spann. Segmentation and recognition of Arabic characters by structural classification [J].Image and Vision Computing, 1997, 15(3): 167 - 179. 被引量:1

共引文献48

同被引文献33

  • 1Meisen Pan, Junbiao Yan, Zhenghong Xiao. An approach to tilt correction of vehicle license plate[A]. Proc. of 2007 International Conference on Mechatronics and Auto- mation[C]. 2007,271-275. 被引量:1
  • 2Singha C, Bhatiab N, Kaur A. Hough transform based fast skew detection and accurate skew correction methods [J]. Pattern Recognition, 2008,41 (12) : 3528-3546. 被引量:1
  • 3Yuan B,Tan C L. Convex hull based skew estimation[J]. Pattern Recognition, 2007,40(2) : 456-475. 被引量:1
  • 4Zhang X,Lin Z C,Sun F C,et al. Rectification of chinese characters as transform invariant low-rank textures[A]. Proc. of 2013 International Conference on Document Anal- ysis and Recognition (ICDAR) [C]. 2011,393-397. 被引量:1
  • 5ZHAO Yong-qiang,YANG Jing-xiang. Hyperspectral image denoising via sparse representation and low-rank con- straint[J]. IEEE Transactions on Geoscience & Remote Sensing, 2015,53(1) : 296-308. 被引量:1
  • 6YUAN Yan-Tang, HAO liang-Yuan, LUO qing-Li. Manifold based sparse representation for hyperspectral image classification [J]. IEEE Transactions on Geoscience & Remote Sensing, 2014,52(12) : 7606-7618. 被引量:1
  • 7Candes E,Braun N,Wakin M. et al. Macro[A]. Proc. of 2007 4th IEEE International Symposium on Biomedical Imaging[C]. 2007,976-979. 被引量:1
  • 8Liang X, Ren X, Ma Y, et al. Repairing sparse low-rank texture[A]. Proc. of European Conference on Computer Vision[C]. 2012,482-495. 被引量:1
  • 9Wright J,Ganesh A,Min K R,et al. Compressive principal component pursuit[J]. The New IMA Journal on Informa- tion and Inference,B013,2(1) :32-68. 被引量:1
  • 10彭义刚,索津莉,戴琼海,等从压缩传感到低秩矩阵恢复:理论与应用[J].自动化学报,2012,38(12):961-971. 被引量:1

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部