手写维文字符分割中的多信息融合路径寻优方法被引量：2

A Path Optimization Method Based on Multiple Information Fusion for Handwritten Uyghur Character Segmentation

下载PDF

导出

摘要针对维吾尔词书写粘连和手写笔画漂移等问题,提出一种基于多信息融合路径寻优的字符分割算法。利用笔画提取、切分和聚类,过分割单词图像得到主体和附加字段,通过字段模糊匹配获得鲁棒的字根序列描述,以抑制笔画漂移造成的干扰;由建立的匹配位置高斯模型来估算字段匹配信息,经对单字分类器输出进行置信度转换,从而得到字符识别信息,再运用数据统计获取单词语义信息;由构建的字符序列二阶Markov语言模型,基于Bayes准则,提出了单词后验概率的多信息加权融合计算方法,通过字段匹配及字根合并的路径寻优,可得到最佳字符分割结果。在手写维文样本库上的实验表明,所提算法能有效提升字符分割的准确率和稳定性。 Character segmentation is a key technique for Uyghur handwriting recognition, but cursive characters and the phenomenon of stroke drift make the segmentation difficult. A new character segmentation algorithm based on multiple information fusion is proposed to solve the problem. Strokes of a word are extracted, segmented and clustered to get two types of sections： main and affix. The robust oversegmentation primitive sequences are obtained using fuzzy section matching to reduce the interference from stroke drift. Then, the matching information is estimated by constructing a matching position Gaussian model. The recognition confidence is converted from character classifier outputs by confidence transformation, and the semantic information is obtained by word data statistics. A character sequences Markov model is presented and the formula to calculate the posterior probability of a word is derived based on the Bayes criterion. The optimal path and the optimal segmentation result are achieved by weighted multiple information fusion. Experiments show that the proposed algorithm can effectively improve the accuracy and stability of character segmentation.

作者许亚美卢朝阳李静姚超

机构地区西安电子科技大学综合业务网理论及关键技术国家重点实验室

出处《西安交通大学学报》 EI CAS CSCD 北大核心 2013年第8期68-73,86,共7页 Journal of Xi'an Jiaotong University

基金国家自然科学基金资助项目(60872141) 中央高校基本科研业务费专项资金资助项目(K50510010007) 华为科技基金资助项目(HITC2011023)

关键词信息处理技术手写文字识别字符分割维吾尔语多信息融合 information processing technology handwriting recognition character segmentation Uyghur language multiple information fusion

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1靳简明,丁晓青,彭良瑞,王华.印刷维吾尔文本切割[J].中文信息学报,2005,19(5):76-83. 被引量：17
2宋喆..现代维吾尔语词汇构成途径新探[D].新疆大学,2006:
3哈力木拉提,阿孜古丽.多字体印刷维吾尔文字符识别系统的研究与开发[J].计算机学报,2004,27(11):1480-1484. 被引量：36

二级参考文献20

1Adnan Amin, and Jean F. Mari. Machine recognition and correction of printed Arabic text [J]. IEEE Transactions on Systems, Man and Cybernetics, 1989, 19(5):1300- 1306. 被引量：1
2Katerin Romeo-Pakker, H. Miled, and Yves Lecourtier. A new approach for Latin / Arabic character segmentation [A]. Proceedings of the 3rd International Conference on Document Analysis and Recognition [C]. Montréal, Cana da, 1995, 874- 877. 被引量：1
3H. Al-Muallim, and S. Yamaguchi. A method of recognition of Arabic cursive handwriting [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1987, 9(5):715- 722. 被引量：1
4Anthony Cheung, Mohammed Bennamoun, and Neil W. Bergmann. An Arabic optical character recognition system using recognition-based segmentation [J]. Pattern Recognition, 2001, 34(2):215- 233. 被引量：1
5Issam Bazzi, Richard Schwartz, and John Makhoul. An omnifont open-vocabulary OCR system for English and Arabic [J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 1999, 21(6):495- 504. 被引量：1
6Anniwear Ymin, and Yoshinao Aoki. On the segmentation of multi-font printed Uygur scripts [A]. Proceedings of the 13th International Conference on Pattern Recognition [C]. Vienna, Austria, 1996, 215- 219. 被引量：1
7A. Zahour, B. Taconet, P. Mercy, and S. Ramdane. Arabic hand-written text-line extraction [A]. Proceedings of the 6th International Conference on Document Analysis and Recognition [C]. Seattle, USA, 2001, 281 - 285. 被引量：1
8Gasser A. Auda, and Hazem Raafat. An automatic text reader using neural networks [A]. Proceedings of the Canadian Conference on Electrical and Computer Engineering [C]. Vancouver, BC Canada, 1993, 92- 95. 被引量：1
9Ibrahim S. I. Abuhaiba, M. J. J. Holt, and S. Datta. Recognition of off-line cursive handwriting [J]. Computer Vision and Image Understanding, 1998, 71(1) :19- 38. 被引量：1
10M.F. Bushofa, and M. Spann. Segmentation and recognition of Arabic characters by structural classification [J].Image and Vision Computing, 1997, 15(3): 167 - 179. 被引量：1

共引文献48

1贾钰峰,章蓬伟,邵小青,张玉茜.印刷维吾尔文识别后处理[J].智能计算机与应用,2020(4):239-242.
2靳简明,王华,丁晓青.维汉英混排文档识别[J].电子与信息学报,2006,28(7):1188-1191. 被引量：3
3阿力木江.亚森,哈力木拉提.买买提.维吾尔文联机手写识别的预处理和特征提取[J].新疆大学学报（自然科学版）,2010,27(2):232-237. 被引量：12
4伊力亚尔.基于2-gram语言模型的哈萨克文语料库校对研究[J].伊犁师范学院学报（自然科学版）,2010,4(3):50-53. 被引量：1
5阿地力.依米提,卢朝阳,李静,刘吉超.一种脱机手写维吾尔文切分的方法[J].新疆师范大学学报（自然科学版）,2010,29(4):72-76. 被引量：3
6韩林峰,赵晖.联机手写维文字符的预处理和特征提取方法[J].电脑知识与技术,2011,7(3):1607-1609.
7达吾勒.阿布都哈依尔,海拉提.克孜尔别克.哈萨克文脱机手写字符识别系统的研究与实现[J].计算机工程,2011,37(8):186-188. 被引量：1
8贾钰峰,哈力木拉提.买买提,冀爽.印刷维吾尔文特征提取之方向码[J].现代计算机,2011,17(6):3-5. 被引量：2
9吐尔根·依布拉音,袁保社.新疆少数民族语言文字信息处理研究与应用[J].中文信息学报,2011,25(6):149-156. 被引量：26
10艾力.居麦,哈力旦.A,黄浩.视频图像中维吾尔文字的识别研究[J].计算机工程与应用,2011,47(36):190-192. 被引量：6

同被引文献33

1Meisen Pan, Junbiao Yan, Zhenghong Xiao. An approach to tilt correction of vehicle license plate[A]. Proc. of 2007 International Conference on Mechatronics and Auto- mation[C]. 2007,271-275. 被引量：1
2Singha C, Bhatiab N, Kaur A. Hough transform based fast skew detection and accurate skew correction methods [J]. Pattern Recognition, 2008,41 (12) : 3528-3546. 被引量：1
3Yuan B,Tan C L. Convex hull based skew estimation[J]. Pattern Recognition, 2007,40(2) : 456-475. 被引量：1
4Zhang X,Lin Z C,Sun F C,et al. Rectification of chinese characters as transform invariant low-rank textures[A]. Proc. of 2013 International Conference on Document Anal- ysis and Recognition (ICDAR) [C]. 2011,393-397. 被引量：1
5ZHAO Yong-qiang,YANG Jing-xiang. Hyperspectral image denoising via sparse representation and low-rank con- straint[J]. IEEE Transactions on Geoscience & Remote Sensing, 2015,53(1) : 296-308. 被引量：1
6YUAN Yan-Tang, HAO liang-Yuan, LUO qing-Li. Manifold based sparse representation for hyperspectral image classification [J]. IEEE Transactions on Geoscience & Remote Sensing, 2014,52(12) : 7606-7618. 被引量：1
7Candes E,Braun N,Wakin M. et al. Macro[A]. Proc. of 2007 4th IEEE International Symposium on Biomedical Imaging[C]. 2007,976-979. 被引量：1
8Liang X, Ren X, Ma Y, et al. Repairing sparse low-rank texture[A]. Proc. of European Conference on Computer Vision[C]. 2012,482-495. 被引量：1
9Wright J,Ganesh A,Min K R,et al. Compressive principal component pursuit[J]. The New IMA Journal on Informa- tion and Inference,B013,2(1) :32-68. 被引量：1
10彭义刚,索津莉,戴琼海,等从压缩传感到低秩矩阵恢复:理论与应用[J].自动化学报,2012,38(12):961-971. 被引量：1

引证文献2

1马杰,张小美,苑焕朝.基于并行分离增广拉格朗日乘子法的字符矫正[J].光电子．激光,2015,26(6):1170-1178. 被引量：1
2姑丽祖热.吐尔逊,尤努斯.艾沙,吐尔根.依布拉音,库尔班.吾布力.连通域结合重叠度的维吾尔文档图像文字切分[J].计算机工程与设计,2016,37(7):1892-1897. 被引量：6

二级引证文献7

1马金辰,谢世朋,李海波.基于多个低秩纹理提取的图像校正方法[J].计算机技术与发展,2017,27(3):97-102. 被引量：1
2阿丽亚.巴吐尔,木特力铺.马木提,努尔毕亚.亚地卡尔,阿力木江.艾沙,库尔班.吾布力.连体段特征聚类的维吾尔文文档图像单词切分[J].计算机工程与设计,2018,39(3):774-779. 被引量：6
3阿依萨代提.阿卜力孜,加合买提.司马义,卡米力.木依丁,艾斯卡尔.艾木都拉.脱机手写维吾尔文本图像单词切分[J].计算机工程与应用,2018,54(9):133-138. 被引量：2
4周文杰,木特力铺·马木提,吾尔尼沙·买买提,阿力木江·艾沙,库尔班·吾布力.基于形态学梯度算法的维文文档图像单词切分[J].计算机工程与设计,2019,40(9):2552-2557. 被引量：2
5刘静,沙吾提江·亚森.基于文档结构的维吾尔文文本水印算法[J].陕西理工大学学报（自然科学版）,2019,35(5):33-38.
6霍留磊,艾斯卡尔·艾木都拉,阿布都萨拉木·达吾提.脱机手写维吾尔单词提取[J].电视技术,2019,43(7):18-25.
7谢智烜,姚红兵,范宁,陈枫.面向多目标透镜的连通域标记检测算法[J].电子科技,2020,33(4):50-54. 被引量：2

1丰伍.开发成功采用神经网络的手写文字识别技术[J].电子科技杂志,1989(4):47-47.
2阎少宏,彭亚绵,杨爱民,周明陶.LLE算法及其在手写文字识别中的应用[J].河北联合大学学报（自然科学版）,2012,34(2):52-55. 被引量：4
3张红.微软三大技术革新人机界面[J].视窗世界,2005(5):20-21.
4张源,李灿平.基于弹性网格特征和神经网络的手写文字识别[J].信息技术,2011,0(12):38-41. 被引量：1
5于传强,郭晓松,张宝生,张安.基于Bayes准则的支持向量机[J].兵工学报,2009,30(5):602-606.
6李波.“互联网+”校园一卡通融合路径研究[J].软件导刊,2017,16(4):177-179. 被引量：3
7何廷润.从互联网角度看电信融合的策略选择[J].移动通信,2008,32(14):11-14. 被引量：2
8宋丽菊.传统广电媒体与网络新媒体融合路径探析[J].西部广播电视,2016,37(12):66-66. 被引量：3
9姜红德.南钢：信息化的集成融合路径[J].中国信息化,2010(9):54-55.
10冷鹏.基于记录匹配算法的记录清理研究[J].科协论坛（下半月）,2007(7):55-56.

西安交通大学学报

2013年第8期

浏览历史

内容加载中请稍等...

手写维文字符分割中的多信息融合路径寻优方法被引量：2

参考文献3

二级参考文献20

共引文献48

同被引文献33

引证文献2

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

手写维文字符分割中的多信息融合路径寻优方法 被引量：2

参考文献3

二级参考文献20

共引文献48

同被引文献33

引证文献2

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

手写维文字符分割中的多信息融合路径寻优方法被引量：2