期刊文献+

中英文混排扭曲文本图像快速校正方法 被引量:1

A Fast Correcting Method for Warped Chinese and English Mixed Document Images
下载PDF
导出
摘要 针对OCR在识别文本图像时,由于扭曲造成的中英文混排文本图像识别率不理想的情况,提出一种快速扭曲校正方法。图像经过预处理后,首先利用形态学膨胀定位文本行,得到各文本行上下边界;分别对每个文本行参考垂直投影信息进行文字切分,获得字符包围盒;然后根据中英文的不同特点在每个文本行中逐个对字符位置进行校正,最终实现图像重构。实验结果表明,该方法校正速度快、精度高,对于中英文混排扭曲文档图像有较好地校正效果,校正后图像OCR识别率有明显提高。 Character recognition rate of OCR processing is not well for warped Chinese and English document image. To resolve this problem, a fast distortion correcting method is proposed in this paper. After the process of image preprocessing, the upper and lower boundary of each text line could be obtained by morphological dilation method. Then, the characters in each line are segmented one by one based on the vertical projection information. Every character can be described in a minimum bounding box. After that, the positions of the segmented characters are corrected according to the different structure characteristics between Chinese and English in each line. Finally, the image could be reconstructed. Experiments showed that this correction method could rectify the warped Chinese and English document image quickly and effectively. The OCR rate of the corrected images could be significantly improved.
出处 《图学学报》 CSCD 北大核心 2015年第6期920-925,共6页 Journal of Graphics
基金 国家自然科学基金资助项目(61371142)
关键词 中英文混排 扭曲文档图像 文本行提取 字符切分 mixture of chinese and english warped document images text line extraction character segmentation
  • 相关文献

参考文献13

  • 1Ghods A R, Mozaffari S, Ahmadpanahi F. Document image dewarping using kinect depth sensor [C]//Iranian Conference on Electrical Engineering (ICEE), 2013: 1-6. 被引量:1
  • 2Tong L J, Zhan G L, Peng Q Y, et al. Warped document image mosaicing method based on inflection point detection and registration [C]//International Conference on Multimedia Information Networking and Security (ICM1NES), 2012:306-310. 被引量:1
  • 3Meng G F, Pan C H, Xiang S M, et al. Metric rectification of curved document images [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(4): 707-722. 被引量:1
  • 4Brown M S, Brent S W. Image restoration of arbitrarily warped documents [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(10): 1295-1306. 被引量:1
  • 5Tang C Q, Dai X J. A rectification algorithm for distorted images from cone surface [C]//International Conference on Wireless Communications, Networking and Mobile Computing (WiCOM), 2010: 1-4. 被引量:1
  • 6杨玲,成运.应用经纬映射的鱼眼图像校正设计方法[J].工程图学学报,2010,31(6):19-22. 被引量:32
  • 7张伟业,赵群飞.读书机器人的版面分析及文字图像预处理算法[J].微型电脑应用,2011(1):58-61. 被引量:8
  • 8Liu H, Ding R W. Restoring chinese document images based on text boundary lines [C]//International Conference on Systems, Man and Cybernetics (ICSMC), 2009: 571-576. 被引量:1
  • 9Bukhari S S, Shafait F, Breuel T M. Coupled snakelets for curled text-line segmentation from warped document images [C]//International Journal on Document Analysis and Recognition(ICDAR), 2013: 748-752. 被引量:1
  • 10曾凡锋,王晓,吴飞飞.基于文本行重构的扭曲文档快速校正方法[J].计算机工程与设计,2014,35(2):573-577. 被引量:4

二级参考文献23

  • 1张伟业,赵群飞.读书机器人的版面分析及文字图像预处理算法[J].微型电脑应用,2011(1):58-61. 被引量:8
  • 2黄有度,苏化明.一种鱼眼图象到透视投影图象的变换模型[J].系统仿真学报,2005,17(1):29-32. 被引量:28
  • 3唐矫燕,赵群飞,杨汝清,吴心然.读书机器人机构设计[J].上海交通大学学报,2005,39(12):2025-2028. 被引量:12
  • 4张森,赵群飞,冶建科.一种数字图像几何畸变的自动校正方法[J].机电一体化,2007,13(3):60-64. 被引量:8
  • 5Brown M S, Seales W B. Image Restoration of Arbitrarily Warped Documents[J]. IEEE Transactions on Pattern Analysis and Machine/ntelligence, 2004, 26(10): 1295-1306. 被引量:1
  • 6Fu Bin, Wu Minghui, Li Rongfeng, et al. A Model-based Book Dewarping Method Using Text Line Detection[C]//Proc. of the 2nd International Workshop on Camera-based Document Analysis and Recognition. Curitiba, Brazil: [s. n.], 2007. 被引量:1
  • 7Zhang Zheng, Tan Chew Lira. Restoration of Images Scanned from Thick Bound Documents[C]//Proc. of 2001 International Conference on Image Processing. Thessaloniki, Greece: [s. n.], 2001. 被引量:1
  • 8Gatos B, Pratikakis I, Ntirogiannis K. Segmentation-based Recovery of Arbitrarily Warped Document Images[C]//Proc. of the 9th International Conference on Document Analysis and Recognition. Curifiba, Brazil:[s. n.], 2007. 被引量:1
  • 9Gatos B, Pratikakis I, Perantonis S J. Adaptive Degraded Document Image Binarization[J]. Pattern Recognition, 2006, 39(3): 317-327. 被引量:1
  • 10Ying X,Hu Z,Zha H.Fisheye lenses calibration using straight-line spherical perspective projection constraint[C] //ACCV,2006,Proc.of 7th Asian Conf.on Computer Vision.India:Hyderabad,ACCV,2006:591-600. 被引量:1

共引文献44

同被引文献1

引证文献1

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部