期刊文献+

基于最大梯度差的叠加文本定位 被引量:1

Graphics text detection with max gradient difference
下载PDF
导出
摘要 通过分析文本特征和背景,提出一种基于最大梯度差的叠加文本定位算法。首先获得横向和竖向两个方向的梯度图像,然后设定一个窗口扫描整个图像,分别计算窗口内的最大梯度差,得到两个方向的最大梯度差矩阵,然后分别通过自适应阈值算法找出疑似文本像素,再将两个方向的判决结果取交集,消除部分复杂背景造成的误判。接着利用数学形态学运算和先验知识剔除伪文本区。最后利用改进的穿越线算法精确定位文本。实验表明,本算法不仅对横向文本具有较高的查全率和较低的虚警率,并且对竖向文本也有较好的定位效果。 This paper proposed an algorithm with max gradient difference by analyzing the text feature. It firstly calculated the gradient of two direct, vertical and horizontal. And then it got the max gradient different matrix by calculating the max gradient difference in a window. Then, it took an adaptive threshold algorithm to determine the text pixels, and calculated the intersec- tion of two results in order to eliminate the influence of part of the complex background. It conducted mathematical morphology operation and prior knowledge to eliminate the false text area. Finally, it used the improved across-line algorithm for precise locating of text. Experiments show that this algorithm not only has higher recall ratio of transverse text, and also has good effect for vertical text.
出处 《计算机应用研究》 CSCD 北大核心 2014年第10期3173-3176,共4页 Application Research of Computers
基金 国家"863"计划资助项目(2011AA010603 2011AA010605)
关键词 最大梯度差 叠加文本 文本定位 穿越线算法 max gradient difference graphics text text detection across-line algorithm
  • 相关文献

参考文献9

  • 1ANTONACOPOULOS A, KARATZAS D.An anthropocentric approach to text extraction from WWW images[C]//Proc of the 4th IAPR Workshop on Document Analysis Systems.New York:ACM Press,2000:515-526. 被引量:1
  • 2ZHONG Yu, KARU K, JAIN A K.Locating text in complex color images[J].Pattern Recognition,1995,28(10):1523-1535. 被引量:1
  • 3LIU Chun-mei, WANG Chun-heng, DAI Ru-wei.Text detection in images based on unsupervised classification of edge-based features[C]//Proc of the 8th International Conference on Document Analysis and Recognition.[S.l.]:IEEE Press,2005:610-614. 被引量:1
  • 4YE Qi-xiang, HUANG Qing-ming, GAO Wen, et al.Fast and robust text detection in images and video frames[J].Image and Vision Computing,2005,23(6):565-576. 被引量:1
  • 5PHAN T Q, SHIVAKUMARA P, TAN C L.A Laplacian method for video text detection[C]//Proc of the 10th International Conference on Document Analysis and Recognition.[S.l.]:IEEE Press,2009:66-70. 被引量:1
  • 6SHIVAKUMARA P, PHAN T Q, TAN C L.A gradient difference based technique for video text detection[C]//Proc of the 10th International Conference on Document Analysis and Recognition.[S.l.]:IEEE Press,2009:156-160. 被引量:1
  • 7田破荒,彭天强,李弼程.基于文字穿越线和笔画连通性的视频文字提取方法[J].电子学报,2009,37(1):72-78. 被引量:10
  • 8SHIVAKUMARA P, PHAN T Q, TAN C L.A Laplacian approach to multi-oriented text detection in video[J].IEEE Trans on Pattern Analysis and Machine Intelligence,2011,33(2):412-419. 被引量:1
  • 9WONG E K, CHEN M.A new robust algorithm for video text extraction[J].Pattern Recognition,2003,36(6):1397-1406. 被引量:1

二级参考文献12

  • 1R Lienhart, A Wemicke. Localizing and segmenting text in images, videos [ J ]. IEEE Transactions on Circuits Syst Video Technol, 2002,12(4) :256 - 268. 被引量:1
  • 2Agnihotri L, Dimitrova N. Text detection for video analysis [ A]. IEEE Workshop on Content-Based Access of Image and Video Libraries [C ]. Fort Collins, CO, USA: IEEE Press, 1999.109 - 113. 被引量:1
  • 3K Jain, B Yu. Automatic text location in images and video frames[ J]. Pattern recognition, 1998,31(12) :2055 - 2076. 被引量:1
  • 4Wenge Mao,Fu-lai Chung,Lam, K K M, Wan-chi Sun.Hybrid Chinese/English text detection in images and video frames [ A]. Proceedings of 16th International Conference on Pattern Recognition, 2002 [C ]. Washington, DC, USA: IEEE Computer Society,Volume (3) ,Aug 2002. 1015 - 1018. 被引量:1
  • 5J Gllavata, R Ewerth, B Freisleben. A text detection, localization and segmentation system for OCR in images[A]. Proceedings of the 1EEE Sixth International Symposium on Multimedia Software Engineering[ C]. Washington, DC, USA :IEEE Computer Society,2004.310 - 317. 被引量:1
  • 6Michael R Lyu, Jiqiang Song, Min Cal. A comprehensive method for multilingual video text detection, localization, and extraction[J ]. IEEE Transaction on circuits and systems for video technology, 2005,15(2) :243 - 255. 被引量:1
  • 7D Chen,K Shearer,H Bourlard. Text enhancement with asymmelric filter for vdeo OCR[A]. In Proceedings of 11 th International Conference Image Analysis Processing [ C ]. Palermo, I taly: IEEE Press,2001,192 - 197. 被引量:1
  • 8T Sato, T Kanade, E K Hughes, M A Smith. Video OCR for digital news archive [ A ]. In Proceedings of IEEE Workshop Content-Based Access Image Video Database[ C]. Bombay, India: IEEE Press, 1998,52 - 60. 被引量:1
  • 9C Ding,X He,H Zha,M Gu,H Simon. A rnin-max cut algorithm for graph partitioning and data clustering [A]. In Proceedings of IEEE International Conference Data Mining [ C ]. San Jose,CA,USA:IEEE Press,2001,107 - 114. 被引量:1
  • 10S U Lee,S Y Chung,R H Park. A comparative performance study of several global thresholding techniques for segmentation[J]. Computer Vision, Graphics and Image Processing, 1990,52(2) : 171 - 190. 被引量:1

共引文献9

同被引文献7

引证文献1

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部