期刊文献+

Scene word recognition from pieces to whole 被引量:1

原文传递
导出
摘要 Convolutional neural networks (CNNs) have had great success with regard to the object classification problem. For character classification, we found that training and testing using accurately segmented character regions with CNNs resulted in higher accuracy than when roughly segmented regions were used. Therefore, we expect to extract complete character regions from seene images. Text in natural scene images has an obvious contrast with its attachments. Many methods attempt to extract characters through different segmentation techniques. However, for blurred, occluded, and complex background cases, those methods may result in adjoined or over segmented characters. In this paper, we propose a scene word recognition model that integrates words from small pieces to entire after-cluster-based segmentation. The segmented connected components are classified as four types: background, in dividual character proposals, adjoined characters, and stroke proposals. Individual character proposals are directly inputted to a CNN that is trained using accurately segmented character images. The sliding window strategy is applied to adjoined character regions. Stroke proposals are considered as fragments of entire characters whose locations are estimated by a stroke spatial distribution system. Then、the estimated characters from adjoined characters and stroke proposals are classified by a CNN that is trained on roughly segmented character images. Finally, a lexicondriven integration method is performed to obtain the final word recognition results. Compared to other word recognition methods, our method achieves a comparable performance on Street View Text and the ICDAR 2003 and ICDAR 2013 benchmark databases. Moreover, our method can deal with recognizing text images of occlusion and improperly segmented text images.
机构地区 SCST ISEE-AIT
出处 《Frontiers of Computer Science》 SCIE EI CSCD 2019年第2期292-301,共10页 中国计算机科学前沿(英文版)
基金 the National Natural Science Foundation of China (Grant No. 61703316).
  • 相关文献

参考文献1

二级参考文献98

  • 1Tsai S S, Chen H, Chen D, Schroth G, Grzeszczuk R, Girod B. Mobile Yingying ZHU et al. Scene text detection and recognition: recent advances and future trends visual search on printed documents using text and low bit-rate features. In: Proceedings of the 18th IEEE International Conference on Image Processing. 2011, 2601-2604. 被引量:1
  • 2Barber D B, Redding J D, McLain T W, Beard R W, Taylor CN. Vision-based target geo-location using a fixed-wing miniature air vehi?cle. Journal of Intelligent and Robotic Systems, 2006, 47(4): 361-382. 被引量:1
  • 3Kisacanin B, Pavlovic V, Huang T S. Real-time vision for human?computer interaction. Springer Science and Business Media, 2005. 被引量:1
  • 4DeSouza G N, Kak A C. Vision for mobile robot navigation: a sur?vey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002,24(2): 237-267. 被引量:1
  • 5Ham Y K, Kang M S, Chung H K, Park R H, Park G T. Recognition of raised characters for automatic classification of rubber tires. Optical Engineering. 1995, 34(1): 102-109. 被引量:1
  • 6Yao C, Zhang X, Bai X, Liu W, Tu Z. Rotation-invariant features for multi-oriented text detection in natural images. PloS one, 2013, 8(8): e70173. 被引量:1
  • 7Yao C, Bai X, Shi B, Liu W. Strokelets: A learned multi-scale represen?tation for scene text recognition. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2014, 4042-4049. 被引量:1
  • 8Chen X, Yuille A L. Detecting and reading text in natural scenes. In: Proceedings of 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2004, 2. 被引量:1
  • 9Epshtein B, Ofek E, Wexler Y. Detecting text in natural scenes with stroke width transform. In: Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition. 2010, 2963-2970. 被引量:1
  • 10Neumann L, Matas J. A method for text localization and recognition in real-world images. Lecture Notes in Computer Science, 2011, 6494, 770-783. 被引量:1

共引文献20

同被引文献2

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部