Scene word recognition from pieces to whole 被引量：1

导出

摘要 Convolutional neural networks (CNNs) have had great success with regard to the object classification problem. For character classification, we found that training and testing using accurately segmented character regions with CNNs resulted in higher accuracy than when roughly segmented regions were used. Therefore, we expect to extract complete character regions from seene images. Text in natural scene images has an obvious contrast with its attachments. Many methods attempt to extract characters through different segmentation techniques. However, for blurred, occluded, and complex background cases, those methods may result in adjoined or over segmented characters. In this paper, we propose a scene word recognition model that integrates words from small pieces to entire after-cluster-based segmentation. The segmented connected components are classified as four types: background, in dividual character proposals, adjoined characters, and stroke proposals. Individual character proposals are directly inputted to a CNN that is trained using accurately segmented character images. The sliding window strategy is applied to adjoined character regions. Stroke proposals are considered as fragments of entire characters whose locations are estimated by a stroke spatial distribution system. Then、the estimated characters from adjoined characters and stroke proposals are classified by a CNN that is trained on roughly segmented character images. Finally, a lexicondriven integration method is performed to obtain the final word recognition results. Compared to other word recognition methods, our method achieves a comparable performance on Street View Text and the ICDAR 2003 and ICDAR 2013 benchmark databases. Moreover, our method can deal with recognizing text images of occlusion and improperly segmented text images.

作者 Anna ZHU Seiichi UCHIDA

机构地区 SCST ISEE-AIT

出处《Frontiers of Computer Science》 SCIE EI CSCD 2019年第2期292-301,共10页 中国计算机科学前沿（英文版）

基金 the National Natural Science Foundation of China (Grant No. 61703316).

关键词 text recognition convolutional neural networks cluster-based segmentation character integration

分类号 TP [自动化与计算机技术]

引文网络
相关文献

参考文献1

1Yingying ZHU,Cong YAO,Xiang BAI.Scene text detection and recognition： recent advances and future trends[J].Frontiers of Computer Science,2016,10(1):19-36. 被引量：21

二级参考文献98

1Tsai S S, Chen H, Chen D, Schroth G, Grzeszczuk R, Girod B. Mobile Yingying ZHU et al. Scene text detection and recognition: recent advances and future trends visual search on printed documents using text and low bit-rate features. In: Proceedings of the 18th IEEE International Conference on Image Processing. 2011, 2601-2604. 被引量：1
2Barber D B, Redding J D, McLain T W, Beard R W, Taylor CN. Vision-based target geo-location using a fixed-wing miniature air vehi?cle. Journal of Intelligent and Robotic Systems, 2006, 47(4): 361-382. 被引量：1
3Kisacanin B, Pavlovic V, Huang T S. Real-time vision for human?computer interaction. Springer Science and Business Media, 2005. 被引量：1
4DeSouza G N, Kak A C. Vision for mobile robot navigation: a sur?vey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002,24(2): 237-267. 被引量：1
5Ham Y K, Kang M S, Chung H K, Park R H, Park G T. Recognition of raised characters for automatic classification of rubber tires. Optical Engineering. 1995, 34(1): 102-109. 被引量：1
6Yao C, Zhang X, Bai X, Liu W, Tu Z. Rotation-invariant features for multi-oriented text detection in natural images. PloS one, 2013, 8(8): e70173. 被引量：1
7Yao C, Bai X, Shi B, Liu W. Strokelets: A learned multi-scale represen?tation for scene text recognition. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2014, 4042-4049. 被引量：1
8Chen X, Yuille A L. Detecting and reading text in natural scenes. In: Proceedings of 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2004, 2. 被引量：1
9Epshtein B, Ofek E, Wexler Y. Detecting text in natural scenes with stroke width transform. In: Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition. 2010, 2963-2970. 被引量：1
10Neumann L, Matas J. A method for text localization and recognition in real-world images. Lecture Notes in Computer Science, 2011, 6494, 770-783. 被引量：1

共引文献20

1王润民,桑农,丁丁,陈杰,叶齐祥,高常鑫,刘丽.自然场景图像中的文本检测综述[J].自动化学报,2018,44(12):2113-2141. 被引量：49
2张矿,朱远平.基于超像素融合的文本分割[J].计算机应用,2016,36(12):3418-3422. 被引量：2
3杨飞.自然场景图像中的文字检测综述[J].电子设计工程,2016,24(24):165-168. 被引量：12
4李翌昕,马尽文.文本检测算法的发展与挑战[J].信号处理,2017,33(4):558-571. 被引量：8
5Junge ZHANG,Kaiqi HUANG,Tieniu TAN,Zhaoxiang ZHANG.Local structured representation for generic object detection[J].Frontiers of Computer Science,2017,11(4):632-648. 被引量：1
6朱盈盈,张拯,章成全,张兆翔,白翔,刘文予.适用于文字检测的候选框提取算法[J].数据采集与处理,2017,32(6):1097-1106. 被引量：2
7白翔,杨明锟,石葆光,廖明辉.基于深度学习的场景文字检测与识别[J].中国科学：信息科学,2018,48(5):531-544. 被引量：35
8刘美华,傅彩明,梁开健,周细凤.应用MSER和局部二值化的网络图片文本定位[J].光电子．激光,2018,29(6):660-668. 被引量：2
9陈晓龙,陈显龙,袁建平,高宇豆,张加其.基于深度学习的电力设备铭牌识别[J].广西大学学报（自然科学版）,2018,43(6):2216-2226. 被引量：14
10陈硕,郑建彬,詹恩奇,汪阳.基于笔画角度变换和宽度特征的自然场景文本检测[J].计算机应用研究,2019,36(4):1270-1274. 被引量：4

同被引文献2

1Huaizu JIANG,Ming-Ming CHENG,Shi-Jie LI,Ali BORJI,Jingdong WANG.Joint salient object detection and existence prediction[J].Frontiers of Computer Science,2019,13(4):778-788. 被引量：4
2Minxi Li,Jiali Mao,Xiaodong Qi,Cheqing Jin.A framework for cloned vehicle detection[J].Frontiers of Computer Science,2020,14(5):181-198. 被引量：1

引证文献1

1Haoyu MA,Ningning LU,Junjun MEI,Tao GUAN,Yu ZHANG,Xin GENG.Label distribution learning for scene text detection[J].Frontiers of Computer Science,2023,17(6):5-12.

1Ying Kong,Xin Liu,Sha Liu,Yong-Xin Li.Characteristics of Mandarin Open-set Word Recognition Development among Chinese Children with Cochlear Implants[J].Chinese Medical Journal,2017(20):2410-2415.
2张爽,郑方.The Character Images in the Captain American Ⅲ—based on the Violation of Cooperative Principle[J].校园英语,2017(34):216-217.
3Christopher D.Hacon,Jingjun Han.On a connectedness principle of Shokurov-Kollr type[J].Science China Mathematics,2019,62(3):411-416.
4Yonglin Tian,Xuan Li,Kunfeng Wang,Fei-Yue Wang.Training and Testing Object Detectors With Virtual Images[J].IEEE/CAA Journal of Automatica Sinica,2018,5(2):539-546. 被引量：10
5Kejun Wang,Haolin Wang,Meichen Liu,Xianglei Xing,Tian Han.Survey on person re-identification based on deep learning[J].CAAI Transactions on Intelligence Technology,2018,3(4):219-227.
6Zohreh Sheikh Khozani,Hossein Bonakdari,Isa Ebtehaj.An expert system for predicting shear stress distribution in circular open channels using gene expression programming[J].Water Science and Engineering,2018,11(2):167-176. 被引量：1
7GONG Junli,SUN Xiaoming,LIN Zhiyong,LU Hongfeng,LU Yongjun.Geochemical and microbial characters of sediment from the gas hydrate area in the Taixinan Basin, South China Sea[J].Acta Oceanologica Sinica,2017,36(9):52-64. 被引量：3
8Fuxin Li.Designing physical experiment about characters of fuel cells[J].International Journal of Technology Management,2017(5):56-58.
9Song Jiaxuan.New Museums Worldwide to Expect in 2019[J].China & The World Cultural Exchange,2019,85(2):28-29.
10隋蕾.凯鲁亚克小说中“垮掉”的含义[J].大众文艺（学术版）,2018(19):27-28.

Frontiers of Computer Science

2019年第2期

浏览历史

内容加载中请稍等...