Text, as one of the most influential inventions of humanity, has played an important role in human life, so far from ancient times. The rich and precise information embod- ied in text is very useful in a wide range of...Text, as one of the most influential inventions of humanity, has played an important role in human life, so far from ancient times. The rich and precise information embod- ied in text is very useful in a wide range of vision-based ap- plications, therefore text detection and recognition in natu- ral scenes have become important and active research topics in computer vision and document analysis. Especially in re- cent years, the community has seen a surge of research efforts and substantial progresses in these fields, though a variety of challenges (e.g. noise, blur, distortion, occlusion and varia- tion) still remain. The purposes of this survey are three-fold: 1) introduce up-to-date works, 2) identify state-of-the-art al- gorithms, and 3) predict potential research directions in the future. Moreover, this paper provides comprehensive links to publicly available resources, including benchmark datasets, source codes, and online demos. In summary, this literature review can serve as a good reference for researchers in the areas of scene text detection and recognition.展开更多
为了获取铭牌图像中的基本参数信息,提出一种基于深度学习的端到端文本识别模型TDRN(Text Detection and Recognition Network)。模型避免了图像裁剪和字符分割,将文本看作一个序列,使用BLSTM(Bidirectional Long Short-term Memory)来...为了获取铭牌图像中的基本参数信息,提出一种基于深度学习的端到端文本识别模型TDRN(Text Detection and Recognition Network)。模型避免了图像裁剪和字符分割,将文本看作一个序列,使用BLSTM(Bidirectional Long Short-term Memory)来获取上下文关系。同时,将文本检测和文本识别整合在同一个网络中共同训练,共享卷积层,以提高整体性能,在文本识别中还引入了注意力机制。模型在公共场景文本数据集SVT(Street View Text)上测试表现良好,F值为68. 69%,高于一般的端到端文本识别模型。与传统铭牌识别方法相比,TDRN准确率更高,鲁棒性更强,能适应复杂的电力场景变化。展开更多
文摘Text, as one of the most influential inventions of humanity, has played an important role in human life, so far from ancient times. The rich and precise information embod- ied in text is very useful in a wide range of vision-based ap- plications, therefore text detection and recognition in natu- ral scenes have become important and active research topics in computer vision and document analysis. Especially in re- cent years, the community has seen a surge of research efforts and substantial progresses in these fields, though a variety of challenges (e.g. noise, blur, distortion, occlusion and varia- tion) still remain. The purposes of this survey are three-fold: 1) introduce up-to-date works, 2) identify state-of-the-art al- gorithms, and 3) predict potential research directions in the future. Moreover, this paper provides comprehensive links to publicly available resources, including benchmark datasets, source codes, and online demos. In summary, this literature review can serve as a good reference for researchers in the areas of scene text detection and recognition.