Offline Urdu Nastaleeq text recognition has long been a serious problem due to its very cursive nature. In order to get rid of the character segmentation problems, many researchers are shifting focus towards segmentat...Offline Urdu Nastaleeq text recognition has long been a serious problem due to its very cursive nature. In order to get rid of the character segmentation problems, many researchers are shifting focus towards segmentation free ligature based recognition approaches. Majority of the prevalent ligature based recognition systems heavily rely on hand-engineered feature extraction techniques. However, such techniques are more error prone and may often lead to a loss of useful information that might hardly be captured later by any manual features. Most of the prevalent Urdu Nastaleeq test recognition was trained and tested on small sets. This paper proposes the use of stacked denoising autoencoder for automatic feature extraction directly from raw pixel values of ligature images. Such deep learning networks have not been applied for the recognition of Urdu text thus far. Different stacked denoising autoencoders have been trained on 178573 ligatures with 3732 classes from un-degraded(noise free) UPTI(Urdu Printed Text Image) data set. Subsequently, trained networks are validated and tested on degraded versions of UPTI data set. The experimental results demonstrate accuracies in range of 93% to 96% which are better than the existing Urdu OCR systems for such large dataset of ligatures.展开更多
针对脱机文字识别,提出了一种基于线性合成的双粒度递归神经网络(Recurrent neural net work,RNN)集成系统.首先,使用单词RNN对未知图像进行识别;然后,依据识别结果进行字符分割,使用字符RNN对分割后的字符进行识别,并利用查表法计算字...针对脱机文字识别,提出了一种基于线性合成的双粒度递归神经网络(Recurrent neural net work,RNN)集成系统.首先,使用单词RNN对未知图像进行识别;然后,依据识别结果进行字符分割,使用字符RNN对分割后的字符进行识别,并利用查表法计算字符的后验概率;最后,综合两个RNN的识别结果决定最终单词输出.在CAPTCHA识别和手写识别上的实验结果证明了该系统的有效性.展开更多
基金National Natural Science Foundation of China (Project No. 61273365)111 Project (No. B08004) are gratefully acknowledged
文摘Offline Urdu Nastaleeq text recognition has long been a serious problem due to its very cursive nature. In order to get rid of the character segmentation problems, many researchers are shifting focus towards segmentation free ligature based recognition approaches. Majority of the prevalent ligature based recognition systems heavily rely on hand-engineered feature extraction techniques. However, such techniques are more error prone and may often lead to a loss of useful information that might hardly be captured later by any manual features. Most of the prevalent Urdu Nastaleeq test recognition was trained and tested on small sets. This paper proposes the use of stacked denoising autoencoder for automatic feature extraction directly from raw pixel values of ligature images. Such deep learning networks have not been applied for the recognition of Urdu text thus far. Different stacked denoising autoencoders have been trained on 178573 ligatures with 3732 classes from un-degraded(noise free) UPTI(Urdu Printed Text Image) data set. Subsequently, trained networks are validated and tested on degraded versions of UPTI data set. The experimental results demonstrate accuracies in range of 93% to 96% which are better than the existing Urdu OCR systems for such large dataset of ligatures.
文摘针对脱机文字识别,提出了一种基于线性合成的双粒度递归神经网络(Recurrent neural net work,RNN)集成系统.首先,使用单词RNN对未知图像进行识别;然后,依据识别结果进行字符分割,使用字符RNN对分割后的字符进行识别,并利用查表法计算字符的后验概率;最后,综合两个RNN的识别结果决定最终单词输出.在CAPTCHA识别和手写识别上的实验结果证明了该系统的有效性.