摘要
本文从实际应用出发,针对定位格内粘连字串的特点提出了一种新的数字分割方法。该方法将数字粘连划分为两种方式:字符间通过过渡笔划形成的粘连,称为过渡粘连;通过共用笔划形成的粘连,称为共用粘连。对于第一种粘连,首先由上下轮廓差和结构点确定候选分割点,再依据数字的左右边缘差、纵向开口深度和结构点对结果进行修正;对于第二种粘连,则直接依据结构点进行分割。该算法具有分割成功率高,运算量小的特点,已应用于实际的银行支票自动处理系统中。
Handwriting numeral segment is a hard but important task in an OCR system. In many applications, numerals are filled in preprinted form frames, which make the segmentation problem easier. General segment methods are developed for unconstrained numeral strings, but they are complex and slow. In this paper, a certain segment method for handwriting numeral strings in form frames is proposed. Usually, there are two connected types for numerals in form frames: transition-connected type (connect by a long horizon stroke) and share-connected type ( connect by sharing one period stroke). For the former type, we firstly detect the candidate segment positions based on local contour features, and then select a better position according to a 2-categories classifier. For the share-connected type, we segment by analysis the contour features. Experiment results on 574 real-life bank check images demonstrate the efficient of our method. The recognition success rate is 82. 4%, and the segment time is 13. 8ms per check. This method has been used in Automatic Chinese Bankcheck Processing demo system.
出处
《模式识别与人工智能》
EI
CSCD
北大核心
2003年第3期342-346,共5页
Pattern Recognition and Artificial Intelligence