摘要
目前成熟的OCR技术使得各类文档图像的智能处理成为可能,其中电子扫描表格图像的自动识别对办公室表单数据的高效化管理有着重要意义。针对低质量电子扫描图像中的各种干扰和表格图像结构固定的特点,提出了一种基于模板匹配的交互式表格数据提取方法。实验证明,此方法适合大批量表格操作并具有较高的准确率。
Recent advances in OCR technology make it possible to address many problems in document image analysis. One of them is intelligent form recognition. The automatic recognition of form images plays an important role in the efficient processing of office documents. Aiming at the characteristics of the fixed form structure, an interactive method using template matching is proposed to meet the needs of solving the problems of extracting data from low-quality electronically scanned form images. Experimental results show that this method is effectual to the bulk form processing and has an accurate extraction performance.
出处
《电子技术(上海)》
2012年第10期7-10,6,共5页
Electronic Technology