摘要
本文依据待校对文本中的常见错误类型介绍了纠错知识库的构造方法以及基于该纠错知识库的自动纠错算法。该算法通过利用出错字串的特征 ,结合上下文启发信息 ,可有效地对文本中的别字、漏字、多字、易位、多字替换等错误提供纠错建议。
According to common error types in pre proofreading text,this paper introduce the method to structure correcting knowledge sets and a automatic correcting algorithm based on this correcting knowledge sets.The algorithm makes a full use of the characteristics of wrong strings and context heuristic information.It can provide correcting suggestions for such errors as ghost word,missed Chinese characters,superfluous Chinese characters,reversed Chinese characters and substituted Chinese characters etc.The method of sorting the correcting suggestions is also discussed.
出处
《中文信息学报》
CSCD
北大核心
2001年第5期33-39,共7页
Journal of Chinese Information Processing
基金
山西省自然科学基金 (9810 31)