摘要
针对数字编码的特点,本文提出了一种在不改变编码方案的情况下通过改进输入规则,结合语言模型,实现汉字数字编码的智能输入技术。文章首先讨论了怎样设计字词码本结构,使之能够满足灵活多样的输入方式,继而设计了一种动态自学习语言模型,重点分析了数据平滑算法在语言模型中的应用与改进,最后通过一个输入法示例程序,对改进前后不同情况下的输入效果进行了测试。实验表明,这种输入技术不但降低了输入法的平均码长,而且显著地提高了首字命中率。
An intelligent digital code-based input technique for Chinese characters, which features in improving the input rules without modifying the original coding scheme and combining the language model,is proposed. The paper disusses how to design the Chinese character and word code to meet the various input modes at first, then designs a dynamic self-study language model, and analyses the data smoothing algorithm in the lariguage model. The experimen- tal results regarding the input performance are given at last, by comparing the intelligent input method with the orginal method, showing that the proposed input technique can not only reduce the average input code length, but also improve the hit rate of the fn'st candidate character,
出处
《中文信息学报》
CSCD
北大核心
2006年第4期100-105,共6页
Journal of Chinese Information Processing
基金
江苏省高技术研究项目资助(BG2005020)
江苏省教育厅自然基金资助(04KKB320134)
关键词
计算机应用
中文信息处理
汉字输入
数字编码
智能输入
动态自学习语言模型
computer application
Chinese information processing
Chinese character input
digital code
intelligent Input
dynamic self-study language model