摘要
提出了一种面向地理信息系统领域的基于专有名词优先的中文分词方法:利用专业词典、通用词典和同义词词典相结合的词典机制,优先切分专有名词,对粗分结果利用Trigram模型进行消歧而获取最终结果。实验证明,该分词算法对专业文献的分词处理具有较好速度和准确性。
A Chinese word segmentation algorithm for Geographic Information System(GIS) based on priority special name was designed: use dictionary mechanism which combines synonyms dictionary,general dictionary and special dictionary,cut the sentences by special name firstly,and get the segmentation result of disambiguating with Trigram mode lastly.The experimental results show that the segmentation algorithm has good speed and accuracy in segmentation processing of professional literature.
出处
《计算机应用》
CSCD
北大核心
2010年第7期1941-1943,共3页
journal of Computer Applications