摘要
在对汉字信息进行处理的过程中,汉字词汇本身的构成具备一定的特殊性,无法实现像英文一样根据空格来分隔单词。因此,对于一整段汉字进行切分形成独立词语是中文信息处理上难点所在。论文主要对中文分词算法以及相应的中文分词技术进行分析和研究,帮助更多人了解到中文分词算法在搜索引擎中的应用。
In the process of processing Chinese characters'information,the formation of Chinese character vocabulary itself has certain particularity,so it is difficult to separate words according to spaces as in English.Therefore,the fragmentation of a whole Chinese character into independent words is the difficulty of Chinese information processing.The paper mainly analyzes and studies the Chinese word segmentation algorithm and the corresponding Chinese word segmentation technology,so as to help more people understand the application of Chinese word segmentation algorithm in search engine.
作者
王洪浩
WANG Hong-hao(Shandong Zheyuan Information Technology Co,Ltd.,Dongying 257200,China)
出处
《中小企业管理与科技》
2019年第3期103-104,共2页
Management & Technology of SME
关键词
中文分词算法
搜索引擎
中文分词技术
Chinese word segmentation algorithm
search engine
Chinese word segmentation technology