摘要
查询式是网络用户搜索时表达其信息需求的主要方式,系统提示的相关词则是用户改善查询的有效工具,该文以这二者为研究对象,从用户的使用行为入手对这二者的特征进行刻画和分析。首先使用日志挖掘的方法,对查询式进行总体的定量描述;进而通过定性分类将查询式中的高频词分为主体词和辅助词两大类,并比照问卷调查的研究结果,发现网络用户在搜索时大量地使用辅助词,主体词的内容相对集中,查询式的长度较短,结构相对简单。在对相关词的研究中,综合问卷调查和对比实验研究结果,发现被试者对搜索引擎提示的相关词认同程度高而应用程度低。该文为理解网络用户搜索时的语言使用提供了实证研究结果,并对搜索引擎索引的改善有一定的参考意义。
Query is Web user's primary method to express his/her information need in searching. Related term provided by systems is a useful tool to refine his/her query. The paper focuses on query and related term; describes and analyzes them from user's utilization behavior aspect. Log mining is used to give descriptive statistics on query words; qualitative categorization is then used to divide the query words into primary and auxiliary keywords. The result of qualitative analysis is compared with the result of a questionnaire survey. Important finding are as the following. Users use auxiIiary keywords greatly. The content of primary keyword is relatively concentrated. Query length is short and the query syntax is simple. From both the questionnaire and the controlled experiment results, we find that users have high recognition and low utilizations on related terms. The study provides empirical results to understand user's language utilization and also data for search engine to refine its index.
出处
《中文信息学报》
CSCD
北大核心
2009年第4期40-47,共8页
Journal of Chinese Information Processing
关键词
计算机应用
中文信息处理
中文搜索引擎
用户搜索行为
语言使用
日志挖掘
问卷调查
对比实验
computer application
Chinese information proeessing
Chinese Search Engines
Information Behavior
Language Utilization
Log Mining
Questionnaire Survey
Controlled Experiment