期刊文献+

中文搜索引擎查询与反馈词语特征研究 被引量:3

Characters of Query and Feedback in Chinese Search Engines
下载PDF
导出
摘要 查询式是网络用户搜索时表达其信息需求的主要方式,系统提示的相关词则是用户改善查询的有效工具,该文以这二者为研究对象,从用户的使用行为入手对这二者的特征进行刻画和分析。首先使用日志挖掘的方法,对查询式进行总体的定量描述;进而通过定性分类将查询式中的高频词分为主体词和辅助词两大类,并比照问卷调查的研究结果,发现网络用户在搜索时大量地使用辅助词,主体词的内容相对集中,查询式的长度较短,结构相对简单。在对相关词的研究中,综合问卷调查和对比实验研究结果,发现被试者对搜索引擎提示的相关词认同程度高而应用程度低。该文为理解网络用户搜索时的语言使用提供了实证研究结果,并对搜索引擎索引的改善有一定的参考意义。 Query is Web user's primary method to express his/her information need in searching. Related term provided by systems is a useful tool to refine his/her query. The paper focuses on query and related term; describes and analyzes them from user's utilization behavior aspect. Log mining is used to give descriptive statistics on query words; qualitative categorization is then used to divide the query words into primary and auxiliary keywords. The result of qualitative analysis is compared with the result of a questionnaire survey. Important finding are as the following. Users use auxiIiary keywords greatly. The content of primary keyword is relatively concentrated. Query length is short and the query syntax is simple. From both the questionnaire and the controlled experiment results, we find that users have high recognition and low utilizations on related terms. The study provides empirical results to understand user's language utilization and also data for search engine to refine its index.
作者 赖茂生 屈鹏
出处 《中文信息学报》 CSCD 北大核心 2009年第4期40-47,共8页 Journal of Chinese Information Processing
关键词 计算机应用 中文信息处理 中文搜索引擎 用户搜索行为 语言使用 日志挖掘 问卷调查 对比实验 computer application Chinese information proeessing Chinese Search Engines Information Behavior Language Utilization Log Mining Questionnaire Survey Controlled Experiment
  • 相关文献

参考文献15

  • 1王继民,彭波.搜索引擎用户访问量模型[J].计算机工程与应用,2004,40(25):9-11. 被引量:11
  • 2王继民,彭波.搜索引擎用户点击行为分析[J].情报学报,2006,25(2):154-162. 被引量:45
  • 3余慧佳,刘奕群,张敏,茹立云,马少平.基于大规模日志分析的搜索引擎用户行为分析[J].中文信息学报,2007,21(1):109-114. 被引量:117
  • 4Jansen B J,Spink A,Saracevic T.Real life,real users,and real needs:a study and analysis of user queries on the Web[J].Information Processing and Management,2000,36(2):207-227. 被引量:1
  • 5Spink A,Jansen B J,Wolfman D,et al.From e-sex to e-commerce:Web search changes[J].IEEE Computer,2002,35(3):133-135. 被引量:1
  • 6Jansen B J,Spink A.How are we searching the World Wide Web? A comparison of nine search engine transaction logs[J].Information Processing and Management,2006,42(1):248-263. 被引量:1
  • 7Bilal D,Kirby J.Differences and similarities in information seeking:children and adults as Web users[J].Information Processing and Management,2002,38(5):649-670. 被引量:1
  • 8Large A L,Beheshti J,Rahman T.Gender differences in collaborative Web searching behavior:an elementary school study[J].Information Processing and Management,2002,38(3):427-443. 被引量:1
  • 9Reih S Y.Investing Web searching behavior in home environments[C]//Humanizing Information Technology:from Ideas to BITS and back:proceedings of the 66th Annual Meeting of the American Society for Information Science and Technology,Long Beach,California,United States,October 19-22,2003.Medford (NJ):Information Today,Inc.,2003:255-264. 被引量:1
  • 10Whitmire E.Disciplinary differences and undergraduates' information seeking behavior[J].Journal of the American Society for Information Science and Technology,2002,53(8):631-638. 被引量:1

二级参考文献37

  • 1王建勇,单松巍,雷鸣,谢正茂,李晓明.Web search engine:characteristics of user behaviors and their implication[J].Science in China(Series F),2001,44(5):351-365. 被引量:4
  • 2.CNNIC(中国互联网络信息中心)[EB/OL].http://www.cnnic.net.cn/,. 被引量:6
  • 3Yinglian Xie,David O'Hallaron.Locality in Search Engine Queries and Its Implications for Caching[C].In :Proc IEEE Infocom,2002 被引量:1
  • 4A Spink,D Wolfram,B J Jansen et al.Searching the web:The public and their queries[J].Journal of the American Society for Information Science, 2001; 53 (2): 226~234 被引量:1
  • 5P Baldi,P Frasconi,P Smyth. Modeling the Intemet and the Web,probabilistic methods and algorithms[M]John Wiley,2003 被引量:1
  • 6.天网搜索引擎[EB/OL].http://e.pku.edu.cn,. 被引量:1
  • 7胡昌化 张军波.基于Matlab的系统分析与设计--小波分析[M].西安电子科技大学出版社,1999.. 被引量:1
  • 8王鹏 单保慈 曾振柄.多尺度网络时序数据挖掘,搜索引擎与Web挖掘进展[M].北京:高等教育出版社,2003.. 被引量:1
  • 9G E P Box,G M Jenkins,G C Reinsel.Time Series Analysis:Forecasting and Control[M].Prentice_hall,Inc, 1994 被引量:1
  • 10中国互联网络信息中心 (China Internet Network Information Center,CNNIC),http://www.cnnic.net.cn/ 被引量:1

共引文献174

同被引文献25

  • 1王继民,彭波,孟涛.基于搜索引擎日志发现相近Web查询[J].北京邮电大学学报,2005,28(z1):44-48. 被引量:4
  • 2王继民,陈翀,彭波.大规模中文搜索引擎的用户日志分析[J].华南理工大学学报(自然科学版),2004,32(z1):1-5. 被引量:24
  • 3余慧佳,刘奕群,张敏,茹立云,马少平.基于大规模日志分析的搜索引擎用户行为分析[J].中文信息学报,2007,21(1):109-114. 被引量:117
  • 4第29次中国互联网络发展状况统计报告[R].北京:中国互联网络信息中心(CNNIC),2012. 被引量:15
  • 5中国互联网络信息中心(CNNIC).2011年中国搜索引擎市场研究报告[R].2011-12. 被引量:1
  • 6Craig Silverstein, Monika Henzinger, Hannes Marais, et al. Analysis of a very large Web search engine query log [ J ]. SIGIR Forum. 1998,33 (I) :6-12. 被引量:1
  • 7Jaime Teevan, Eytan Adar, Rosie Jones, et al. History repeats itself: repeat queries in Yahoo' s logs[C]//Proceedings of the 29th annual in- ternational ACM SIGIR conference on Research and development in information retrieval ,2006:6 - 11. 被引量:1
  • 8Ricardo Baeza-Yates C H, Mendoza M. Query recommendation using query logs in search engines[ C ]//Trends in Database Technology-ED- BT 2004 Workshops,2005. 被引量:1
  • 9Xu J, Croft W B. Query expansion using local and global document analysis[ C]//Proceedings of the 19th International Conference on Research and Development in Information Retrieval, 1996:4- 11. 被引量:1
  • 10Downey D, Dumais S, Liebling D, et al. Understanding the relationship between searchers' queries and information goals [ C ]//CIKM' 08 Proceeding of the 17th ACM conference on Information and knowledge management ,2008. 被引量:1

引证文献3

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部