期刊文献+

一种基于聚类的文本迁移学习算法 被引量:1

Transfer Learning Algorithm for Text Classification Based on Clustering
下载PDF
导出
摘要 当现有训练数据过期,而新数据又非常少时,运用迁移学习能够有效提高分类器性能。本文提出一种基于聚类的文本迁移学习算法,给出了算法的主要思想及实现步骤。然后,在中文文本语料库上进行了实验,并与非迁移学习算法进行了比较。实验证明该方法能有效提高分类器性能。 Transfer learning can improve the performance of classifier effectively, when the training data are out of date, but the new data are very few. In this paper, we propose a transfer learning algorithm for text classification based on clustering. We describe the main idea and the step of the algorithm. Then have experiment on text corpus of Chinese, and compare the algorithm with transfer-unaware algorithm. The experiments demonstrate that this algorithm significantly outperforms the others.
出处 《计算机系统应用》 2010年第12期238-241,共4页 Computer Systems & Applications
基金 国家自然科学基金(60873100)
关键词 训练数据过期 新数据非常少 迁移学习 聚类 文本 training data are out of date new data are very few transfer learning clustering text
  • 相关文献

参考文献12

  • 1牛延莉,张化.文本自动分类研究进展[J].软件导刊,2008,7(4):24-26. 被引量:3
  • 2Sinno Jialin Pan, Yang Q. A Survey on Transfer Learning. IEEE TKDE, 2009. 被引量:1
  • 3Caruana R. Multitask learning. Machine Learning, 1997,28(1):41-75. 被引量:1
  • 4Dai WY, Yang Q, Xue GR, Yu Y. Boosting for transfer learning. Proceedings of the Twenty-Fourth International Conference on Machine Learning, 2007:193 -- 200. 被引量:1
  • 5Dai WY, Chen YQ, Xue GR, Yang Q, Yu Y. Translated learning:Transfer learning across different feature spaces. Advances in Neural Information Processing Systems 21, 2009. 被引量:1
  • 6Ling X, Xue GR, Dai WY, Jiang Y, Yang Q, Yu Y. Can Chinese Web Pages be Classified with English. Proceedings the Seventeenth International World Wide Web Conference (WWW 2008), Beijing, China, 2008:969--978. 被引量:1
  • 7Dai WY, Xue GR, Yang Q, Yu Y. Transfer naive bayes classifiers for text classification. Proceedings of the Twenty-Second National Conference on Artificial Intelligence, 540-- 545. 被引量:1
  • 8Do C, Ng A. Transfer learning for text classification. Advances in Neural Information Processing System 18, 2006:299--306. 被引量:1
  • 9吴启明,易云飞.文本聚类综述[J].河池学院学报,2008,28(2):86-91. 被引量:21
  • 10Salton G, Buckley C. Term Weighting Approaches in Automatic Text Retrieval. Information Processing and Management, 1998,24(5):513--523. 被引量:1

二级参考文献25

共引文献51

同被引文献10

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部