大数据环境下隐性语义索引的研究综述

A survey of implicit semantic indexing in big data environment

下载PDF

导出

摘要本文针对大数据环境下的多标签集成分类器的技术积累、基于Boos Texter算法的迭代次数优化方法、基准子集特征数量分类器算法构造的现状,根据文本分类存在问题进行分析,为满足今后分类器改进提出大数据环境下的隐性语义索引改进和不断改进策略。 In this paper,the present situation of the technology accumulation of multi label integrated classifier in large data environment,the optimization method of iterations based on Boos Texter algorithm and the algorithm of datum subset feature quantity classifier are constructed,and the problems in the text classification are analyzed,and the large data environment is put forward to meet the future classifier improvement.Recessive semantic index improvement and continuous improvement strategy.

作者李峤王茹娟 Li Jiao;Wang Rujuan(School of humanities,Northeast Normal University,Changchun Jilin,130117)

机构地区东北师范大学人文学院

出处《电子测试》 2018年第14期115-116,共2页 Electronic Test

基金吉林省教育厅"十三五"科学技术研究项目"大数据环境下语义索引技术的研究"(吉教科合字[2016]第159号)

关键词多标签集成分类器基准文本分类 multi label ensemble classifier benchmark text categorization

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1龚静,黄欣阳.基于隐性语义索引的多标签文本分类集成方法[J].计算机工程与设计,2017,38(9):2556-2561. 被引量：6
2刘春蔚,邹自明,佟继周.基于LSI的日地空间领域科学数据语义检索模型[J].中国科学院大学学报（中英文）,2016,33(5):711-719. 被引量：4
3徐建锁,王正欧.基于LSI和自组织神经网络的高效文本聚类方法[J].天津大学学报（自然科学与工程技术版）,2004,37(11):1026-1030. 被引量：7

二级参考文献28

1Alahakoon D, Halgamuge S K.Dynamic self organizing maps with controlled growth for knowledge discovery[J].IEEE Trans on Neural Networks,2000,11(3):601-614. 被引量：1
2Dumais S T,Furnas G W,Landauer T K,et al.Using latent semantic analysis to improve information retrival[A].In:CHI'88 Proceedings[C].New York:ACM Press,1998. 被引量：1
3Deerwester S, Susan S T, Furnas S T,et al.Indexing by latent semantic[J].Journal of American Society for Information Science,1990,41(5):391-407. 被引量：1
4Kolda T G,O'Leary.Large latent semantic indexing via a semi discrete matrix decomposition[R].No.UM-CSD CS-TR-3713,Maryland:University of Maryland,1996. 被引量：1
5Furnas G W, Deerwester S, Dumais S T, et al.Information retrival using singular value decomposition model of latent semantic structure[A].In:Proceedings of SIGIR'88[C].New York:ACM Press,1988. 被引量：1
6Jones C B, Purves R, Ruas A, et al. Spatial information retrieval and geographical ontologies an overview of the SPIRIT project[ C ] //Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2002: 387-388. 被引量：1
7Li W, Yang C, Nebcrt D, et al. Semantic-based web service discovery and chaining for building an Arctic spatial data infrastructure [ J ]. Computers & Geosciences, 2011, 37 (11): 1752-1762. 被引量：1
8Bhattacharjee S, Ghosh S K. Automatic resolution of semantic heterogeneity in GIS: an ontology based approach [ M ] // Advanced Computing, Networking and Informatics-Volume 1. Springer International Publishing, 2014: 585-591. 被引量：1
9Wu Z, Zeng W, Wu J, et al. Method for semantic service registration and query based on WordNet: U. S. Patent 8 671 103[P]. 2014-03-11. 被引量：1
10Pal D, Mitra M, Datta K. Improving query expansion using WordNet [ J ]. Journal of the Association for Information Science and Technology, 2014, 65 (12) : 2 469-2 478. 被引量：1

共引文献14

1孙辉,陈晓云,马志新.基于语句-词条矩阵的聚簇式动态增长聚类算法[J].清华大学学报（自然科学版）,2005,45(S1):1814-1817. 被引量：1
2王伟.文本自动聚类技术研究[J].情报杂志,2009,28(2):94-97. 被引量：6
3唐果,陈宏刚.基于BBS热点主题发现的文本聚类方法[J].计算机工程,2010,36(7):79-81. 被引量：14
4刘勘,周丽红,陈譞.基于关键词的科技文献聚类研究[J].图书情报工作,2012,56(4):6-11. 被引量：18
5常娥.基于LSI理论的文本自动聚类研究[J].图书情报工作,2012,56(11):89-92. 被引量：4
6吕庆莉.基于信息增益的中医体质多标记分类方法研究[J].中国中医药信息杂志,2019,26(6):97-100.
7文晓艺,郝程程.基于奇异值分解的新闻标题聚类研究[J].计算机技术与发展,2020,30(2):42-46. 被引量：3
8熊森林,邹自明,胡晓彦,纪珍.空间科学数据产品组织模型[J].农业大数据学报,2019,1(4):30-36. 被引量：4
9杨恒,颜宏文.基于DBM的电力投诉工单分类的应用研究[J].计算技术与自动化,2020,39(3):86-90. 被引量：3
10孙桂煌.基于大数据技术的中文多标签文本分类方法研究[J].齐齐哈尔大学学报（自然科学版）,2020,36(6):39-43. 被引量：2

1物联网产业视点[J].物联网技术,2018,8(10):1-2.
2陈姝,阳亮.世界地面无人系统发展现状[J].国外坦克,2018,0(9):12-22. 被引量：6
3曾润华,张树群.改进卷积神经网络的语音情感识别方法[J].应用科学学报,2018,36(5):837-844. 被引量：12
4刘中锋.基于局部学习的差分隐私集成特征选择算法[J].计算机技术与发展,2018,28(10):79-82. 被引量：3
5王喜,何福男,张书奎.交错立方体上限制容错单播算法的研究[J].西南师范大学学报（自然科学版）,2018,43(9):51-59. 被引量：2
6郭鑫.铝合金板焊接工艺试验验证浅析(上)[J].汽车工艺师,2018(10):46-49. 被引量：1
7汪在荣,张文凤.一种基于成对粒子的粒子群优化算法[J].四川师范大学学报（自然科学版）,2018,41(6):840-846. 被引量：3
8刘云,陈倩.无线传感网中分布式信号检测的多维特征值算法优化研究[J].计算机工程与科学,2018,40(9):1585-1590. 被引量：7
9刘黎志,邓介一,吴云韬.基于HBase的多分类逻辑回归算法研究[J].计算机应用研究,2018,35(10):3007-3010. 被引量：11
10邓寅,王园园.中国经济过度杠杆化分析:表现、逻辑与治理[J].当代财经,2018(9):15-25. 被引量：1

电子测试

2018年第14期

浏览历史

内容加载中请稍等...

大数据环境下隐性语义索引的研究综述

参考文献3

二级参考文献28

共引文献14

相关作者

相关机构

相关主题

浏览历史