摘要
对来源不同的地质对象进行关联匹配,并通过模型对其结构、属性及语义关系进行表示是后期语义查询及聚类等任务的重要支撑。文章针对地质调查空间实体与外部文本描述语义异构、表达差异等问题,提出了一种基于注意力机制的孪生网络地质调查空间实体与文本描述信息关联匹配模型。首先,将地质调查空间实体的属性信息转换成为文本段落,以句向量基本粒度对地质空间实体进行文本语义编码;接着将两类文本对象映射到统一向量空间中,并输入到孪生网络中进行特征学习,最后在构建真实数据集上进行模型性能的实验测评。结果显示,该模型能够较好表示地质调查空间实体句子语义信息,其识别F1值相比基准实验提高了8.4个百分点,优于选取的对比方法。
Association matching of geological objects with different sources and representation of their structures,attributes and semantic relationships by models is an important support for later tasks such as semantic query and clustering.In this paper,we propose a twin network geological survey spatial entities and text description information association matching model based on attention mechanism for the problems of semantic heterogeneity and expression differences between geological survey spatial entities and external text descriptions.First,the attribute information of geological survey spatial entities is converted into text paragraphs,and the text semantics of geological spatial entities is encoded with the basic granularity of sentence vectors;then the two types of text objects are mapped into a unified vector space and input to the twin network for feature learning,and finally the experimental evaluation of model performance is conducted on the constructed real dataset.The results demonstrate that the model can better represent the sentence semantic information of geological survey spatial entities,and its recognition F1 value is improved by 8.4 percentage points compared with the benchmark experiment,which is better than the selected comparison method.
作者
邱芹军
马凯
谢忠
陶留锋
黄波
QIU Qinjun;MA Kai;XIE Zhong;TAO Liufeng;HUANG Bo(School of Computer Sciences,China University of Geosciences,Wuhan 430074,China;Hubei Key Laboratory of Intelligent Geo-Information Processing,China University of Geosciences,Wuhan 430074,China;Hubei Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering,China Three Gorges University,Yichang 443002,China;College of Computer and Information Technology,China Three Gorges University,Yichang 443002,China;Wuhan Zondy Cyber Technology Ltd.,Co.,Wuhan 430074,China)
出处
《高校地质学报》
CAS
CSCD
北大核心
2023年第3期337-344,共8页
Geological Journal of China Universities
基金
国家重点研发计划项目(2022YFF0711601,2022YFB3904200)
国家自然科学基金原创性探索项目(42050101)
中国博士后科学基金(2021M702991)
支持企业技术创新发展项目任务书《自主可控的全国产化全空间GIS平台研发》和智能地学信息处理湖北省重点实验室开放研究课题(KLIGIP-2021A01)
湖北省自然科学基金(2022CFB640)联合资助。
关键词
地质调查实体
文本多语义表征
信息匹配
语义相似性
geological survey entities
textual semantic representation
information matching
semantic similarity