摘要
异构信息源集成技术提供统一接口 ,使各种基于因特网的应用能迅速、准确地提取所需信息 ,为用户屏蔽各种信息源的异构性 .这种异构信息源的集成技术从传统的结构化的异构数据库扩大到半结构化的大量 Web页面信息及无结构的信息 .对集成技术的方法 ,如数据模型、Web信息、描述语言 XML、主流软件开发技术及信息智能搜索、查询重写、查询分析等进行了分析 ,给出了半结构化异构信息源集成的系统架构 ,并指出该项技术未来发展趋势 .
To discuss the technology of integration of heterogeneous information sources which can provide a unified interface to make use of various heterogeneous information sources for the information modernization of enterprises and applications based on Internet. It shields the heterogeneous information sources for users and applications. So it is easy to extract the needed information quickly. The integration technology of heterogeneous information sources is expanded from traditional structured databases to semi structured Web pages and non structured information. This paper analyzes the technical approaches of integrating heterogeneous information sources, such as data model, Web information, description laguage XML, main current of software development technology, information intelligent search, query rewrite and query analysis. Trends of development of this field are expected.
出处
《北京理工大学学报》
EI
CAS
CSCD
北大核心
2002年第5期533-536,共4页
Transactions of Beijing Institute of Technology
基金
总装备部预研项目
关键词
异构信息源
信息集成
半结构化信息
信息智能搜索
查询重写
查询分析
异构数据库
heterogeneous information integration
semi structured information
information intelligent search
query rewrite
query analysis