将文本挖掘和共词分析方法相结合应用到专利文献的研究中去,以期通过对专利的内容分析更深层次地了解不同技术主题的研究现状及发展趋势。以射频识别(Radio Frequency Identification,RFID)技术领域为研究对象,对此领域专利的摘要进行...将文本挖掘和共词分析方法相结合应用到专利文献的研究中去,以期通过对专利的内容分析更深层次地了解不同技术主题的研究现状及发展趋势。以射频识别(Radio Frequency Identification,RFID)技术领域为研究对象,对此领域专利的摘要进行文本挖掘,从中提取能够反映此技术领域特征的关键词,根据关键词之间的共现关系,对其进行聚类分析,得到目前RFID领域的六个技术主题,并借助战略坐标图对这六个技术主题进行分析,探寻每个技术主题的发展趋势,为企业技术创新活动和产业发展战略的制定提供决策参考。展开更多
Engineering and research teams often develop new products and technologies by referring to inventions described in patent databases. Efficient patent analysis builds R&D knowledge, reduces new product development tim...Engineering and research teams often develop new products and technologies by referring to inventions described in patent databases. Efficient patent analysis builds R&D knowledge, reduces new product development time, increases market success, and reduces potential patent infringement. Thus, it is beneficial to automatically and systematically extract information from patent documents in order to improve knowledge sharing and collaboration among R&D team members. In this research, patents are summarized using a combined ontology based and TF-IDF concept clustering approach. The ontology captures the general knowledge and core meaning of patents in a given domain. Then, the proposed methodology extracts, clusters, and integrates the content of a patent to derive a summary and a cluster tree diagram of key terms. Patents from the International Patent Classification (IPC) codes B25C, B25D, B25F (categories for power hand tools) and B24B, C09G and H011 (categories for chemical mechanical polishing) are used as case studies to evaluate the compression ratio, retention ratio, and classification accuracy of the summarization results. The evaluation uses statistics to represent the summary generation and its compression ratio, the ontology based keyword extraction retention ratio, and the summary classification accuracy. The results show that the ontology based approach yields about the same compression ratio as previous non-ontology based research but yields on average an 11% improvement for the retention ratio and a 14% improvement for classification accuracy.展开更多
【目的】分析国内外专利网络研究进展,梳理研究现状、发现研究问题和研判研究趋势。【文献范围】分别以"Patent Network"和"专利网络"为主题在Web of Science核心集和CNKI核心期刊库中检索,通过去重、去除不相关文...【目的】分析国内外专利网络研究进展,梳理研究现状、发现研究问题和研判研究趋势。【文献范围】分别以"Patent Network"和"专利网络"为主题在Web of Science核心集和CNKI核心期刊库中检索,通过去重、去除不相关文献后,共检索到英文论文465篇,中文论文196篇,分析其中代表性论文106篇。【方法】首先,利用团渗透重叠社区发现算法对"专利网络"关键词共现网络进行主题挖掘,分析中英文热点研究主题;其次,对热点研究主题下的高被引论文进行述评。【结果】综合现有研究,专利网络构建方法主要有合作关系、引用关系、技术转移关系、技术相似关系等,主流研究方法有社会网络分析、复杂网络和文本挖掘等。【局限】仅对热点研究领域的高被引代表性论文进行分析,未能覆盖全部研究主题和文献。【结论】专利网络研究尚未形成系统性的理论和方法体系,新兴研究方法的应用仍处于探索阶段。专利网络分析需向中观层面深入,网络演化机制、模型和仿真实验研究还需进一步加强。专利网络语义化分析倾向越来越明显;基于多种关系的综合性专利网络构建和分析,获得越来越多的关注,未来有可能成为新兴研究方向。展开更多
文摘将文本挖掘和共词分析方法相结合应用到专利文献的研究中去,以期通过对专利的内容分析更深层次地了解不同技术主题的研究现状及发展趋势。以射频识别(Radio Frequency Identification,RFID)技术领域为研究对象,对此领域专利的摘要进行文本挖掘,从中提取能够反映此技术领域特征的关键词,根据关键词之间的共现关系,对其进行聚类分析,得到目前RFID领域的六个技术主题,并借助战略坐标图对这六个技术主题进行分析,探寻每个技术主题的发展趋势,为企业技术创新活动和产业发展战略的制定提供决策参考。
基金supported by National Science Council research grants
文摘Engineering and research teams often develop new products and technologies by referring to inventions described in patent databases. Efficient patent analysis builds R&D knowledge, reduces new product development time, increases market success, and reduces potential patent infringement. Thus, it is beneficial to automatically and systematically extract information from patent documents in order to improve knowledge sharing and collaboration among R&D team members. In this research, patents are summarized using a combined ontology based and TF-IDF concept clustering approach. The ontology captures the general knowledge and core meaning of patents in a given domain. Then, the proposed methodology extracts, clusters, and integrates the content of a patent to derive a summary and a cluster tree diagram of key terms. Patents from the International Patent Classification (IPC) codes B25C, B25D, B25F (categories for power hand tools) and B24B, C09G and H011 (categories for chemical mechanical polishing) are used as case studies to evaluate the compression ratio, retention ratio, and classification accuracy of the summarization results. The evaluation uses statistics to represent the summary generation and its compression ratio, the ontology based keyword extraction retention ratio, and the summary classification accuracy. The results show that the ontology based approach yields about the same compression ratio as previous non-ontology based research but yields on average an 11% improvement for the retention ratio and a 14% improvement for classification accuracy.
文摘【目的】分析国内外专利网络研究进展,梳理研究现状、发现研究问题和研判研究趋势。【文献范围】分别以"Patent Network"和"专利网络"为主题在Web of Science核心集和CNKI核心期刊库中检索,通过去重、去除不相关文献后,共检索到英文论文465篇,中文论文196篇,分析其中代表性论文106篇。【方法】首先,利用团渗透重叠社区发现算法对"专利网络"关键词共现网络进行主题挖掘,分析中英文热点研究主题;其次,对热点研究主题下的高被引论文进行述评。【结果】综合现有研究,专利网络构建方法主要有合作关系、引用关系、技术转移关系、技术相似关系等,主流研究方法有社会网络分析、复杂网络和文本挖掘等。【局限】仅对热点研究领域的高被引代表性论文进行分析,未能覆盖全部研究主题和文献。【结论】专利网络研究尚未形成系统性的理论和方法体系,新兴研究方法的应用仍处于探索阶段。专利网络分析需向中观层面深入,网络演化机制、模型和仿真实验研究还需进一步加强。专利网络语义化分析倾向越来越明显;基于多种关系的综合性专利网络构建和分析,获得越来越多的关注,未来有可能成为新兴研究方向。