期刊文献+

面向个人简历的事件抽取和检索框架 被引量:4

Framework of Vita Event Extraction and Retrieval
下载PDF
导出
摘要 个人简历(Curriculum Vitae,Vita)通常包含了丰富的数据,如个人信息、教育背景以及工作经历等。从大量的个人简历中抽取出有用的信息并提供检索服务,可以提供更加全面和完整的个人资料。个人简历中包含的信息可以看成是按时间排序的事件序列。进一步地,可以从不同的个人简历所包含的事件中挖掘出事件之间的关联关系。提出了一个从个人简历中提取并检索事件的框架,它可以自动地从互联网上搜索并下载个人简历文档,并从中提取出感兴趣的事件保存在数据库里,以进一步查询和检索事件。所完成的工作包括:(1)提出了一个事件表示模型,用于描述事件的基本属性及检索事件;(2)基于条件随机场提出了一个概率模型,用于从个人简历中自动提取事件;(3)通过挖掘事件属性之间的共现性,提出了基于事件的检索方法。 A curriculum vitae (henceforth referred to as a vita) usually contains a wealth of abundant data such as per- sonal information, educational background,publications and work experience. It is significant to search, extract and ex- plore the data from these vita documents which may provide a more comprehensive and integral personal profile. This personal profile can be viewed as a series of events. Moreover, we can take advantage of events from different individual's vita to explore and establish relationships between these events and the people involved. In this paper, we presented a framework extracting and explorating vita event, which can retrieve vita documents from the Internet, extract events from these documents and save the events to a database for further exploration. More concretely, the work introduced in this paper includes: (1) an event presentation model which characterizes the basic attributes of events and is utilized for event exploration; (2) a probabilistic model for extracting events from vita documents automatically; (3) an event explo- ration approach by exploiting the co-occurrence of the event attributes on the basis of the event presentation model and the event extraction approach.
出处 《计算机科学》 CSCD 北大核心 2012年第7期154-160,174,共8页 Computer Science
基金 国家自然科学基金(61040006) 湖北省自然科学基金(2010CDZ027) 湖北省教育厅科技项目(B20101909)资助
关键词 条件随机场 事件检索 事件抽取 事件表示 Conditional random fields,Event retrieval, Event extraction, Event presentation
  • 相关文献

参考文献30

  • 1Appan P, Sundaram H. Networked Multimedia Event Explora- tion[C]//Proceedings of the 12th Annual ACM International Conference on Multimedia. ACM Press, 2004 .. 40-47. 被引量:1
  • 2Arasu A, Garcia-Molina H. Extracting Structured Data from Web Pages[C]//Proceedings of 2003 ACM SIGMOD Interna- tional Conference on Management of Data. ACM Press, 2003: 337-348. 被引量:1
  • 3Berger A L, Pietra D V J, Pietra D, et al. A Maximum Entropy Approach to Natural Language Processing[J]-]. Computational Linguistics, 1996,22 (1) .. 39-71. 被引量:1
  • 4Burges C J C. A Tutorial on Support Vector Machines for Pat- tern RecognitionJ-]. Data Mining and Knowledge Discovery, 1998,2, (2) : 121-167. 被引量:1
  • 5Butter D, Liu L, Pu C. A Fully Automated Object Extraction System for the World Wide Web[C]//Proceedings of the 21 th International Conference on Distributed Computing Systems(IC- DC 2001). IEEE Computer Society, 2001:361-370. 被引量:1
  • 6Cai D, Yu S, Wen J R, et al. Blocked-based Web Search[C] // Proceedings of the 27th Annual International ACM SIGIR Con- ference on Research and Development in Information Retrieval, ACM Press, 2004 : 456-463. 被引量:1
  • 7Chieu H L, Lee Y K. Query based Event Extraction Along a Timeline[C]] // Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Infor- mation Retrieval. ACM Press, 2004:425-432. 被引量:1
  • 8Fung G P C, Yu J X, Yu P S, et al. Parameter Free Bursty E- vents Detection in Text Streams I-C]//Proceedings of the 31th International Conference on Very Large Data tMses(VLDB). VLDB Endowment, 2005 .. 181-192. 被引量:1
  • 9Fung G P C, YuJ X, Liu H, et al. Time-dependent Event Hierar- chy Construction[C]//Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(SIGKDD). ACM Press, 2007 : 300-309. 被引量:1
  • 10Ghahramani Z, Jordan M I. Factorial Hidden Markov Models [J]. J. Machine Learning, 1997,29 (2/3) : 245-273. 被引量:1

同被引文献50

  • 1黄发良,钟智.用于分类的支持向量机[J].广西师范学院学报(自然科学版),2004,21(3):75-78. 被引量:14
  • 2李丽双,黄德根,陈春荣,杨元生.SVM与规则相结合的中文地名自动识别[J].中文信息学报,2006,20(5):51-57. 被引量:32
  • 3https://www,1dc.uperm.edu/collaborations/past-projects/ace[EB]. 被引量:1
  • 4Bao Jiana, Li Tingyu, Yao Tianfang. Event Information Extraction Approach based on ComplexChinese Texts [ C ] //IEEE Computer So- ciety. 445 Hoes Lane- P.O.Box 1331, Piscataway, NJ 08855- 1331, United States: IEEE Computer Society, 2012: 61-64. 被引量:1
  • 5Zhang Xiuhong, Gong Zhe. Information extraction based on event driven from template web pages [ C ]//Springer Verlag. Tiergarten- strasse 17, Heidelberg, D- 69121, Germany: Springer Verlag, 2013, 211 LNEE: 515-523. 被引量:1
  • 6Jiang Bo, Zhu Mengxia, Wang Jiale. Ontology - based information extraction of crop diseases on Chinese web pages [J]. Academy Pub- lisher, 2013, 8 (1): 85-90. 被引量:1
  • 7Bao Jiana, Li Tingyu, Yao Tianfang. Event Information Extraction Approach based on Complex Chinese Texts [ C ] //IEEE Computer Society. 445 Hoes Lane - P.O.Box 1331, Piscataway, NJ 08855 - 1331, United States: IEEE Computer Society, 2012:61-64. 被引量:1
  • 8Li Cunhua, Hu Yun, Zlaong Zhaoman. An Event Ontology Construc- tion Approach To Web Crime Mining [ C] //IEEE Computer Society. 445 Hoes Lane - P. O. Box 1331, Piscataway, NJ 08855 - 1331, U- nited States: IEEE Computer Society, 2010, (5) : 2441 - 2445. 被引量:1
  • 9Ding Xiaoshan, Li Fang, Zhang Dongmo. Causal Relation Recogni- tion between Sentence - based Events [ C ] //IEEE Computer Society. 445 Hoes Lane - P. O. Box 1331, Piscataway, NJ 08855 - 1331, U- nited States: IEEE Computer Society, 2011: 1688- 1693. 被引量:1
  • 10Fu Jianfeng, Liu Zongtian, Zhang Zhaoman, Shah Jianfang. Chi- nese event extraction based on feature weighting [ J]. Asian Network for Scientific Information, 2010, 9 (1): 184- 187. 被引量:1

引证文献4

二级引证文献37

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部