摘要
文本的自然标注是作者基于非NLP目的对文本的句间关系、句内关系、词间关系等不同层面的语义信息进行的标注,其在句法、语义、语用、NLP以及揭示自然语言符号系统的运作规律等方面都一定的功用。在《阿Q正传》的自然标注资源中,汉字性质的自然标注有2 517次,标点符号性质的自然标注2 709次,句内关系的标注占比最高(69.88%),其次是句间关系(19.94%)和词间关系(10.17%)标注。在标注频率上,平均每7.37个汉字有一次汉字性质的自然标注,平均每4.06个字符有一次综合(汉字和标点符号)性质的自然标注。
The natural annotation of a text is to annotate semantic information at different levels for the purpose of NON-NLP,such as sentence-to-sentence relationship,intra-sentence relationship,and inter-word relationship.It has certain functions in terms of syntax,semantics,pragmatics,NLP,and the revealing the law of the operation of natural language symbology.This article uses The True Story of Ah Q as an example to examine the natural annotation resources of the story text detailedly.The natural annotation of the Chinese characters is found to be 2517 times,and the natural annotation of punctuation is 2709 times.The inter-sentence relationship annotation has the highest proportion(69.88%),followed by the intra-sentence relationship annotation(19.94%)and the inter-word relationship annotation(10.17%).On the annotating frequency,every 7.37 Chinese characters have a natural annotation on average.And every 4.06 characters have a comprehensive natural annotation(Chinese character and punctuation).
作者
邱庆山
郑莹
QIU Qingshan;ZHENG Ying(School of Chinese Language and Literature,Hubei University,Wuhan,Hubei 430062)
出处
《玉溪师范学院学报》
2019年第4期70-75,共6页
Journal of Yuxi Normal University
基金
2018年国家社科基金项目“基于认知组合性词义观的汉语词类体系重建与标注实践研究”(批准号:18BYY181)阶段性研究成果