摘要
口语语料库的建设是口语研究的基础工作,该文选择具有代表性的交谈式谈话节目《锵锵三人行》和对谈式谈话节目《鲁豫有约》作为语料,建立了一个小型的谈话节目语料库,并构建了包含五大类16小类的会话结构标注体系,对语料进行了会话结构的标注。统计得到打断结构309例,插入结构141例,重复结构111例,问答结构653/589例,阻碍—修正结构51/21例,反映了会话结构在数量上的不均衡分布,节目的形式、性质以及交际任务是会话结构分布的主要影响因素。会话结构组合具有模式性,该文使用Trigram方法对其组合情况进行了分析,发现语料中的高频组合是问答毗邻对,此外有大量的非毗邻性组合。会话结构组合模式不但反映出谈话节目的风格特点,还有助于分析会话中的功能性模块、会话策略的形成,进而更加深入地了解会话的运作机制。
The construction of a speech corpus is the foundation of research on oral languages.In this paper,a smallscale corpus is constructed based on the representative talk shows,QiangqiangSanrenxing and LuYuYouyue.An annotation system constituted by 5primary categories and 16 subtypes is developed to annotate the conversational structures.According to the statistics of conversational structures,there are 309 interrupted structures,141 inserted structures,111 repetitive structures,653/589 question and answer structures,51/21obstruction-correction structures,which reflect the unbalanced distribution of the number of conversational structures.The form,nature and communicative tasks of the talk shows are the main influencing factors of the distribution of the conversational structure.In addition,conversational structures show certain patterns,and therefore trigram analysis is carried out to explore the combinations.It is found that the highest frequency combination in the corpus is the question-answer adjacency pair,in addition to a large number of contingency combinations.The combination patterns of conversation structures not only reflect the style of the talk shows,but also help to analyze the functional modules in the conversation,the formation of conversation strategies,and thus help us more deeply understand the operational mechanisms of the conversation.
出处
《中文信息学报》
CSCD
北大核心
2016年第6期140-146,共7页
Journal of Chinese Information Processing
基金
香港教育大学中国语言学系资助(2015-16-CHL-06)
关键词
谈话节目
会话结构
组合模式
talk shows
conversational structures
combination patterns