This paper discusses the Chinese Learner English Corpus and presents the preliminary results of itsanalysis.The last 20 years have witnessed the revival of corpus linguistics,which is characterized by theempirical app...This paper discusses the Chinese Learner English Corpus and presents the preliminary results of itsanalysis.The last 20 years have witnessed the revival of corpus linguistics,which is characterized by theempirical approaches for quantitative and qualitative analysis of large collections of natural texts in authenticuse.As a performance-based approach,corpus linguistic studies have attracted the attention of a growingnumber of applied linguists.Over recent years learner English corpora have been widely reported,becausethe analysis of the learners’ language in real use may shed light on the nature of inter-language.Languagelearning is viewed as a process in which the learners actively engage themselves in observation,hypothesisformulation about the target language rules,and experimentation of such hypothesis in their attemptedcommunication in the target language.This paper discusses the principles of CLEC development,principlesof tagging,and measures taken to ensure the consistency in machine-aided human tagging.Based on theresults of statistical analysis of CLEC and a contrastive analysis against native English speaker corpora,errors of Chinese learners of English,over-use and under-use of collocates,and their possible causes arediscussed.Implications of the research findings in English language teaching and learning and in data-driven instruction are also presented.展开更多
文摘This paper discusses the Chinese Learner English Corpus and presents the preliminary results of itsanalysis.The last 20 years have witnessed the revival of corpus linguistics,which is characterized by theempirical approaches for quantitative and qualitative analysis of large collections of natural texts in authenticuse.As a performance-based approach,corpus linguistic studies have attracted the attention of a growingnumber of applied linguists.Over recent years learner English corpora have been widely reported,becausethe analysis of the learners’ language in real use may shed light on the nature of inter-language.Languagelearning is viewed as a process in which the learners actively engage themselves in observation,hypothesisformulation about the target language rules,and experimentation of such hypothesis in their attemptedcommunication in the target language.This paper discusses the principles of CLEC development,principlesof tagging,and measures taken to ensure the consistency in machine-aided human tagging.Based on theresults of statistical analysis of CLEC and a contrastive analysis against native English speaker corpora,errors of Chinese learners of English,over-use and under-use of collocates,and their possible causes arediscussed.Implications of the research findings in English language teaching and learning and in data-driven instruction are also presented.