期刊文献+

基于词典和统计相结合的维吾尔语拼写检查方法 被引量:2

Spelling Check Method of Uyghur Languages Based on Dictionary and Statistics
下载PDF
导出
摘要 该文通过研究国内外相关的拼写错误查错和纠错方法的理论,再结合维吾尔语自身的特点,提出了基于词典和统计相结合的维吾尔语拼写查错方法。首先,提出基于词典的方法进行词库和词干提取的拼写检查;其次,提出基于N元语法的词缀连接有效性判断模型,对未登录词提出基于N元语法的拼写检查模型;最后,结合以上几种方法各自的优点提出基于混合策略的拼写检查方法,该方法在准确性和检查结果可靠性等方面得到了较显著的提高。 In this paper, we present spelling check method of Uyghur Languages based on a combination of dictiona- ry and statistics. Firstly, we describe a stemming method based on the dictionary. Secondly, we proposed N-gram- based model to judge the suffix to a stem, detecting the misspelling unknown words at the same time. Finally, we present a spelling check method based a hybrid strategy of combining different methods. This method achieves im- provements in accuracy, reliability, and so on.
出处 《中文信息学报》 CSCD 北大核心 2014年第2期66-71,共6页 Journal of Chinese Information Processing
基金 国家重点基础研究计划973(2011211B07) 国家自然科学基金(61262060 61063043) 国家自然科学基金(61063026) 国家创新基金(10C26226505485) 国家社会科学基金重点项目(10AYY006)
关键词 维吾尔语 拼写检查 词典 N元语法 Uyghur Language spelling check dictionary N-gram
  • 相关文献

参考文献17

  • 1Kukich K. Techniques for automatically correcting words in text[C]. Proceedings of the ACM Computing Surveys, 1992,24(2) ,377-439. 被引量:1
  • 2Boswell D. Language Models for Spelling Correction [C]. Proceedings of the CSE 256, 2004. 被引量:1
  • 3Rickard J C. Domeij Viggo Kann Ola Knutsson. A Swedish Grammar Checker[R]: Association for Com- putational Linguistics, 2000. 被引量:1
  • 4Dhanabalan T, Parthasarathi R, Geetha T V. TamilSpell Checker[C]. Proceedings of the Sixth Tamil In- ternet 2003 Conference, Chennai, Tamilnadu, India, 2003:22-24. 被引量:1
  • 5Hamrouni B M. Logic compression of dictionaries for multilingual spelling eheckers[C]//Proeeedings of the 15th Conference on Computational Linguistics, K yoto, Japan, 1994: 5-9. 被引量:1
  • 6Menno van Zaanen, Gerhard van Huyssteen. Impro- ving a Spelling Checker for Afrikaans[C]//Proceed- ings of the Language and Computers, Publisher Rodo- pi, ISSN 0921-5034, 2003,47(1): 143-156. 被引量:1
  • 7Arif Billah A1-Mahmud Abdullah, Rahman A. A Ge- neric Spell Checker Engine for South Asian Languages [J]. IASTED 2003, 2003:3-5. 被引量:1
  • 8Dembitz S, Knezevic P, Sokele M. Developing a Spell Checker as an Expert System[J]. Journal of Compu- ting and Information Technology-CIT 11, 2004: 285- 291. 被引量:1
  • 9施得胜,等.基于统计的中文错字侦测法[J].电脑与通讯.1992,8:19. 被引量:1
  • 10张仰森,丁冰青.基于二元接续关系检查的字词级自动查错方法[J].中文信息学报,2001,15(3):36-43. 被引量:29

二级参考文献33

  • 1古丽拉.阿东别克,米吉提.阿布力米提.维吾尔语词切分方法初探[J].中文信息学报,2004,18(6):61-65. 被引量:39
  • 2Gulila·Adongbieke. The Research of Proofreading for the Uighur Character [A],The 2001 IEEE International Conference on System, Man and Cybernetics (SMC2001)[C], 2001.10.7 - . 10.10, Tucson, Arizona ,U.S.A,P874- 876. 被引量:1
  • 3CHRISTOPHER D,MANNING,HINRICH SCHUTZE.统计自然语言处理基础[M].苑春法译.北京:电子工业出版社,2005:143-163. 被引量:5
  • 4Gengqing Wu,Fang Zheng.A Method to Build a Super Small but Practically Accurate Language Model for Handheld Devices[J].J.Computer Science & Technology,2003,18 (6):747-755. 被引量:1
  • 5Fang Zheng,Zhanjiang Song,Pascale Fung,et al.Mandarin Pronunciation Modeling Based on CASS Corpus[J].J.Computer Science & Technology,May 2002,17(3):249 -263. 被引量:1
  • 6R.Rosenfeld,et al.Error Analysis and Disfluency Modeling in the Switchboard Domain[A].In:proceedings of the 4th International Conference on Speech and Language Processing (ICSLP)[C].Philadelphia,PA,USA,1996. 被引量:1
  • 7Rukmini M.Iyer,Mari Ostendorf,Modeling long distance dependence in language:topic mixtures versus dynamic cache models[J].IEEE Transactions on Speech and Audio Processing,Volume 7 Issue 1,Jan 1999.Page(s):30 -39. 被引量:1
  • 8Daniel Gildea,Thomas Hofmann.Topic Based Language Models Using EM[A].In:proceedings of 6th European Conference on Speech Communication and Technology (Eurospeech'99)[C].1999,pages 2167 -2170. 被引量:1
  • 9R.Rosenfeld.A Maximum Entropy Approach to Adaptive Statistical Language Model[J].Computer Speech & Language,1996,10:187-228. 被引量:1
  • 10S.M.Katz.Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer[J].IEEE Transaction on Acoustic,Speech and Signal Processing,1987,35(3):400 -401. 被引量:1

共引文献85

同被引文献13

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部