期刊文献+

家庭服务机器人语音指令深层信息识别 被引量:2

Deep Information Recognition of Speech Instruction of Family Service Robot
下载PDF
导出
摘要 针对机器人缺乏对语音指令的深层识别能力,提出一种基于DNN/GMM的深层信息识别方法.首先,在全息地图环境数据基础上,运用双隐层深度神经网络构建家庭环境神经网络模型;其次,基于该模型对语音指令进行目标对象深层信息识别,并据此提取相关预备指令;再次,基于语音指令高斯混合模型识别语音指令类型;最后,依据不同指令类型选定最优服务对象深层信息识别方法,之后提取服务指令.在一般家庭中构建实验环境,结果验证了该方法的正确性和有效性,且使得机器人依据指令的深层信息能够更加准确地理解并执行指令. According to robot lacks of ability of recognizing deep information of speech instructions, this paper proposes a deep infor- mation recognition method based on DNN/GMM. First of all, on the basis of the environmental data of holographic map, to build neu- ral network model of family environment by improved double hidden layer deep neural network; Secondly, mining deep information of target object of speech instruction based on this model, and then extracting related prepare instruction;Thirdly, to recognize the type of speech instruction based on Gaussian Mixture Model;Finally,to choose the best method of service object recognition by different in- struction type, and then extract service instruction. To build experimental environment in the general family, the experiment results show that the accuracy and validity of this method,and also enables the robot more accurately understand and executive instructions.
出处 《小型微型计算机系统》 CSCD 北大核心 2015年第6期1347-1352,共6页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(61305113)资助
关键词 全息地图 神经网络模型 高斯混合模型 预备指令 服务指令 holographic map NNM GMM prepare instruction service instruction
  • 相关文献

参考文献2

二级参考文献19

  • 1Reynolds D A. Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing, 2000, 10:19 -41 被引量:1
  • 2Reynolds D A, Rose R C. Robust Text-Independent Speaker Identification Using Gaus,sian Mixture Speaker Models. IEEE Trans onSpeech Audio Process, 1995, 3: 72- 83 被引量:1
  • 3Reynolds D A. An Overview of Automatic Speaker RecognitionTechnology. In: Proc of the International Conference on Acoustics,Speech and Signal Processing (ICASSP'02). Orlando, FL, USA,2002, Ⅳ : 4072 - 4075 被引量:1
  • 4NIST. The 2001 NIST Speaker Recognition Evaluation Plan.http://www, nist. gov/speech/tests/spk/. The Official Website for the NIST Speaker Recognition Evaluations. 被引量:1
  • 5Martin A, Przybocki M. The NIST 1999 Speaker Recognition Evaluation - an Overview. Digital Signal Processing, 2000, 10:10 -18 被引量:1
  • 6Reynolds D A. Speaker Identification and Verification Using Gaussian Mixture Speaker Models. Speech Communication, 1995, 17:91 - 108 被引量:1
  • 7Reynolds D A. Comparison of Background Normalization Methods for Text-Independent Speaker Verification. In: Proc of European Conference on Speech Communication and Technology (EU-RCSPEECH). Rhodos, Greece, 1997, Ⅱ: 963-966 被引量:1
  • 8Reynolds D A. Experimental Evaluation of Features for Robust Speaker Identification. IEEE Trans on Speech Audio Process,1994, 2:639-643 被引量:1
  • 9Deller J R, Proakisa J G, Hansenm J H L. Discrete-Time Processing of Speech Signals. New York: Macmillan Publishing Company,1993 被引量:1
  • 10Chang E, Shi Y, Zhou J, Huang C. Speech Lab in a Box: A Man- darin Speech Toolbox to Jumpstart Speech Related Research. In: Proc of European Conference on Speech Communication and Technology (EURCSPEECH). Aalborg, Denmark, 2001, 192 - 199 被引量:1

共引文献16

同被引文献13

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部