确定部分观测隐状态的k步记忆模型研究

Research on k-step memory model inference of partially observable hidden state

导出

摘要提出一种顺序联想记忆网络模型以解决隐状态的定位问题.该模型利用活性衰退、突触势能以及同步激活实现k步记忆,根据k步顺序历史确定隐状态.该模型是一种分布式计算模型,并行机制使得隐状态的定位不会因为存储知识的增多而效率下降,具有真正的在线计算能力.最后对记忆的迭代算法进行改进,实验结果表明改进后的模型具有更好的容错能力. A sequence associative memory network was proposed in this thesis to resolve the hidden state problem.This model utilized the cell activity decay,pre-synaptic potentials and cell-synfire mechanisms to realize the k-step memory,identify the hidden state according to k-step sequence history.This model was a distributed computing model.The efficiency of identifing the hidden state didn′t drop along with the increase of knowledge storage by use of its parallel mechanism.It possessed the real on-line computing capability.Finally,the improved memory iteration algorithm was presented. The experiment results show that the improved algorithm has better fault-tolerant ability.

作者王作为梁晓丹张汝波

机构地区天津工业大学计算机科学与软件学院大连民族学院机电信息工程学院

出处《华中科技大学学报（自然科学版）》 EI CAS CSCD 北大核心 2013年第S1期356-359,共4页 Journal of Huazhong University of Science and Technology(Natural Science Edition)

基金国家高技术研究发展计划资助项目(2009AA04Z215) 国家自然科学基金资助项目(60975071 61173032)

关键词部分观测隐状态确定隐状态活性衰减前突触势能联想记忆 partially observable hidden state identify the hidden state activity decay pre-synaptic potentials associative memory

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献8

1D. L. Wang,B. Yuwono.Incremental learning of complex temporal patterns[].IEEE Transactions on Neural Networks.1996 被引量：1
2Ray S R,Kargupta H.A temporal sequence processor based on the biological reaction-diffusion process[].Complex Systems.1996 被引量：1
3McCallum A K.Reinforcement Learning with Selective Perception and Hidden State[]..1996 被引量：1
4Timmer S,,Riedmiller M.Reinforcement learning with history lists[]..2009 被引量：1
5Oladell M,Huber M.Symbol generation and feature selection for reinforcement learning agents using affordances and U-Trees[].SystemsMan and Cybernetics (SMC).2012 被引量：1
6Bakker B.Reinforcement learning with long shortterm memory[].Advances in Neural Information Processing Systems.2002 被引量：1
7刘娟..基于时空信息与认知模型的移动机器人导航机制研究[D].中南大学,2003:
8王作为..具有认知能力的智能机器人行为学习方法研究[D].哈尔滨工程大学,2010:

1路红,费树岷,郑建勇,张涛.Multi-object tracking based on behaviour and partial observation[J].Journal of Southeast University(English Edition),2008,24(4):468-472.
2Tim Ellis.利用部分观测跟踪被遮挡的目标(英文)[J].自动化学报,2003,29(3):370-380. 被引量：2
3张东升.了解企业存储中各个术语[J].网络与信息,2010,24(9):26-27.
4杨自厚.神经网络技术及其在钢铁工业中的应用第1讲人工神经网络的基本概念[J].冶金自动化,1996,20(4):48-51. 被引量：8
5李杰,李驿华,齐怡.面向对象数据模型在故障诊断专家系统中应用[J].电子测量技术,2003,26(3):13-14.
6张立柱,韩忠东.采用页式存储的知识库建立方法的研究[J].山东矿业学院学报,1999,18(3):113-116.
7饶东宁,蒋志华,姜云飞.在部分观测环境下的不确定动作模型学习[J].软件学报,2014,25(1):51-63. 被引量：2
8刘栋梁,高国安.基于RDBMS的知识表示及其实现[J].信息与控制,2000,29(1):82-85. 被引量：5
9于文江.摄影人应该了解的图像存储知识[J].旅游纵览,2006,0(10):42-42.
10喇元,田立斌,陈婷,王珏.基于贝叶斯控制图的变压器故障诊断决策[J].数据采集与处理,2012,27(S2):379-382. 被引量：1

华中科技大学学报（自然科学版）

2013年第S1期

浏览历史

内容加载中请稍等...

确定部分观测隐状态的k步记忆模型研究

参考文献8

相关作者

相关机构

相关主题

浏览历史