期刊文献+

基于Q学习算法的X光主动视觉安检方法 被引量:4

X-ray security inspection method using active vision based on Q-learning algorithm
下载PDF
导出
摘要 针对主动视觉安检方法检测性能不高和检测速度慢的问题,基于Q学习(QL)算法提出了采用状态回溯的启发式Q学习(HASB-QL)算法进行最佳视角估计。该算法引入代价函数和启发函数,提高了学习效率,加快了Q学习收敛。首先,对通过安检扫描仪获取的X光图像进行单视角检测;然后,对姿势作出估计并通过在状态回溯过程中比较重复动作的选择策略获取最佳旋转角度,再次进行单视角检测,直到检测到危险品;此外,在检测过程中多于一个视角时,建立几何约束以消除误报。对GDXray数据集中的手枪和剃刀刀片的X光图像进行实验,实验结果表明,相比于以Q学习为基础的主动视觉算法,改进的主动视觉算法检测手枪所得精确率和召回率之间的加权平均值F1值提高了9. 60%,检测速度提高了12. 45%;检测剃刀刀片所得的F1值提高了2. 51%,速度提高了17. 39%。所提算法提高了危险品检测的性能和速度。 In order to solve the problems of poor detection performance and slow detection speed of the active vision security inspection method,a Heuristically Accelerated State Backtracking Q-Learning(HASB-QL)algorithm based on Q-Learning(QL)algorithm was proposed to estimate the next-best-view.The cost function and heuristic function were introduced by the proposed algorithm to improve the learning efficiency and speed up the convergence of QL.Firstly,the single view detection of X-ray images obtained by security scanner was performed.Secondly,the pose was estimated and the best rotation angle was obtained by comparing the selection strategy of repeated action in the state backtracking process,and then the single view detection was performed again until the threat object was detected.Moreover,the geometric constraint was established to eliminate the false alarms when the number of views was more than one in detection process.The X-ray images of handguns and razor blades in GDXray data set were used for the experiment.The experimental results show that,compared with active vision algorithm based on QL,the weighted average value of F1between the precision and recall of detecting the handguns by the improved active vision algorithm is increased by9.60%and the detection speed is increased by12.45%,while the F1of detecting razor blades is increased by2.51%and the detection speed is increased by17.39%.The proposed algorithm can improve the performance and speed of threat object detection.
作者 丁静文 陈树越 陆贵荣 DING Jingwen;CHEN Shuyue;LU Guirong(School of Information Science & Engineering, Changzhou University, Changzhou Jiangsu 213164, China)
出处 《计算机应用》 CSCD 北大核心 2018年第12期3414-3418,共5页 journal of Computer Applications
基金 国家自然科学基金资助项目(51176016) 常州市工程技术研究中心项目(CM20179060)~~
关键词 X光图像 安检 主动视觉 状态回溯 启发函数 Q学习 X-ray image security inspection active vision state backtracking heuristic function Q-Learning (QL)
  • 相关文献

参考文献5

二级参考文献27

  • 1魏英姿 ,赵明扬 .强化学习算法中启发式回报函数的设计及其收敛性分析[J].计算机科学,2005,32(3):190-193. 被引量:13
  • 2沈晶,顾国昌,刘海波.分层强化学习研究综述[J].模式识别与人工智能,2005,18(5):574-581. 被引量:7
  • 3Sanner S, Boutilier C. Approximate linear programming for first order mdps [C] //The 21st Conf on Uncertainty in Artificial Intelligence. Amsterdam, Netherland: North Holland Publishing Company, 2005 被引量:1
  • 4Dabney W, Govern A M. Utile distinctions for reinforcement learning [C]//The 20th Int Joint Artificial Intelligence. Singapore: World Scientific P Company, 2007 relational Conf on ublishing 被引量:1
  • 5Croonenborghs T, Ramon J, Blockeel H, et al. Online learning and exploiting relational models in reinforcement learning [C] //The 20th Int Joint Conf on Artificial Intelligence. Singapore: World Scientific Publishing Company, 2007 被引量:1
  • 6Barto A, Mahadevan S. Recent advances in hierarchical reinforcement learning [J]. Discrete Event Dynamic Systems: Theory and Applications, 2003, 13(4):41-77 被引量:1
  • 7Tadepalli P, Givan R, Driessens K. Relational reinforcement learning: An overview [C]//ICML-04 Workshop on Relational RL. Boston: PWS Publishing Company, 2004 被引量:1
  • 8Sanner S. Simultaneous learning of structure and value in relational reinforcement learning [C]//ICML'05 Workshop on Rich Representations for Reinforcement Learning. San Francisco: Morgan Kaufmann, 2005 被引量:1
  • 9Landwehr N, Kersting K, de raedt L. nFOIL: Integrating naive Bayes and FOIL [J]. Journal of Machine Learning Research, 2007, 8(5): 481-507 被引量:1
  • 10Driessens K, Ramon J, Gartner T. Graph kernels and Gaussican processes for relational reinforcement learning [M]. Amsterdam: Kluwer Academic Publishers, 2006 被引量:1

共引文献45

同被引文献39

引证文献4

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部