检索结果-维普期刊中文期刊服务平台

期刊文献⁺

任意字段

题名或关键词

题名

关键词

文摘

作者

第一作者

机构

刊名

分类号

参考文献

作者简介

基金资助

栏目信息

共找到8篇文章

< 1 >

每页显示 20 50 100

已选择0条

导出题录引用分析

统计分析

显示方式：

文摘详细列表

相关度排序被引量排序时效性排序

Heterogeneous Network Selection Optimization Algorithm Based on a Markov Decision Model 被引量：7: 1; 作者 Jianli Xie Wenjuan Gao Cuiran Li 《China Communications》 SCIE CSCD 2020年第2期40-53,共14页; A network selection optimization algorithm based on the Markov decision process(MDP)is proposed so that mobile terminals can always connect to the best wireless network in a heterogeneous network environment.Consideri... 展开更多; 关键词 heterogeneous wireless networks markov decision process reward function genetic algorithm simulated annealing; 下载PDF 职称材料

A New Theoretical Framework of Pyramid Markov Processes for Blockchain Selfish Mining 被引量：2: 2; 作者 Quanlin Li Yanxia Chang +1 位作者 Xiaole Wu Guoqing Zhang 《Journal of Systems Science and Systems Engineering》 SCIE EI CSCD 2021年第6期667-711,共45页; In this paper,we provide a new theoretical framework of pyramid Markov processes to solve some open and fundamental problems of blockchain selfish mining under a rigorous mathematical setting.We first describe a more ... 展开更多; 关键词 Blockchain Proof of Work selfish mining main chain pyramid markov process pyramid markov reward process phase-type distribution Matrix-geometric solution; 原文传递

Convergence analysis of an incremental approach to online inverse reinforcement learning: 3; 作者 Zhuo-jun JIN Hui QIAN Shen-yi CHEN Miao-liang ZHU 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2011年第1期17-24,共8页; Interest in inverse reinforcement learning (IRL) has recently increased,that is,interest in the problem of recovering the reward function underlying a Markov decision process (MDP) given the dynamics of the system and... 展开更多; 关键词 Incremental approach reward recovering Online learning Inverse reinforcement learning markov decision process; 原文传递

基于马尔科夫奖励过程的牵引系统可靠性评估被引量：1: 4; 作者李小波褚敏 +2 位作者陆朱剑程岳梅田世贺《智能计算机与应用》 2020年第2期89-92,96,共5页; 针对地铁列车牵引系统可靠性问题,提出一种层次分析法和马尔科夫奖励过程相结合的牵引系统可靠性评估方法。利用层次分析法确定牵引系统模块层的可靠性评价指标,并计算综合权重,确定各模块层进入不同状态的奖励系数,采用马尔科夫奖励过... 展开更多; 关键词地铁牵引系统可靠性层次分析法马尔科夫奖励过程; 下载PDF 职称材料

带动作回报的连续时间Markov回报过程验证: 5; 作者黄镇谨陆阳 +1 位作者杨娟王智文《电子测量与仪器学报》 CSCD 北大核心 2015年第11期1603-1613,共11页; 为了能够更准确的表达不确定性复杂系统的时空验证,针对当前连续时间Markov回报过程(continue time markov reward decision process,CMRDP)验证中只考虑状态回报的问题,提出带动作回报的验证方法。考虑添加了动作回报的空间性能约束,... 展开更多; 关键词 markov回报过程模型验证动作回报时空有界可达概率; 下载PDF 职称材料

非对称超市模型的报酬过程与性能优化研究: 6; 作者李泉林丁园园杨飞飞《应用概率统计》 CSCD 北大核心 2015年第4期411-431,共21页; 超市模型具有操作简单、反应快速、实时管控等优点而成为研究大型网络资源管理的一个重要数学工具,它已经在物联网、云计算、云制造、大数据、交通运输、医疗卫生等重要实际领域中获得了极为广泛的应用.目前,非对称超市模型是这个研究... 展开更多; 关键词非对称超市模型路径选择策略马氏报酬过程报酬函数值递推算法; 下载PDF 职称材料

Asymptotic Evaluations of the Stability Index for a Markov Control Process with the Expected Total Discounted Reward Criterion: 7; 作者 Jaime Eduardo Martínez-Sánchez 《American Journal of Operations Research》 2021年第1期62-85,共24页; In this work, for a control consumption-investment process with the discounted reward optimization criteria, a numerical estimate of the stability index is made. Using explicit formulas for the optimal stationary poli... 展开更多; 关键词 Control Consumption-Investment process Discrete-Time markov Control process Expected Total Discounted reward Probabilistic Metrics Stability Index Estimation; 下载PDF 职称材料

Variance Optimization for Continuous-Time Markov Decision Processes: 8; 作者 Yaqing Fu 《Open Journal of Statistics》 2019年第2期181-195,共15页; This paper considers the variance optimization problem of average reward in continuous-time Markov decision process (MDP). It is assumed that the state space is countable and the action space is Borel measurable space... 展开更多; 关键词 CONTINUOUS-TIME markov Decision process Variance OPTIMALITY of Average reward Optimal POLICY of Variance POLICY ITERATION; 下载PDF 职称材料

	题名	作者	出处	发文年	被引量	操作
1	Heterogeneous Network Selection Optimization Algorithm Based on a Markov Decision Model	Jianli Xie Wenjuan Gao Cuiran Li	《China Communications》 SCIE CSCD	2020	7	下载PDF 职称材料
2	A New Theoretical Framework of Pyramid Markov Processes for Blockchain Selfish Mining	Quanlin Li Yanxia Chang Xiaole Wu Guoqing Zhang	《Journal of Systems Science and Systems Engineering》 SCIE EI CSCD	2021	2	原文传递
3	Convergence analysis of an incremental approach to online inverse reinforcement learning	Zhuo-jun JIN Hui QIAN Shen-yi CHEN Miao-liang ZHU	《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI	2011	0	原文传递
4	基于马尔科夫奖励过程的牵引系统可靠性评估	李小波褚敏陆朱剑程岳梅田世贺	《智能计算机与应用》	2020	1	下载PDF 职称材料
5	带动作回报的连续时间Markov回报过程验证	黄镇谨陆阳杨娟王智文	《电子测量与仪器学报》 CSCD 北大核心	2015	0	下载PDF 职称材料
6	非对称超市模型的报酬过程与性能优化研究	李泉林丁园园杨飞飞	《应用概率统计》 CSCD 北大核心	2015	0	下载PDF 职称材料
7	Asymptotic Evaluations of the Stability Index for a Markov Control Process with the Expected Total Discounted Reward Criterion	Jaime Eduardo Martínez-Sánchez	《American Journal of Operations Research》	2021	0	下载PDF 职称材料
8	Variance Optimization for Continuous-Time Markov Decision Processes	Yaqing Fu	《Open Journal of Statistics》	2019	0	下载PDF 职称材料

已选择0条

导出题录引用分析

统计分析

使用帮助返回顶部