期刊文献+
共找到20篇文章
< 1 >
每页显示 20 50 100
FORWARD-BACKWARD STOCHASTIC DIFFERENTIAL EQUATIONS, LINEAR QUADRATIC STOCHASTIC OPTIMAL CONTROL AND NONZERO SUM DIFFERENTIAL GAMES 被引量:13
1
作者 WUZhen 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2005年第2期179-192,共14页
In this paper, we use the solutions of forward-backward stochastic differential equations to get the explicit form of the optimal control for linear quadratic stochastic optimal control problem and the open-loop Nash ... In this paper, we use the solutions of forward-backward stochastic differential equations to get the explicit form of the optimal control for linear quadratic stochastic optimal control problem and the open-loop Nash equilibrium point for nonzero sum differential games problem. We also discuss the solvability of the generalized Riccati equation system and give the linear feedback regulator for the optimal control problem using the solution of this kind of Riccati equation system. 展开更多
关键词 stochastic differential equations stochastic optimal control riccatiequation nonzero sum stochastic differential game
原文传递
Stationary Almost Markov ε-Equilibria for Discounted Stochastic Games with Borel Spaces and Unbounded Payoffs
2
作者 WU Yiting ZHANG Junyu HUANG Song 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2024年第4期1672-1684,共13页
This paper is concerned with nonzero-sum discrete-time stochastic games in Borel state and action spaces under the expected discounted payoff criterion.The payoff function can be unbounded.The transition probability i... This paper is concerned with nonzero-sum discrete-time stochastic games in Borel state and action spaces under the expected discounted payoff criterion.The payoff function can be unbounded.The transition probability is a convex combination of finite probability measures that are dominated by a probability measure on the state space and depend on the state variable.Under suitable conditions,the authors establish the existence of stationary almost Markov ε-equilibria and give an approximation method via some stochastic games with bounded payoffs.Finally,a production game is introduced to illustrate the applications of the main result,which generalizes the bounded payoff case. 展开更多
关键词 Almost Markovε-equilibrium Borel state space expected discounted payoff criterion nonzero-sum stochastic games unbounded payoffs
原文传递
A Type of General Forward-Backward Stochastic Differential Equations and Applications 被引量:4
3
作者 Li CHEN Zhen WU 《Chinese Annals of Mathematics,Series B》 SCIE CSCD 2011年第2期279-292,共14页
The authors discuss one type of general forward-backward stochastic differential equations (FBSDEs) with Ito's stochastic delayed equations as the forward equations and anticipated backward stochastic differential... The authors discuss one type of general forward-backward stochastic differential equations (FBSDEs) with Ito's stochastic delayed equations as the forward equations and anticipated backward stochastic differential equations as the backward equations.The existence and uniqueness results of the general FBSDEs are obtained.In the framework of the general FBSDEs in this paper,the explicit form of the optimal control for linear-quadratic stochastic optimal control problem with delay and the Nash equilibrium point for nonzero sum differential games problem with delay are obtained. 展开更多
关键词 Stochastic delayed differential equations Anticipated backward stochastic differential equations Forward-backward stochastic differential equations Linear-quadratic stochastic optimal control with delay nonzero sum stochastic differential game with delay
原文传递
Behavior Prediction of Untrusted Relays Based on Nonzero-Sum Game
4
作者 付晓梅 吴晓 汪清 《Transactions of Tianjin University》 EI CAS 2015年第4期371-376,共6页
To keep the secrecy performance from being badly influenced by untrusted relay(UR), a multi-UR network through amplify-and-forward(AF) cooperative scheme is put forward, which takes relay weight and harmful factor int... To keep the secrecy performance from being badly influenced by untrusted relay(UR), a multi-UR network through amplify-and-forward(AF) cooperative scheme is put forward, which takes relay weight and harmful factor into account. A nonzero-sum game is established to capture the interaction among URs and detection strategies. Secrecy capacity is investigated as game payoff to indicate the untrusted behaviors of the relays. The maximum probabilities of the behaviors of relay and the optimal system detection strategy can be obtained by using the proposed algorithm. 展开更多
关键词 physical layer security COOPERATIVE communication untrusted RELAY SECRECY capacity nonzero-sum GAME
下载PDF
A MAXIMUM PRINCIPLE APPROACH TO STOCHASTIC H_2/H_∞ CONTROL WITH RANDOM JUMPS
5
作者 张启侠 孙启良 《Acta Mathematica Scientia》 SCIE CSCD 2015年第2期348-358,共11页
A necessary maximum principle is given for nonzero-sum stochastic Oltterential games with random jumps. The result is applied to solve the H2/H∞ control problem of stochastic systems with random jumps. A necessary an... A necessary maximum principle is given for nonzero-sum stochastic Oltterential games with random jumps. The result is applied to solve the H2/H∞ control problem of stochastic systems with random jumps. A necessary and sufficient condition for the existence of a unique solution to the H2/H∞ control problem is derived. The resulting solution is given by the solution of an uncontrolled forward backward stochastic differential equation with random jumps. 展开更多
关键词 nonzero-sum stochastic differential games maximum principle Poisson process stochastic H2/H∞ control forward backward stochastic differential equations
下载PDF
Nonzero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates 被引量:2
6
作者 ZHANG WenZhao GUO XianPing 《Science China Mathematics》 SCIE 2012年第11期2405-2416,共12页
This paper attempts to study two-person nonzero-sum games for denumerable continuous-time Markov chains determined by transition rates,with an expected average criterion.The transition rates are allowed to be unbounde... This paper attempts to study two-person nonzero-sum games for denumerable continuous-time Markov chains determined by transition rates,with an expected average criterion.The transition rates are allowed to be unbounded,and the payoff functions may be unbounded from above and from below.We give suitable conditions under which the existence of a Nash equilibrium is ensured.More precisely,using the socalled "vanishing discount" approach,a Nash equilibrium for the average criterion is obtained as a limit point of a sequence of equilibrium strategies for the discounted criterion as the discount factors tend to zero.Our results are illustrated with a birth-and-death game. 展开更多
关键词 nonzero-sum game expected average criterion Nash equilibrium unbounded transition rates unbounded payoff function
原文传递
倒向重随机系统的非零和微分对策问题 被引量:1
7
作者 许洁 蔺瑞强 《通化师范学院学报》 2022年第12期13-18,共6页
文章探讨了倒向重随机系统的非零和微分对策问题,其系统状态是由一类倒向重随机微分方程刻画.在适当的假设条件下,引入相应的代价泛函,利用经典的凸变分技术和对偶方法给出纳什均衡点存在的必要条件.
关键词 倒向重随机微分方程 微分对策 纳什均衡点 非零和
下载PDF
Optimal synchronization control formulti-agent systems with input saturation:a nonzero-sum game 被引量:1
8
作者 Hongyang LI Qinglai WEI 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2022年第7期1010-1019,共10页
This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into ... This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into a multi-agent nonzero-sum game.Then,the Nash equilibrium can be achieved by solving the coupled Hamilton–Jacobi–Bellman(HJB)equations with nonquadratic input energy terms.A novel off-policy reinforcement learning method is presented to obtain the Nash equilibrium solution without the system models,and the critic neural networks(NNs)and actor NNs are introduced to implement the presented method.Theoretical analysis is provided,which shows that the iterative control laws converge to the Nash equilibrium.Simulation results show the good performance of the presented method. 展开更多
关键词 Optimal synchronization control Multi-agent systems nonzero-sum game Adaptive dynamic programming Input saturation Off-policy reinforcement learning Policy iteration
原文传递
Three-dimensional nonlinear H_2/H_∞ guidance law based upon approach of solving the state feedback Nash balance point
9
作者 桑保华 姜长生 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2010年第3期383-388,共6页
Based upon the theory of the nonlinear quadric two-person nonzero-sum differential game,the fact that the time-limited mixed H2/H∞ control problem can be turned into the problem of solving the state feedback Nash bal... Based upon the theory of the nonlinear quadric two-person nonzero-sum differential game,the fact that the time-limited mixed H2/H∞ control problem can be turned into the problem of solving the state feedback Nash balance point is mentioned. Upon this,a theorem about the solution of the state feedback control is given,the Lyapunov stabilization of the nonlinear system under this control is proved,too. At the same time,this solution is used to design the nonlinear H2/H∞ guidance law of the relative motion between the missile and the target in three-dimensional(3D) space. By solving two coupled Hamilton-Jacobi partial differential inequalities(HJPDI),a control with more robust stabilities and more robust performances is obtained. With different H∞ performance indexes,the correlative weighting factors of the control are analytically designed. At last,simulations under different robust performance indexes and under different initial conditions and under the cases of intercepting different maneuvering targets are carried out. All results indicate that the designed law is valid. 展开更多
关键词 nonlinear system mixed H2/H∞ control state feedback Nash balance point two-person nonzero-sum differential game three-dimensional guidance law
下载PDF
BACKWARD LINEAR-QUADRATIC STOCHASTIC OPTIMAL CONTROL AND NONZERO-SUM DIFFERENTIAL GAME PROBLEM WITH RANDOM JUMPS
10
作者 Detao ZHANG 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2011年第4期647-662,共16页
This paper studies the existence and uniqueness of solutions of fully coupled forward-backward stochastic differential equations with Brownian motion and random jumps.The result is applied to solve a linear-quadratic ... This paper studies the existence and uniqueness of solutions of fully coupled forward-backward stochastic differential equations with Brownian motion and random jumps.The result is applied to solve a linear-quadratic optimal control and a nonzero-sum differential game of backward stochastic differential equations.The optimal control and Nash equilibrium point are explicitly derived. Also the solvability of a kind Riccati equations is discussed.All these results develop those of Lim, Zhou(2001) and Yu,Ji(2008). 展开更多
关键词 Backward stochastic differential equations nonzero-sum differential game optimal con-trol poisson processes Riccati equation.
原文传递
N人线性-非二次微分对策
11
作者 汪非 潘立平 《复旦学报(自然科学版)》 CAS CSCD 北大核心 2014年第5期567-583,590,共18页
研究受多个局中人(人数可任意)影响的线性时变常微分系统和非二次目标泛函组构成的非零和微分对策问题.给出了两个拟Riccati偏微分方程组——(80)和(23),以及相应于方程组(23)的非线性积分方程组族(43).运用方程组(80)的regular解推导... 研究受多个局中人(人数可任意)影响的线性时变常微分系统和非二次目标泛函组构成的非零和微分对策问题.给出了两个拟Riccati偏微分方程组——(80)和(23),以及相应于方程组(23)的非线性积分方程组族(43).运用方程组(80)的regular解推导出了闭环Nash均衡策略;利用Pontryagin最大值原理和方程组(23)的normal解得出了开环Nash均衡策略的闭环表示;还揭示了(43)之解族与(23)之解两者的双向联系. 展开更多
关键词 多人微分对策 非零和 线性-非二次 闭环Nash均衡策略 开环Nash均衡策略的闭环表示
原文传递
非零和微分博弈系统的事件触发最优跟踪控制
12
作者 石义博 王朝立 《控制理论与应用》 EI CAS CSCD 北大核心 2023年第2期220-230,共11页
近年来,对于具有未知动态的非零和微分博弈系统的跟踪问题,已经得到了讨论,然而这些方法是时间触发的,在传输带宽和计算资源有限的环境下并不适用.针对具有未知动态的连续时间非线性非零和微分博弈系统,本文提出了一种基于积分强化学习... 近年来,对于具有未知动态的非零和微分博弈系统的跟踪问题,已经得到了讨论,然而这些方法是时间触发的,在传输带宽和计算资源有限的环境下并不适用.针对具有未知动态的连续时间非线性非零和微分博弈系统,本文提出了一种基于积分强化学习的事件触发自适应动态规划方法.该策略受梯度下降法和经验重放技术的启发,利用历史和当前数据更新神经网络权值.该方法提高了神经网络权值的收敛速度,消除了一般文献设计中常用的初始容许控制假设.同时,该算法提出了一种易于在线检查的持续激励条件(通常称为PE),避免了传统的不容易检查的持续激励条件.基于李亚普诺夫理论,证明了跟踪误差和评价神经网络估计误差的一致最终有界性.最后,通过一个数值仿真实例验证了该方法的可行性. 展开更多
关键词 非零和博弈 积分强化学习 最优跟踪控制 神经网络 事件触发
下载PDF
部分可观测带跳倒向随机系统的非零和微分博弈及其应用
13
作者 陈晓兰 王凯凯 朱庆峰 《工程数学学报》 CSCD 北大核心 2023年第5期738-750,共13页
微分博弈是研究两个或多个局中人的控制作用同时施加于一个由微分方程描述的动态系统时实现各自最优目标的博弈过程的理论,因其有趣的数学性质和经济领域的应用价值得到了广泛的关注。研究了一类部分可观测带跳倒向随机系统的非零和微... 微分博弈是研究两个或多个局中人的控制作用同时施加于一个由微分方程描述的动态系统时实现各自最优目标的博弈过程的理论,因其有趣的数学性质和经济领域的应用价值得到了广泛的关注。研究了一类部分可观测带跳倒向随机系统的非零和微分博弈问题,其中博弈系统涉及跳过程,且每个参与者拥有不同的观测方程。对于这种部分可观测的随机微分博弈问题,在控制域为凸的条件下,采用凸变分和对偶技术,建立了博弈纳什均衡点的最大值原理;在适当的凹凸性假设下,证明了必要性最优条件也是充分性最优条件,得到了验证定理。应用上述最大值原理,研究了部分可观测带跳倒向随机系统的线性二次(Linear Quadratic,LQ)博弈问题,得到了LQ博弈问题的唯一最优控制,其中状态方程和伴随方程构成了一类带跳的正倒向随机微分方程。由于LQ模型通常被用于描述许多金融和经济现象,期望上述的部分可观测带跳倒向随机系统的LQ博弈结果能在这些领域得到广泛应用。 展开更多
关键词 倒向随机微分方程 泊松过程 非零和随机微分博弈 最大值原理 纳什均衡点
下载PDF
部分可观测信息下的线性二次非零和随机微分对策 被引量:3
14
作者 王光臣 《山东大学学报(理学版)》 CAS CSCD 北大核心 2007年第6期12-15,共4页
结合正倒向随机微分方程理论和滤波技术,给出了一类部分可观测信息下线性二次非零和随机微分对策问题的纳什均衡点.
关键词 正倒向随机微分方程 非零和微分对策 纳什均衡点 滤波
下载PDF
模型未知非零和博弈问题的策略迭代算法 被引量:3
15
作者 杨明 罗艳红 王义贺 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2015年第3期318-321,326,共5页
提出了一种在线积分策略迭代算法,用来求解内部非线性动力模型未知的双人非零和博弈问题.通过在控制策略和干扰策略中引入探测信号,从而避开了系统的模型信息,得到了一个求解非零和博弈的无模型的近似动态规划算法.该算法同步更新值函... 提出了一种在线积分策略迭代算法,用来求解内部非线性动力模型未知的双人非零和博弈问题.通过在控制策略和干扰策略中引入探测信号,从而避开了系统的模型信息,得到了一个求解非零和博弈的无模型的近似动态规划算法.该算法同步更新值函数、控制策略、扰动策略,并且最终得到收敛的策略权值.在算法实现过程中,使用4个神经网络分别近似两个值函数、控制策略和扰动策略,使用最小二乘法估计神经网络的未知参数.最后仿真结果验证了算法的有效性. 展开更多
关键词 自适应动态规划 非零和博弈 策略迭代 神经网络 最优控制
下载PDF
基于均衡理念的流域污染物排放许可交易 被引量:2
16
作者 付意成 阮本清 臧文斌 《重庆大学学报(自然科学版)》 EI CAS CSCD 北大核心 2012年第9期114-120,共7页
污染物排放许可交易是一种实现流域污染物治理与水质改善均衡发展的有效经济手段。在概述国内外流域污染物排放交易研究特点的基础上,给出以治污成本最小化、低水位水质风险最小化为目标函数的污染物排放交易研究框架。在综合遗传算法... 污染物排放许可交易是一种实现流域污染物治理与水质改善均衡发展的有效经济手段。在概述国内外流域污染物排放交易研究特点的基础上,给出以治污成本最小化、低水位水质风险最小化为目标函数的污染物排放交易研究框架。在综合遗传算法改进序列(NSGA-Ⅱ)、Young交易理论(YBT)、污染物初始排放许可分配(IDPA)模型求解适用范围的基础上,构建涵盖流域治污层面相关要素的污染物排放许可交易框架。利用非零和博弈模型,以前述理论框架为基础,构建以流域均衡发展、治污成本最小化为目的的污染物排放交易模型。以典型流域为例,给出各方能够接受的最优均衡化结果,验证了模型的适用性。结合应用中出现的问题,给出模型完善建议和使用前景预测。 展开更多
关键词 均衡 流域污染物排放许可交易 多目标优化模型 非零和博弈
下载PDF
Ornstein-Uhlenbeck模型下保险与再保险的均衡保险投资分析
17
作者 江五元 杨招军 《应用数学学报》 CSCD 北大核心 2022年第6期905-920,共16页
本文构建保险公司和再保险公司的比例再保险与投资组合微分博奔模型,研究两公司基于加权终期财富效用最大化的均衡决策问题.假设保险公司的资本盈余过程为复合Poisson风险跳过程,为降低赔付风险,保险公司可以向再保险公司购买比例再保... 本文构建保险公司和再保险公司的比例再保险与投资组合微分博奔模型,研究两公司基于加权终期财富效用最大化的均衡决策问题.假设保险公司的资本盈余过程为复合Poisson风险跳过程,为降低赔付风险,保险公司可以向再保险公司购买比例再保险。两个公司都可以投资于风险资产满足Ornstein-Uhlenbeck随机模型的金融市场,优化资本管理.在保险公司和再保险公司的绝对风险厌恶指数不随财富数量而变化的假设下,利用博弈论和动态规划原理,得到了两公司的Nash均衡比例再保险和投资组合策略的解析表达式.给出了均衡保险与投资存在的必要条件,对均衡条件下的再保险供需关系进行了分析。 展开更多
关键词 Ornstein-Uhlenbeck模型 复合Poisson风险过程 非零和博弈 再保险与投资
原文传递
有限时间随机奇异系统的非零和博弈 被引量:1
18
作者 周海英 张成科 朱怀念 《广东工业大学学报》 CAS 2014年第2期32-35,共4页
讨论了噪声依赖于状态和控制的It型随机奇异系统的有限时间非零和博弈问题.在两人博弈的特殊情形-单人博弈,即随机奇异系统最优控制的基础上,把单人博弈的相关结果推广到两人非零和博弈,得到了有限时间随机奇异系统非零和博弈问题均... 讨论了噪声依赖于状态和控制的It型随机奇异系统的有限时间非零和博弈问题.在两人博弈的特殊情形-单人博弈,即随机奇异系统最优控制的基础上,把单人博弈的相关结果推广到两人非零和博弈,得到了有限时间随机奇异系统非零和博弈问题均衡解存在的充分条件等价于其相应耦合Riccati微分方程存在解. 展开更多
关键词 随机奇异系统 非零和博弈 耦合Riccati微分方程
下载PDF
一种随机微分对策的Nash平衡 被引量:1
19
作者 张卓奎 陈慧婵 《西安电子科技大学学报》 EI CAS CSCD 北大核心 2000年第5期635-637,共3页
利用动态规划原理和值函数的概念 ,在It^o微分的意义下讨论了IHRS线性二次型随机控制问题的最优控制率 ,研究了具有参数的动态系统和具有参数的价值函数的IHRS线性二次型两人非零和随机微分对策 。
关键词 随机微分对策 随机控制 NASH平衡 动态规划
下载PDF
带有无界赔付函数的非零和随机对策折扣模型
20
作者 杨洁 郭先平 《中山大学学报(自然科学版)》 CAS CSCD 北大核心 2008年第5期23-27,36,共6页
讨论了赔付函数可能既无上界又无下界的离散时间可数状态非零和随机对策的折扣模型。在零和随机对策中常用的"漂移"和"连续-紧"性条件下,用Fan's不动点定理证明了Nash平衡点的存在性。
关键词 非零和随机对策 期望折扣赔付准则 NASH平衡点 可数状态空间
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部