In this paper, we use the solutions of forward-backward stochastic differential equations to get the explicit form of the optimal control for linear quadratic stochastic optimal control problem and the open-loop Nash ...In this paper, we use the solutions of forward-backward stochastic differential equations to get the explicit form of the optimal control for linear quadratic stochastic optimal control problem and the open-loop Nash equilibrium point for nonzero sum differential games problem. We also discuss the solvability of the generalized Riccati equation system and give the linear feedback regulator for the optimal control problem using the solution of this kind of Riccati equation system.展开更多
This paper is concerned with nonzero-sum discrete-time stochastic games in Borel state and action spaces under the expected discounted payoff criterion.The payoff function can be unbounded.The transition probability i...This paper is concerned with nonzero-sum discrete-time stochastic games in Borel state and action spaces under the expected discounted payoff criterion.The payoff function can be unbounded.The transition probability is a convex combination of finite probability measures that are dominated by a probability measure on the state space and depend on the state variable.Under suitable conditions,the authors establish the existence of stationary almost Markov ε-equilibria and give an approximation method via some stochastic games with bounded payoffs.Finally,a production game is introduced to illustrate the applications of the main result,which generalizes the bounded payoff case.展开更多
The authors discuss one type of general forward-backward stochastic differential equations (FBSDEs) with Ito's stochastic delayed equations as the forward equations and anticipated backward stochastic differential...The authors discuss one type of general forward-backward stochastic differential equations (FBSDEs) with Ito's stochastic delayed equations as the forward equations and anticipated backward stochastic differential equations as the backward equations.The existence and uniqueness results of the general FBSDEs are obtained.In the framework of the general FBSDEs in this paper,the explicit form of the optimal control for linear-quadratic stochastic optimal control problem with delay and the Nash equilibrium point for nonzero sum differential games problem with delay are obtained.展开更多
To keep the secrecy performance from being badly influenced by untrusted relay(UR), a multi-UR network through amplify-and-forward(AF) cooperative scheme is put forward, which takes relay weight and harmful factor int...To keep the secrecy performance from being badly influenced by untrusted relay(UR), a multi-UR network through amplify-and-forward(AF) cooperative scheme is put forward, which takes relay weight and harmful factor into account. A nonzero-sum game is established to capture the interaction among URs and detection strategies. Secrecy capacity is investigated as game payoff to indicate the untrusted behaviors of the relays. The maximum probabilities of the behaviors of relay and the optimal system detection strategy can be obtained by using the proposed algorithm.展开更多
A necessary maximum principle is given for nonzero-sum stochastic Oltterential games with random jumps. The result is applied to solve the H2/H∞ control problem of stochastic systems with random jumps. A necessary an...A necessary maximum principle is given for nonzero-sum stochastic Oltterential games with random jumps. The result is applied to solve the H2/H∞ control problem of stochastic systems with random jumps. A necessary and sufficient condition for the existence of a unique solution to the H2/H∞ control problem is derived. The resulting solution is given by the solution of an uncontrolled forward backward stochastic differential equation with random jumps.展开更多
This paper attempts to study two-person nonzero-sum games for denumerable continuous-time Markov chains determined by transition rates,with an expected average criterion.The transition rates are allowed to be unbounde...This paper attempts to study two-person nonzero-sum games for denumerable continuous-time Markov chains determined by transition rates,with an expected average criterion.The transition rates are allowed to be unbounded,and the payoff functions may be unbounded from above and from below.We give suitable conditions under which the existence of a Nash equilibrium is ensured.More precisely,using the socalled "vanishing discount" approach,a Nash equilibrium for the average criterion is obtained as a limit point of a sequence of equilibrium strategies for the discounted criterion as the discount factors tend to zero.Our results are illustrated with a birth-and-death game.展开更多
This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into ...This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into a multi-agent nonzero-sum game.Then,the Nash equilibrium can be achieved by solving the coupled Hamilton–Jacobi–Bellman(HJB)equations with nonquadratic input energy terms.A novel off-policy reinforcement learning method is presented to obtain the Nash equilibrium solution without the system models,and the critic neural networks(NNs)and actor NNs are introduced to implement the presented method.Theoretical analysis is provided,which shows that the iterative control laws converge to the Nash equilibrium.Simulation results show the good performance of the presented method.展开更多
Based upon the theory of the nonlinear quadric two-person nonzero-sum differential game,the fact that the time-limited mixed H2/H∞ control problem can be turned into the problem of solving the state feedback Nash bal...Based upon the theory of the nonlinear quadric two-person nonzero-sum differential game,the fact that the time-limited mixed H2/H∞ control problem can be turned into the problem of solving the state feedback Nash balance point is mentioned. Upon this,a theorem about the solution of the state feedback control is given,the Lyapunov stabilization of the nonlinear system under this control is proved,too. At the same time,this solution is used to design the nonlinear H2/H∞ guidance law of the relative motion between the missile and the target in three-dimensional(3D) space. By solving two coupled Hamilton-Jacobi partial differential inequalities(HJPDI),a control with more robust stabilities and more robust performances is obtained. With different H∞ performance indexes,the correlative weighting factors of the control are analytically designed. At last,simulations under different robust performance indexes and under different initial conditions and under the cases of intercepting different maneuvering targets are carried out. All results indicate that the designed law is valid.展开更多
This paper studies the existence and uniqueness of solutions of fully coupled forward-backward stochastic differential equations with Brownian motion and random jumps.The result is applied to solve a linear-quadratic ...This paper studies the existence and uniqueness of solutions of fully coupled forward-backward stochastic differential equations with Brownian motion and random jumps.The result is applied to solve a linear-quadratic optimal control and a nonzero-sum differential game of backward stochastic differential equations.The optimal control and Nash equilibrium point are explicitly derived. Also the solvability of a kind Riccati equations is discussed.All these results develop those of Lim, Zhou(2001) and Yu,Ji(2008).展开更多
基金This work is supported by the National Natural Science Foundation (Grant No.10371067)the Youth Teacher Foundation of Fok Ying Tung Education Foundation, the Excellent Young Teachers Program and the Doctoral Program Foundation of MOE and Shandong Province, China.
文摘In this paper, we use the solutions of forward-backward stochastic differential equations to get the explicit form of the optimal control for linear quadratic stochastic optimal control problem and the open-loop Nash equilibrium point for nonzero sum differential games problem. We also discuss the solvability of the generalized Riccati equation system and give the linear feedback regulator for the optimal control problem using the solution of this kind of Riccati equation system.
基金supported by the National Key Research and Development Program of China under Grant No.2022YFA1004600the National Natural Science Foundation of China under Grant No.11931018+1 种基金the Guangdong Basic and Applied Basic Research Foundation under Grant No.2021A1515010057the Guangdong Province Key Laboratory of Computational Science at the Sun Yat-sen University under Grant No.2020B1212060032。
文摘This paper is concerned with nonzero-sum discrete-time stochastic games in Borel state and action spaces under the expected discounted payoff criterion.The payoff function can be unbounded.The transition probability is a convex combination of finite probability measures that are dominated by a probability measure on the state space and depend on the state variable.Under suitable conditions,the authors establish the existence of stationary almost Markov ε-equilibria and give an approximation method via some stochastic games with bounded payoffs.Finally,a production game is introduced to illustrate the applications of the main result,which generalizes the bounded payoff case.
基金Project supported by the 973 National Basic Research Program of China (No. 2007CB814904)the National Natural Science Foundations of China (No. 10921101)+2 种基金the Shandong Provincial Natural Science Foundation of China (No. 2008BS01024)the Science Fund for Distinguished Young Scholars of Shandong Province (No. JQ200801)the Shandong University Science Fund for Distinguished Young Scholars(No. 2009JQ004)
文摘The authors discuss one type of general forward-backward stochastic differential equations (FBSDEs) with Ito's stochastic delayed equations as the forward equations and anticipated backward stochastic differential equations as the backward equations.The existence and uniqueness results of the general FBSDEs are obtained.In the framework of the general FBSDEs in this paper,the explicit form of the optimal control for linear-quadratic stochastic optimal control problem with delay and the Nash equilibrium point for nonzero sum differential games problem with delay are obtained.
基金Supported by the National Natural Science Foundation of China(No.61101223)
文摘To keep the secrecy performance from being badly influenced by untrusted relay(UR), a multi-UR network through amplify-and-forward(AF) cooperative scheme is put forward, which takes relay weight and harmful factor into account. A nonzero-sum game is established to capture the interaction among URs and detection strategies. Secrecy capacity is investigated as game payoff to indicate the untrusted behaviors of the relays. The maximum probabilities of the behaviors of relay and the optimal system detection strategy can be obtained by using the proposed algorithm.
基金supported by the Doctoral foundation of University of Jinan(XBS1213)the National Natural Science Foundation of China(11101242)
文摘A necessary maximum principle is given for nonzero-sum stochastic Oltterential games with random jumps. The result is applied to solve the H2/H∞ control problem of stochastic systems with random jumps. A necessary and sufficient condition for the existence of a unique solution to the H2/H∞ control problem is derived. The resulting solution is given by the solution of an uncontrolled forward backward stochastic differential equation with random jumps.
基金supported by National Science Foundation for Distinguished Young Scholars of China (Grant No. 10925107)Guangdong Province Universities and Colleges Pearl River Scholar Funded Scheme (2011)
文摘This paper attempts to study two-person nonzero-sum games for denumerable continuous-time Markov chains determined by transition rates,with an expected average criterion.The transition rates are allowed to be unbounded,and the payoff functions may be unbounded from above and from below.We give suitable conditions under which the existence of a Nash equilibrium is ensured.More precisely,using the socalled "vanishing discount" approach,a Nash equilibrium for the average criterion is obtained as a limit point of a sequence of equilibrium strategies for the discounted criterion as the discount factors tend to zero.Our results are illustrated with a birth-and-death game.
基金Project supported by the National Key R&D Program of China(No.2018YFB1702300)the National Natural Science Foundation of China(Nos.61722312 and 61533017)。
文摘This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into a multi-agent nonzero-sum game.Then,the Nash equilibrium can be achieved by solving the coupled Hamilton–Jacobi–Bellman(HJB)equations with nonquadratic input energy terms.A novel off-policy reinforcement learning method is presented to obtain the Nash equilibrium solution without the system models,and the critic neural networks(NNs)and actor NNs are introduced to implement the presented method.Theoretical analysis is provided,which shows that the iterative control laws converge to the Nash equilibrium.Simulation results show the good performance of the presented method.
基金Sponsored by the National Natural Science Foundation of China (Grant No.90716028)
文摘Based upon the theory of the nonlinear quadric two-person nonzero-sum differential game,the fact that the time-limited mixed H2/H∞ control problem can be turned into the problem of solving the state feedback Nash balance point is mentioned. Upon this,a theorem about the solution of the state feedback control is given,the Lyapunov stabilization of the nonlinear system under this control is proved,too. At the same time,this solution is used to design the nonlinear H2/H∞ guidance law of the relative motion between the missile and the target in three-dimensional(3D) space. By solving two coupled Hamilton-Jacobi partial differential inequalities(HJPDI),a control with more robust stabilities and more robust performances is obtained. With different H∞ performance indexes,the correlative weighting factors of the control are analytically designed. At last,simulations under different robust performance indexes and under different initial conditions and under the cases of intercepting different maneuvering targets are carried out. All results indicate that the designed law is valid.
基金supported by National Natural Science Foundation of China(10671112)National Basic Research Program of China(973 Program)(2007CB814904)the Natural Science Foundation of Shandong Province(Z2006A01)
文摘This paper studies the existence and uniqueness of solutions of fully coupled forward-backward stochastic differential equations with Brownian motion and random jumps.The result is applied to solve a linear-quadratic optimal control and a nonzero-sum differential game of backward stochastic differential equations.The optimal control and Nash equilibrium point are explicitly derived. Also the solvability of a kind Riccati equations is discussed.All these results develop those of Lim, Zhou(2001) and Yu,Ji(2008).
基金Supported by the National Defense Basic Research Program(JCKY2019413D001)the Natural Science Foundation(6217023627,62003214,62173054)the Shanghai Natural Science Foundation(19ZR1436000)。