期刊文献+
共找到24篇文章
< 1 2 >
每页显示 20 50 100
基于平均场博弈的钢铁生产企业电-碳市场非合作博弈均衡分析 被引量:3
1
作者 孙啸天 杨争林 +4 位作者 任涵钰 谢海鹏 张润凡 郑亚先 别朝红 《电网技术》 EI CSCD 北大核心 2023年第8期3058-3068,共11页
随着电力市场改革和全国碳市场建设工作的推进,负荷侧的高耗能、高排放企业需参与市场化运行。数量众多的高耗能、高排放企业在电–碳市场中的策略性行为将对自身运行、电价、碳排放权价格产生重要影响。该文以钢铁生产企业为例,分析其... 随着电力市场改革和全国碳市场建设工作的推进,负荷侧的高耗能、高排放企业需参与市场化运行。数量众多的高耗能、高排放企业在电–碳市场中的策略性行为将对自身运行、电价、碳排放权价格产生重要影响。该文以钢铁生产企业为例,分析其对电–碳市场的影响。首先,针对钢铁生产企业的生产工艺流程,构建其碳排放以及电能消费模型。其次,基于电力价格以及碳排放权市场交易模型,构建钢铁生产企业参与电–碳市场的运行交易决策模型,以及市场博弈均衡模型。此外,针对策略性主体数目众多时均衡点求解困难的实际问题,提出了基于平均场博弈理论的均衡点求解方法,并证明所采用平均场博弈方法均衡点的存在性、唯一性以及与原博弈问题纳什均衡点的等价性。采用Mann型迭代格式,设计均衡点求解的加速算法。最后,通过算例仿真证明了所提模型的有效性以及算法的计算效率优势。 展开更多
关键词 钢铁生产企业 电力市场 碳市场 均衡分析 平均场博弈
下载PDF
Mean field game of optimal relative investment with jump risk
2
作者 Lijun Bo Shihua Wang Xiang Yu 《Science China Mathematics》 SCIE CSCD 2024年第5期1159-1188,共30页
In this paper,we study the n-player game and the mean field game under the constant relative risk aversion relative performance on terminal wealth,in which the interaction occurs by peer competition.In the model with ... In this paper,we study the n-player game and the mean field game under the constant relative risk aversion relative performance on terminal wealth,in which the interaction occurs by peer competition.In the model with n agents,the price dynamics of underlying risky assets depend on a common noise and contagious jump risk modeled by a multi-dimensional nonlinear Hawkes process.With a continuum of agents,we formulate the mean field game problem and characterize a deterministic mean field equilibrium in an analytical form under some conditions,allowing us to investigate some impacts of model parameters in the limiting model and discuss some financial implications.Moreover,based on the mean field equilibrium,we construct an approximate Nash equilibrium for the n-player game when n is sufficiently large.The explicit order of the approximation error is also derived. 展开更多
关键词 relative performance contagious jump risk mean field game with jumps mean field equilibrium approximate Nash equilibrium
原文传递
Mean field and n-player games in Ito-diffusion markets under forward performance criteria
3
作者 Thaleia Zariphopoulou 《Probability, Uncertainty and Quantitative Risk》 2024年第2期123-148,共26页
We study n-player games of portfolio choice in general common Ito-diffusion markets under relative performance criteria and time monotone forward utilities.We,also,consider their continuum limit which gives rise to a ... We study n-player games of portfolio choice in general common Ito-diffusion markets under relative performance criteria and time monotone forward utilities.We,also,consider their continuum limit which gives rise to a forward mean field game with unbounded controls in both the drift and volatility terms.Furthermore,we allow for general(time monotone)preferences,thus departing from the homothetic case,the only case so far analyzed.We produce explicit solutions for the optimal policies,the optimal wealth processes and the game values,and also provide representative examples for both the finite and the mean field game. 展开更多
关键词 mean field game n-player game Ito-diffusion Forward performance criteria
原文传递
最优投资与风险控制策略的多人非零和博弈及平均场博弈 被引量:2
4
作者 莫仕茵 朱怀念 《广东工业大学学报》 CAS 2023年第5期123-132,共10页
金融市场中存在大量的机构投资者,机构投资者追求高回报高财富的特性导致市场竞争日益激烈,竞争的市场环境使得机构投资者不仅追求自身财富的最大化,还关注与竞争对手之间的财富差距。本文研究多个机构投资者策略互动下的投资与风险控... 金融市场中存在大量的机构投资者,机构投资者追求高回报高财富的特性导致市场竞争日益激烈,竞争的市场环境使得机构投资者不仅追求自身财富的最大化,还关注与竞争对手之间的财富差距。本文研究多个机构投资者策略互动下的投资与风险控制问题。假设每个投资者均可以将财富投资于金融市场中以实现财富增值,同时通过购买保险等方式将面临的风险部分转移给其他金融机构。使用投资者自身财富与市场平均财富之差描述的相对业绩刻画市场竞争,投资者的目标是最大化终端时刻相对绩效的期望效用,在非零和博弈框架下构建了多人投资与风险控制博弈模型,以CARA效用函数为例,运用随机微分博弈理论和平均场博弈理论求出Nash均衡状态下的最优投资与风险控制策略,并进行参数的敏感性分析。研究发现:竞争将导致风险投资攀升,风险控制减弱,从而导致金融市场的系统性风险增加;机构投资者自身及竞争对手的风险偏好和市场竞争程度均会影响均衡投资与风险控制策略;盈余波动影响风险控制策略发生同向改变,但这种影响在波动轻微时较为明显,当波动超过一定程度时,波动对风险控制策略影响甚微。研究为机构投资者的投资与风险控制策略选择提供了有益指导。 展开更多
关键词 投资与风险控制 非零和博弈 平均场博弈 NASH均衡 动态规划
下载PDF
e-Nash mean-field games for stochastic linear-quadratic systems with delay and applications
5
作者 Heping Ma Yu Shi +1 位作者 Ruijing Li Weifeng Wang 《Probability, Uncertainty and Quantitative Risk》 2024年第3期389-404,共16页
In this paper,we focus on mean-field linear-quadratic games for stochastic large-population systems with time delays.The e-Nash equilibrium for decentralized strategies in linear-quadratic games is derived via the con... In this paper,we focus on mean-field linear-quadratic games for stochastic large-population systems with time delays.The e-Nash equilibrium for decentralized strategies in linear-quadratic games is derived via the consistency condition.By means of variational analysis,the system of consistency conditions can be expressed by forward-backward stochastic differential equations.Numerical examples illustrate the sensitivity of solutions of advanced backward stochastic differential equations to time delays,the effect of the the population's collective behaviors,and the consistency of mean-field estimates. 展开更多
关键词 mean-field game Linear-quadratic problem Time delay Large-population e-Nash equilibrium
原文传递
Incentive Feedback Stackelberg Strategy in Mean-Field Type Stochastic Difference Games
6
作者 GAO Wenhui LIN Yaning ZHANG Weihai 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2024年第4期1425-1445,共21页
This paper designs an incentive Stackelberg strategy for the discrete-time stochastic systems with mean-field terms.Sufficient conditions for the existence of such a design are suggested.Moreover,the incentive strateg... This paper designs an incentive Stackelberg strategy for the discrete-time stochastic systems with mean-field terms.Sufficient conditions for the existence of such a design are suggested.Moreover,the incentive strategy is obtained as a feedback form including the deviation of the state and its mathematical expectation.Also,the stability analysis is involved.It is found that the stability can be guaranteed by the follower.In addition,the specific algorithm is proposed and its effectiveness is checked by two examples. 展开更多
关键词 Discrete-time mean-field stochastic systems incentive strategy Stackelberg game teamoptimal solution
原文传递
A Mean-Field Game for a Forward-Backward Stochastic System With Partial Observation and Common Noise
7
作者 Pengyan Huang Guangchen Wang +1 位作者 Shujun Wang Hua Xiao 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第3期746-759,共14页
This paper considers a linear-quadratic(LQ) meanfield game governed by a forward-backward stochastic system with partial observation and common noise,where a coupling structure enters state equations,cost functionals ... This paper considers a linear-quadratic(LQ) meanfield game governed by a forward-backward stochastic system with partial observation and common noise,where a coupling structure enters state equations,cost functionals and observation equations.Firstly,to reduce the complexity of solving the meanfield game,a limiting control problem is introduced.By virtue of the decomposition approach,an admissible control set is proposed.Applying a filter technique and dimensional-expansion technique,a decentralized control strategy and a consistency condition system are derived,and the related solvability is also addressed.Secondly,we discuss an approximate Nash equilibrium property of the decentralized control strategy.Finally,we work out a financial problem with some numerical simulations. 展开更多
关键词 Decentralized control strategy ϵ-Nash equilibrium forward-backward stochastic system mean-field game partial observation
下载PDF
群体追逃微分博弈
8
作者 高红伟 孟斌斌 +1 位作者 刘剑 戴照鹏 《运筹学学报(中英文)》 CSCD 北大核心 2024年第3期46-62,共17页
本文以微分博弈和经典的追逃问题为主线,对群体追逃微分博弈的历史发展脉络进行梳理。针对大规模群体追逃问题,从平均场博弈视角出发,阐释了强化学习技术的应用前景。提出探索解决逆向追逃微分博弈的观点,可适用于水下无人舰艇、陆地机... 本文以微分博弈和经典的追逃问题为主线,对群体追逃微分博弈的历史发展脉络进行梳理。针对大规模群体追逃问题,从平均场博弈视角出发,阐释了强化学习技术的应用前景。提出探索解决逆向追逃微分博弈的观点,可适用于水下无人舰艇、陆地机器人以及空中无人机集群等同类场景。区别于其他综述性文章,作者对于俄罗斯以及苏联在本领域发展历史中代表性的学术流派给予了较多关注。 展开更多
关键词 追逃微分博弈 群体智能博弈 平均场博弈 逆向博弈 强化学习
下载PDF
Optimal trajectory and downlink power control for multi-type UAV aerial base stations 被引量:6
9
作者 Lixin LI Yan SUN +3 位作者 Qianqian CHENG Dawei WANG Wensheng LIN Wei CHEN 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2021年第9期11-23,共13页
Unmanned Aerial Vehicles(UAVs)enabled Aerial Base Stations(UABSs)have been studied widely in future communications.However,there are a series of challenges such as interference management,trajectory design and resourc... Unmanned Aerial Vehicles(UAVs)enabled Aerial Base Stations(UABSs)have been studied widely in future communications.However,there are a series of challenges such as interference management,trajectory design and resource allocation in the scenarios of multi-UAV networks.Besides,different performances among UABSs increase complexity and bring many challenges.In this paper,the joint downlink transmission power control and trajectory design problem in multi-type UABSs communication network is investigated.In order to satisfy the signal to interference plus noise power ratio of users,each UABS needs to adjust its position and transmission power.Based on the interactions among multiple communication links,a non-cooperative Mean-Field-Type Game(MFTG)is proposed to model the joint optimization problem.Then,a Nash equilibrium solution is solved by two steps:first,the users in the given area are clustered to get the initial deployment of the UABSs;second,the Mean-Field Q(MFQ)-learning algorithm is proposed to solve the discrete MFTG problem.Finally,the effectiveness of the approach is verified through the simulations,which simplifies the solution process and effectively reduces the energy consumption of each UABS. 展开更多
关键词 mean-field-Type game(MFTG) Power control Q-LEARNING Trajectory Unmanned Aerial Vehicle(UAV)
原文传递
Data Security Defense and Algorithm for Edge Computing Based on Mean Field Game 被引量:5
10
作者 Chengshan Qian Xue Li +1 位作者 Ning Sun Yuqing Tian 《Journal of Cyber Security》 2020年第2期97-106,共10页
With the development of the Internet of Things,the edge devices are increasing.Cyber security issues in edge computing have also emerged and caused great concern.We propose a defense strategy based on Mean field game ... With the development of the Internet of Things,the edge devices are increasing.Cyber security issues in edge computing have also emerged and caused great concern.We propose a defense strategy based on Mean field game to solve the security issues of edge user data during edge computing.Firstly,an individual cost function is formulated to build an edge user data security defense model.Secondly,we research the𝜀𝜀-Nash equilibrium of the individual cost function with finite players and prove the existence of the optimal defense strategy.Finally,by analyzing the stability of edge user data loss,it proves that the proposed defense strategy is effective. 展开更多
关键词 Edge computing mean field game cyber security
下载PDF
一种密集无人机网络下行发射功率控制算法研究 被引量:5
11
作者 王庆 黄勇 常晶 《电子测量技术》 北大核心 2021年第13期59-67,共9页
为解决密集无人机网络的下行功率控制问题,降低无人机的相互干扰、提高系统能量效率,提出了一种密集无人机网络下行发射功率控制算法。首先,将无人机网络的功率控制问题建模为高维系统状态下的平均场博弈论模型,降低无人机间的相互干扰... 为解决密集无人机网络的下行功率控制问题,降低无人机的相互干扰、提高系统能量效率,提出了一种密集无人机网络下行发射功率控制算法。首先,将无人机网络的功率控制问题建模为高维系统状态下的平均场博弈论模型,降低无人机间的相互干扰;其次,将平均场博弈论模型转化为马尔可夫决策过程,以得到无人机密集部署下的均衡解;另外,提出了一种基于深度强化学习的平均场博弈论算法,通过使用深度神经网络获取系统的最优功率控制策略,从而最大化系统能量效率。最后,通过仿真分析将所提方法与其他3种算法进行对比分析。实验结果表明,所提方法能够使无人机与环境进行有效交互,有效降低无人机的相互干扰,增强系统网络通信性能;同时,与其他3种方法相比,所提方法的收敛速度更快、能量效率更高,具有良好的收敛性与可靠性。 展开更多
关键词 无人机 功率控制 能量效率 平均场博弈论 深度强化学习
下载PDF
群视角下的多智能体强化学习方法综述 被引量:1
12
作者 项凤涛 罗俊仁 +2 位作者 谷学强 苏炯铭 张万鹏 《智能科学与技术学报》 CSCD 2023年第3期313-329,共17页
多智能体系统是分布式人工智能领域的前沿研究概念,传统的多智能体强化学习方法主要聚焦群体行为涌现、多智能体合作与协调、智能体间交流与通信、对手建模与预测等主题,但依然面临环境部分可观、对手策略非平稳、决策空间维度高、信用... 多智能体系统是分布式人工智能领域的前沿研究概念,传统的多智能体强化学习方法主要聚焦群体行为涌现、多智能体合作与协调、智能体间交流与通信、对手建模与预测等主题,但依然面临环境部分可观、对手策略非平稳、决策空间维度高、信用分配难理解等难题,如何设计满足智能体数量规模比较大、适应多类不同应用场景的多智能体强化学习方法是该领域的前沿课题。首先简述了多智能体强化学习的相关研究进展;其次着重从规模可扩展与种群自适应两个视角对多种类、多范式的多智能体学习方法进行了综合概述归纳,系统梳理了集合置换不变性、注意力机制、图与网络理论、平均场理论共四大类规模可扩展学习方法,迁移学习、课程学习、元学习、元博弈共四大类种群自适应强化学习方法,给出典型应用场景;最后从基准平台开发、双层优化架构、对抗策略学习、人机协同价值对齐和自适应博弈决策环共5个方面进行了前沿研究方向展望,该研究可为多模态环境下多智能强化学习的相关前沿重点问题研究提供参考。 展开更多
关键词 分布式智能 平均场理论 图神经网络 元学习 元博弈
下载PDF
基于平均场博弈理论的交通流分析问题
13
作者 程榕 付博洋 魏雅薇 《南开大学学报(自然科学版)》 CAS CSCD 北大核心 2023年第5期77-84,共8页
应用平均场博弈理论将传统交通动力学模型中类气体宏观模型和类粒子微观模型结合在一起,通过定义道路容通量这一概念,研究交通流问题.考虑到加速制动等微观动力学约束,将几种不同交通情形抽象为数学模型,给出相应的平均场博弈问题对应... 应用平均场博弈理论将传统交通动力学模型中类气体宏观模型和类粒子微观模型结合在一起,通过定义道路容通量这一概念,研究交通流问题.考虑到加速制动等微观动力学约束,将几种不同交通情形抽象为数学模型,给出相应的平均场博弈问题对应的正倒向偏微分方程系统.通过构造基本无振荡格式对上述系统进行数值模拟,分析了交通容通量对交通流产生的影响. 展开更多
关键词 交通流模型 平均场博弈 正倒向偏微分方程系统 基本无振荡格式
原文传递
基于平均场均衡的Ad hoc网络路由协议 被引量:3
14
作者 张旭 钱志鸿 +1 位作者 刘影 王雪 《哈尔滨工程大学学报》 EI CAS CSCD 北大核心 2014年第4期504-509,共6页
为了简化使用完美马尔科夫均衡方法可能引起的复杂计算过程,本文依据博弈论方法,提出一种平均场均衡的无线自组织网络路由协议(mean field equilibrium AODV,MFEA)。该方法要求每个节点利用所有其他节点的信息来分析自己的最优策略,而... 为了简化使用完美马尔科夫均衡方法可能引起的复杂计算过程,本文依据博弈论方法,提出一种平均场均衡的无线自组织网络路由协议(mean field equilibrium AODV,MFEA)。该方法要求每个节点利用所有其他节点的信息来分析自己的最优策略,而不需要知道每一个局中人的信息,并且在足够大的局中人数目情况下性能更加近似马尔科夫均衡。仿真实验显示:提出的MFEA路由协议在包投递率、时延和归一化开销方面均优于AODV(Ad hoc on-demand distance vector routing)协议,在节点密集的无线自组织网络中仍可获得比较好效果。 展开更多
关键词 平均场均衡 无线自组织网络 博弈论 收敛性分析 路由协议
下载PDF
基于平均场博弈的超密集网络边缘缓存和删除分配研究 被引量:2
15
作者 王孟哲 滕颖蕾 +2 位作者 宋梅 韩丹涛 张勇 《北京邮电大学学报》 EI CAS CSCD 北大核心 2020年第2期29-39,共11页
超密集网络设备数目庞大导致缓存分配算法复杂度极高,频繁地缓存和删除同样的内容导致的系统不稳定,为此,提出了基于平均场博弈(MFG)的分布式缓存分配算法和基于李雅普诺夫漂移加惩罚(DPP)方法的分布式删除分配算法.MFG方法使缓存分配... 超密集网络设备数目庞大导致缓存分配算法复杂度极高,频繁地缓存和删除同样的内容导致的系统不稳定,为此,提出了基于平均场博弈(MFG)的分布式缓存分配算法和基于李雅普诺夫漂移加惩罚(DPP)方法的分布式删除分配算法.MFG方法使缓存分配算法的复杂度与基站数目无关.DPP方法将具有时间相关性的删除分配问题解耦成为每个时刻的问题,并求解得到了兼顾系统稳定性和网络开销优化的删除分配策略.仿真结果表明,MFG方法能够使网络最优控制策略快速收敛,并且在超密集场景下得到明显低于基本缓存分配方法的网络开销;李雅普诺夫DPP方法能够实现兼顾网络开销优化的网络缓存和删除稳定性. 展开更多
关键词 超密集网络 平均场 平均场博弈 李雅普诺夫理论 漂移加惩罚
原文传递
Delay-Optimal Random Access in Large-Scale Energy Harvesting IoT Networks Based on Mean Field Game 被引量:1
16
作者 Dezhi Wang Wei Wang +1 位作者 Zhaoyang Zhang Aiping Huang 《China Communications》 SCIE CSCD 2022年第4期121-136,共16页
With energy harvesting capability, the Internet of things(IoT) devices transmit data depending on their available energy, which leads to a more complicated coupling and brings new technical challenges to delay optimiz... With energy harvesting capability, the Internet of things(IoT) devices transmit data depending on their available energy, which leads to a more complicated coupling and brings new technical challenges to delay optimization. In this paper,we study the delay-optimal random access(RA) in large-scale energy harvesting IoT networks. We model a two-dimensional Markov decision process(MDP)to address the coupling between the data and energy queues, and adopt the mean field game(MFG) theory to reveal the coupling among the devices by utilizing the large-scale property. Specifically, to obtain the optimal access strategy for each device, we derive the Hamilton-Jacobi-Bellman(HJB) equation which requires the statistical information of other devices.Moreover, to model the evolution of the states distribution in the system, we derive the Fokker-PlanckKolmogorov(FPK) equation based on the access strategy of devices. By solving the two coupled equations,we obtain the delay-optimal random access solution in an iterative manner with Lax-Friedrichs method. Finally, the simulation results show that the proposed scheme achieves significant performance gain compared with the conventional schemes. 展开更多
关键词 wireless communications energy harvesting random access mean field game
下载PDF
A Trading Execution Model Based on Mean Field Games and Optimal Control
17
作者 Lorella Fatone Francesca Mariani +1 位作者 Maria Cristina Recchioni Francesco Zirilli 《Applied Mathematics》 2014年第19期3091-3116,共26页
We present a trading execution model that describes the behaviour of a big trader and of a multitude of retail traders operating on the shares of a risky asset. The retail traders are modeled as a population of “cons... We present a trading execution model that describes the behaviour of a big trader and of a multitude of retail traders operating on the shares of a risky asset. The retail traders are modeled as a population of “conservative” investors that: 1) behave in a similar way, 2) try to avoid abrupt changes in their trading strategies, 3) want to limit the risk due to the fact of having open positions on the asset shares, 4) in the long run want to have a given position on the asset shares. The big trader wants to maximize the revenue resulting from the action of buying or selling a (large) block of asset shares in a given time interval. The behaviour of the retail traders and of the big trader is modeled using respectively a mean field game model and an optimal control problem. These models are coupled by the asset share price dynamic equation. The trading execution strategy adopted by the retail traders is obtained solving the mean field game model. This strategy is used to formulate the optimal control problem that determines the behaviour of the big trader. The previous mathematical models are solved using the dynamic programming principle. In some special cases explicit solutions of the previous models are found. An extensive numerical study of the trading execution model proposed is presented. The interested reader is referred to the website: http://www.econ.univpm.it/recchioni/finance/w19 to find material including animations, an interactive application and an app that helps the understanding of the paper. A general reference to the work of the authors and of their coauthors in mathematical finance is the website:?http://www.econ.univpm.it/recchioni/finance. 展开更多
关键词 TRADING EXECUTION mean field game Optimal Control
下载PDF
基于平均场博弈的新冠肺炎最优防控力度切换策略 被引量:1
18
作者 薄立军 张婷婷 徐明雯婵 《应用概率统计》 CSCD 北大核心 2021年第3期274-290,共17页
本文以新冠肺炎(COVID-19)在美国各州感染人数的公开数据为例,提出新冠肺炎大流行期的多地区随机动态传染模型.为解决何时"开放"或"限制"经济社会活动的问题,本文建立基于平均场交互的期望效用最大化的多地区最优... 本文以新冠肺炎(COVID-19)在美国各州感染人数的公开数据为例,提出新冠肺炎大流行期的多地区随机动态传染模型.为解决何时"开放"或"限制"经济社会活动的问题,本文建立基于平均场交互的期望效用最大化的多地区最优防控切换Nash均衡策略.随后,考虑地区个数为无穷时对应的代表性地区感染人数模型和最优防控切换问题,证明其为有限地区情形Nash均衡的极限.最后,通过比较及分析不同状态下的最优切换边界,我们对何时及如何调整防控力度给出了具体的建议. 展开更多
关键词 新冠肺炎 传染病模型 最优切换 NASH均衡 平均场博弈
下载PDF
Mean field games of controls:Propagation of monotonicities
19
作者 Chenchen Mou Jianfeng Zhang 《Probability, Uncertainty and Quantitative Risk》 2022年第3期247-274,共28页
The theory of Mean Field Game of Controls considers a class of mean field games where the interaction is through the joint distribution of the state and control.It is well known that,for standard mean field games,cert... The theory of Mean Field Game of Controls considers a class of mean field games where the interaction is through the joint distribution of the state and control.It is well known that,for standard mean field games,certain monotonicity conditions are crucial to guarantee the uniqueness of mean field equilibria and then the global wellposedness for master equations.In the literature the monotonicity condition could be the Lasry–Lions monotonicity,the displacement monotonicity,or the anti-monotonicity conditions.In this paper,we investigate these three types of monotonicity conditions for Mean Field Games of Controls and show their propagation along the solutions to the master equations with common noises.In particular,we extend the displacement monotonicity to semi-monotonicity,whose propagation result is new even for standard mean field games.This is the first step towards the global wellposedness theory for master equations of Mean Field Games of Controls. 展开更多
关键词 mean field game of controls Master equation Lasry-Lions monotonicity Displacement semi-monotonicity Anti-monotonicity
原文传递
Backward-forward linear-quadratic mean-field games with major and minor agents 被引量:1
20
作者 Jianhui Huang Shujun Wang Zhen Wu 《Probability, Uncertainty and Quantitative Risk》 2016年第1期278-304,共27页
This paper studies the backward-forward linear-quadratic-Gaussian(LQG)games with major and minor agents(players).The state of major agent follows a linear backward stochastic differential equation(BSDE)and the states ... This paper studies the backward-forward linear-quadratic-Gaussian(LQG)games with major and minor agents(players).The state of major agent follows a linear backward stochastic differential equation(BSDE)and the states of minor agents are governed by linear forward stochastic differential equations(SDEs).The major agent is dominating as its state enters those of minor agents.On the other hand,all minor agents are individually negligible but their state-average affects the cost functional of major agent.The mean-field game in such backward-major and forward-minor setup is formulated to analyze the decentralized strategies.We first derive the consistency condition via an auxiliary mean-field SDEs and a 3×2 mixed backward-forward stochastic differential equation(BFSDE)system.Next,we discuss the wellposedness of such BFSDE system by virtue of the monotonicity method.Consequently,we obtain the decentralized strategies for major and minor agents which are proved to satisfy the-Nash equilibrium property. 展开更多
关键词 Backward-forward stochastic differential equation(BFSDE) Consistency condition -Nash equilibrium Large-population system Major-minor agent mean-field game
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部