By integrating deep neural networks with reinforcement learning,the Double Deep Q Network(DDQN)algorithm overcomes the limitations of Q-learning in handling continuous spaces and is widely applied in the path planning...By integrating deep neural networks with reinforcement learning,the Double Deep Q Network(DDQN)algorithm overcomes the limitations of Q-learning in handling continuous spaces and is widely applied in the path planning of mobile robots.However,the traditional DDQN algorithm suffers from sparse rewards and inefficient utilization of high-quality data.Targeting those problems,an improved DDQN algorithm based on average Q-value estimation and reward redistribution was proposed.First,to enhance the precision of the target Q-value,the average of multiple previously learned Q-values from the target Q network is used to replace the single Q-value from the current target Q network.Next,a reward redistribution mechanism is designed to overcome the sparse reward problem by adjusting the final reward of each action using the round reward from trajectory information.Additionally,a reward-prioritized experience selection method is introduced,which ranks experience samples according to reward values to ensure frequent utilization of high-quality data.Finally,simulation experiments are conducted to verify the effectiveness of the proposed algorithm in fixed-position scenario and random environments.The experimental results show that compared to the traditional DDQN algorithm,the proposed algorithm achieves shorter average running time,higher average return and fewer average steps.The performance of the proposed algorithm is improved by 11.43%in the fixed scenario and 8.33%in random environments.It not only plans economic and safe paths but also significantly improves efficiency and generalization in path planning,making it suitable for widespread application in autonomous navigation and industrial automation.展开更多
新古典经济学与行为经济学在解释意愿支付(willing to pay,WTP)与意愿接受(willing to accept,WTA)之间差异这一现象时,存在严重的分歧。国际上,众多学者针对WTP与WTA不一致的现象也展开了广泛的实验研究。之所以众多学者关注WTP与WTA...新古典经济学与行为经济学在解释意愿支付(willing to pay,WTP)与意愿接受(willing to accept,WTA)之间差异这一现象时,存在严重的分歧。国际上,众多学者针对WTP与WTA不一致的现象也展开了广泛的实验研究。之所以众多学者关注WTP与WTA之间的差异问题,一方面是因为它涉及新古典经济理论中关于偏好假定的合理性问题;另一方面在于WTP与WTA还是两种重要的价值评估手段。因为许多公共经济政策的制定或效果评估与此有直接关系。许多公共经济政策的制定或效果评估需要在WTP与WTA之间进行审慎选择,特别是在涉及医疗、环境问题、公共产品供给以及不可再生资源定价方面。正确区分这两种价值评估手段的适用性是公共经济政策制定的一个基础性条件。鉴于此,本文在重现研究争论和系统梳理相应研究结论的基础上对公共政策经济估值方法(WTP还是WTA)的选择提出识别依据。展开更多
In this paper, a class of strongly non linear generalised Riemann Hilbert problems for second order elliptic system is studied. By means of the theory of integral equations and using an explicit form of the solutio...In this paper, a class of strongly non linear generalised Riemann Hilbert problems for second order elliptic system is studied. By means of the theory of integral equations and using an explicit form of the solution, a reduction is made to a nonlinear boundary value problem for two holomorphic functions. And using an approximation dealing with a solvable perturbed problems and suitable prior estimates, we prove that the problems possess solution in Hardy class, the solution w(z) belongs to W 1 2()∩W 2 p(G),p>2 .展开更多
Suppose that B is a sufficiently large positive constant, ε is a sufficiently small positive constant, X and N are sufficiently large. It is mainly proved that i) for the positive integers n, X<n≤2X, except for O...Suppose that B is a sufficiently large positive constant, ε is a sufficiently small positive constant, X and N are sufficiently large. It is mainly proved that i) for the positive integers n, X<n≤2X, except for O(X log-B X) values, the interval (n, n+n1/14+ε) contains a prime number; ii) if A = N1/2+s, then the even numbers in the interval (N, N+A), except for 0(Alog-B N) values, are all Goldbach numbers.展开更多
基金funded by National Natural Science Foundation of China(No.62063006)Guangxi Science and Technology Major Program(No.2022AA05002)+1 种基金Key Laboratory of AI and Information Processing(Hechi University),Education Department of Guangxi Zhuang Autonomous Region(No.2022GXZDSY003)Central Leading Local Science and Technology Development Fund Project of Wuzhou(No.202201001).
文摘By integrating deep neural networks with reinforcement learning,the Double Deep Q Network(DDQN)algorithm overcomes the limitations of Q-learning in handling continuous spaces and is widely applied in the path planning of mobile robots.However,the traditional DDQN algorithm suffers from sparse rewards and inefficient utilization of high-quality data.Targeting those problems,an improved DDQN algorithm based on average Q-value estimation and reward redistribution was proposed.First,to enhance the precision of the target Q-value,the average of multiple previously learned Q-values from the target Q network is used to replace the single Q-value from the current target Q network.Next,a reward redistribution mechanism is designed to overcome the sparse reward problem by adjusting the final reward of each action using the round reward from trajectory information.Additionally,a reward-prioritized experience selection method is introduced,which ranks experience samples according to reward values to ensure frequent utilization of high-quality data.Finally,simulation experiments are conducted to verify the effectiveness of the proposed algorithm in fixed-position scenario and random environments.The experimental results show that compared to the traditional DDQN algorithm,the proposed algorithm achieves shorter average running time,higher average return and fewer average steps.The performance of the proposed algorithm is improved by 11.43%in the fixed scenario and 8.33%in random environments.It not only plans economic and safe paths but also significantly improves efficiency and generalization in path planning,making it suitable for widespread application in autonomous navigation and industrial automation.
文摘新古典经济学与行为经济学在解释意愿支付(willing to pay,WTP)与意愿接受(willing to accept,WTA)之间差异这一现象时,存在严重的分歧。国际上,众多学者针对WTP与WTA不一致的现象也展开了广泛的实验研究。之所以众多学者关注WTP与WTA之间的差异问题,一方面是因为它涉及新古典经济理论中关于偏好假定的合理性问题;另一方面在于WTP与WTA还是两种重要的价值评估手段。因为许多公共经济政策的制定或效果评估与此有直接关系。许多公共经济政策的制定或效果评估需要在WTP与WTA之间进行审慎选择,特别是在涉及医疗、环境问题、公共产品供给以及不可再生资源定价方面。正确区分这两种价值评估手段的适用性是公共经济政策制定的一个基础性条件。鉴于此,本文在重现研究争论和系统梳理相应研究结论的基础上对公共政策经济估值方法(WTP还是WTA)的选择提出识别依据。
文摘In this paper, a class of strongly non linear generalised Riemann Hilbert problems for second order elliptic system is studied. By means of the theory of integral equations and using an explicit form of the solution, a reduction is made to a nonlinear boundary value problem for two holomorphic functions. And using an approximation dealing with a solvable perturbed problems and suitable prior estimates, we prove that the problems possess solution in Hardy class, the solution w(z) belongs to W 1 2()∩W 2 p(G),p>2 .
基金the National Natural Science Foundation of China
文摘Suppose that B is a sufficiently large positive constant, ε is a sufficiently small positive constant, X and N are sufficiently large. It is mainly proved that i) for the positive integers n, X<n≤2X, except for O(X log-B X) values, the interval (n, n+n1/14+ε) contains a prime number; ii) if A = N1/2+s, then the even numbers in the interval (N, N+A), except for 0(Alog-B N) values, are all Goldbach numbers.