期刊文献+
共找到2,044篇文章
< 1 2 103 >
每页显示 20 50 100
JMPR农药残留急性膳食摄入量计算方法 被引量:35
1
作者 高仁君 王蔚 +1 位作者 陈隆智 张文吉 《中国农学通报》 CSCD 2006年第4期101-105,共5页
农药残留急性膳食风险评估直到最近才引起大家的关注。目前,粮农组织和世界卫生组织农药残留联席会议(JMPR)研究国际范围农药急性膳食风险评估;美国、英国、荷兰、澳大利亚和新西兰也开始进行国家农药急性膳食风险评估。农药急性膳食风... 农药残留急性膳食风险评估直到最近才引起大家的关注。目前,粮农组织和世界卫生组织农药残留联席会议(JMPR)研究国际范围农药急性膳食风险评估;美国、英国、荷兰、澳大利亚和新西兰也开始进行国家农药急性膳食风险评估。农药急性膳食风险是急性或短期接触毒性与农药急性膳食摄入量的函数。急性膳食接触量评估常用的方法有:定点或确定性方法和概率模型法。在确定性方法中选取食物的大部分人群消耗量和高残留量来计算膳食摄入量,为了解决混合样品中食品个体之间的残留差异,在计算中引入了变异因子。JMPR根据具体情况分为情形1、情形2a、情形2b、情形3来计算农药急性膳食摄入量。中国应该尽快建立健全膳食结构和农产品性状数据库,建立健全市场中农产品的农药残留数据库,并在高毒和中等毒性农药登记前应采用JMPR的确定性方法,进行急性膳食风险评估,提高农药膳食摄入的安全性。 展开更多
关键词 农药残留 急性膳食风险评估 急性膳食摄入量 确定性方法
下载PDF
Quantum secure direct communication with cluster states 被引量:20
2
作者 CAO WeiFeng YANG YuGuang WEN QiaoYan 《Science China(Physics,Mechanics & Astronomy)》 SCIE EI CAS 2010年第7期1271-1275,共5页
A quantum secure direct communication protocol with cluster states is proposed.Compared with the deterministic secure quantum communication protocol with the cluster state proposed by Yuan and Song(Int.J.Quant.Inform.... A quantum secure direct communication protocol with cluster states is proposed.Compared with the deterministic secure quantum communication protocol with the cluster state proposed by Yuan and Song(Int.J.Quant.Inform.,2009,7:689),this protocol can achieve higher intrinsic efficiency by using two-step transmission.The implementation of this protocol is also discussed. 展开更多
关键词 quantum secure direct communication deterministic secure quantum communication cluster state
原文传递
Deterministic secure quantum communication over a collective-noise channel 被引量:20
3
作者 GU Bin PEI ShiXin +1 位作者 SONG Biao ZHONG Kun 《Science China(Physics,Mechanics & Astronomy)》 SCIE EI CAS 2009年第12期1913-1918,共6页
We present two deterministic secure quantum communication schemes over a collective-noise. One is used to complete the secure quantum communication against a collective-rotation noise and the other is used against a c... We present two deterministic secure quantum communication schemes over a collective-noise. One is used to complete the secure quantum communication against a collective-rotation noise and the other is used against a collective-dephasing noise. The two parties of quantum communication can exploit the correlation of their subsystems to check eavesdropping efficiently. Although the sender should prepare a sequence of three-photon entangled states for accomplishing secure communication against a collective noise,the two parties need only single-photon measurements,rather than Bell-state measurements,which will make our schemes convenient in practical application. 展开更多
关键词 deterministic SECURE QUANTUM COMMUNICATION QUANTUM SECURE direct COMMUNICATION collective-noise CHANNEL SINGLE-PHOTON measurements
原文传递
Deep reinforcement learning for dynamic computation offloading and resource allocation in cache-assisted mobile edge computing systems 被引量:19
4
作者 Samrat Nath Jingxian Wu 《Intelligent and Converged Networks》 2020年第2期181-198,共18页
Mobile Edge Computing(MEC)is one of the most promising techniques for next-generation wireless communication systems.In this paper,we study the problem of dynamic caching,computation offloading,and resource allocation... Mobile Edge Computing(MEC)is one of the most promising techniques for next-generation wireless communication systems.In this paper,we study the problem of dynamic caching,computation offloading,and resource allocation in cache-assisted multi-user MEC systems with stochastic task arrivals.There are multiple computationally intensive tasks in the system,and each Mobile User(MU)needs to execute a task either locally or remotely in one or more MEC servers by offloading the task data.Popular tasks can be cached in MEC servers to avoid duplicates in offloading.The cached contents can be either obtained through user offloading,fetched from a remote cloud,or fetched from another MEC server.The objective is to minimize the long-term average of a cost function,which is defined as a weighted sum of energy consumption,delay,and cache contents’fetching costs.The weighting coefficients associated with the different metrics in the objective function can be adjusted to balance the tradeoff among them.The optimum design is performed with respect to four decision parameters:whether to cache a given task,whether to offload a given uncached task,how much transmission power should be used during offloading,and how much MEC resources to be allocated for executing a task.We propose to solve the problems by developing a dynamic scheduling policy based on Deep Reinforcement Learning(DRL)with the Deep Deterministic Policy Gradient(DDPG)method.A new decentralized DDPG algorithm is developed to obtain the optimum designs for multi-cell MEC systems by leveraging on the cooperations among neighboring MEC servers.Simulation results demonstrate that the proposed algorithm outperforms other existing strategies,such as Deep Q-Network(DQN). 展开更多
关键词 Mobile Edge Computing(MEC) caching computation offloading resource allocation Deep Reinforcement Learning(DRL) Deep deterministic Policy Gradient(DDPG) multi-cell
原文传递
Relevant experience learning:A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments 被引量:16
5
作者 Zijian HU Xiaoguang GAO +2 位作者 Kaifang WAN Yiwei ZHAI Qianglong WANG 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2021年第12期187-204,共18页
Unmanned Aerial Vehicles(UAVs)play a vital role in military warfare.In a variety of battlefield mission scenarios,UAVs are required to safely fly to designated locations without human intervention.Therefore,finding a ... Unmanned Aerial Vehicles(UAVs)play a vital role in military warfare.In a variety of battlefield mission scenarios,UAVs are required to safely fly to designated locations without human intervention.Therefore,finding a suitable method to solve the UAV Autonomous Motion Planning(AMP)problem can improve the success rate of UAV missions to a certain extent.In recent years,many studies have used Deep Reinforcement Learning(DRL)methods to address the AMP problem and have achieved good results.From the perspective of sampling,this paper designs a sampling method with double-screening,combines it with the Deep Deterministic Policy Gradient(DDPG)algorithm,and proposes the Relevant Experience Learning-DDPG(REL-DDPG)algorithm.The REL-DDPG algorithm uses a Prioritized Experience Replay(PER)mechanism to break the correlation of continuous experiences in the experience pool,finds the experiences most similar to the current state to learn according to the theory in human education,and expands the influence of the learning process on action selection at the current state.All experiments are applied in a complex unknown simulation environment constructed based on the parameters of a real UAV.The training experiments show that REL-DDPG improves the convergence speed and the convergence result compared to the state-of-the-art DDPG algorithm,while the testing experiments show the applicability of the algorithm and investigate the performance under different parameter conditions. 展开更多
关键词 Autonomous Motion Planning(AMP) Deep deterministic Policy Gradient(DDPG) Deep Reinforcement Learning(DRL) Sampling method UAV
原文传递
NewIP:开拓未来数据网络的新连接和新能力 被引量:16
6
作者 郑秀丽 蒋胜 王闯 《电信科学》 2019年第9期2-11,共10页
互联网应用深刻地影响着人们的工作与生活,纷至沓来的新应用对数据网络提出了新的挑战。基于未来应用对数据网络提出的需求,剖析了数据网络需要基于IP进行继承式发展,提出了一种新型的网络协议体系——NewIP,并介绍了5种关键使能技术,... 互联网应用深刻地影响着人们的工作与生活,纷至沓来的新应用对数据网络提出了新的挑战。基于未来应用对数据网络提出的需求,剖析了数据网络需要基于IP进行继承式发展,提出了一种新型的网络协议体系——NewIP,并介绍了5种关键使能技术,包括确定性IP、内生安全、面向万网互联的新寻址与控制机制、用户可定义、新传输层等。 展开更多
关键词 NewIP 数据网络 确定性 异构 万网互联 内生安全 高吞吐
下载PDF
基于多分区操作系统的多核确定性调度方法设计 被引量:14
7
作者 刘鸽 叶宏 +2 位作者 李运喜 胡宁 何翔 《航空计算技术》 2016年第1期99-102,共4页
为提高多分区操作系统环境下多核处理器的应用带来的操作系统任务调度的确定性,研究了多分环境下多核操作系统结构,从处理器核心和分区的资源分配和分区调度表的时间同步角度出发,采用了静态配置、处理器绑定和时间窗口调度方法,实现了... 为提高多分区操作系统环境下多核处理器的应用带来的操作系统任务调度的确定性,研究了多分环境下多核操作系统结构,从处理器核心和分区的资源分配和分区调度表的时间同步角度出发,采用了静态配置、处理器绑定和时间窗口调度方法,实现了处理器资源的确定性分配,解决了调度表之间的时间同步问题。测试结果表明,方法有效提高了分区环境下多核操作系统任务调度的确定性。 展开更多
关键词 分区操作系统 多核处理器 确定性 调度
下载PDF
Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of UCAVs 被引量:12
8
作者 LI Yue QIU Xiaohui +1 位作者 LIU Xiaodong XIA Qunli 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2020年第4期734-742,共9页
The ever-changing battlefield environment requires the use of robust and adaptive technologies integrated into a reliable platform. Unmanned combat aerial vehicles(UCAVs) aim to integrate such advanced technologies wh... The ever-changing battlefield environment requires the use of robust and adaptive technologies integrated into a reliable platform. Unmanned combat aerial vehicles(UCAVs) aim to integrate such advanced technologies while increasing the tactical capabilities of combat aircraft. As a research object, common UCAV uses the neural network fitting strategy to obtain values of attack areas. However, this simple strategy cannot cope with complex environmental changes and autonomously optimize decision-making problems. To solve the problem, this paper proposes a new deep deterministic policy gradient(DDPG) strategy based on deep reinforcement learning for the attack area fitting of UCAVs in the future battlefield. Simulation results show that the autonomy and environmental adaptability of UCAVs in the future battlefield will be improved based on the new DDPG algorithm and the training process converges quickly. We can obtain the optimal values of attack areas in real time during the whole flight with the well-trained deep network. 展开更多
关键词 attack area neural network deep deterministic policy gradient(DDPG) unmanned combat aerial vehicle(UCAV)
下载PDF
高码率Turbo码中确定性交织器的设计 被引量:7
9
作者 史治平 靳蕃 《西南交通大学学报》 EI CSCD 北大核心 2002年第5期544-547,共4页
交织器设计在Turbo码理论中具有重要地位 ,高码率Turbo码对交织器具有特殊要求。通过对影响交织器设计的 5个重要因素分析 ,研究了如何设计理想的Turbo码交织器 ,并讨论了 3种常见确定性交织器的映射函数以及在高码率Turbo码中的奇偶性... 交织器设计在Turbo码理论中具有重要地位 ,高码率Turbo码对交织器具有特殊要求。通过对影响交织器设计的 5个重要因素分析 ,研究了如何设计理想的Turbo码交织器 ,并讨论了 3种常见确定性交织器的映射函数以及在高码率Turbo码中的奇偶性设计。计算机模拟结果显示通过映射函数得到的 3种奇偶交织器实现简单 。 展开更多
关键词 随机性 奇偶性 误比特率 映射函数 高码率Turbo码 确定性交织器 设计
下载PDF
An efficient deterministic secure quantum communication scheme based on cluster states and identity authentication 被引量:10
10
作者 刘文杰 陈汉武 +3 位作者 马廷淮 李志强 刘志昊 胡文博 《Chinese Physics B》 SCIE EI CAS CSCD 2009年第10期4105-4109,共5页
A novel efficient deterministic secure quantum communication scheme based on four-qubit cluster states and single-photon identity authentication is proposed. In this scheme, the two authenticated users can transmit tw... A novel efficient deterministic secure quantum communication scheme based on four-qubit cluster states and single-photon identity authentication is proposed. In this scheme, the two authenticated users can transmit two bits of classical information per cluster state, and its efficiency of the quantum communication is 1/3, which is approximately 1.67 times that of the previous protocol presented by Wang et al [Chin. Phys. Lett. 23 (2006) 2658]. Security analysis shows the present scheme is secure against intercept-resend attack and the impersonator's attack. Furthermore, it is more economic with present-day techniques and easily processed by a one-way quantum computer. 展开更多
关键词 deterministic secure quantum communication cluster state identity authentication
下载PDF
设定地震——概率地震危险性评估和确定性危险性评估的连接 被引量:10
11
作者 陶夏新 陶正如 师黎静 《地震工程与工程振动》 CSCD 北大核心 2014年第4期101-109,共9页
在简要回顾地震危险性评估概率方法和确定性方法的发展与协调的基础上,本文强调设定地震在防震减灾工作中的重要性、在估计大地震近场地震动中建立震源模型的必要性,指出概率方法中不确定性校正对地震危险性的影响,结合实例提出了一种... 在简要回顾地震危险性评估概率方法和确定性方法的发展与协调的基础上,本文强调设定地震在防震减灾工作中的重要性、在估计大地震近场地震动中建立震源模型的必要性,指出概率方法中不确定性校正对地震危险性的影响,结合实例提出了一种确定设定地震参数的方法,并展示了应用于一个区域地震区划的试验结果。 展开更多
关键词 设定地震 地震危险性 评估 概率的 确定性的
下载PDF
Point mass filter based matching algorithm in gravity aided underwater navigation 被引量:8
12
作者 HAN Yurong WANG Bo +1 位作者 DENG Zhihong FU Mengyin 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2018年第1期152-159,共8页
Gravity-aided inertial navigation is a hot issue in the applications of underwater autonomous vehicle(UAV). Since the matching process is conducted with a gravity anomaly database tabulated in the form of a digital mo... Gravity-aided inertial navigation is a hot issue in the applications of underwater autonomous vehicle(UAV). Since the matching process is conducted with a gravity anomaly database tabulated in the form of a digital model and the resolution is 2’ × 2’,a filter model based on vehicle position is derived and the particularity of inertial navigation system(INS) output is employed to estimate a parameter in the system model. Meanwhile, the matching algorithm based on point mass filter(PMF) is applied and several optimal selection strategies are discussed. It is obtained that the point mass filter algorithm based on the deterministic resampling method has better practicability. The reliability and the accuracy of the algorithm are verified via simulation tests. 展开更多
关键词 gravity-aided inertial navigation system(INS) navigation point mass filter(PMF) deterministic resampling
下载PDF
RECONFIGURABLE PRODUCTION LINE MODELING AND SCHEDULING USING PETRI NETS AND GENETIC ALGORITHM 被引量:8
13
作者 XIE Nan LI Aiping 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2006年第3期362-367,共6页
In response to the production capacity and functionality variations, a genetic algorithm (GA) embedded with deterministic timed Petri nets(DTPN) for reconfigurable production line(RPL) is proposed to solve its s... In response to the production capacity and functionality variations, a genetic algorithm (GA) embedded with deterministic timed Petri nets(DTPN) for reconfigurable production line(RPL) is proposed to solve its scheduling problem. The basic DTPN modules are presented to model the corresponding variable structures in RPL, and then the scheduling model of the whole RPL is constructed. And in the scheduling algorithm, firing sequences of the Petri nets model are used as chromosomes, thus the selection, crossover, and mutation operator do not deal with the elements in the problem space, but the elements of Petri nets model. Accordingly, all the algorithms for GA operations embedded with Petri nets model are proposed. Moreover, the new weighted single-objective optimization based on reconfiguration cost and E/T is used. The results of a DC motor RPL scheduling suggest that the presented DTPN-GA scheduling algorithm has a significant impact on RPL scheduling, and provide obvious improvements over the conventional scheduling method in practice that meets duedate, minimizes reconfiguration cost, and enhances cost effectivity. 展开更多
关键词 Reconfigurable production line deterministic timed Petri nets (DTPN) Modeling Scheduling Genetic algorithm(GA)
下载PDF
On the Decadal and Interdecadal Variability in the Pacific Ocean 被引量:7
14
作者 杨海军 张琼 《Advances in Atmospheric Sciences》 SCIE CAS CSCD 2003年第2期173-184,共12页
The Pacific decadal and interdecadal oscillation (PDO) has been extensively explored in recent decades because of its profound impact on global climate systems. It is a long-lived ENSO-like pattern of Pacific climate ... The Pacific decadal and interdecadal oscillation (PDO) has been extensively explored in recent decades because of its profound impact on global climate systems. It is a long-lived ENSO-like pattern of Pacific climate variability with a period of 10-30 years. The general picture is that the anomalously warm (cool) SSTs in the central North Pacific are always accompanied by the anomalously cool (warm) SSTs along the west coast of America and in the central east tropical Pacific with comparable amplitude. In general, there are two classes of opinions on the origin of this low-frequency climate variability, one thinking that it results from deterministically coupled modes of the Pacific ocean-atmosphere system, and the other, from stochastic atmospheric forcing. The deterministic origin emphasizes that the internal physical processes in an air-sea system can provide a positive feedback mechanism to amplify an initial perturbation, and a negative feedback mechanism to reverse the phase of oscillation. The dynamic evolution of ocean circulation determines the timescale of the oscillation. The stochastic origin, however, emphasizes that because the atmospheric activities can be thought as having no preferred timescale and are associated with an essentially white noise spectrum, tne ocean response can manifest a red peak in a certain low frequency range with a decadal to interdecadal timescale. In this paper, the authors try to systematically understand the state of the art of observational, theoretical and numerical studies on the PDO and hope to provide a useful background reference for current research. 展开更多
关键词 climate system Pacific decadal oscillation deterministic origin stochastic origin
下载PDF
面向空间应用的时间触发以太网 被引量:9
15
作者 邱爱华 张涛 顾逸东 《国防科技大学学报》 EI CAS CSCD 北大核心 2014年第5期117-123,共7页
将时间触发协议增加到采用星型拓扑结构的以太网业务上,且保持完全兼容IEEE802.3标准,可解决空间有效载荷使用标准以太网通信遇到的实时性、确定性和可靠性问题。时间触发以太网基于全局基准时间,实时数据传输遵循严格的时序,支持时间... 将时间触发协议增加到采用星型拓扑结构的以太网业务上,且保持完全兼容IEEE802.3标准,可解决空间有效载荷使用标准以太网通信遇到的实时性、确定性和可靠性问题。时间触发以太网基于全局基准时间,实时数据传输遵循严格的时序,支持时间和事件触发的两种通信过程。时间触发通道适合同步或周期性实时消息的传输,而事件触发通道适用于偶发或非周期性消息的传输。设计了空间应用以太网的拓扑结构和协议栈,介绍了系统通信的过程,进行了时间触发协议的研究,并对同步性能和网络性能进行了仿真验证。该以太网在保证可靠性、安全性的同时,增强了网络的灵活性,提高了网络利用率。 展开更多
关键词 时间触发 以太网 空间应用 实时性 确定性
下载PDF
Deterministic quantum communication using the symmetric W state 被引量:9
16
作者 TSAI Chia-Wei HWANG Tzonelih 《Science China(Physics,Mechanics & Astronomy)》 SCIE EI CAS 2013年第10期1903-1908,共6页
This study proposes a new coding function for the symmetric W state. Based on the new coding function, a theoretical protocol of deterministic quanama communication (DQC) is proposed. The sender can use the proposed... This study proposes a new coding function for the symmetric W state. Based on the new coding function, a theoretical protocol of deterministic quanama communication (DQC) is proposed. The sender can use the proposed coding function to encode his/her message, and the receiver can perform the imperfect Bell measurement to obtain the sender's message. In comparison to the existing DQC protocols that also use the W class state, the proposed protocol is more efficient and also more practical within today's technology. Moreover, the security of this protocol is analyzed to show that any eavesdropper will be detected with a very high probability under both the ideal and the noisy quantum channel. 展开更多
关键词 deterministic quantum communication symmetric W state imperfect Bell measurement
原文传递
航天器可应用实时以太网分析 被引量:9
17
作者 邱爱华 张涛 顾逸东 《空间科学学报》 CAS CSCD 北大核心 2015年第3期368-380,共13页
在航天器上应用以太网的目的是借助以太网的灵活性,获得方便的通信接入及数据传输的高带宽,并且能适应控制领域通信的实时性和确定性.实时以太网的确定性、通信调度机制和传输特征是其满足航天器网络通信特性需求的主要判定依据.通过分... 在航天器上应用以太网的目的是借助以太网的灵活性,获得方便的通信接入及数据传输的高带宽,并且能适应控制领域通信的实时性和确定性.实时以太网的确定性、通信调度机制和传输特征是其满足航天器网络通信特性需求的主要判定依据.通过分析航天器数据通信特点,对比实时以太网通信调度策略,得出应用于航天器的实时以太网应具备时间同步、强实时性、确定性、高带宽、多数据类型、双通道冗余及兼容标准以太网等特征. 展开更多
关键词 实时以太网 航天器 确定性 可靠性 通信调度
下载PDF
从需求和要求两角度分析5G工业应用 被引量:8
18
作者 刘丹 魏剑嵬 +1 位作者 赵艳领 王麟琨 《自动化仪表》 CAS 2020年第10期1-6,共6页
当前,新一代5G移动通信技术与工业制造业正在快速、融合发展。首先,从需求和要求出发,一方面说明工业制造业数字化、网络化、智能化发展对于5G的迫切需求,另一方面阐述工业自动控制和信息处理对5G及其他网络通信技术还具有不同等级的实... 当前,新一代5G移动通信技术与工业制造业正在快速、融合发展。首先,从需求和要求出发,一方面说明工业制造业数字化、网络化、智能化发展对于5G的迫切需求,另一方面阐述工业自动控制和信息处理对5G及其他网络通信技术还具有不同等级的实时性、确定性、可靠性、可用性、安全性等特殊要求。然后,基于5G技术特点及组网关键技术,分析5G对于工业应用的天然适应性,并介绍几种典型的5G工业应用场景及其对5G网络服务质量的指标要求。最后,提出当前要实现5G大规模工业应用还需考虑及解决的迫切问题。 展开更多
关键词 实时性 确定性 可靠性 可用性 安全性 软件定义网络 网络功能虚拟化 移动边缘计算 网络切片
下载PDF
Approximating Nash Equilibrium in Day-ahead Electricity Market Bidding with Multi-agent Deep Reinforcement Learning 被引量:8
19
作者 Yan Du Fangxing Li +1 位作者 Helia Zandi Yaosuo Xue 《Journal of Modern Power Systems and Clean Energy》 SCIE EI CSCD 2021年第3期534-544,共11页
In this paper,a day-ahead electricity market bidding problem with multiple strategic generation company(GEN-CO)bidders is studied.The problem is formulated as a Markov game model,where GENCO bidders interact with each... In this paper,a day-ahead electricity market bidding problem with multiple strategic generation company(GEN-CO)bidders is studied.The problem is formulated as a Markov game model,where GENCO bidders interact with each other to develop their optimal day-ahead bidding strategies.Considering unobservable information in the problem,a model-free and data-driven approach,known as multi-agent deep deterministic policy gradient(MADDPG),is applied for approximating the Nash equilibrium(NE)in the above Markov game.The MAD-DPG algorithm has the advantage of generalization due to the automatic feature extraction ability of the deep neural networks.The algorithm is tested on an IEEE 30-bus system with three competitive GENCO bidders in both an uncongested case and a congested case.Comparisons with a truthful bidding strategy and state-of-the-art deep reinforcement learning methods including deep Q network and deep deterministic policy gradient(DDPG)demonstrate that the applied MADDPG algorithm can find a superior bidding strategy for all the market participants with increased profit gains.In addition,the comparison with a conventional-model-based method shows that the MADDPG algorithm has higher computational efficiency,which is feasible for real-world applications. 展开更多
关键词 Bidding strategy day-ahead electricity market deep reinforcement learning Markov game multi-agent deterministic policy gradient(MADDPG) Nash equilibrium(NE)
原文传递
Path Following Control for UAV Using Deep Reinforcement Learning Approach 被引量:8
20
作者 Yintao Zhang Youmin Zhang Ziquan Yu 《Guidance, Navigation and Control》 2021年第1期91-108,共18页
Unmanned aerial vehicles(UAVs)have been extensively used in civil and industrial applications due to the rapid development of the guidance,navigation and control(GNC)technologies.Especially,using deep reinforcement le... Unmanned aerial vehicles(UAVs)have been extensively used in civil and industrial applications due to the rapid development of the guidance,navigation and control(GNC)technologies.Especially,using deep reinforcement learning methods for motion control acquires a major progress recently,since deep Q-learning algorithm has been successfully applied to the continuous action domain problem.This paper proposes an improved deep deterministic policy gradient(DDPG)algorithm for path following control problem of UAV.A speci-c reward function is designed for minimizing the cross-track error of the path following problem.In the training phase,a double experience replay bu®er(DERB)is used to increase the learning e±ciency and accelerate the convergence speed.First,the model of UAV path following problem has been established.After that,the framework of DDPG algorithm is constructed.Then the state space,action space and reward function of the UAV path following algorithm are designed.DERB is proposed to accelerate the training phase.Finally,simulation results are carried out to show the e®ectiveness of the proposed DERB–DDPG method. 展开更多
关键词 Path following deep deterministic policy gradient double experience replay bu®er
原文传递
上一页 1 2 103 下一页 到第
使用帮助 返回顶部