期刊文献+
共找到12篇文章
< 1 >
每页显示 20 50 100
2022年深度学习技术主要发展动向分析 被引量:3
1
作者 王亚珅 胡武陵 +1 位作者 朱小伶 葛悦涛 《无人系统技术》 2023年第1期104-113,共10页
深度学习正逐渐成为新一代人工智能最核心的技术之一。对2022年深度学习热门领域的主要发展动向进行了综合评述。首先,介绍小数据小样本深度学习研究领域的最新进展;其次,探讨量子计算与深度学习的融合路径;然后,概述强化学习对通用智... 深度学习正逐渐成为新一代人工智能最核心的技术之一。对2022年深度学习热门领域的主要发展动向进行了综合评述。首先,介绍小数据小样本深度学习研究领域的最新进展;其次,探讨量子计算与深度学习的融合路径;然后,概述强化学习对通用智能的推动作用;最后,盘点深度学习在多模态学习方向的进展。综述表明,面向小数据、小样本的深度学习技术正在引领深度学习向自监督方向不断迈进,深度学习与其他先进计算范式(例如量子计算等)深入融合趋势愈发明显,强化学习在一定程度上具备解决复杂问题的通用智能,多模态深度学习技术已迎来关键性突破。 展开更多
关键词 深度学习 小数据 量子计算 预训练 强化学习 多模态学习
下载PDF
Empirical Analysis of Decision Making of an AI Agent on IBM’s 5Q Quantum Computer 被引量:3
2
作者 Wei Hu 《Natural Science》 2018年第1期45-58,共14页
A recent work has shown that using an ion trap quantum processor can speed up the decision making of a reinforcement learning agent. Its quantum advantage is observed when the external environment changes, and then ag... A recent work has shown that using an ion trap quantum processor can speed up the decision making of a reinforcement learning agent. Its quantum advantage is observed when the external environment changes, and then agent needs to relearn again. One character of this quantum hardware system discovered in this study is that it tends to overestimate the values used to determine the actions the agent will take. IBM’s five qubit superconducting quantum processor is a popular quantum platform. The aims of our study are twofold. First we want to identify the hardware characteristic features of IBM’s 5Q quantum computer when running this learning agent, compared with the ion trap processor. Second, through careful analysis, we observe that the quantum circuit employed in the ion trap processor for this agent could be simplified. Furthermore, when tested on IBM’s 5Q quantum processor, our simplified circuit demonstrates its enhanced performance over the original circuit on one of the hard learning tasks investigated in the previous work. We also use IBM’s quantum simulator when a good baseline is needed to compare the performances. As more and more quantum hardware devices are moving out of the laboratory and becoming generally available to public use, our work emphasizes the fact that the features and constraints of the quantum hardware could take a toll on the performance of quantum algorithms. 展开更多
关键词 quantum COMPUTATION quantum Machine learning quantum reinforcement learning quantum Circuit
下载PDF
Reinforcement Learning with Deep Quantum Neural Networks
3
作者 Wei Hu James Hu 《Journal of Quantum Information Science》 2019年第1期1-14,共14页
The advantage of quantum computers over classical computers fuels the recent trend of developing machine learning algorithms on quantum computers, which can potentially lead to breakthroughs and new learning models in... The advantage of quantum computers over classical computers fuels the recent trend of developing machine learning algorithms on quantum computers, which can potentially lead to breakthroughs and new learning models in this area. The aim of our study is to explore deep quantum reinforcement learning (RL) on photonic quantum computers, which can process information stored in the quantum states of light. These quantum computers can naturally represent continuous variables, making them an ideal platform to create quantum versions of neural networks. Using quantum photonic circuits, we implement Q learning and actor-critic algorithms with multilayer quantum neural networks and test them in the grid world environment. Our experiments show that 1) these quantum algorithms can solve the RL problem and 2) compared to one layer, using three layer quantum networks improves the learning of both algorithms in terms of rewards collected. In summary, our findings suggest that having more layers in deep quantum RL can enhance the learning outcome. 展开更多
关键词 Continuous-Variable quantum COMPUTERS quantum Machine learning quantum reinforcement learning DEEP learning Q learning Actor-Critic Grid World Environment
下载PDF
Distributional Reinforcement Learning with Quantum Neural Networks
4
作者 Wei Hu James Hu 《Intelligent Control and Automation》 2019年第2期63-78,共16页
Traditional reinforcement learning (RL) uses the return, also known as the expected value of cumulative random rewards, for training an agent to learn an optimal policy. However, recent research indicates that learnin... Traditional reinforcement learning (RL) uses the return, also known as the expected value of cumulative random rewards, for training an agent to learn an optimal policy. However, recent research indicates that learning the distribution over returns has distinct advantages over learning their expected value as seen in different RL tasks. The shift from using the expectation of returns in traditional RL to the distribution over returns in distributional RL has provided new insights into the dynamics of RL. This paper builds on our recent work investigating the quantum approach towards RL. Our work implements the quantile regression (QR) distributional Q learning with a quantum neural network. This quantum network is evaluated in a grid world environment with a different number of quantiles, illustrating its detailed influence on the learning of the algorithm. It is also compared to the standard quantum Q learning in a Markov Decision Process (MDP) chain, which demonstrates that the quantum QR distributional Q learning can explore the environment more efficiently than the standard quantum Q learning. Efficient exploration and balancing of exploitation and exploration are major challenges in RL. Previous work has shown that more informative actions can be taken with a distributional perspective. Our findings suggest another cause for its success: the enhanced performance of distributional RL can be partially attributed to its superior ability to efficiently explore the environment. 展开更多
关键词 Continuous-Variable quantum Computers quantum reinforcement learning Distributional reinforcement learning QUANTILE Regression Distributional Q learning Grid World ENVIRONMENT MDP Chain ENVIRONMENT
下载PDF
Q Learning with Quantum Neural Networks
5
作者 Wei Hu James Hu 《Natural Science》 2019年第1期31-39,共9页
Applying quantum computing techniques to machine learning has attracted widespread attention recently and quantum machine learning has become a hot research topic. There are three major categories of machine learning:... Applying quantum computing techniques to machine learning has attracted widespread attention recently and quantum machine learning has become a hot research topic. There are three major categories of machine learning: supervised, unsupervised, and reinforcement learning (RL). However, quantum RL has made the least progress when compared to the other two areas. In this study, we implement the well-known RL algorithm Q learning with a quantum neural network and evaluate it in the grid world environment. RL is learning through interactions with the environment, with the aim of discovering a strategy to maximize the expected cumulative rewards. Problems in RL bring in unique challenges to the study with their sequential nature of learning, potentially long delayed reward signals, and large or infinite size of state and action spaces. This study extends our previous work on solving the contextual bandit problem using a quantum neural network, where the reward signals are immediate after each action. 展开更多
关键词 Continuous-Variable quantum COMPUTERS quantum Machine learning quantum reinforcement learning Q learning GRID World Environment
下载PDF
Practical Meta-Reinforcement Learning of Evolutionary Strategy with Quantum Neural Networks for Stock Trading
6
作者 Erik Sorensen Wei Hu 《Journal of Quantum Information Science》 2020年第3期43-71,共29页
We show the practicality of two existing meta-learning algorithms Model-</span></span><span><span><span> </span></span></span><span><span><span><spa... We show the practicality of two existing meta-learning algorithms Model-</span></span><span><span><span> </span></span></span><span><span><span><span style="font-family:Verdana;">Agnostic Meta-Learning and Fast Context Adaptation Via Meta-learning using an evolutionary strategy for parameter optimization, as well as propose two novel quantum adaptations of those algorithms using continuous quantum neural networks, for learning to trade portfolios of stocks on the stock market. The goal of meta-learning is to train a model on a variety of tasks, such that it can solve new learning tasks using only a small number of training samples. In our classical approach, we trained our meta-learning models on a variety of portfolios that contained 5 randomly sampled Consumer Cyclical stocks from a pool of 60. In our quantum approach, we trained our </span><span style="font-family:Verdana;">quantum meta-learning models on a simulated quantum computer with</span><span style="font-family:Verdana;"> portfolios containing 2 randomly sampled Consumer Cyclical stocks. Our findings suggest that both classical models could learn a new portfolio with 0.01% of the number of training samples to learn the original portfolios and can achieve a comparable performance within 0.1% Return on Investment of the Buy and Hold strategy. We also show that our much smaller quantum meta-learned models with only 60 model parameters and 25 training epochs </span><span style="font-family:Verdana;">have a similar learning pattern to our much larger classical meta-learned</span><span style="font-family:Verdana;"> models that have over 250,000 model parameters and 2500 training epochs. Given these findings</span></span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">,</span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;"> we also discuss the benefits of scaling up our experiments from a simulated quantum computer to a 展开更多
关键词 reinforcement learning Deep learning META-learning Evolutionary Strategy quantum Computing quantum Machine learning Stock Market Algorithmic Trading
下载PDF
Training a Quantum Neural Network to Solve the Contextual Multi-Armed Bandit Problem
7
作者 Wei Hu James Hu 《Natural Science》 2019年第1期17-27,共11页
Artificial intelligence has permeated all aspects of our lives today. However, to make AI behave like real AI, the critical bottleneck lies in the speed of computing. Quantum computers employ the peculiar and unique p... Artificial intelligence has permeated all aspects of our lives today. However, to make AI behave like real AI, the critical bottleneck lies in the speed of computing. Quantum computers employ the peculiar and unique properties of quantum states such as superposition, entanglement, and interference to process information in ways that classical computers cannot. As a new paradigm of computation, quantum computers are capable of performing tasks intractable for classical processors, thus providing a quantum leap in AI research and making the development of real AI a possibility. In this regard, quantum machine learning not only enhances the classical machine learning approach but more importantly it provides an avenue to explore new machine learning models that have no classical counterparts. The qubit-based quantum computers cannot naturally represent the continuous variables commonly used in machine learning, since the measurement outputs of qubit-based circuits are generally discrete. Therefore, a continuous-variable (CV) quantum architecture based on a photonic quantum computing model is selected for our study. In this work, we employ machine learning and optimization to create photonic quantum circuits that can solve the contextual multi-armed bandit problem, a problem in the domain of reinforcement learning, which demonstrates that quantum reinforcement learning algorithms can be learned by a quantum device. 展开更多
关键词 Continuous-Variable quantum COMPUTERS quantum Machine learning quantum reinforcement learning CONTEXTUAL Multi-Armed BANDIT PROBLEM
下载PDF
Quantum Multiple Q-Learning 被引量:1
8
作者 Michael Ganger Wei Hu 《International Journal of Intelligence Science》 2019年第1期1-22,共22页
In this paper, a collection of value-based quantum reinforcement learning algorithms are introduced which use Grover’s algorithm to update the policy, which is stored as a superposition of qubits associated with each... In this paper, a collection of value-based quantum reinforcement learning algorithms are introduced which use Grover’s algorithm to update the policy, which is stored as a superposition of qubits associated with each possible action, and their parameters are explored. These algorithms may be grouped in two classes, one class which uses value functions (V(s)) and new class which uses action value functions (Q(s,a)). The new (Q(s,a))-based quantum algorithms are found to converge faster than V(s)-based algorithms, and in general the quantum algorithms are found to converge in fewer iterations than their classical counterparts, netting larger returns during training. This is due to fact that the (Q(s,a)) algorithms are more precise than those based on V(s), meaning that updates are incorporated into the value function more efficiently. This effect is also enhanced by the observation that the Q(s,a)-based algorithms may be trained with higher learning rates. These algorithms are then extended by adding multiple value functions, which are observed to allow larger learning rates and have improved convergence properties in environments with stochastic rewards, the latter of which is further improved by the probabilistic nature of the quantum algorithms. Finally, the quantum algorithms were found to use less CPU time than their classical counterparts overall, meaning that their benefits may be realized even without a full quantum computer. 展开更多
关键词 quantum COMPUTING reinforcement learning Q-learning
下载PDF
基于情景记忆的量子深度强化学习
9
作者 朱献超 侯晓凯 +1 位作者 吴绍君 祝峰 《电子科技大学学报》 EI CAS CSCD 北大核心 2022年第2期170-175,共6页
作为量子机器学习的一个新兴子领域,量子深度强化学习旨在利用量子神经网络构建一个量子智能体,使其通过与环境进行不断交互习得一个最优策略,以达到期望累积回报最大化。然而,现有量子深度强化学习方法在训练过程中需要与经典环境进行... 作为量子机器学习的一个新兴子领域,量子深度强化学习旨在利用量子神经网络构建一个量子智能体,使其通过与环境进行不断交互习得一个最优策略,以达到期望累积回报最大化。然而,现有量子深度强化学习方法在训练过程中需要与经典环境进行大量交互,从而导致大量多次调用量子线路。为此,该文提出了一种基于情景记忆的量子深度强化学习模型,称为量子情景记忆深度Q网络,该模型利用情景记忆来加速量子智能体的训练过程。具体来说,该模型将历史上出现的拥有高奖励值的经验记录到情景记忆中,使得在当前环境的状态与情景记忆中的某状态相似时,量子智能体可以根据该历史状态快速地获得想要的动作,从而减少了算法优化的迭代次数。在5个经典的雅达利游戏上的数值模拟表明,该文提出的方法可以显著地减少训练量子智能体的迭代次数,进而可以获得比其他量子深度强化学习方法更高的分数。 展开更多
关键词 深度强化学习 情景记忆 量子计算 量子强化学习
下载PDF
量子强化学习技术及研究进展
10
作者 韦云凯 王志宏 冷甦鹏 《广州大学学报(自然科学版)》 CAS 2021年第1期56-68,共13页
近年来,强化学习理论和算法研究迅速发展,并且在竞争博弈、智能控制、分析预测、优化调度等领域得到广泛应用.但是,传统强化学习算法学习效率低、系统开销大,尤其是面对复杂任务时这种情况更为严重.结合量子计算特性,可实现对强化学习... 近年来,强化学习理论和算法研究迅速发展,并且在竞争博弈、智能控制、分析预测、优化调度等领域得到广泛应用.但是,传统强化学习算法学习效率低、系统开销大,尤其是面对复杂任务时这种情况更为严重.结合量子计算特性,可实现对强化学习算法的加速,由此提出的量子强化学习技术,对强化学习技术的发展赋予了全新的动力与广阔的前景,引发了日益广泛的关注.文章对量子强化学习技术及其研究进展进行了介绍、分析与展望.首先,分别对量子计算和强化学习的基本概念和原理进行了介绍.在此基础上,介绍了量子强化学习的基本思想与机制,并从两方面分析介绍了量子强化学习的研究与进展:①传统计算环境下,将量子特性融入到强化学习以提高算法效率;②量子计算环境下,将经典环境量子化之后,智能体同环境进行量子化交互的强化学习技术.最后,对量子强化学习的应用前景进行了展望. 展开更多
关键词 量子计算 强化学习 量子强化学习 机器学习 人工智能
下载PDF
量子侧抑制强化竞争算法研究
11
作者 侯旋 《电子设计工程》 2015年第6期37-40,共4页
深入研究了量子竞争网络的原理,基于量子竞争算法,通过引入神经元侧抑制功能与强化学习机制,建立了量子侧抑制强化竞争网络模型,提出了量子侧抑制强化竞争算法。量子侧抑制强化竞争网络竞争层神经元所具有的侧抑制功能提高了网络精度,... 深入研究了量子竞争网络的原理,基于量子竞争算法,通过引入神经元侧抑制功能与强化学习机制,建立了量子侧抑制强化竞争网络模型,提出了量子侧抑制强化竞争算法。量子侧抑制强化竞争网络竞争层神经元所具有的侧抑制功能提高了网络精度,其强化学习机制加速了网络学习。通过对具有四类样本的数据集进行模式识别实验,量子侧抑制强化竞争算法的平均正确率为98.89%,量子竞争算法的平均正确率为95.56%。结果表明量子侧抑制强化竞争算法相对量子竞争算法具有更高的模式识别能力。 展开更多
关键词 量子神经网络 侧抑制联结 强化学习机制 量子竞争 模式识别
下载PDF
虚拟资源分配优化量子学习算法仿真研究 被引量:3
12
作者 叶青 方子叶 《计算机仿真》 北大核心 2020年第6期288-292,共5页
针对单一优化目标的基于用户需求或服务质量的虚拟资源分配问题,采用协同决策方法,将用户满意、效能最优和服务质量多目标协同集成,以虚拟资源计算服务租用收益和用户满意度效用最大化为优化目标,构建虚拟资源分配集成优化模型。提出改... 针对单一优化目标的基于用户需求或服务质量的虚拟资源分配问题,采用协同决策方法,将用户满意、效能最优和服务质量多目标协同集成,以虚拟资源计算服务租用收益和用户满意度效用最大化为优化目标,构建虚拟资源分配集成优化模型。提出改进的量子粒子群算法,设计种群的个体学习权重因子,增加个体寻优的学习激励策略,设置粒子搜索的学习阈值,解决算法陷入局部最优解的问题。通过种群搜索学习求解,得出满足用户需求、效能和服务质量的虚拟资源作业分配的最优服务收益选择方案。仿真结果表明,改进算法具有较好的全局搜索效率和性能。 展开更多
关键词 量子粒子群算法 强化学习 虚拟资源 效用分配 全局优化
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部