经典的特征点提取算法是从整个图像进行遍历来确定特征点,运算量较大,不能满足实时应用的要求。提出了一种特征点快速稀疏提取算法,该方法首先利用高斯拉普拉斯算子(Laplacian of Gaussian,LoG)提取图像梯度,设定阈值过滤获得图像的边...经典的特征点提取算法是从整个图像进行遍历来确定特征点,运算量较大,不能满足实时应用的要求。提出了一种特征点快速稀疏提取算法,该方法首先利用高斯拉普拉斯算子(Laplacian of Gaussian,LoG)提取图像梯度,设定阈值过滤获得图像的边缘稀疏矩阵,然后在稀疏矩阵的基础上利用改进的加速分割测试特征(Features from Accelerated Segment Test,FAST)检测算法,解决了传统匹配算法提取特征点耗时的问题,使图像实时匹配成为可能。为减少误匹配对,利用感知哈希算法对匹配对进行提纯,并根据仿射不变性建立两个约束条件进一步验证单应性矩阵,提高配准精度。实验结果证明,该算法提高了特征点提取的速度以及配准精度。展开更多
Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges...Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges mentioned above with a single model.To tackle this dilemma,this paper proposes spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting(STSIGMA),an efficient end-to-end method to jointly and accurately perceive the AD environment and forecast the trajectories of the surrounding traffic agents within a unified framework.ST-SIGMA adopts a trident encoder-decoder architecture to learn scene semantics and agent interaction information on bird’s-eye view(BEV)maps simultaneously.Specifically,an iterative aggregation network is first employed as the scene semantic encoder(SSE)to learn diverse scene information.To preserve dynamic interactions of traffic agents,ST-SIGMA further exploits a spatio-temporal graph network as the graph interaction encoder.Meanwhile,a simple yet efficient feature fusion method to fuse semantic and interaction features into a unified feature space as the input to a novel hierarchical aggregation decoder for downstream prediction tasks is designed.Extensive experiments on the nuScenes data set have demonstrated that the proposed ST-SIGMA achieves significant improvements compared to the state-of-theart(SOTA)methods in terms of scene perception and trajectory forecasting,respectively.Therefore,the proposed approach outperforms SOTA in terms of model generalisation and robustness and is therefore more feasible for deployment in realworld AD scenarios.展开更多
文摘经典的特征点提取算法是从整个图像进行遍历来确定特征点,运算量较大,不能满足实时应用的要求。提出了一种特征点快速稀疏提取算法,该方法首先利用高斯拉普拉斯算子(Laplacian of Gaussian,LoG)提取图像梯度,设定阈值过滤获得图像的边缘稀疏矩阵,然后在稀疏矩阵的基础上利用改进的加速分割测试特征(Features from Accelerated Segment Test,FAST)检测算法,解决了传统匹配算法提取特征点耗时的问题,使图像实时匹配成为可能。为减少误匹配对,利用感知哈希算法对匹配对进行提纯,并根据仿射不变性建立两个约束条件进一步验证单应性矩阵,提高配准精度。实验结果证明,该算法提高了特征点提取的速度以及配准精度。
基金Basic and Advanced Research Projects of CSTC,Grant/Award Number:cstc2019jcyj-zdxmX0008Science and Technology Research Program of Chongqing Municipal Education Commission,Grant/Award Numbers:KJQN202100634,KJZDK201900605National Natural Science Foundation of China,Grant/Award Number:62006065。
文摘Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges mentioned above with a single model.To tackle this dilemma,this paper proposes spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting(STSIGMA),an efficient end-to-end method to jointly and accurately perceive the AD environment and forecast the trajectories of the surrounding traffic agents within a unified framework.ST-SIGMA adopts a trident encoder-decoder architecture to learn scene semantics and agent interaction information on bird’s-eye view(BEV)maps simultaneously.Specifically,an iterative aggregation network is first employed as the scene semantic encoder(SSE)to learn diverse scene information.To preserve dynamic interactions of traffic agents,ST-SIGMA further exploits a spatio-temporal graph network as the graph interaction encoder.Meanwhile,a simple yet efficient feature fusion method to fuse semantic and interaction features into a unified feature space as the input to a novel hierarchical aggregation decoder for downstream prediction tasks is designed.Extensive experiments on the nuScenes data set have demonstrated that the proposed ST-SIGMA achieves significant improvements compared to the state-of-theart(SOTA)methods in terms of scene perception and trajectory forecasting,respectively.Therefore,the proposed approach outperforms SOTA in terms of model generalisation and robustness and is therefore more feasible for deployment in realworld AD scenarios.