Segmentation of moving objects in a video sequence is a basic task for application of computer vision. However, shadows extracted along with the objects can result in large errors in object localization and recognitio...Segmentation of moving objects in a video sequence is a basic task for application of computer vision. However, shadows extracted along with the objects can result in large errors in object localization and recognition. In this paper, we propose a method of moving shadow detection based on edge information, which can effectively detect the cast shadow of a moving vehicle in a traffic scene. Having confirmed shadows existing in a figure, we execute the shadow removal algorithm proposed in this paper to segment the shadow from the foreground. The shadow eliminating algorithm removes the boundary of the cast shadow and preserves object edges firstly; secondly, it reconstructs coarse object shapes based on the edge information of objects; and finally, it extracts the cast shadow by subtracting the moving object from the change detection mask and performs further processing. The proposed method has been further tested on images taken under different shadow orientations, vehicle colors and vehicle sizes, and the results have revealed that shadows can be successfully eliminated and thus good video segmentation can be obtained.展开更多
For some important object recognition applications such as intelligent robots and unmanned driving, images are collected on a consecutive basis and associated among themselves, besides, the scenes have steady prior fe...For some important object recognition applications such as intelligent robots and unmanned driving, images are collected on a consecutive basis and associated among themselves, besides, the scenes have steady prior features. Yet existing technologies do not take full advantage of this information. In order to take object recognition further than existing algorithms in the above application, an object recognition method that fuses temporal sequence with scene priori information is proposed. This method first employs YOLOv3 as the basic algorithm to recognize objects in single-frame images, then the DeepSort algorithm to establish association among potential objects recognized in images of different moments, and finally the confidence fusion method and temporal boundary processing method designed herein to fuse, at the decision level, temporal sequence information with scene priori information. Experiments using public datasets and self-built industrial scene datasets show that due to the expansion of information sources, the quality of single-frame images has less impact on the recognition results, whereby the object recognition is greatly improved. It is presented herein as a widely applicable framework for the fusion of information under multiple classes. All the object recognition algorithms that output object class, location information and recognition confidence at the same time can be integrated into this information fusion framework to improve performance.展开更多
利用遥感影像识别土地利用类型及监测其变化情况在城市规划和土地利用优化等领域发挥着重要作用。当前,相关数据集存在样本量少、类别划分不合理、数据不开源等局限,难以满足样本驱动的深度学习遥感信息提取范式的需求。本文构建了一个...利用遥感影像识别土地利用类型及监测其变化情况在城市规划和土地利用优化等领域发挥着重要作用。当前,相关数据集存在样本量少、类别划分不合理、数据不开源等局限,难以满足样本驱动的深度学习遥感信息提取范式的需求。本文构建了一个面向深度学习的大规模场景分类与变化检测数据集MtSCCD (Multi-temporal Scene Classification and Change Detection)。该数据集包括MtSCCD_LUSC (MtSCCD Land Use Scene Classification)和MtSCCD_LUCD (MtSCCD Land Use Change Detection)两个子数据集,分别用于土地利用场景分类与变化检测任务。该数据集具有以下特点:(1) MtSCCD是目前规模最大的公开的土地利用类型识别与检测数据集,包含10种土地利用类型共65548幅图像,并且样本覆盖中国5个城市的中心区域;(2)由于MtSCCD数据集根据城市划分训练集、验证集以及测试集,对于新增的城市土地利用数据,可以根据需求划分为训练集与验证集或测试集,因此可扩展性较高;(3) MtSCCD数据集中测试集与训练集的样本来自不同的城市,因此符合实际业务需求,且能够验证模型的泛化性能。基于MtSCCD_LUSC和MtSCCD_LUCD两个子数据集,本文评估了多个深度学习网络的分类与变化检测效果,为后续的相关研究提供了参考。展开更多
基金The work was supported by the National Natural Science Foundation of PRC (No.60574033)the National Key Fundamental Research & Development Programs(973)of PRC (No.2001CB309403)
文摘Segmentation of moving objects in a video sequence is a basic task for application of computer vision. However, shadows extracted along with the objects can result in large errors in object localization and recognition. In this paper, we propose a method of moving shadow detection based on edge information, which can effectively detect the cast shadow of a moving vehicle in a traffic scene. Having confirmed shadows existing in a figure, we execute the shadow removal algorithm proposed in this paper to segment the shadow from the foreground. The shadow eliminating algorithm removes the boundary of the cast shadow and preserves object edges firstly; secondly, it reconstructs coarse object shapes based on the edge information of objects; and finally, it extracts the cast shadow by subtracting the moving object from the change detection mask and performs further processing. The proposed method has been further tested on images taken under different shadow orientations, vehicle colors and vehicle sizes, and the results have revealed that shadows can be successfully eliminated and thus good video segmentation can be obtained.
文摘For some important object recognition applications such as intelligent robots and unmanned driving, images are collected on a consecutive basis and associated among themselves, besides, the scenes have steady prior features. Yet existing technologies do not take full advantage of this information. In order to take object recognition further than existing algorithms in the above application, an object recognition method that fuses temporal sequence with scene priori information is proposed. This method first employs YOLOv3 as the basic algorithm to recognize objects in single-frame images, then the DeepSort algorithm to establish association among potential objects recognized in images of different moments, and finally the confidence fusion method and temporal boundary processing method designed herein to fuse, at the decision level, temporal sequence information with scene priori information. Experiments using public datasets and self-built industrial scene datasets show that due to the expansion of information sources, the quality of single-frame images has less impact on the recognition results, whereby the object recognition is greatly improved. It is presented herein as a widely applicable framework for the fusion of information under multiple classes. All the object recognition algorithms that output object class, location information and recognition confidence at the same time can be integrated into this information fusion framework to improve performance.
文摘利用遥感影像识别土地利用类型及监测其变化情况在城市规划和土地利用优化等领域发挥着重要作用。当前,相关数据集存在样本量少、类别划分不合理、数据不开源等局限,难以满足样本驱动的深度学习遥感信息提取范式的需求。本文构建了一个面向深度学习的大规模场景分类与变化检测数据集MtSCCD (Multi-temporal Scene Classification and Change Detection)。该数据集包括MtSCCD_LUSC (MtSCCD Land Use Scene Classification)和MtSCCD_LUCD (MtSCCD Land Use Change Detection)两个子数据集,分别用于土地利用场景分类与变化检测任务。该数据集具有以下特点:(1) MtSCCD是目前规模最大的公开的土地利用类型识别与检测数据集,包含10种土地利用类型共65548幅图像,并且样本覆盖中国5个城市的中心区域;(2)由于MtSCCD数据集根据城市划分训练集、验证集以及测试集,对于新增的城市土地利用数据,可以根据需求划分为训练集与验证集或测试集,因此可扩展性较高;(3) MtSCCD数据集中测试集与训练集的样本来自不同的城市,因此符合实际业务需求,且能够验证模型的泛化性能。基于MtSCCD_LUSC和MtSCCD_LUCD两个子数据集,本文评估了多个深度学习网络的分类与变化检测效果,为后续的相关研究提供了参考。