This paper presents a novel formulation for detecting objects with articulated rigid bodies from highresolution monitoring images, particularly engineering vehicles. There are many pixels in high-resolution monitoring...This paper presents a novel formulation for detecting objects with articulated rigid bodies from highresolution monitoring images, particularly engineering vehicles. There are many pixels in high-resolution monitoring images, and most of them represent the background. Our method first detects ob ject patches from monitoring images using a coarse detection process. In this phase, we build a descriptor based on histograms of oriented gradient, which contain color frequency information. Then we use a linear support vector machine to rapidly detect many image patches that may contain ob ject parts, with a low false negative rate and a high false positive rate. In the second phase, we apply a refinement classification to determine the patches that actually contain ob jects. In this stage, we increase the size of the image patches so that they include the complete ob ject using models of the ob ject parts.Then an accelerated and improved salient mask is used to improve the performance of the dense scale-invariant feature transform descriptor. The detection process returns the absolute position of positive ob jects in the original images. We have applied our methods to three datasets to demonstrate their effectiveness.展开更多
基金supported by the China Knowledge Centre for Engineering Sciences and Technology(No.CKCEST-2014-1-2)the Zhejiang Provincial Natural Science Foundation of China(No.LY14F020027)the National Natural Science Foundation of China(No.61272304)
文摘This paper presents a novel formulation for detecting objects with articulated rigid bodies from highresolution monitoring images, particularly engineering vehicles. There are many pixels in high-resolution monitoring images, and most of them represent the background. Our method first detects ob ject patches from monitoring images using a coarse detection process. In this phase, we build a descriptor based on histograms of oriented gradient, which contain color frequency information. Then we use a linear support vector machine to rapidly detect many image patches that may contain ob ject parts, with a low false negative rate and a high false positive rate. In the second phase, we apply a refinement classification to determine the patches that actually contain ob jects. In this stage, we increase the size of the image patches so that they include the complete ob ject using models of the ob ject parts.Then an accelerated and improved salient mask is used to improve the performance of the dense scale-invariant feature transform descriptor. The detection process returns the absolute position of positive ob jects in the original images. We have applied our methods to three datasets to demonstrate their effectiveness.