ETRI Knowledge Sharing Platform : Performance Enhancement of YOLOv3 by Adding Prediction Layers with Spatial Pyramid Pooling for Vehicle Detection

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper Performance Enhancement of YOLOv3 by Adding Prediction Layers with Spatial Pyramid Pooling for Vehicle Detection

Cited 56 time in scopus

Citation: International Conference on Advanced Video and Signal-based Surveillance (AVSS) 2018, pp.411-416

Abstract: In recent years, vision-based object detection methods using convolutional neural network (CNN) have been very successful. However, the object detection method using the CNN feature has a disadvantage that lots of feature maps should be generated in order to be robust against the scale change and the occlusion of the object. Also, simply raising a large number of feature maps does not improve performance. We propose a multi-scale vehicle detection with spatial pyramid pooling method which is robust to the scale change of the vehicle and the occlusion by improving the conventional YOLOv3 algorithm. The proposed method was evaluated through the UA-DETRAC benchmark and obtain the state-of-the-art mAP, which is better than those of the DPM, ACF, R-CNN, CompACT, NANO, SA-FRCNN, and Faster-RCNN2.

KSP Keywords: CNN feature, Convolution neural network(CNN), Detection Method, Feature map, Multi-scale, Pooling method, R-CNN, Spatial Pyramid Pooling, Vehicle detection, neural network(NN), performance enhancement

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.