FPGA-Based Vehicle Detection and Tracking Accelerator

A convolutional neural network-based multiobject detection and tracking algorithm can be applied to vehicle detection and traffic flow statistics, thus enabling smart transportation. Aiming at the problems of the high computational complexity of multiobject detection and tracking algorithms, a large...

Full description

Bibliographic Details
Published in:Sensors
Main Authors: Jiaqi Zhai, Bin Li, Shunsen Lv, Qinglei Zhou
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Subjects:
Online Access:https://www.mdpi.com/1424-8220/23/4/2208
_version_ 1850095005458235392
author Jiaqi Zhai
Bin Li
Shunsen Lv
Qinglei Zhou
author_facet Jiaqi Zhai
Bin Li
Shunsen Lv
Qinglei Zhou
author_sort Jiaqi Zhai
collection DOAJ
container_title Sensors
description A convolutional neural network-based multiobject detection and tracking algorithm can be applied to vehicle detection and traffic flow statistics, thus enabling smart transportation. Aiming at the problems of the high computational complexity of multiobject detection and tracking algorithms, a large number of model parameters, and difficulty in achieving high throughput with a low power consumption in edge devices, we design and implement a low-power, low-latency, high-precision, and configurable vehicle detector based on a field programmable gate array (FPGA) with YOLOv3 (You-Only-Look-Once-version3), YOLOv3-tiny CNNs (Convolutional Neural Networks), and the Deepsort algorithm. First, we use a dynamic threshold structured pruning method based on a scaling factor to significantly compress the detection model size on the premise that the accuracy does not decrease. Second, a dynamic 16-bit fixed-point quantization algorithm is used to quantify the network parameters to reduce the memory occupation of the network model. Furthermore, we generate a reidentification (RE-ID) dataset from the UA-DETRAC dataset and train the appearance feature extraction network on the Deepsort algorithm to improve the vehicles’ tracking performance. Finally, we implement hardware optimization techniques such as memory interlayer multiplexing, parameter rearrangement, ping-pong buffering, multichannel transfer, pipelining, Im2col+GEMM, and Winograd algorithms to improve resource utilization and computational efficiency. The experimental results demonstrate that the compressed YOLOv3 and YOLOv3-tiny network models decrease in size by 85.7% and 98.2%, respectively. The dual-module parallel acceleration meets the demand of the 6-way parallel video stream vehicle detection with the peak throughput at 168.72 fps.
format Article
id doaj-art-d1f3499cdb0d4d4dbc55dc2dccf47a35
institution Directory of Open Access Journals
issn 1424-8220
language English
publishDate 2023-02-01
publisher MDPI AG
record_format Article
spelling doaj-art-d1f3499cdb0d4d4dbc55dc2dccf47a352025-08-20T00:07:21ZengMDPI AGSensors1424-82202023-02-01234220810.3390/s23042208FPGA-Based Vehicle Detection and Tracking AcceleratorJiaqi Zhai0Bin Li1Shunsen Lv2Qinglei Zhou3School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaSchool of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaSchool of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaSchool of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaA convolutional neural network-based multiobject detection and tracking algorithm can be applied to vehicle detection and traffic flow statistics, thus enabling smart transportation. Aiming at the problems of the high computational complexity of multiobject detection and tracking algorithms, a large number of model parameters, and difficulty in achieving high throughput with a low power consumption in edge devices, we design and implement a low-power, low-latency, high-precision, and configurable vehicle detector based on a field programmable gate array (FPGA) with YOLOv3 (You-Only-Look-Once-version3), YOLOv3-tiny CNNs (Convolutional Neural Networks), and the Deepsort algorithm. First, we use a dynamic threshold structured pruning method based on a scaling factor to significantly compress the detection model size on the premise that the accuracy does not decrease. Second, a dynamic 16-bit fixed-point quantization algorithm is used to quantify the network parameters to reduce the memory occupation of the network model. Furthermore, we generate a reidentification (RE-ID) dataset from the UA-DETRAC dataset and train the appearance feature extraction network on the Deepsort algorithm to improve the vehicles’ tracking performance. Finally, we implement hardware optimization techniques such as memory interlayer multiplexing, parameter rearrangement, ping-pong buffering, multichannel transfer, pipelining, Im2col+GEMM, and Winograd algorithms to improve resource utilization and computational efficiency. The experimental results demonstrate that the compressed YOLOv3 and YOLOv3-tiny network models decrease in size by 85.7% and 98.2%, respectively. The dual-module parallel acceleration meets the demand of the 6-way parallel video stream vehicle detection with the peak throughput at 168.72 fps.https://www.mdpi.com/1424-8220/23/4/2208FPGAvehicle detectionaccelerator architectureYOLODeepSort
spellingShingle Jiaqi Zhai
Bin Li
Shunsen Lv
Qinglei Zhou
FPGA-Based Vehicle Detection and Tracking Accelerator
FPGA
vehicle detection
accelerator architecture
YOLO
DeepSort
title FPGA-Based Vehicle Detection and Tracking Accelerator
title_full FPGA-Based Vehicle Detection and Tracking Accelerator
title_fullStr FPGA-Based Vehicle Detection and Tracking Accelerator
title_full_unstemmed FPGA-Based Vehicle Detection and Tracking Accelerator
title_short FPGA-Based Vehicle Detection and Tracking Accelerator
title_sort fpga based vehicle detection and tracking accelerator
topic FPGA
vehicle detection
accelerator architecture
YOLO
DeepSort
url https://www.mdpi.com/1424-8220/23/4/2208
work_keys_str_mv AT jiaqizhai fpgabasedvehicledetectionandtrackingaccelerator
AT binli fpgabasedvehicledetectionandtrackingaccelerator
AT shunsenlv fpgabasedvehicledetectionandtrackingaccelerator
AT qingleizhou fpgabasedvehicledetectionandtrackingaccelerator