FPGA-Based Vehicle Detection and Tracking Accelerator
A convolutional neural network-based multiobject detection and tracking algorithm can be applied to vehicle detection and traffic flow statistics, thus enabling smart transportation. Aiming at the problems of the high computational complexity of multiobject detection and tracking algorithms, a large...
| Published in: | Sensors |
|---|---|
| Main Authors: | , , , |
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2023-02-01
|
| Subjects: | |
| Online Access: | https://www.mdpi.com/1424-8220/23/4/2208 |
| _version_ | 1850095005458235392 |
|---|---|
| author | Jiaqi Zhai Bin Li Shunsen Lv Qinglei Zhou |
| author_facet | Jiaqi Zhai Bin Li Shunsen Lv Qinglei Zhou |
| author_sort | Jiaqi Zhai |
| collection | DOAJ |
| container_title | Sensors |
| description | A convolutional neural network-based multiobject detection and tracking algorithm can be applied to vehicle detection and traffic flow statistics, thus enabling smart transportation. Aiming at the problems of the high computational complexity of multiobject detection and tracking algorithms, a large number of model parameters, and difficulty in achieving high throughput with a low power consumption in edge devices, we design and implement a low-power, low-latency, high-precision, and configurable vehicle detector based on a field programmable gate array (FPGA) with YOLOv3 (You-Only-Look-Once-version3), YOLOv3-tiny CNNs (Convolutional Neural Networks), and the Deepsort algorithm. First, we use a dynamic threshold structured pruning method based on a scaling factor to significantly compress the detection model size on the premise that the accuracy does not decrease. Second, a dynamic 16-bit fixed-point quantization algorithm is used to quantify the network parameters to reduce the memory occupation of the network model. Furthermore, we generate a reidentification (RE-ID) dataset from the UA-DETRAC dataset and train the appearance feature extraction network on the Deepsort algorithm to improve the vehicles’ tracking performance. Finally, we implement hardware optimization techniques such as memory interlayer multiplexing, parameter rearrangement, ping-pong buffering, multichannel transfer, pipelining, Im2col+GEMM, and Winograd algorithms to improve resource utilization and computational efficiency. The experimental results demonstrate that the compressed YOLOv3 and YOLOv3-tiny network models decrease in size by 85.7% and 98.2%, respectively. The dual-module parallel acceleration meets the demand of the 6-way parallel video stream vehicle detection with the peak throughput at 168.72 fps. |
| format | Article |
| id | doaj-art-d1f3499cdb0d4d4dbc55dc2dccf47a35 |
| institution | Directory of Open Access Journals |
| issn | 1424-8220 |
| language | English |
| publishDate | 2023-02-01 |
| publisher | MDPI AG |
| record_format | Article |
| spelling | doaj-art-d1f3499cdb0d4d4dbc55dc2dccf47a352025-08-20T00:07:21ZengMDPI AGSensors1424-82202023-02-01234220810.3390/s23042208FPGA-Based Vehicle Detection and Tracking AcceleratorJiaqi Zhai0Bin Li1Shunsen Lv2Qinglei Zhou3School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaSchool of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaSchool of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaSchool of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaA convolutional neural network-based multiobject detection and tracking algorithm can be applied to vehicle detection and traffic flow statistics, thus enabling smart transportation. Aiming at the problems of the high computational complexity of multiobject detection and tracking algorithms, a large number of model parameters, and difficulty in achieving high throughput with a low power consumption in edge devices, we design and implement a low-power, low-latency, high-precision, and configurable vehicle detector based on a field programmable gate array (FPGA) with YOLOv3 (You-Only-Look-Once-version3), YOLOv3-tiny CNNs (Convolutional Neural Networks), and the Deepsort algorithm. First, we use a dynamic threshold structured pruning method based on a scaling factor to significantly compress the detection model size on the premise that the accuracy does not decrease. Second, a dynamic 16-bit fixed-point quantization algorithm is used to quantify the network parameters to reduce the memory occupation of the network model. Furthermore, we generate a reidentification (RE-ID) dataset from the UA-DETRAC dataset and train the appearance feature extraction network on the Deepsort algorithm to improve the vehicles’ tracking performance. Finally, we implement hardware optimization techniques such as memory interlayer multiplexing, parameter rearrangement, ping-pong buffering, multichannel transfer, pipelining, Im2col+GEMM, and Winograd algorithms to improve resource utilization and computational efficiency. The experimental results demonstrate that the compressed YOLOv3 and YOLOv3-tiny network models decrease in size by 85.7% and 98.2%, respectively. The dual-module parallel acceleration meets the demand of the 6-way parallel video stream vehicle detection with the peak throughput at 168.72 fps.https://www.mdpi.com/1424-8220/23/4/2208FPGAvehicle detectionaccelerator architectureYOLODeepSort |
| spellingShingle | Jiaqi Zhai Bin Li Shunsen Lv Qinglei Zhou FPGA-Based Vehicle Detection and Tracking Accelerator FPGA vehicle detection accelerator architecture YOLO DeepSort |
| title | FPGA-Based Vehicle Detection and Tracking Accelerator |
| title_full | FPGA-Based Vehicle Detection and Tracking Accelerator |
| title_fullStr | FPGA-Based Vehicle Detection and Tracking Accelerator |
| title_full_unstemmed | FPGA-Based Vehicle Detection and Tracking Accelerator |
| title_short | FPGA-Based Vehicle Detection and Tracking Accelerator |
| title_sort | fpga based vehicle detection and tracking accelerator |
| topic | FPGA vehicle detection accelerator architecture YOLO DeepSort |
| url | https://www.mdpi.com/1424-8220/23/4/2208 |
| work_keys_str_mv | AT jiaqizhai fpgabasedvehicledetectionandtrackingaccelerator AT binli fpgabasedvehicledetectionandtrackingaccelerator AT shunsenlv fpgabasedvehicledetectionandtrackingaccelerator AT qingleizhou fpgabasedvehicledetectionandtrackingaccelerator |
