Journal of System Simulation ›› 2025, Vol. 37 ›› Issue (2): 551-562.doi: 10.16182/j.issn1004731x.joss.23-1564
• Papers • Previous Articles
Guo Yecai1,2, Sun Jingdong1, Saha Amitave1
Received:
2023-12-25
Revised:
2024-01-25
Online:
2025-02-14
Published:
2025-02-10
CLC Number:
Guo Yecai, Sun Jingdong, Saha Amitave. Improved Target Detection Algorithm for Aerial Images Based on YOLOv5[J]. Journal of System Simulation, 2025, 37(2): 551-562.
Table 6
Detection accuracy of each model in the VisDrone-2019 dataset
Class | SSD | Faster RCNN | YOLOv7 | YOLOv8n | YOLOv5s | FSD-YOLOv5s |
---|---|---|---|---|---|---|
pedestrian | 22.4 | 8.3 | 40.0 | 37.3 | 40.1 | 46.0 |
people | 10.3 | 3.4 | 27.8 | 29.3 | 32.6 | 36.9 |
bicycle | 5.30 | 4.60 | 8.96 | 9.20 | 12.00 | 11.10 |
car | 58.0 | 41.2 | 74.1 | 76.8 | 74.2 | 78.6 |
van | 30.3 | 30.7 | 42.3 | 40.8 | 36.8 | 39.2 |
truck | 29.7 | 38.7 | 43.4 | 32.7 | 29.7 | 30.2 |
tricycle | 11.5 | 18.5 | 20.0 | 25.1 | 20.9 | 20.7 |
awning-tricycle | 4.5 | 10.1 | 13.0 | 12.5 | 11.5 | 11.5 |
bus | 46.5 | 52.4 | 55.9 | 50.6 | 43.2 | 47.0 |
motor | 19.5 | 10.0 | 36.8 | 39.4 | 38.8 | 42.2 |
mAP | 23.8 | 21.8 | 36.2 | 35.4 | 33.9 | 36.3 |
FPS | 67 | 13 | 56 | 62 | 57 | 48 |
1 | Song Gang, Du Hongwei, Zhang Xinyue, et al. Small Object Detection in Unmanned Aerial Vehicle Images Using Multi-scale Hybrid Attention[J]. Engineering Applications of Artificial Intelligence, 2024, 128: 107455. |
2 | Du Dawei, Qi Yuankai, Yu Hongyang, et al. The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking[C]//Computer Vision-ECCV 2018. Cham: Springer International Publishing, 2018: 375-391. |
3 | Gu Jingjing, Su Tao, Wang Qiuhong, et al. Multiple Moving Targets Surveillance Based on a Cooperative Network for Multi-UAV[J]. IEEE Communications Magazine, 2018, 56(4): 82-89. |
4 | Kussul Nataliia, Lavreniuk Mykola, Skakun S, et al. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data[J]. IEEE Geoscience and Remote Sensing Letters, 2017, 14(5): 778-782. |
5 | Sadgrove Edmund J, Falzon Greg, Miron David, et al. Real-time Object Detection in Agricultural/Remote Environments Using the Multiple-expert Colour Feature Extreme Learning Machine (MEC-ELM)[J]. Computers in Industry, 2018, 98: 183-191. |
6 | Girshick R, Donahue J, Darrell T, et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[C]//2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2014: 580-587. |
7 | Girshick R. Fast R-CNN[C]//2015 IEEE International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2015: 1440-1448. |
8 | Ren Shaoqing, He Kaiming, Girshick R, et al. Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems. Cambridge: MIT Press, 2015: 91-99. |
9 | Liu Wei, Anguelov D, Erhan D, et al. SSD: Single Shot MultiBox Detector[C]//Computer Vision-ECCV 2016. Cham: Springer International Publishing, 2016: 21-37. |
10 | Lin T Y, Goyal P, Girshick R, et al. Focal Loss for Dense Object Detection[C]//2017 IEEE International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2017: 2999-3007. |
11 | Redmon J, Divvala S, Girshick R, et al. You Only Look Once: Unified, Real-time Object Detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2016: 779-788. |
12 | Redmon J, Farhadi A. YOLO9000: Better, Faster, Stronger[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2017: 6517-6525. |
13 | Redmon J, Farhadi A. YOLOv3: An Incremental Improvement[EB/OL]. (2018-04-08) [2023-07-07]. . |
14 | Bochkovskiy Alexey, Yao Wang Chien, Liao Hongyuan. YOLOv4: Optimal Speed and Accuracy of Object Detection[EB/OL]. (2020-04-23) [2023-07-07]. . |
15 | Xiao Hanguang, Li Yuewei, Xiu Yu, et al. Development of Outdoor Swimmers Detection System with Small Object Detection Method Based on Deep Learning[J]. Multimedia Systems, 2023, 29(1): 323-332. |
16 | Onur Can Koyun, Reyhan Kevser Keser, İbrahim Batuhan Akkaya, et al. Focus-and-detect: A Small Object Detection Framework for Aerial Images[J]. Signal Processing: Image Communication, 2022, 104: 116675. |
17 | Xue Zhenyang, Lin Haifeng, Wang Fang. A Small Target Forest Fire Detection Model Based on YOLOv5 Improvement[J]. Forests, 2022, 13(8): 1332. |
18 | Zhang Yifan, Ren Weiqiang, Zhang Zhang, et al. Focal and Efficient IoU Loss for Accurate Bounding Box Regression[J]. Neurocomputing, 2022, 506(C): 146-157. |
19 | Sunkara R, Luo Tie. No More Strided Convolutions or Pooling: A New CNN Building Block for Low-resolution Images and Small Objects[C]//Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Cham: Springer Nature Switzerland, 2023: 443-459. |
20 | Huang Gao, Liu Zhuang, Laurens Van Der Maaten, et al. Densely Connected Convolutional Networks[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2017: 2261-2269. |
21 | Du Dawei, Zhu Pengfei, Wen Longyin, et al. VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results[C]//2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW). Piscataway: IEEE, 2019: 213-226. |
22 | He Kaiming, Zhang Xiangyu, Ren Shaoqing, et al. Deep Residual Learning for Image Recognition[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2016: 770-778. |
23 | Newell A, Yang Kaiyu, Deng Jia. Stacked Hourglass Networks for Human Pose Estimation[C]//Computer Vision-ECCV 2016. Cham: Springer International Publishing, 2016: 483-499. |
24 | Xie Saining, Girshick R, Dollár Piotr, et al. Aggregated Residual Transformations for Deep Neural Networks[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2017: 5987-5995. |
25 | Lin T Y, Dollár Piotr, Girshick R, et al. Feature Pyramid Networks for Object Detection[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2017: 936-944. |
26 | Khan Habib, Hussain T, Samee Ullah Khan, et al. Deep Multi-scale Pyramidal Features Network for Supervised Video Summarization[J]. Expert Systems with Applications, 2024, 237, Part C: 121288. |
27 | He Kaiming, Gkioxari G, Dollár Piotr, et al. Mask R-CNN[C]//2017 IEEE International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2017: 2980-2988. |
28 | Cai Zhaowei, Vasconcelos N. Cascade R-CNN: Delving into High Quality Object Detection[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 6154-6162. |
29 | Wang C Y, Bochkovskiy A, Liao Hongyuan. YOLOv7: Trainable Bag-of-freebies Sets New State-of-the-art for Real-time Object Detectors[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2023: 7464-7475. |
30 | Tian Zhi, Shen Chunhua, Chen Hao, et al. FCOS: Fully Convolutional One-stage Object Detection[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2019: 9626-9635. |
31 | Duan Kaiwen, Bai Song, Xie Lingxi, et al. CenterNet: Keypoint Triplets for Object Detection[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2019: 6568-6577. |
32 | Bodla N, Singh B, Chellappa R, et al. Soft-NMS-improving Object Detection with One Line of Code[C]//2017 IEEE International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2017: 5562-5570. |
33 | Neubeck A, Van Gool L. Efficient Non-maximum Suppression[C]//18th International Conference on Pattern Recognition (ICPR'06). Piscataway: IEEE, 2006: 850-855. |
34 | Dai Xiyang, Chen Yinpeng, Xiao Bin, et al. Dynamic Head: Unifying Object Detection Heads with Attentions[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2021: 7369-7378. |
35 | Liang Tingting, Chu Xiaojie, Liu Yudong, et al. CBNet: A Composite Backbone Network Architecture for Object Detection[J]. IEEE Transactions on Image Processing, 2022, 31: 6893-6906. |
36 | Lin T Y, Maire M, Belongie S, et al. Microsoft COCO: Common Objects in Context[C]//Computer Vision-ECCV 2014. Cham: Springer International Publishing, 2014: 740-755. |
37 | Carion N, Massa F, Synnaeve G, et al. End-to-end Object Detection with Transformers[C]//Computer Vision- ECCV 2020. Cham: Springer International Publishing, 2020: 213-229. |
38 | Liu Shu, Qi Lu, Qin Haifang, et al. Path Aggregation Network for Instance Segmentation[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 8759-8768. |
[1] | Xu Zhongkai, Liu Yanling, Sheng Xiaojuan, Wang Chao, Ke Wenjun. Automatic Detection Algorithm for Typical Defects of Substation Based on Improved YOLOv5 [J]. Journal of System Simulation, 2024, 36(11): 2604-2615. |
[2] | Fu Qiang, Teng Xianyun, Ji Yuanfa, Ren Fenghua. SLAM Dynamic Algorithm Based on Improved Feature Description [J]. Journal of System Simulation, 2024, 36(11): 2712-2721. |
[3] | Su Tong, Wang Ying, Deng Qiyang, Li Zhaobin. Improved Foggy Pedestrian and Vehicle Detection Algorithm Based on YOLOv5 [J]. Journal of System Simulation, 2024, 36(10): 2413-2422. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||