Discover the SciOpen Platform and Achieve Your Research Goals with Ease.
Search articles, authors, keywords, DOl and etc.
To address the degradation in recognition accuracy caused by false and missed detections of small target traffic signs, this study proposes a small traffic sign recognition algorithm based on CGT-YOLO. First, a contextaware enhancement module (CAM) is introduced to replace the spatial pyramid pooling fast (SPPF) module in the YOLOv5s network. By employing parallel dilated convolutions with different dilation rates, the CAM enhances multiscale feature representation and contextual information of small traffic signs without reducing spatial resolution.Second, a global attention mechanism (GAM) is inserted after the concatenation operation in the backbone network of YOLOv5s. The GAM extracts features enhanced by the CAM and strengthens global interaction between channel and spatial dimensions through 3D permutation, multi-layer perceptron, and convolutional spatial attention, thereby highlighting the features of small traffic signs and mitigating the negative effects of complex backgrounds and long distances. Finally, a task-specific context (TSC) decoupled head is utilized to separate features for classification and localization tasks. Through the semantic context encoder (SCE) and detail preservation encoder (DPE) modules, the head generates semantically rich low-resolution feature maps for classification and high-resolution feature maps containing boundary information for localization, respectively. This disentangles classification and localization tasks at the feature source, resolving feature conflicts between the two tasks for small target traffic signs. Experimental results on a dataset constructed by integrating TT100K and CCTSDB show that the improved model achieves enhanced performance across all metrics: the missed detection rate and false detection rate are reduced by 12.1 and 11.6 percentage points, respectively, while mAP(0.50∶0.95) increases by 0.026 0. Compared to models such as YOLOv8s, NanoDet-Plus, and RT-DETR-Nano, CGT-YOLO demonstrates superior performance across multiple metrics. While maintaining a high inference speed (72.5 FPS), it effectively reduces false and missed detections, significantly improving the detection accuracy and robustness of small target traffic signs in complex scenarios.
Comments on this article