DualStream-AttnXGS: An Attention-Enhanced Dual-Stream Model Based on Human Keypoint Recognition for Driver Distraction Detection

Zhuo He; Chengming Chen; Xiaoyi Zhou

doi:10.3390/app152412974

Abstract

Driver distraction remains one of the leading causes of traffic accidents. Although deep learning approaches such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Transformers have been extensively applied for distracted driving detection, their performance is often hindered by limited real-time efficiency and high false detection rates. To address these challenges, this paper proposes an efficient dual-stream neural architecture, termed DualStream-AttnXGS, which jointly leverages visual and pose information to improve distraction recognition accuracy. In the RGB stream, an enhanced EfficientNetB0 backbone is employed, where Ghost Convolution and Coordinate Attention modules are integrated to strengthen feature representation while maintaining lightweight computation. A compound loss function combining Center Loss and Focal Loss is further introduced to promote inter-class separability and stabilize training. In parallel, the keypoint stream extracts human skeletal features using YOLOv8-Pose, which are subsequently classified through a compact ensemble model based on XGBoost v2.1.4 and Gradient Boosting. Finally, a Softmax-based probabilistic fusion strategy integrates the outputs of both streams for the final prediction. The proposed model achieved 99.59% accuracy on the SFD3 dataset while attaining 99.12% accuracy on the AUCD2 dataset, demonstrating that the proposed dual-stream architecture provides a more effective solution than single-stream models by leveraging complementary visual and pose information.

Affiliated Institutions

Related Publications

Cascaded Pyramid Network for Multi-person Pose Estimation

Yilun Chen , Zhicheng Wang , Yuxiang Peng +3 more

The topic of multi-person pose estimation has been largely improved recently, especially with the development of convolutional neural network. However, there still exist a lot o...

2018 2018 IEEE/CVF Conference on Computer ... 1733 citations

GPU-acceleration for Large-scale Tree Boosting

Huan Zhang , Si Si , Cho‐Jui Hsieh

In this paper, we present a novel massively parallel algorithm for accelerating the decision tree building procedure on GPUs (Graphics Processing Units), which is a crucial step...

2017 arXiv (Cornell University) 61 citations

GhostNet: More Features From Cheap Operations

Kai Han , Yunhe Wang , Qi Tian +3 more

Deploying convolutional neural networks (CNNs) on embedded devices is difficult due to the limited memory and computation resources. The redundancy in feature maps is an importa...

2020 3894 citations

Efficient object localization using Convolutional Networks

Jonathan Tompson , Ross Goroshin , Arjun Jain +2 more

Recent state-of-the-art performance on human-body pose estimation has been achieved with Deep Convolutional Networks (ConvNets). Traditional ConvNet architectures include poolin...

2015 1324 citations

Dynamic Convolution: Attention Over Convolution Kernels

Yinpeng Chen , Xiyang Dai , Mengchen Liu +3 more

Light-weight convolutional neural networks (CNNs) suffer performance degradation as their low computational budgets constrain both the depth (number of convolution layers) and t...

2020 1166 citations

Publication Info

Year: 2025
Type: article
Volume: 15
Issue: 24
Pages: 12974-12974
Citations: 0
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

DualStream-AttnXGS: An Attention-Enhanced Dual-Stream Model Based on Human Keypoint Recognition for Driver Distraction Detection

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                                    Zhuo He, 
                                
                                    Chengming Chen, 
                                
                                    Xiaoyi Zhou
                                
                            (2025). 
                            DualStream-AttnXGS: An Attention-Enhanced Dual-Stream Model Based on Human Keypoint Recognition for Driver Distraction Detection. 
                            Applied Sciences
                            , 15
                            (24)
                            , 12974-12974.
                            https://doi.org/10.3390/app152412974

Identifiers

DOI: 10.3390/app152412974