Quantitative Identification of Driver Distraction: A Weakly Supervised Contrastive Learning Approach
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS(2024)
Nanyang Technol Univ | Polytech Hauts Defrance
Abstract
Accurate recognition of driver distraction is significant for the design of human-machine cooperation driving systems. Existing studies mainly focus on classifying varied distracted driving behaviors, which depend heavily on the scale and quality of datasets and only detect the discrete distraction categories. Therefore, most data-driven approaches have limited capability of recognizing unseen driving activities and cannot provide a reasonable solution for downstream applications. To address these challenges, this paper develops a vision Transformer-enabled weakly supervised contrastive (W-SupCon) learning framework, in which distracted behaviors are quantified by calculating their distances from the normal driving representation set. The Gaussian mixed model (GMM) is employed for the representation clustering, which centralizes the distribution of the normal driving representation set to better identify distracted behaviors. A novel driver behavior dataset and the other three ones are employed for the evaluation, experimental results demonstrate that our proposed approach has more accurate and robust performance than existing methods in the recognition of unknown driver activities. Furthermore, the rationality of distraction levels for different driving behaviors is evaluated through driver skeleton poses. The constructed dataset and demo videos are available at https://yanghh.io/Driver-Distraction-Quantification .
MoreTranslated text
Key words
Behavioral sciences,Vehicles,Feature extraction,Transformers,Training,Decoding,Support vector machines,Driver distraction quantification,weakly supervised contrastive learning,representation clustering
求助PDF
上传PDF
View via Publisher
AI Read Science
AI Summary
AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.
Example
Background
Key content
Introduction
Methods
Results
Related work
Fund
Key content
- Pretraining has recently greatly promoted the development of natural language processing (NLP)
- We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
- We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
- The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
- Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance
Upload PDF to Generate Summary
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Related Papers
Vision-Language Models Can Identify Distracted Driver Behavior from Naturalistic Videos
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 2024
被引用3
Highly Discriminative Driver Distraction Detection Method Based on Swin Transformer
VEHICLES 2024
被引用0
Recent Advances in Reinforcement Learning-Based Autonomous Driving Behavior Planning: A Survey
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES 2024
被引用1
Transportation Research Part F Traffic Psychology and Behaviour 2024
被引用0
A Lightweight Intelligent Laryngeal Cancer Detection System for Rural Areas
AMERICAN JOURNAL OF OTOLARYNGOLOGY 2024
被引用0
A Robust Operators’ Cognitive Workload Recognition Method Based on Denoising Masked Autoencoder
KNOWLEDGE-BASED SYSTEMS 2024
被引用1
IEEE Trans Intell Transp Syst 2024
被引用0
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper