Home // MMEDIA 2022, The Fourteenth International Conference on Advances in Multimedia // View article


Comparison of Two Approaches for Human Tense Situation Analysis in Car Cabin

Authors:
Quentin Portes
José Mendes-Carvalho
Julien Pinquier
Frédéric Lerasle

Keywords: Human interactions, multimodality, data fusion, audio and video features, end-to-end

Abstract:
The use of audio, video and text modalities to simultaneously analyze human interactions is a recent trend in the field of deep learning. The multimodality tends to create computationally expensive models. Our in-vehicle specific context requires recording a database to validate our approach. Twenty-two participants playing three different scenarios ("curious", "argued refusal" and "not argued refusal") of interactions between a driver and a passenger were recorded. We propose two different models to identify tense situations in a car cabin. One is based on an end-to-end approach and the other one is a hybrid model using handcrafted features for audio and video modalities. We obtain similar results (around 81% of balanced accuracy) with the two architectures but we highlight their complementary. We also provide details regarding the benefits of combining different sensor channels.

Pages: 1 to 8

Copyright: Copyright (c) IARIA, 2022

Publication date: April 24, 2022

Published in: conference

ISSN: 2308-4448

ISBN: 978-1-61208-942-3

Location: Barcelona, Spain

Dates: from April 24, 2022 to April 28, 2022