Home // International Journal On Advances in Telecommunications, volume 6, numbers 3 and 4, 2013 // View article


High Quality Video Conferencing: Region of Interest Encoding and Joint Video/Audio Analysis

Authors:
Christopher Bulla
Christian Feldmann
Magnus Schäfer
Florian Heese
Thomas Schlien
Martin Schink

Keywords: object detection; object tracking; region of interest coding; beamforming; scene composition; video-conferencing

Abstract:
In this paper, we present a high quality video conferencing system, that has been developed in the collaborative project “Connected Visual Reality (CoVR) 1 – High Quality Visual Communication in Heterogeneous Networks” and was designed to reduce bitrate while preserving a constant visual quality. We utilize the fact that the main focus in a typical video conference lies upon the participating persons to save bitrate in less interesting parts of the video and introduce a scene composition concept that is merely based on the detected regions of interest. The region of interest encoding and the scene composition will be supported by a joint video and audio analysis. On the video analysis side we use a Viola-Jones face detector to detect, and a MeanShift tracker to track the regions of interest. The audio analysis exploits the information from the video analysis about the detected participants by a beamforming algorithm and creates an activity index for each participant. To represent the detected region of interests for the encoder we use a quality map on the level of macro-blocks, which allows the encoder to choose its quantization parameter individually for each macro-block. Finally, the proposed scene composition omits the background and shows only the most active participants of the conference, thus visual quantization artifacts introduced by the encoder get irrelevant. Experiments on recorded conference sequences demonstrate bitrate savings up to 50% that can be achieved with the proposed system.

Pages: 153 to 163

Copyright: Copyright (c) to authors, 2013. Used with permission.

Publication date: December 31, 2013

Published in: journal

ISSN: 1942-2601