A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design

Duong, Ngoc Q. K.; Duong, Hien-Thanh

Home // PATTERNS 2015, The Seventh International Conferences on Pervasive Patterns and Applications // View article

A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design

Authors:
Ngoc Q. K. Duong
Hien-Thanh Duong

Keywords: Voice pattern; audio identification and synchronization; spectral features; statistical models.

Abstract:
Audio fingerprinting, also named as audio hashing, has been well-known as a powerful technique to perform audio identification and synchronization. It basically involves two major steps: fingerprint (voice pattern) design and matching search. While the first step concerns the derivation of a robust and compact audio signature, the second step usually requires knowledge about database and quick-search algorithms. Though this technique offers a wide range of real-world applications, to the best of the authors' knowledge, a comprehensive survey of existing algorithms appeared more than eight years ago. Thus, in this paper, we present a more up-to-date review and, for emphasizing on the audio signal processing aspect, we focus our state-of-the-art survey on the fingerprint design step for which various audio features and their tractable statistical models are discussed.

Pages: 32 to 37

Copyright: Copyright (c) IARIA, 2015

Publication date: March 22, 2015

Published in: conference

ISSN: 2308-3557

ISBN: 978-1-61208-393-3

Location: Nice, France

Dates: from March 22, 2015 to March 27, 2015