Home // ICDT 2013, The Eighth International Conference on Digital Telecommunications // View article


Investigating Image Processing Based Aligner for Large Texts

Authors:
Andi Buzo
Horia Cucu
Corneliu Burileanu

Keywords: Lightly Supervised Acoustic Modeling; Text Alignment; Under-resourced Languages; Image Processing

Abstract:
Speech annotation is a costly and time consuming process because it requires high accuracy. Lightly supervised acoustic modeling solves this problem by making use of approximate transcriptions of speech recordings. In under-resourced languages, the speech recordings are not transcribed entirely and the accuracy of the transcription is poor. In this case, it is necessary an additional segmentation step. We propose a segmentation method that uses image processing techniques in order to spot a text island into a larger one. We also investigate on the effect of several tuning parameters on the method's accuracy.

Pages: 50 to 54

Copyright: Copyright (c) IARIA, 2013

Publication date: April 21, 2013

Published in: conference

ISSN: 2308-3964

ISBN: 978-1-61208-262-2

Location: Venice, Italy

Dates: from April 21, 2013 to April 26, 2013