Home // INFOCOMP 2013, The Third International Conference on Advanced Communications and Computation // View article
Rapid Prototyping of a Croatian Large Vocabulary Continuous Speech Recognition System
Authors:
Dario Bajo
Danijel Turković
Šandor Dembitz
Keywords: automatic speech recognition; continuous speech; large-scale n-gram model; large vocabulary.
Abstract:
The Croatian language, like many minority languages used by less than 0.1% of the world population, is in need of mature automatic speech recognition (ASR) systems for applications such as transcription of speech recordings, voice control, an aid to impaired people, etc. This paper describes a short-term research and development project aimed to produce an applicable Croatian large vocabulary continuous speech recognition system from scratch. The open-source CMU Sphinx toolkit was our platform choice. For the purpose of acoustic model training, we made a speech training set of several hundred utterances, containing words carefully chosen according to their phonetic properties. Language models were derived from the Croatian large-scale n-gram system, which ensures the system’s applicability. During the project, we succeeded in developing an ASR system able to recognize freely chosen utterances composed of 15,000 most frequently used Croatian words reasonably well.
Pages: 13 to 18
Copyright: Copyright (c) IARIA, 2013
Publication date: November 17, 2013
Published in: conference
ISSN: 2308-3484
ISBN: 978-1-61208-310-0
Location: Lisbon, Portugal
Dates: from November 17, 2013 to November 21, 2013