Discriminative training of gender-dependent acoustic models

Vaněk, Jan; Psutka, Josef V.; Zelinka, Jan; Pražák, Aleš; Psutka, Josef

Title:	Discriminative training of gender-dependent acoustic models
Other Titles:	Diskriminativní modelování gender-dependent akustických modelů
Authors:	Vaněk, Jan Psutka, Josef V. Zelinka, Jan Pražák, Aleš Psutka, Josef
Citation:	VANĚK, Jan; PSUTKA, Josef V.; ZELINKA, Jan; PRAŽÁK, Aleš; PSUTKA, Josef. Discriminative training of gender-dependent acoustic models. In: Text, speech and dialogue. Berlin: Springer, 2009, p. 331-338. (Lecture notes in computer science; 5729). ISBN 978-3-642-04207-2.
Issue Date:	2009
Publisher:	Springer
Document type:	článek article
URI:	http://hdl.handle.net/11025/16959 http://www.kky.zcu.cz/cs/publications/VanekJan_2009_Discriminative
ISBN:	978-3-642-04207-2
Keywords:	akustické modelování;HMM;diskriminativní modelování;genderově závislé modely
Keywords in different language:	acoustics modeling;HMM;discriminative training;gender-dependent models
Abstract:	Hlavním cílem tohoto článku je prozkoumat metody modelování gender-dependent akustických modelů, které budou brát v úvahu nedokonalou funkci detektoru řečníka. Takovéto metody se uplatní zejména v úlohách automatického rozpoznávání řeči v reálném čase, kde detekovaná informace o řečníkovy bývá často zpožděná nebo chybná. Cílem je minimalizovat vliv této chybovosti na výsledek rozpoznávání. Článek rovněž popisuje automatický algoritmus pro tvorbu těchto modelů, kterým lze standardní gender-dependent modely ještě vylepšit. Idea tohoto přístupu spočívá v tom, že někteří muži mají hlas akusticky spíše se podobající hlasu ženskému a naopak. Tento algoritmus dokáže rovněž zkorigovat případné chyby v úvodní informaci o pohlaví řečníků.
Abstract in different language:	The main goal of this paper is to explore the methods of gender-dependent acoustic modeling that would take the possibly of imperfect function of a gender detector into consideration. Such methods will be beneficial in real-time recognition tasks (eg. real-time subtitling of meetings) when the automatic gender detection is delayed or incorrect. The goal is to minimize an impact to the correct function of the recognizer. The paper also describes a technique of unsupervised splitting of training data, which can improve gender-dependent acoustic models trained on the basis of manual markers (male/female). The idea of this approach is grounded on the fact that a significant amount of "masculine" female and "feminine" male voices occurring in training corpora and also on frequent errors in manual markers.
Rights:	© Jan Vaněk - Josef V. Psutka - Jan Zelinka - Aleš Pražák - Josef Psutka
Appears in Collections:	Články / Articles (KIV) Články / Articles (KKY)

Files in This Item:

File	Description	Size	Format
VanekJan_2009_Discriminative.pdf	Plný text	109,9 kB	Adobe PDF	View/Open

Show full item record

Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/16959

search

navigation