Diarization of the Language Consulting Center Telephone Calls

Zajíc, Zbyněk; Psutka, Josef; Zajícová, Lucie; Müller, Luděk; Salajka, Petr

Full metadata record

DC pole	Hodnota	Jazyk
dc.contributor.author	Zajíc, Zbyněk
dc.contributor.author	Psutka, Josef
dc.contributor.author	Zajícová, Lucie
dc.contributor.author	Müller, Luděk
dc.contributor.author	Salajka, Petr
dc.date.accessioned	2020-03-09T11:00:22Z	-
dc.date.available	2020-03-09T11:00:22Z	-
dc.date.issued	2019
dc.identifier.citation	ZAJÍC, Z., PSUTKA, J., ZAJÍCOVÁ, L., MÜLLER, L., SALAJKA, P. Diarization of the Language Consulting Center Telephone Calls. In: Speech and Computer, 21st International Conference, SPECOM 2019, Istanbul, turkey, August 20-25,2019, Proceedings. Cham: Springer, 2019. s. 549-558. ISBN 978-3-030-26060-6 , ISSN 0302-9743.	en
dc.identifier.isbn	978-3-030-26060-6
dc.identifier.issn	0302-9743
dc.identifier.uri	2-s2.0-85071471006
dc.identifier.uri	http://hdl.handle.net/11025/36622
dc.description.abstract	V tomto článku popisujeme diarizaci archivu Jazykové poradny vznikajícím v rámci projektu "Zpřístupnění dotazů jazykové poradny v lingvisticky strukturované databázi". Jedna část tohoto archivu je nahraná pouze v mono kvalit, naším úkolem je proto rozdělit data pomocí diarizace. Náš přístup využívá informace o identitě jazykového poradce získané z přepisu jeho představení na začátku každého z hovorů. Protože naše data jsou jedinenčná, pro porovnání uvádíme také výsledky dostupného systému diarizace Kaldi.	cs
dc.format	10 s.	cs
dc.format.mimetype	application/pdf
dc.language.iso	en	en
dc.publisher	Springer	en
dc.relation.ispartofseries	Speech and Computer, 21st International Conference, SPECOM 2019, Istanbul, turkey, August 20-25,2019, Proceedings	en
dc.rights	Plný text není přístupný.	cs
dc.rights	© Springer	en
dc.subject	Diarizace, x-vektor, automatické rozpoznávání řeči, modelování pomocí gaussovských směsí	cs
dc.title	Diarization of the Language Consulting Center Telephone Calls	en
dc.title.alternative	Diarizace telefonních hovorů Jazykové poradny Ústavu pro jazyk český	cs
dc.type	konferenční příspěvek	cs
dc.type	conferenceObject	en
dc.rights.access	closedAccess	en
dc.type.version	publishedVersion	en
dc.description.abstract-translated	In this paper, we describe a diarization of the archive data from the project “Access to a Linguistically Structured Database of Enquiries from the Language Consulting Center”. This project is attempting to provide improved access to the large archives of the Czech language of mainly telephone conversations collected continuously by The Language Consulting Center. One part of this archives contains mono recordings, where the data of the client and the language counsellor are mixed in one channel. In our proposed approach to a diarization, we used the information about the identity of the language counsellor acquired from the text transcription on the beginning of the conversation. For the initial stage of the diarization, our system based on clustering the x-vectors was adopted. The resegmentation step is used for refining the boundaries of speaker changes by the pre-trained Gaussian mixture model of the counsellor. Because of the uniqueness of our data, we compared our results with the Kaldi diarization as the baseline system.	en
dc.subject.translated	Diarization, x-vector, Automatic speech recognition, GMM	en
dc.identifier.doi	10.1007/978-3-030-26061-3_56
dc.type.status	Peer-reviewed	en
dc.identifier.obd	43927394
dc.project.ID	DG16P02B009/Zpřístupnění dotazů jazykové poradny v lingvisticky strukturované databázi	cs
Vyskytuje se v kolekcích:	Konferenční příspěvky / Conference Papers (KKY) OBD

Soubory připojené k záznamu:

Soubor	Velikost	Formát
Zajic2019_Chapter_DiarizationOfTheLanguageConsul.pdf	420,62 kB	Adobe PDF	Zobrazit/otevřít Vyžádat kopii

Zobrazit minimální záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/36622

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace