Title: | Evaluation of Wav2Vec Speech Recognition for Speakers with Cognitive Disorders |
Authors: | Švec, Jan Polák, Filip Bartoš, Aleš Zapletalová, Michaela Víta, Martin |
Citation: | ŠVEC, J. POLÁK, F. BARTOŠ, A. ZAPLETALOVÁ, M. VÍTA, M. Evaluation of Wav2Vec Speech Recognition for Speakers with Cognitive Disorders. In Text, Speech, and Dialogue 25th International Conference, TSD 2022, Brno, Czech Republic, September 6–9, 2022, Proceedings. Cham: Springer International Publishing, 2022. s. 501-512. ISBN: 978-3-031-16269-5 , ISSN: 0302-9743 |
Issue Date: | 2022 |
Publisher: | Springer International Publishing |
Document type: | konferenční příspěvek ConferenceObject |
URI: | 2-s2.0-85139028569 http://hdl.handle.net/11025/50929 |
ISBN: | 978-3-031-16269-5 |
ISSN: | 0302-9743 |
Keywords in different language: | Spoken dialog systems;Degenerative diseases;Dementia;Tests |
Abstract in different language: | In this paper, we present a spoken dialog system used for collecting data for future research in the field of dementia prediction from speech. The dialog system was used to collect the speech data of patients with mild cognitive deficits. The core task solved by the dialog system was the spoken description of the vivid shore picture for one minute. The patients also performed other simple speech-based tasks. All utterances were recorded and manually transcribed to obtain a ground-truth reference. We describe the architecture of the dialog system as well as the results of the first speech recognition experiments. The zero-shot Wav2Vec 2.0 speech recognizer was used and the recognition accuracy on word- and character-level was evaluated. |
Rights: | Plný text je přístupný v rámci univerzity přihlášeným uživatelům. © Springer Nature Switzerland AG |
Appears in Collections: | Konferenční příspěvky / Conference papers (NTIS) Konferenční příspěvky / Conference Papers (KKY) OBD |
Files in This Item:
File | Size | Format | |
---|---|---|---|
Svec_Polak_Bartos_Zapletalova_Vita_Evaluation_of_Wav2Vec_Speech_Recognition_TSD_2022.pdf | 1,32 MB | Adobe PDF | View/Open Request a copy |
Please use this identifier to cite or link to this item:
http://hdl.handle.net/11025/50929
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.