Konferenční příspěvky / Conference Papers (KKY) Domovská stránka kolekce Zobrazit statistiky

Procházet
Přihlásit se k zasílání denních e-mailů o novinkách RSS Feed RSS Feed RSS Feed
Záznamy kolekce (řazeno podle Datum zaslání v sestupně pořadí): 61 až 80 z 198
Pražák, Aleš , Loose, Zdeněk , Psutka, Josef , Radová, Vlasta , Psutka, Josef , Švec, Jan
Live TV Subtitling Through Respeaking

In this paper, we describe our solution for live TV subtitling. The subtitling system uses the respeaking concept with respeakers closely tied with the automatic speech recognition system. The ASR is specially tailored to the live subtitling task by using respeaker-specific acoustic mod...

Chýlek, Adam , Švec, Jan , Šmídl, Luboš
Initial Experiments on Question Answering from the Intrinsic Structure of Oral History Archives

Large audio archives with spoken content are natural candidates for question answering systems. Oral history archives generally contain many facts and stories that would be otherwise hard to obtain without listening to hours of recordings. We strive for making the archive more accessibl...

Volín, Jan , Řezáčková, Markéta , Matoušek, Jindřich
Human and Transformer-Based Prosodic Phrasing in Two Speech Genres

The chief objective of the study was to observe phrasing behaviour of transformer-based neural networks from the linguistic point of view. The transformer-based architecture mapped prosodic phrasing in isolated sentences read out on request, but was commanded to predict prosodic phrases in&#...

Bouček, Zdeněk , Neduchal, Petr , Flídr, Miroslav
DronePort: Smart Drone Battery Management System

This paper deals with the description of a drone management system for long-term missions called DronePort. First, the issue of long-term missions and possible approaches are outlined. Further, the individual components of proposed system, both hardware, and software are introduced. The Dron...

Psutka, Josef , Pražák, Aleš , Vaněk, Jan
Recognition of Heavily Accented and Emotional Speech of English and Czech Holocaust Survivors Using Various DNN Architectures

The Malach Project [6] verified the possibility of using automatic speech recognition (ASR) methods to search for information in large multilingual archives of Holocaust testimonies. After the end of the MALACH project, in which we participated, we continued to work on the completion&#x...

Zajíc, Zbyněk , Kunešová, Marie , Müller, Luděk
Applying EEND Diarization to Telephone Recordings from a Call Center

In this paper, we focus on the issue of speaker diarization of data from a real call center. We have previously proposed a specialized solution to the problem, which employed additional knowledge about the identities of the phone operators (in our case, the language counselors...

Bureš, Lukáš , Müller, Luděk
Semantic Segmentation in the Task of Long-Term Visual Localization

In this paper, it is discussed the problem of long-term visual localization with a using of the Aachen Day-Night dataset. Our experiments confirmed that carefully fine-tuning parameters of the Hierarchical Localization method can lead to enhance the visual localization accuracy. Next, our&#x...

Matoušek, Jakub , Duník, Jindřich , Brandner, Marek , Elvira, Viktor
Comparison of Discrete and Continuous State Estimation with Focus on Active Flux Scheme

This paper deals with the state estimation of non-linear stochastic dynamic systems, both continuous and discrete in time, with an emphasis on a numerical solution to the Bayesian relations by the point-mass filters. The filters for discrete-discrete and continuous-discrete state-space models...

Radtke, Sussane , Ajgl, Jiří , Straka, Ondřej , Hanebeck, Uwe D.
Learning and Exploiting Partial Knowledge in Distributed Estimation

In distributed estimation, several sensor nodes provide estimates of the same underlying dynamic process. These estimates are correlated but due to local processing, the correlations are only partially known or even unknown. For a consistent fusion of the local estimates, the correlation...

Vašíček, Vojtěch , Liška, Jindřich , Strnad, Jaromír , Jakl, Jan
Experimental Validation of the Blade Excitation in a Shaft Vibration Signals

Monitoring of blade vibration is an important part in the maintenance of the rotating machine and its proper operation. The signal analysis of blade vibration plays an important role in detection of the blade structure change. Early detection can avoid unnecessary losses. This is&#...

Hanzlíček, Zdeněk , Vít, Jakub , Řezáčková, Markéta
Speakers Talking Foreign Languages in a Multi-lingual TTS System

This paper presents experiments with a multi-lingual multi-speaker TTS synthesis system jointly trained on English, German, Russian, and Czech speech data. The experimental LSTM-based TTS system with a trainable neural vocoder utilizes the International Phonetic Alphabet (IPA) which allows a stra...

Vraštil, Michal , Matoušek, Jindřich
On Comparison of XGBoost and Convolutional Neural Networks for Glottal Closure Instant Detection

In this paper, we progress further in the development of an automatic GCI detection model. In previous papers, we compared XGBoost with other supervised learning models just as with a deep one-dimensional convolutional neural network. Here we aimed to compare a deep one-dimensional ...

Tihelka, Daniel , Matoušek, Jindřich , Tihelková, Alice
How Much End-to-End is Tacotron 2 End-to-End TTS System

In recent years, the concept of end-to-end text-to-speech synthesis has begun to attract the attention of researchers. The motivation is simple – replacing the individual modules that TTS traditionally built on with a powerful deep neural network simplifies the architecture of the entir...

Psutka, Josef , Švec, Jan , Pražák, Aleš
CNN-TDNN-Based Architecture for Speech Recognition Using Grapheme Models in Bilingual Czech-Slovak Task

Czech and Slovak languages are very similar, not only in writing but also in phonetic form. This work aims to find a suitable combination of these two languages concerning better recognition results. We would like to show such a contribution on the Malach project. The Mal...

Švec, Jan , Lehečka, Jan , Šmídl, Luboš , Ircing, Pavel
Transformer-Based Automatic Punctuation Prediction and Word Casing Reconstruction of the ASR Output

The paper proposes a module for automatic punctuation prediction and casing reconstruction based on transformers architectures (BERT/T5) that constitutes the current state-of-the-art in many similar NLP tasks. The main motivation for our work was to increase the readability of the ASR o...

Řezáčková, Markéta , Švec, Jan , Tihelka, Daniel
T5G2P: Using Text-to-Text Transfer Transformer for Grapheme-to-Phoneme Conversion

Despite the increasing popularity of end-to-end text-to-speech (TTS) systems, the correct grapheme-to-phoneme (G2P) module is still a crucial part of those relying on a phonetic input. In this paper, we, therefore, introduce a T5G2P model, a Text-to-Text Transfer Transformer (T5) neural netw...

Vašíček, Vojtěch , Liška, Jindřich , Strnad, Jaromír , Jakl, Jan
Identification of dynamic behavior of steam turbine blades using rotor vibration measurement

Ensuring the reliability of the steam turbine is fundamental task for a proper operation. Monitoring systems are traditionally used for this purpose. Early detection of initial mechanical change can avoid time and financial losses. The last stage blades of low-pressure turbine can be&#x...

Picek, Lukáš , Říha, Antonín , Zita, Aleš
Coral Reef annotation, localisation and pixel-wise classification using Mask R-CNN and Bag of Tricks

This article describes an automatic system for detection, classification and segmentation of individual coral substrates in underwater images. The proposed system achieved the best performances in both tasks of the second edition of the ImageCLEFcoral competition. Specifically, mean average precision&...

Čech, Martin , Goubej, Martin , Sobota, Jaroslav , Visioli, Antonio
Model-based system engineering in control education using HIL simulators

Nowadays, model-based and knowledge-based system engineering bring completely new demands also to the master degree teaching process and programs. Specifically, it is necessary to establish gluing technologies between individual master degree courses while full STEM education scope is covered. Since&#...

Zita, Aleš , Picek, Lukáš , Říha, Antonín
Sketch2Code: Automatic hand-drawn UIelements detection with Faster R-CNN

Transcription of User Interface (UI) elements hand drawings to the computer code is a tedious and repetitive task. Therefore, a need arose to create a system capable of automating such process. This paper describes a deep learning-based method for hand-drawn user interface elements ...

Záznamy kolekce (řazeno podle Datum zaslání v sestupně pořadí): 61 až 80 z 198