Detekce slov s nepravidelnou výslovností v českém textu

Lehečka, Jan

Full metadata record

DC pole	Hodnota	Jazyk
dc.contributor.advisor	Hoidekr, Jan
dc.contributor.author	Lehečka, Jan
dc.contributor.referee	Švec, Jan
dc.date.accepted	2012-06-20
dc.date.accessioned	2013-06-19T06:29:02Z
dc.date.available	2011-09-19	cs
dc.date.available	2013-06-19T06:29:02Z
dc.date.issued	2012
dc.date.submitted	2012-05-18
dc.identifier	47846
dc.identifier.uri	http://hdl.handle.net/11025/2648
dc.description.abstract	Cílem této diplomové práce je navrhnout a implementovat systém, který automaticky hledá a označuje slova s nepravidelnou výslovností v českých textech. Nepravidelná výslovnost slova je taková výslovnost, která nelze odvodit pomocí pravidel české fonetické transkripce. Pro řešení je použit klasifikátor, který roztřídí všechna slova do dvou tříd, a to do třídy slov s pravidelnou výslovností a třídy slov s nepravidelnou výslovností. Natrénovaný klasifikátor zohledňuje i slovník výjimek zabudovaný v existujícím fonetickém transkriberu. Výsledky této práce ukazují, že nejlepší klasifikace slov je dosaženo při použití klasifikátoru podle k-nejbližšího souseda. Dalšími zkoumanými klasifikátory v této práci byly neuronové sítě, lineární SVC a rozhodovací stromy.	cs
dc.format	60 s.	cs
dc.format.mimetype	application/pdf
dc.language.iso	cs	cs
dc.publisher	Západočeská univerzita v Plzni	cs
dc.rights	Plný text práce je přístupný bez omezení.	cs
dc.subject	nepravidelná výslovnost	cs
dc.subject	fonetická transkripce	cs
dc.subject	automatická detekce jazyka	cs
dc.subject	jazykový model	cs
dc.subject	klasifikace	cs
dc.subject	lineární systém rovnic	cs
dc.subject	klasifikátor podle k-nejbližšího souseda	cs
dc.subject	neuronové sítě	cs
dc.title	Detekce slov s nepravidelnou výslovností v českém textu	cs
dc.title.alternative	Detection of words with irregular pronunciation in Czech text	en
dc.type	diplomová práce	cs
dc.thesis.degree-name	Ing.	cs
dc.thesis.degree-level	Navazující	cs
dc.thesis.degree-grantor	Západočeská univerzita v Plzni. Fakulta aplikovaných věd	cs
dc.description.department	Katedra kybernetiky	cs
dc.thesis.degree-program	Aplikované vědy a informatika	cs
dc.description.result	Obhájeno	cs
dc.rights.access	openAccess	en
dc.description.abstract-translated	The goal of this work is proposal and implementation of a system, which is able to find and mark words with irregular pronunciation in Czech texts. Irregular pronunciation of word is such pronunciation, that can not be derived by using rules of Czech phonetic transcription. To solve the problem, a classifier separating words into two classes is used. In the first target class, there are words with regular pronunciation, and the second class contains only words with irregular pronunciation. Trained classifier takes also a vocabulary of exceptions built in existing phonetic transcriber into consideration. The result of this work shows that the best classification is achieved when using k-nearest neighbor classifier. Other investigated classifiers in this work were neural networks, linear SVC and decision trees.	en
dc.subject.translated	irregular pronunciation	en
dc.subject.translated	phonetic transcription	en
dc.subject.translated	automatic language detection	en
dc.subject.translated	language model	en
dc.subject.translated	classification	en
dc.subject.translated	linear system of equations	en
dc.subject.translated	k-nearest neighbor classifier	en
dc.subject.translated	neural networks	en
Vyskytuje se v kolekcích:	Diplomové práce / Theses (KKY)

Soubory připojené k záznamu:

Soubor	Popis	Velikost	Formát
dp_lehecka.pdf	Plný text práce	1,3 MB	Adobe PDF	Zobrazit/otevřít
lehecka-v.pdf	Posudek vedoucího práce	1,75 MB	Adobe PDF	Zobrazit/otevřít
lehecka-o.pdf	Posudek oponenta práce	1,82 MB	Adobe PDF	Zobrazit/otevřít
lehecka-p.pdf	Průběh obhajoby práce	1,48 MB	Adobe PDF	Zobrazit/otevřít

Zobrazit minimální záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/2648

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace