Unsupervised methods for language modeling: technical report no. DCSE/TR-2012-03

Brychcín, Tomáš

Full metadata record

DC pole	Hodnota	Jazyk
dc.contributor.author	Brychcín, Tomáš
dc.date.accessioned	2016-06-21T06:45:43Z
dc.date.available	2016-06-21T06:45:43Z
dc.date.issued	2012
dc.identifier.uri	http:// www.kiv.zcu.cz/publications/
dc.identifier.uri	http://hdl.handle.net/11025/21549
dc.format	10 s.	cs
dc.format.mimetype	application/pdf
dc.language.iso	en	en
dc.publisher	University of West Bohemia in Pilsen	en
dc.rights	© University of West Bohemia in Pilsen	en
dc.subject	jazykový model	cs
dc.subject	n-gram	cs
dc.title	Unsupervised methods for language modeling: technical report no. DCSE/TR-2012-03	en
dc.type	zpráva	cs
dc.type	report	en
dc.rights.access	openAccess	en
dc.type.version	publishedVersion	en
dc.description.abstract-translated	Language models are crucial for many tasks in NLP and N-grams are the best way to build them. Huge e ort is being invested in improving n-gram language models. By introducing external information (morphology, syntax, partitioning into documents, etc.) into the models a signi cant improvement can be achieved. The models can however be improved with no external information and smoothing is an excellent example of such an improvement. Thesis summarizes the state-of-the-art approaches to unsupervised language modeling with emphases on the in ectional languages, which are particularly hard to model. It is focused on methods that can discover hidden patterns that are already in a training corpora. These patterns can be very useful for enhancing the performance of language modeling, moreover they do not require additional information sources.	en
dc.subject.translated	language model	en
dc.subject.translated	n-gram	en
Vyskytuje se v kolekcích:	Zprávy / Reports (KIV)

Soubory připojené k záznamu:

Soubor	Popis	Velikost	Formát
Brychcin.pdf	Plný text	425,44 kB	Adobe PDF	Zobrazit/otevřít

Zobrazit minimální záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/21549

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace