Evaluating full lemmatization of Croatian texts
The chapter presents the implementation and evaluation of a module for full lemmatization of Croatian texts. The module implements several lemmatization procedures, all of them based on merging outputs of the previously developed stochastic morphosyntactic tagger CroTag and the infectional lexicon o...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:312303 |
---|---|
Matična publikacija: |
Technologies for the Processing and Retrieval of Semi-Structured Documents: Experience from the CADIAL Project Language and Technology |
Glavni autori: | Agić, Željko (-), Tadić, Marko (Author), Dovedan Han, Zdravko |
Vrsta građe: | Članak |
Jezik: | eng |
Online pristup: |
http://langtech.jrc.ec.europa.eu/Documents/2009_Cadial-Book_TOC.pdf |