Evaluating full lemmatization of Croatian texts

The chapter presents the implementation and evaluation of a module for full lemmatization of Croatian texts. The module implements several lemmatization procedures, all of them based on merging outputs of the previously developed stochastic morphosyntactic tagger CroTag and the infectional lexicon o...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:312303
Matična publikacija: Technologies for the Processing and Retrieval of Semi-Structured Documents: Experience from the CADIAL Project
Language and Technology
Glavni autori: Agić, Željko (-), Tadić, Marko (Author), Dovedan Han, Zdravko
Vrsta građe: Članak
Jezik: eng
Online pristup: http://langtech.jrc.ec.europa.eu/Documents/2009_Cadial-Book_TOC.pdf