Multi-word term extraction from comparable corpora by combining contextual and constituent clues

In this paper we present an approach to automatically extract and align multi-word terms from an English-Slovene comparable health corpus. First, the terms are extracted from the corpus for each language separately using a list of user-adjustable morphosyntactic patterns and a term weighting measure...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:318229
Matična publikacija: Proceedings of the Workshop on Building and Using Comparable Corpora (BUCC’12)
Istanbul : 2012
Glavni autori: Ljubešić, Nikola, informatičar (-), Vintar, Špela (Author), Fišer, Darja
Vrsta građe: Članak
Jezik: eng