Multi-word term extraction from comparable corpora by combining contextual and constituent clues
In this paper we present an approach to automatically extract and align multi-word terms from an English-Slovene comparable health corpus. First, the terms are extracted from the corpus for each language separately using a list of user-adjustable morphosyntactic patterns and a term weighting measure...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:318229 |
---|---|
Matična publikacija: |
Proceedings of the Workshop on Building and Using Comparable Corpora (BUCC’12) Istanbul : 2012 |
Glavni autori: | Ljubešić, Nikola, informatičar (-), Vintar, Špela (Author), Fišer, Darja |
Vrsta građe: | Članak |
Jezik: | eng |