Finding Multiword Term Candidates in Croatian
The paper presents the research in the field of statistical processing of a corpus of texts in Croatian with the primary aim of finding statistically significant co-occurrences of n-grams of tokens (digrams , trigrams and tetragrams). The collocations found with this method present the list of candi...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:314161 |
---|---|
Matična publikacija: |
Proceedings of Information Extraction for Slavic Languages 2003 Workshop (IESL2003) Sofija : BAS, 2003 |
Glavni autori: | Tadić, Marko (-), Šojat, Krešimir (Author) |
Vrsta građe: | Članak |
Jezik: | eng |