Finding Multiword Term Candidates in Croatian

The paper presents the research in the field of statistical processing of a corpus of texts in Croatian with the primary aim of finding statistically significant co-occurrences of n-grams of tokens (digrams , trigrams and tetragrams). The collocations found with this method present the list of candi...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:314161
Matična publikacija: Proceedings of Information Extraction for Slavic Languages 2003 Workshop (IESL2003)
Sofija : BAS, 2003
Glavni autori: Tadić, Marko (-), Šojat, Krešimir (Author)
Vrsta građe: Članak
Jezik: eng