Vocabulary size prediction of Croatian texts
The preliminary research of the vocabulary size of the Croatian lexical corpora shows that the distribution of types is regular and that deviations of the calculated values are within theoretically acceptable limit. The research also brought us to conclusion that Zipf's Law in Croatian language...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:314266/Details |
---|---|
Matična publikacija: |
Proceedings of the 25th International Conference on Information Technology Interfaces Zagreb : SRCE, 2003 |
Glavni autori: | Tuđman, Miroslav (-), Mikelić, Nives (Author), Boras, Damir |
Vrsta građe: | Članak |
Jezik: | eng |
LEADER | 01778naa a2200241uu 4500 | ||
---|---|---|---|
008 | 131111s2003 xx 1 eng|d | ||
035 | |a (CROSBI)132968 | ||
040 | |a HR-ZaFF |b hrv |c HR-ZaFF |e ppiak | ||
100 | 1 | |a Tuđman, Miroslav | |
245 | 1 | 0 | |a Vocabulary size prediction of Croatian texts / |c Tuđman, Miroslav ; Mikelić, Nives ; Boras, Damir. |
246 | 3 | |i Naslov na engleskom: |a Vocabulary size prediction of Croatian texts | |
300 | |a 223-228 |f str. | ||
520 | |a The preliminary research of the vocabulary size of the Croatian lexical corpora shows that the distribution of types is regular and that deviations of the calculated values are within theoretically acceptable limit. The research also brought us to conclusion that Zipf's Law in Croatian language is not applicable because the lexical density is different, i.e. the proportion of types and tokens in different languages is different and the parameters of that proportion need to be calculated for every language separately. | ||
536 | |a Projekt MZOS |f 0130443 | ||
546 | |a ENG | ||
690 | |a 5.04 | ||
693 | |a Lexical items, vocabulary size, Zipf law, lexical density, token, type, Croatian text corpus. |l hrv |2 crosbi | ||
693 | |a Lexical items, vocabulary size, Zipf law, lexical density, token, type, Croatian text corpus |l eng |2 crosbi | ||
700 | 1 | |a Mikelić, Nives |4 aut | |
700 | 1 | |a Boras, Damir |4 aut | |
773 | 0 | |a International Conference on Information Technology Interfaces (25 ; 2003) (18.-19.06.2003 ; Cavtat, Hrvatska) |t Proceedings of the 25th International Conference on Information Technology Interfaces |d Zagreb : SRCE, 2003 |n Budin, Leo ; Lužar-Stiffler, Vesna ; Bekić, Zoran ; Hljuz Dobrić, Vesna |g str. 223-228 | |
942 | |c RZB |u 1 |v Recenzija |z Znanstveni - Predavanje - CijeliRad | ||
999 | |c 314266 |d 314264 |