Croatian Text Summarizer (CROSUM)
The paper describes automatic summarization of the scientific papers in Croatian language. The goal of the CROSUM is to generate extracts with high percent of extract-worthiness and about the same size as the author's abstract. This preliminary research shows that extracts generated using the l...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:314700/Details |
---|---|
Matična publikacija: |
Proceedings of the 27th International Conference on Information Technology Interfaces (ITI) Zagreb : Srce University Computing Centre, 2005 |
Glavni autori: | Lauc, Tomislava (-), Mikelić, Nives (Author), Boras, Damir |
Vrsta građe: | Članak |
Jezik: | eng |
LEADER | 01928naa a2200253uu 4500 | ||
---|---|---|---|
008 | 131111s2005 xx 1 eng|d | ||
035 | |a (CROSBI)205814 | ||
040 | |a HR-ZaFF |b hrv |c HR-ZaFF |e ppiak | ||
100 | 1 | |a Lauc, Tomislava | |
245 | 1 | 0 | |a Croatian Text Summarizer (CROSUM) / |c Lauc, Tomislava ; Mikelić, Nives ; Boras, Damir. |
246 | 3 | |i Naslov na engleskom: |a Croatian Text Summarizer (CROSUM) | |
300 | |a 651-657 |f str. | ||
520 | |a The paper describes automatic summarization of the scientific papers in Croatian language. The goal of the CROSUM is to generate extracts with high percent of extract-worthiness and about the same size as the author's abstract. This preliminary research shows that extracts generated using the lemmatized wordforms dictionary are not quite different from extracts that are given on the base of the non-lemmatized wordforms dictionary. The research brought us to conclusion that we should develop a technique for identifying cue phrases from training corpus or some linguistic technique in order to improve the text summarization for Croatian language. | ||
536 | |a Projekt MZOS |f 0130443 | ||
536 | |a Projekt MZOS |f 0130464 | ||
546 | |a ENG | ||
690 | |a 5.04 | ||
693 | |a Text summarizer, Croatian language, extract, inflected language, tfidf words |l hrv |2 crosbi | ||
693 | |a Text summarizer, Croatian language, extract, inflected language, tfidf words |l eng |2 crosbi | ||
700 | 1 | |a Mikelić, Nives |4 aut | |
700 | 1 | |a Boras, Damir |4 aut | |
773 | 0 | |a International Conference on Information Technology Interfaces (27 ; 2005) (20.-23.06. 2005 ; Cavtat, Hrvatska) |t Proceedings of the 27th International Conference on Information Technology Interfaces (ITI) |d Zagreb : Srce University Computing Centre, 2005 |n Budin, Leo ; Lužar-Stiffler, Vesna ; Bekić, Zoran ; Hljuz Dobrić, Vesna (ur.). |g str. 651-657 | |
942 | |c RZB |u 1 |v Recenzija |z Znanstveni - Predavanje - CijeliRad | ||
999 | |c 314700 |d 314698 |