Croatian web text summarizer (CroWebSum)

The paper describes automatic summarization of newspaper texts in Croatian language. The goal of the CroWebSum is to generate high-quality extracts that are both coherent and keep relevant information from the original text. The preliminary evaluation shows that extracts in the size of 10 % of the o...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:317016/Details
Matična publikacija: Proceedings of the ITI 2010 32nd International Conference on INFORMATION TECHNOLOGY INTERFACES
Zagreb : University Computing Centre, University of Zagreb, 2010
Glavni autori: Mikelić Preradović, Nives (-), Boras, Damir (Author), Ljubešić, Nikola, informatičar
Vrsta građe: Članak
Jezik: eng
LEADER 02177naa a2200253uu 4500
008 131111s2010 xx 1 eng|d
035 |a (CROSBI)508501 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |9 449  |a Mikelić Preradović, Nives 
245 1 0 |a Croatian web text summarizer (CroWebSum) /  |c Mikelić Preradović, Nives ; Ljubešić, Nikola ; Boras, Damir. 
246 3 |i Naslov na engleskom:  |a Croatian web text summarizer (CroWebSum) 
300 |a 109-114  |f str. 
520 |a The paper describes automatic summarization of newspaper texts in Croatian language. The goal of the CroWebSum is to generate high-quality extracts that are both coherent and keep relevant information from the original text. The preliminary evaluation shows that extracts in the size of 10 % of the original text have good coherence, while the extract in the size of 5 % of the original text still conveys the most relevant information. Also, while cutting down news to SMS size (maximum 160 characters), CroWebSum performed quite well. The research brought us to conclusion that we should develop a technique that uses context vectors to calculate the semantic similarity between the terms in the document as well as pronoun resolution algorithm in order to improve the text summarization for Croatian language. 
536 |a Projekt MZOS  |f 130-1301679-1380 
536 |a Projekt MZOS  |f 130-1301799-1999 
546 |a ENG 
690 |a 5.04 
693 |a Newspaper text summarizer, SweSum, Croatian language, extract, inflected language  |l hrv  |2 crosbi 
693 |a Newspaper text summarizer, SweSum, Croatian language, extract, inflected language  |l eng  |2 crosbi 
773 0 |a ITI 2010 32nd International Conference on INFORMATION TECHNOLOGY INTERFACES (21.-24.06.2010. ; Cavtat, Hrvatska)  |t Proceedings of the ITI 2010 32nd International Conference on INFORMATION TECHNOLOGY INTERFACES  |d Zagreb : University Computing Centre, University of Zagreb, 2010  |n Luzar-Stiffler, V.  |z 978-1-4244-5732-8  |g str. 109-114 
700 1 |9 418  |a Boras, Damir  |4 aut 
700 1 |9 445  |a Ljubešić, Nikola,   |c informatičar  |4 aut 
942 |c RZB  |u 2  |v Recenzija  |z Znanstveni - Predavanje - CijeliRad  |t 1.08 
999 |c 317016  |d 317014