A Generic Method for Multi Word Extraction from Wikipedia

This paper presents the generic method for multiword expression extraction from Wikipedia. The method is using the propreties of this specific encyclopedic genre in its HTML format and it relies on the intention of the autors of articles to link to other articles. The relevant links were processed b...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:315627/Details
Matična publikacija: Proceedings of the 30th International Conference on Information Technology Interfaces
Zagreb : SRCE University Computer Centre, University of Zagreb, 2008
Glavni autori: Bekavac, Božo (-), Tadić, Marko (Author)
Vrsta građe: Članak
Jezik: eng
LEADER 01881naa a2200277uu 4500
008 131111s2008 xx 1 eng|d
035 |a (CROSBI)348724 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |9 835  |a Bekavac, Božo 
245 1 2 |a A Generic Method for Multi Word Extraction from Wikipedia /  |c Bekavac, Božo ; Tadić, Marko. 
246 3 |i Naslov na engleskom:  |a A Generic Method for Multi Word Extraction from Wikipedia 
300 |a 663-667  |f str. 
363 |i 2008 
520 |a This paper presents the generic method for multiword expression extraction from Wikipedia. The method is using the propreties of this specific encyclopedic genre in its HTML format and it relies on the intention of the autors of articles to link to other articles. The relevant links were processed by applying local regular grammars within the NooJ development envi-ronment. We tested the method on a Croatian version of Wikipedia and we present the results obtained. 
536 |a Projekt MZOS  |f 036-1300646-1986 
536 |a Projekt MZOS  |f 130-1300646-0645 
536 |a Projekt MZOS  |f 130-1300646-1002 
546 |a ENG 
690 |a 5.04 
690 |a 6.03 
693 |a multi word expressions, multi word extraction, Croatian, Wikipedia  |l hrv  |2 crosbi 
693 |a multi word expressions, multi word extraction, Croatian, Wikipedia  |l eng  |2 crosbi 
773 0 |a 30th International Conference on Information Technology Interfaces (ITI 2008) (23-26.06.2008. ; Cavtat / Dubrovnik, Hrvatska)  |t Proceedings of the 30th International Conference on Information Technology Interfaces  |d Zagreb : SRCE University Computer Centre, University of Zagreb, 2008  |n Lužar-Stiffler, Vesna ; Hljuz Dobrić, Vesna ; Bekić, Zoran  |x 1330-1012  |z 978-953-7138-12-7  |g str. 663-667 
700 1 |9 888  |a Tadić, Marko  |4 aut 
942 |c RZB  |u 2  |v Recenzija  |z Znanstveni - Predavanje - CijeliRad  |t 1.08 
999 |c 315627  |d 315625