A Generic Method for Multi Word Extraction from Wikipedia
This paper presents the generic method for multiword expression extraction from Wikipedia. The method is using the propreties of this specific encyclopedic genre in its HTML format and it relies on the intention of the autors of articles to link to other articles. The relevant links were processed b...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:315627 |
---|---|
Matična publikacija: |
Proceedings of the 30th International Conference on Information Technology Interfaces Zagreb : SRCE University Computer Centre, University of Zagreb, 2008 |
Glavni autori: | Bekavac, Božo (-), Tadić, Marko (Author) |
Vrsta građe: | Članak |
Jezik: | eng |