Information Retrieval Meets Human Language Technology

The paper tries to highlight several points where informational retrieval could benefit from human language technology. Although there is an increasing quantity of multimedia content on the web, the majority of information is still coded in natural languages(s). Since the global trend in web languag...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:310895/Details
Matična publikacija: Quest for Information, CUC2000 Proceedings
Pale, P., Milinović, M.
Glavni autor: Tadić, Marko (-)
Vrsta građe: Članak
Jezik: eng
LEADER 02245naa a2200229uu 4500
008 131111s2000 xx eng|d
020 |a 9536802-01-5 
035 |a (CROSBI)101848 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |a Tadić, Marko 
245 1 0 |a Information Retrieval Meets Human Language Technology /  |c Tadić, Marko. 
246 3 |i Naslov na engleskom:  |a Information Retrieval Meets Human Language Technology 
300 |a 0-0  |f str. 
520 |a The paper tries to highlight several points where informational retrieval could benefit from human language technology. Although there is an increasing quantity of multimedia content on the web, the majority of information is still coded in natural languages(s). Since the global trend in web language usage is coding in native language, it leads to the situation in which web pages in English do not represent the majority of web-text anymore. To retrieve information from web pages requires several tools, which have to deal with particular human language. These tools can be borrowed from the field of human language technology (HLT) and applied to web-text. Some possible areas of HLT which could be used in information retrieval from web-text are: 1) morphological processing: should be able to cope with different word-forms in particular language in order to make language specific full-text search available ; 2) named entities recognition: should give the possibility to get information on concepts which have fixed names/ titles/formulas (such as personal/institutional/geographical names, temporal expressions etc.) ; 3) semantic thesauruses: should give the ability to retrieve information on the basis of language synonymy/proximity. The situation in the field of HLT for Croatian is discussed at the end of article. 
536 |a Projekt MZOS  |f 130718 
546 |a ENG 
690 |a 6.03 
693 |a human language technology, information retrieval, Croatian language  |l hrv  |2 crosbi 
693 |a human language technology, information retrieval, Croatian language  |l eng  |2 crosbi 
773 0 |t Quest for Information, CUC2000 Proceedings  |d Zagreb : CARNet, 2000  |n Pale, P., Milinović, M.  |z 953-6802-01-5  |g str. 0-0 
942 |c POG  |t 1.16.1  |u 1  |z Znanstveni 
999 |c 310895  |d 310893