Extracting terminology by language independent methods

Automatic extraction of corpus-based terminology can help in building terminology lists which represent valuable resource for the research, education and practical implementation. Specific terminology lists represent an intermediate step between the free text and the controlled vocabulary. Such list...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:335511/Details
Matična publikacija: Translation studies and translation practice : proceedings of the 2nd International TRANSLATA Conference 2014. Part 1.
New York ; Frankfurt am Main : Peter Lang, 2017.
Glavni autori: Seljan, Sanja (-), Stančić, Hrvoje (Author), Dunđer, Ivan
Vrsta građe: Članak
Jezik: eng
LEADER 02733naa a22002777i 4500
003 HR-ZaFF
005 20181230133553.0
008 150109s2017 au 1 eng|d
999 |c 335511  |d 335508 
035 |a (CROSBI)704002 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |a Seljan, Sanja  |9 430 
245 1 0 |a Extracting terminology by language independent methods /  |c Sanja Seljan ; Ivan Dunđer, Hrvoje Stančić. 
246 3 |i Naslov na engleskom:  |a Extracting terminology by language independent methods 
300 |a 141-147  |f str. 
500 |a MZOS i potpora Sveučilišta 
520 |a Automatic extraction of corpus-based terminology can help in building terminology lists which represent valuable resource for the research, education and practical implementation. Specific terminology lists represent an intermediate step between the free text and the controlled vocabulary. Such lists can be used in information retrieval, in document indexing, in machine learning, in education, or extended to cross-language information access. Terminology extraction could be performed on monolingual or bilingual/ multilingual texts by . various terminology extraction methods relying on statistical or language approaches, or on hybrid model. Evaluation of extracted terminology candidates requires considerable human expertise in evaluation and final compilation. The paper presents automatic extraction process from monolingual text performed by three language independent tools, but relying on different principles. The research is conducted on the specific text domain relating to medical documentation consisting of reports, approvals and decisions on chemical and pharmaceutical documentation and instructions of use. After the digitization process and use of OCR techniques, the automatic extraction process is performed by use of three language independent tools. Results are compared and evaluated. Results are discussed in the frame of possible integration into more complex information system. 
536 |a Projekt MZOS  |f 130-1300646-0909 
546 |a ENG 
690 |a 5.04 
693 |a automatic terminology extraction, statistical tools, language independent methods, evaluation, indexing  |l hrv  |2 crosbi 
693 |a automatic terminology extraction, statistical tools, language independent methods, evaluation, indexing  |l eng  |2 crosbi 
700 1 |a Stančić, Hrvoje  |4 aut  |9 431 
700 1 |9 1065  |a Dunđer, Ivan  |4 aut 
773 0 |a International Translata Conference (2 ; 2014. ; Innsbruck, Austrija)  |t Translation studies and translation practice : proceedings of the 2nd International TRANSLATA Conference 2014. Part 1.  |d New York ; Frankfurt am Main : Peter Lang, 2017.  |z 978363168097 
942 |c RZB  |u 2  |v Recenzija  |t 1.08