Extracting terminology by language independent methods
Automatic extraction of corpus-based terminology can help in building terminology lists which represent valuable resource for the research, education and practical implementation. Specific terminology lists represent an intermediate step between the free text and the controlled vocabulary. Such list...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:335511/Details |
---|---|
Matična publikacija: |
Translation studies and translation practice : proceedings of the 2nd International TRANSLATA Conference 2014. Part 1. New York ; Frankfurt am Main : Peter Lang, 2017. |
Glavni autori: | Seljan, Sanja (-), Stančić, Hrvoje (Author), Dunđer, Ivan |
Vrsta građe: | Članak |
Jezik: | eng |
LEADER | 02733naa a22002777i 4500 | ||
---|---|---|---|
003 | HR-ZaFF | ||
005 | 20181230133553.0 | ||
008 | 150109s2017 au 1 eng|d | ||
999 | |c 335511 |d 335508 | ||
035 | |a (CROSBI)704002 | ||
040 | |a HR-ZaFF |b hrv |c HR-ZaFF |e ppiak | ||
100 | 1 | |a Seljan, Sanja |9 430 | |
245 | 1 | 0 | |a Extracting terminology by language independent methods / |c Sanja Seljan ; Ivan Dunđer, Hrvoje Stančić. |
246 | 3 | |i Naslov na engleskom: |a Extracting terminology by language independent methods | |
300 | |a 141-147 |f str. | ||
500 | |a MZOS i potpora Sveučilišta | ||
520 | |a Automatic extraction of corpus-based terminology can help in building terminology lists which represent valuable resource for the research, education and practical implementation. Specific terminology lists represent an intermediate step between the free text and the controlled vocabulary. Such lists can be used in information retrieval, in document indexing, in machine learning, in education, or extended to cross-language information access. Terminology extraction could be performed on monolingual or bilingual/ multilingual texts by . various terminology extraction methods relying on statistical or language approaches, or on hybrid model. Evaluation of extracted terminology candidates requires considerable human expertise in evaluation and final compilation. The paper presents automatic extraction process from monolingual text performed by three language independent tools, but relying on different principles. The research is conducted on the specific text domain relating to medical documentation consisting of reports, approvals and decisions on chemical and pharmaceutical documentation and instructions of use. After the digitization process and use of OCR techniques, the automatic extraction process is performed by use of three language independent tools. Results are compared and evaluated. Results are discussed in the frame of possible integration into more complex information system. | ||
536 | |a Projekt MZOS |f 130-1300646-0909 | ||
546 | |a ENG | ||
690 | |a 5.04 | ||
693 | |a automatic terminology extraction, statistical tools, language independent methods, evaluation, indexing |l hrv |2 crosbi | ||
693 | |a automatic terminology extraction, statistical tools, language independent methods, evaluation, indexing |l eng |2 crosbi | ||
700 | 1 | |a Stančić, Hrvoje |4 aut |9 431 | |
700 | 1 | |9 1065 |a Dunđer, Ivan |4 aut | |
773 | 0 | |a International Translata Conference (2 ; 2014. ; Innsbruck, Austrija) |t Translation studies and translation practice : proceedings of the 2nd International TRANSLATA Conference 2014. Part 1. |d New York ; Frankfurt am Main : Peter Lang, 2017. |z 978363168097 | |
942 | |c RZB |u 2 |v Recenzija |t 1.08 |