Investigating language independence in HMM PoS/MSD-tagging

The paper presents an investigation of functional dependencies in morphosyntactic tagging using hidden Markov models. Starting from a well known fact that the HMM tagging paradigm relies on lexical knowledge acquired from training corpora and stored in form of transition and emission matrices, also...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:315628/Details
Matična publikacija: Proceedings of the 30th International Conference on Information Technology Interfaces
Zagreb : SRCE University Computer Centre, University of Zagreb, 2008
Glavni autori: Agić, Željko (-), Tadić, Marko (Author), Dovedan Han, Zdravko
Vrsta građe: Članak
Jezik: eng
Online pristup: http://bib.irb.hr/datoteka/348726.2008-ITI-ZAMTZD-final.pdf
LEADER 02385naa a2200301uu 4500
008 131111s2008 xx 1 eng|d
035 |a (CROSBI)348726 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |9 495  |a Agić, Željko 
245 1 0 |a Investigating language independence in HMM PoS/MSD-tagging /  |c Agić, Željko ; Tadić, Marko ; Dovedan, Zdravko. 
246 3 |i Naslov na engleskom:  |a Investigating Language Independence in HMM PoS/MSD-Tagging 
300 |a 657-662  |f str. 
520 |a The paper presents an investigation of functional dependencies in morphosyntactic tagging using hidden Markov models. Starting from a well known fact that the HMM tagging paradigm relies on lexical knowledge acquired from training corpora and stored in form of transition and emission matrices, also called a language model, in the experiment, we apply the TnT trigram tagger on creating language models for seven different languages from the MULTEXT East v3 project translations of George Orwell’ s 1984. – Czech, Estonian, Hungarian, Romanian, Serbian, Slovene and original English version. We then use these language models in the tagging procedure and obtain details on various relations between training corpora statistics, training outputs and outputs of the tagging procedure. 
536 |a Projekt MZOS  |f 036-1300646-1986 
536 |a Projekt MZOS  |f 130-1300646-0645 
536 |a Projekt MZOS  |f 130-1300646-1776 
546 |a ENG 
690 |a 2.09 
690 |a 5.04 
690 |a 6.03 
693 |a language independence, part-of-speech tagging, morphosyntactic tagging, hidden Markov models  |l hrv  |2 crosbi 
693 |a language independence, part-of-speech tagging, morphosyntactic tagging, hidden Markov models  |l eng  |2 crosbi 
773 0 |a 30th International Conference on Information Technology Interfaces (ITI 2008) (23-26.06.2008. ; Cavtat / Dubrovnik, Hrvatska)  |t Proceedings of the 30th International Conference on Information Technology Interfaces  |d Zagreb : SRCE University Computer Centre, University of Zagreb, 2008  |n Lužar-Stiffler, Vesna ; Hljuz Dobrić, Vesna ; Bekić, Zoran  |z 978-953-7138-12-7  |g str. 657-662 
700 1 |9 888  |a Tadić, Marko  |4 aut 
700 1 |9 415  |a Dovedan Han, Zdravko  |4 aut 
856 |u http://bib.irb.hr/datoteka/348726.2008-ITI-ZAMTZD-final.pdf 
942 |c RZB  |u 2  |v Recenzija  |z Znanstveni - Predavanje - CijeliRad  |t 1.08 
999 |c 315628  |d 315626