Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing

We present a new version of the Croatian Dependency Treebank. It constitutes a slight departure from the previously closely observed Prague Dependency Treebank syntactic layer annotation guidelines as we introduce a new subset of syntactic tags on top of the existing tagset. These new tags are used...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:335539/Details
Matična publikacija: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)
Reykjavik, Iceland : European Language Resources Association (ELRA), 2014
Glavni autori: Agić, Željko (-), Berović, Daša (Author), Merkler, Danijela, Tadić, Marko
Vrsta građe: Članak
Jezik: eng
Online pristup: http://bib.irb.hr/datoteka/698034.694_Paper.pdf
http://www.lrec-conf.org/proceedings/lrec2014/pdf/694_Paper.pdf
LEADER 02705naa a22003377i 4500
003 HR-ZaFF
005 20180114222648.0
008 150109s2014 ic 1 eng|d
999 |c 335539  |d 335536 
035 |a (CROSBI)698034 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |9 495  |a Agić, Željko 
245 1 0 |a Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing /  |c Agić, Željko ; Berović, Daša ; Merkler, Danijela ; Tadić, Marko. 
246 3 |i Naslov na engleskom:  |a Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing 
300 |a 2313-2319  |f str. 
520 |a We present a new version of the Croatian Dependency Treebank. It constitutes a slight departure from the previously closely observed Prague Dependency Treebank syntactic layer annotation guidelines as we introduce a new subset of syntactic tags on top of the existing tagset. These new tags are used in explicit annotation of subordinate clauses via subordinate conjunctions. Introducing the new annotation to Croatian Dependency Treebank, we also modify head attachment rules addressing subordinate conjunctions and subordinate clause predicates. In an experiment with data-driven dependency parsing, we show that implementing these new annotation guidelines leeds to a statistically significant improvement in parsing accuracy. We also observe a substantial improvement in inter-annotator agreement, facilitating more consistent annotation in further treebank development. 
536 |a Projekt MZOS  |f 130-1300646-0645 
536 |a Projekt MZOS  |f 130-1300646-1776 
546 |a ENG 
690 |a 6.03 
690 |a 5.04 
693 |a dependency treebank, dependency parsing, Croatian language  |l hrv  |2 crosbi 
693 |a dependency treebank, dependency parsing, Croatian language  |l eng  |2 crosbi 
700 1 |a Berović, Daša  |4 aut  |9 839 
700 1 |a Merkler, Danijela  |4 aut  |9 868 
700 1 |a Tadić, Marko  |4 aut  |9 888 
773 0 |a Ninth International Conference on Language Resources and Evaluation (LREC 2014) (26-31.05.2014. ; Reykjavik, Island)  |t Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)  |d Reykjavik, Iceland : European Language Resources Association (ELRA), 2014  |n Calzolari, Nicoletta ; Choukri, Khalid ; Declerck, Thierry ; Loftsson, Hrafn ; Maegaard, Bente ; Mariani, Joseph ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios  |z 978-2-9517408-8-4  |g str. 2313-2319 
856 |u http://bib.irb.hr/datoteka/698034.694_Paper.pdf 
856 |u http://www.lrec-conf.org/proceedings/lrec2014/pdf/694_Paper.pdf 
942 |c RZB  |u 2  |v Recenzija  |z Znanstveni - Poster - CijeliRad  |t 1.08 
962 |w WOS:000355611003145