Babel Treebank of Public Messages in Croatian

The paper presents the process of constructing a publicly available treebank of public messages written in Croatian. The messages were collected from various electronic sources – e-mail, blog, Facebook and SMS – and published on the Zagreb Museum of Contemporary Art LED facade within the Babel art p...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:310290/Details
Matična publikacija: Procedia -- Social and Behavioral Sciences
95C (2013), str. 490-497
Glavni autori: Merkler, Danijela (-), Agić, Ana (Author), Agić, Željko
Vrsta građe: Članak
Jezik: eng
LEADER 02095naa a2200277uu 4500
008 131105s2013 xx eng|d
022 |a 1877-0428 
024 |2 doi  |a 10.1016/j.sbspro.2013.10.673 
035 |a (CROSBI)651264 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |9 868  |a Merkler, Danijela 
245 1 0 |a Babel Treebank of Public Messages in Croatian /  |c Merkler, Danijela ; Agić, Željko ; Agić, Ana. 
246 3 |i Naslov na engleskom:  |a Babel Treebank of Public Messages in Croatian 
300 |a 490-497  |f str. 
363 |a 95C  |i 2013 
520 |a The paper presents the process of constructing a publicly available treebank of public messages written in Croatian. The messages were collected from various electronic sources – e-mail, blog, Facebook and SMS – and published on the Zagreb Museum of Contemporary Art LED facade within the Babel art project. The project aimed to use the facade as an open-space blog or social interface for enabling citizens to publicly express their views. Construction and current state of the treebank is presented along with future work plans. A comparison of Babel Treebank with Croatian Dependency Treebank and SETimes.HR treebank regarding differing domains and annotation schemes is briefly sketched. The treebank is used as a test platform for introducing a new standard for syntactic annotation of Croatian texts. An experiment with morphosyntactic tagging and dependency parsing of the treebank is conducted, providing first insight to computational processing of non-standard text in Croatian. 
546 |a ENG 
690 |a 5.04 
690 |a 6.03 
693 |a dependency treebank, dependency parsing ; public messages, non-standard text, Croatian language  |l hrv  |2 crosbi 
693 |a dependency treebank, dependency parsing ; public messages, non-standard text, Croatian language  |l eng  |2 crosbi 
700 1 |a Agić, Ana  |4 aut 
700 1 |9 495  |a Agić, Željko  |4 aut 
773 0 |t Procedia -- Social and Behavioral Sciences  |x 1877-0428  |g 95C (2013), str. 490-497 
942 |c CLA  |t 1.01  |u 2  |z Znanstveni - clanak 
999 |c 310290  |d 310288