RECSA: Resource for Evaluating Cross-lingual Semantic Annotation

In recent years large repositories of structured knowledge (DBpedia, Freebase, YAGO) have become a valuable resource for language technologies, especially for the automatic aggregation of knowledge from textual data. One essential component of language technologies, which leverage such knowledge bas...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:335611/Details
Matična publikacija: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Reykjavik, Iceland : European Language Resources Association, ELRA, 2014
Glavni autori: Rettinger, Achim (-), Zhang, Lei (Author), Berović, Daša, Srebačić, Matea, Merkler, Danijela, Tadić, Marko
Vrsta građe: Članak
Jezik: eng
Online pristup: http://www.lrec-conf.org/proceedings/lrec2014/pdf/998_Paper.pdf
LEADER 03211naa a2200313uu 4500
003 HR-ZaFF
005 20180114222907.0
008 150109s2014 xx 1 eng|d
999 |c 335611  |d 335608 
035 |a (CROSBI)701194 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |a Rettinger, Achim 
245 1 0 |a RECSA: Resource for Evaluating Cross-lingual Semantic Annotation /  |c Rettinger, Achim ; Zhang, Lei ; Berović, Daša ; Merkler, Danijela ; Srebačić, Matea ; Tadić, Marko. 
300 |a 4000-4003  |f str. 
520 |a In recent years large repositories of structured knowledge (DBpedia, Freebase, YAGO) have become a valuable resource for language technologies, especially for the automatic aggregation of knowledge from textual data. One essential component of language technologies, which leverage such knowledge bases, is the linking of words or phrases in specific text documents with elements from the knowledge base (KB). We call this semantic annotation. In the same time, initiatives like Wikidata try to make those knowledge bases less language dependent in order to allow cross-lingual or language independent knowledge access. This poses a new challenge to semantic annotation tools which typically are language dependent and link documents in one language to a structured knowledge base grounded in the same language. Ultimately, the goal is to construct cross-lingual semantic annotation tools that can link words or phrases in one language to a structured knowledge database in any other language or to a language independent representation. To support this line of research we developed what we believe could serve as a gold standard Resource for Evaluating Cross- lingual Semantic Annotation (RECSA). We compiled a hand- annotated parallel corpus of 300 news articles in three languages with cross-lingual semantic groundings to the English Wikipedia and DBPedia. We hope that this new language resource, which is freely available, will help to establish a standard test set and methodology to comparatively evaluate cross-lingual semantic annotation technologies. 
536 |a Projekt MZOS  |f 130-1300646-0645 
546 |a ENG 
690 |a 6.03 
693 |a cross-lingual semantic annotation, evaluation resource, Linked Open Data  |l hrv  |2 crosbi 
693 |a cross-lingual semantic annotation, evaluation resource, Linked Open Data  |l eng  |2 crosbi 
700 1 |a Zhang, Lei  |4 aut 
700 1 |a Berović, Daša  |4 aut  |9 839 
700 1 |a Srebačić, Matea  |4 aut  |9 991 
700 1 |a Merkler, Danijela  |4 aut  |9 868 
700 1 |a Tadić, Marko  |4 aut  |9 888 
773 0 |a Ninth International Conference on Language Resources and Evaluation (LREC'14) (26-31.05.2014. ; Reykjavik, Iceland)  |t Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)  |d Reykjavik, Iceland : European Language Resources Association, ELRA, 2014  |n Calzolari, N., Choukri, K., Declerck, T., Loftsson, H., Maegaard, B., Mariani, J., Moreno, A., Odijk, J., Piperidis, S.  |z 978-2-9517408-8-4  |g str. 4000-4003 
856 |u http://www.lrec-conf.org/proceedings/lrec2014/pdf/998_Paper.pdf 
942 |c RZB  |u 2  |v Recenzija  |z Znanstveni - Poster - CijeliRad  |t 1.08 
962 |w WOS:000355611005105