Procedures in building the Croatian-English parallel corpus
This contribution gives a survey of procedures and formats used in building the Croatian-English parallel corpus which is being collected at the Institute of Linguistics at the Philosophical Faculty, University of Zagreb. The primary text source is the newspaper Croatia Weekly which has been publish...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:312266/Details |
---|---|
Matična publikacija: |
Text Corpora and Multilingual Lexicography Benjamins Current Topics |
Glavni autor: | Tadić, Marko (-) |
Vrsta građe: | Članak |
Jezik: | eng |
LEADER | 01638naa a2200229uu 4500 | ||
---|---|---|---|
008 | 131111s2007 xx eng|d | ||
020 | |a 978 90 272 2238 1 | ||
035 | |a (CROSBI)419754 | ||
040 | |a HR-ZaFF |b hrv |c HR-ZaFF |e ppiak | ||
100 | 1 | |a Tadić, Marko | |
245 | 1 | 0 | |a Procedures in building the Croatian-English parallel corpus / |c Tadić, Marko. |
246 | 3 | |i Naslov na engleskom: |a Procedures in building the Croatian-English parallel corpus | |
300 | |a 93-107 |f str. | ||
520 | |a This contribution gives a survey of procedures and formats used in building the Croatian-English parallel corpus which is being collected at the Institute of Linguistics at the Philosophical Faculty, University of Zagreb. The primary text source is the newspaper Croatia Weekly which has been published from the beginning of 1998 by HIKZ (Croatian Institute for Information and Culture). After a quick survey of existing English-Croatian parallel corpora, the article copes with procedures involved in text conversion and text encoding, particularly the alignment. There are several recent suggestions for alignment encoding, and they are listed and elaborated at the end of the article. | ||
536 | |a Projekt MZOS |f 130-1300646-0645 | ||
546 | |a ENG | ||
690 | |a 6.03 | ||
693 | |a parallel corpus, Croatian, English, building |l hrv |2 crosbi | ||
693 | |a parallel corpus, Croatian, English, building |l eng |2 crosbi | ||
773 | 0 | |t Text Corpora and Multilingual Lexicography |d Amsterdam, Philadelphia : Benjamins, 2007 |k Benjamins Current Topics |h 160 |n Teubert, Wolfgang |z 978 90 272 2238 1 |g str. 93-107 | |
942 | |c POG |t 1.16.1 |u 1 |z Znanstveni | ||
999 | |c 312266 |d 312264 |