Statistical machine translation of croatian weather forecasts: how much data do we need?

This research is the first step towards developing a system for translating Croatian weather forecasts into multiple languages. This step deals with the Croatian-English language pair. The parallel corpus consists of a one-year sample of the weather forecasts for the Adriatic, con- sisting of 7, 893...

Full description

Permalink: http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:308956/Details
Matična publikacija: CIT. Journal of computing and information technology
18 (2010), 4 ; str. 303-308
Glavni autori: Ljubešić, Nikola, informatičar (-), Boras, Damir (Author), Bago, Petra
Vrsta građe: Članak
Jezik: eng
Online pristup: http://bib.irb.hr/datoteka/507907.ljubesic10a-statistical.pdf
LEADER 02084naa a2200289uu 4500
008 131105s2010 xx eng|d
022 |a 1330-1136 
024 |2 doi  |a 10.2498/cit.1001914 
035 |a (CROSBI)507907 
040 |a HR-ZaFF  |b hrv  |c HR-ZaFF  |e ppiak 
100 1 |9 445  |a Ljubešić, Nikola,   |c informatičar 
245 1 0 |a Statistical machine translation of croatian weather forecasts: how much data do we need? /  |c Ljubešić, Nikola ; Bago, Petra ; Boras, Damir. 
246 3 |i Naslov na engleskom:  |a Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need? 
300 |a 303-308  |f str. 
363 |a 18  |b 4  |i 2010 
520 |a This research is the first step towards developing a system for translating Croatian weather forecasts into multiple languages. This step deals with the Croatian-English language pair. The parallel corpus consists of a one-year sample of the weather forecasts for the Adriatic, con- sisting of 7, 893 sentence pairs. Evaluation is performed by the automatic evaluation measures BLUE, NIST and METEOR, as well as by manually evaluating a sample of 200 translations. We have shown that with a small- sized training set and the state-of-the art Moses system, decod- ing can be done with 96% accuracy concerning adequacy and fluency. Additional improvement is expected by increasing the training set size. Finally, the correlation of the recorded evaluation measures is explored. 
536 |a Projekt MZOS  |f 130-1301679-1380 
546 |a ENG 
690 |a 5.04 
693 |a statistical machine translation, weather forecast, automatic evaluation, human evaluation  |l hrv  |2 crosbi 
693 |a statistical machine translation, weather forecast, automatic evaluation, human evaluation  |l eng  |2 crosbi 
773 0 |t CIT. Journal of computing and information technology  |x 1330-1136  |g 18 (2010), 4 ; str. 303-308 
700 1 |9 418  |a Boras, Damir  |4 aut 
700 1 |9 474  |a Bago, Petra  |4 aut 
856 |u http://bib.irb.hr/datoteka/507907.ljubesic10a-statistical.pdf 
942 |c CLA  |t 1.01  |u 2  |z Znanstveni - clanak 
999 |c 308956  |d 308954