Towards sentiment analysis of financial texts in Croatian
The paper presents results of an experiment dealing with sentiment analysis of Croatian text from the domain of finance. The goal of the experiment was to design a system model for automatic detection of general sentiment and polarity phrases in these texts. We have assembled a document collection f...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:316611/Details |
---|---|
Matična publikacija: |
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC2010) Valletta : European Language Resources Association, 2010 |
Glavni autori: | Agić, Željko (-), Tadić, Marko (Author), Ljubešić, Nikola, informatičar |
Vrsta građe: | Članak |
Jezik: | eng |
Online pristup: |
http://bib.irb.hr/datoteka/455003.876_Paper.pdf http://www.lrec-conf.org/proceedings/lrec2010/pdf/876_Paper.pdf |
LEADER | 02884naa a2200301uu 4500 | ||
---|---|---|---|
008 | 131111s2010 xx 1 eng|d | ||
035 | |a (CROSBI)455003 | ||
040 | |a HR-ZaFF |b hrv |c HR-ZaFF |e ppiak | ||
100 | 1 | |9 495 |a Agić, Željko | |
245 | 1 | 0 | |a Towards sentiment analysis of financial texts in Croatian / |c Agić, Željko ; Ljubešić, Nikola ; Tadić, Marko. |
246 | 3 | |i Naslov na engleskom: |a Towards Sentiment Analysis of Financial Texts in Croatian | |
300 | |a 1164-1167 |f str. | ||
520 | |a The paper presents results of an experiment dealing with sentiment analysis of Croatian text from the domain of finance. The goal of the experiment was to design a system model for automatic detection of general sentiment and polarity phrases in these texts. We have assembled a document collection from web sources writing on the financial market in Croatia and manually annotated articles from a subset of that collection for general sentiment. Additionally, we have manually annotated a number of these articles for phrases encoding positive or negative sentiment within a text. In the paper, we provide an analysis of the compiled resources. We show a statistically significant correspondence (1) between the overall market trend on the Zagreb Stock Exchange and the number of positively and negatively accented articles within periods of trend and (2) between the general sentiment of articles and the number of polarity phrases within those articles. We use this analysis as an input for designing a rule-based local grammar system for automatic detection of polarity phrases and evaluate it on held out data. The system achieves F1-scores of 0.61 (P: 0.94, R: 0.45) and 0.63 (P: 0.97, R: 0.47) on positive and negative polarity phrases. | ||
536 | |a Projekt MZOS |f 130-1300646-0645 | ||
536 | |a Projekt MZOS |f 130-1300646-1776 | ||
536 | |a Projekt MZOS |f 130-1301679-1380 | ||
546 | |a ENG | ||
690 | |a 5.04 | ||
690 | |a 6.03 | ||
693 | |a sentiment analysis, financial texts, Croatian language |l hrv |2 crosbi | ||
693 | |a sentiment analysis, financial texts, Croatian language |l eng |2 crosbi | ||
773 | 0 | |a Seventh International Conference on Language Resources and Evaluation (19.-21.05.2010. ; Valletta, Malta) |t Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC2010) |d Valletta : European Language Resources Association, 2010 |n Calzolari, Nicoletta ; Choukri, Khalid ; Maegaard, Bente ; Mariani, Joseph ; Odjik, Jan ; Piperidis, Stelios ; Rosner, Mike ; Tapias, Daniel |z 2-9517408-6-7 |g str. 1164-1167 | |
700 | 1 | |9 888 |a Tadić, Marko |4 aut | |
700 | 1 | |9 445 |a Ljubešić, Nikola, |c informatičar |4 aut | |
856 | |u http://bib.irb.hr/datoteka/455003.876_Paper.pdf | ||
856 | |u http://www.lrec-conf.org/proceedings/lrec2010/pdf/876_Paper.pdf | ||
942 | |c RZB |u 2 |v Recenzija |z Znanstveni - Poster - CijeliRad |t 1.08 | ||
999 | |c 316611 |d 316609 |