Evaluation of similarity metrics for programming code plagiarism detection method
This paper shortly presents source code plagiarism detection method based on the low-level language. The similarity or distance metric that is used to calculate similarity coefficient between two source files has great impact on method's performance and results. This paper analyzes precision an...
Permalink: | http://skupni.nsk.hr/Record/ffzg.KOHA-OAI-FFZG:318257/Details |
---|---|
Matična publikacija: |
CECIIS 2011 Proceedings Varaždin : Faculty of Organization and Informatics, 2011 |
Glavni autor: | Juričić, Vedran (-) |
Vrsta građe: | Članak |
Jezik: | eng |
LEADER | 01595naa a2200193uu 4500 | ||
---|---|---|---|
008 | 131111s2011 xx 1 eng|d | ||
035 | |a (CROSBI)620650 | ||
040 | |a HR-ZaFF |b hrv |c HR-ZaFF |e ppiak | ||
100 | 1 | |9 489 |a Juričić, Vedran | |
245 | 1 | 0 | |a Evaluation of similarity metrics for programming code plagiarism detection method / |c Juričić, Vedran. |
246 | 3 | |i Naslov na engleskom: |a Evaluation of similarity metrics for programming code plagiarism detection method | |
300 | |a 83-88 |f str. | ||
520 | |a This paper shortly presents source code plagiarism detection method based on the low-level language. The similarity or distance metric that is used to calculate similarity coefficient between two source files has great impact on method's performance and results. This paper analyzes precision and recall of four most commonly used metrics, Levenstein distance, Cosine similarity, NGram similarity and Greedy String Tilling. Testing is based on various test cases that represent the most frequent code modification techniques. | ||
546 | |a ENG | ||
693 | |a agiarism detection, similarity, source code, similarity metric |l hrv |2 crosbi | ||
693 | |a agiarism detection, similarity, source code, similarity metric |l eng |2 crosbi | ||
773 | 0 | |a Central European Conference on Information and Intelligent Systems (21-23.9.2011 ; Varaždin, Hrvatska) |t CECIIS 2011 Proceedings |d Varaždin : Faculty of Organization and Informatics, 2011 |n Tihomir Hunjak, Sandra Lovrenčić, Igor Tomičić |x 1847-2001 |g str. 83-88 | |
942 | |c RZB |u 2 |v Recenzija |z Znanstveni - Predavanje - CijeliRad |t 1.08 | ||
999 | |c 318257 |d 318255 |