Abstract
In this work we compare a total of 9 different tools for the detection of source code plagiarism. We evaluated the plagiarism or copy detection tools CPD, JPlag, Sherlock, Marble, Moss, Plaggie and SIM and two baselines, one based on the Unix tool diff and one based on the difflib module from the Python Standard Library. We provide visualizations of the output of these tools and compare the performance of each tool when running it on different tasks by comparing both the F-measures and the area under the precision-recall-curve (AUC-PR). We compare the performance using these metrics on each task and identify the best performing tools.
Original language | English |
---|---|
Title of host publication | CSERC '17: Proceedings of the 6th Computer Science Education Research Conference |
Editors | Vreda Pieterse, Marko van Eekelen, Michalis Giannakos |
Publisher | Association for Computing Machinery |
Pages | 73-82 |
Number of pages | 10 |
ISBN (Electronic) | 9781450363389 |
DOIs | |
Publication status | Published - 14 Nov 2017 |
Event | 6th Computer Science Education Research Conference, CSERC 2017 - Helsinki, Finland Duration: 13 Nov 2017 → 14 Nov 2017 |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Conference
Conference | 6th Computer Science Education Research Conference, CSERC 2017 |
---|---|
Country/Territory | Finland |
City | Helsinki |
Period | 13/11/17 → 14/11/17 |
Bibliographical note
Publisher Copyright:© 2017 Association for Computing Machinery.
Keywords
- Empirical study
- Program plagiarism detection
- Quantitative comparison
- Tools