Abstract
Authorship attribution, i.e., determining the author of a document automatically based on a reference corpus, is an established topic in computational linguistics and Digital Humanities. However, state of the art techniques are generally applied to documents of 5000 words or more. This research explores a method to attribute authorship on paragraph level, using simulated annealing to incrementally increase the accuracy of classification.
Original language | English |
---|---|
Title of host publication | Proceedings of Benelearn 2016 |
Number of pages | 3 |
Publication status | Published - Sept 2016 |
Event | The 25th Belgian-Dutch Conference on Machine Learning (Benelearn) - Katholieke Universiteit Leuven, Kortrijk, Belgium Duration: 12 Sept 2016 → 13 Sept 2016 https://www.kuleuven-kulak.be/benelearn/ |
Conference
Conference | The 25th Belgian-Dutch Conference on Machine Learning (Benelearn) |
---|---|
Country/Territory | Belgium |
City | Kortrijk |
Period | 12/09/16 → 13/09/16 |
Internet address |
Keywords
- author attribution
- Enlightenment philosophy
- simulated annealing