Semi-Automatic Data Annotation guided by Feature Space Projection

B. Benato, J. Gomes, A. Telea, A. Falcao

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Data annotation using visual inspection (supervision) of each training sample can be laborious. Interactive solutions alleviate this by helping experts propagate labels from a few supervised samples to unlabeled ones based solely on the visual analysis of their feature space projection (with no further sample supervision). We present a semi-automatic data annotation approach based on suitable feature space projection and semi-supervised label estimation. We validate our method on the popular MNIST dataset and on images of human intestinal parasites with and without fecal impurities, a large and diverse dataset that makes classification very hard. We evaluate two approaches for semi-supervised learning from the latent and projection spaces, to choose the one that best reduces user annotation effort and also increases classification accuracy on unseen data. Our results demonstrate the added-value of visual analytics tools that combine complementary abilities of humans and machines for more effective machine learning.
Original languageEnglish
Article number107612
Number of pages11
JournalPattern Recognition
Volume109
DOIs
Publication statusPublished - 2020

Keywords

  • semi-supervised learning
  • unsupervised feature learning
  • interactive data annotation
  • autoencoder-neutral networks
  • data visualization

Fingerprint

Dive into the research topics of 'Semi-Automatic Data Annotation guided by Feature Space Projection'. Together they form a unique fingerprint.

Cite this