Empirical Study: Visual Analytics for Comparing Stacking to Blending Ensemble Learning

Angelos Chatzimparmpas, Rafael Messias Martins, Kostiantyn Kucher, Andreas Kerren

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademic

Abstract

Stacked generalization (also called stacking) is an ensemble method in machine learning that uses a metamodel to combine the predictive results of heterogeneous base models arranged in at least one layer. K-fold cross-validation is employed at the various stages of training in this method. Nonetheless, another validation strategy is to try out several splits of data leading to different train and test sets for the base models and then use only the latter to train the metamodel—this is known as blending. In this work, we present a modification of an existing visual analytics system, entitled StackGenVis, that now supports the process of composing robust and diverse ensembles of models with both aforementioned methods. We have built multiple ensembles using our system with the two respective methods, and we tested the performance with six small- to large-sized data sets. The results indicate that stacking is significantly more powerful than blending based on three performance metrics. However, the training times of the base models and the final ensembles are lower and more stable during various train/test splits in blending rather than stacking.
Original languageEnglish
Title of host publication2021 23rd International Conference on Control Systems and Computer Science (CSCS)
PublisherIEEE Electromagnetic Compatibility Society
Pages1-8
Number of pages8
ISBN (Print)978-1-6654-3940-4
DOIs
Publication statusPublished - 28 May 2021
Event2021 23rd International Conference on Control Systems and Computer Science (CSCS) - Bucharest, Romania
Duration: 26 May 202128 May 2021

Conference

Conference2021 23rd International Conference on Control Systems and Computer Science (CSCS)
Period26/05/2128/05/21

Keywords

  • Training
  • Measurement
  • Computer science
  • Visual analytics
  • Computational modeling
  • Stacking
  • Machine learning

Fingerprint

Dive into the research topics of 'Empirical Study: Visual Analytics for Comparing Stacking to Blending Ensemble Learning'. Together they form a unique fingerprint.

Cite this