Stacking bagged and dagged models

Ting, Kai Ming; Witten, Ian H.

Item

Stacking bagged and dagged models

Ting, Kai Ming
;
Witten, Ian H.

Abstract

In this paper, we investigate the method of stacked generalization in combining models derived from different subsets of a training dataset by a single learning algorithm, as well as different algorithms. The simplest way to combine predictions from competing models is majority vote, and the effect of the sampling regime used to generate training subsets has already been studied in this context-when bootstrap samples are used the method is called bagging, and for disjoint samples we call it dagging. This paper extends these studies to stacked generalization, where a learning algorithm is employed to combine the models. This yields new methods dubbed bag-stacking and dag-stacking. We demonstrate that bag-stacking and dag-stacking can be effective for classification tasks even when the training samples cover just a small fraction of the full dataset. In contrast to earlier bagging results, we show that bagging and bag-stacking work for stable as well as unstable learning algorithms, as do dagging and dag-stacking. We find that bag-stacking (dag-stacking) almost always has higher predictive accuracy than bagging (dagging), and we also show that bag-stacking models derived using two different algorithms is more effective than conventional bagging.

Type

Working Paper

Series

Computer Science Working Papers

Citation

Ting, K.M. & Witten, I.H. (1997). Stacking bagged and dagged models. (Working paper 97/09). Hamilton, New Zealand: University of Waikato, Department of Computer Science.

Date

1997-03

Stacking bagged and dagged models

Ting, Kai Ming
;
Witten, Ian H.

Abstract

Type

Type of thesis

Series

Citation

Date

Publisher

Degree

Supervisors

Rights

Files

Permanent link

DOI

Publisher version

Collections