Spate, J., Gibert, K., Sànchèz-Marr, M., Frank, E., Comas, J., Athanasiadis, I. & Letcher, R. (2006).Data Mining as a Tool for Environmental Scientists. In: Voinov, A., Jakeman, A.J., Rizzoli, A.E. (eds). Proceedings of the iEMSs Third Biennial Meeting: "Summit on Environmental Modelling and Software". International Environmental Modelling and Software Society, Burlington, USA, July 2006.
Permanent Research Commons link: http://hdl.handle.net/10289/7682
Over recent years a huge library of data mining algorithms has been developed to tackle a variety of problems in fields such as medical imaging and network traffic analysis. Many of these techniques are far more flexible than more classical modelling approaches and could be usefully applied to data-rich environmental problems. Certain techniques such as Artificial Neural Networks, Clustering, Case-Based Reasoning and more recently Bayesian Decision Networks have found application in environmental modelling while other methods, for example classification and association rule extraction, have not yet been taken up on any wide scale. We propose that these and other data mining techniques could be usefully applied to difficult problems in the field. This paper introduces several data mining concepts and briefly discusses their application to environmental modelling, where data may be sparse, incomplete, or heterogenous.
International Environmental Modelling and Software Society
© 2006 the authors.