Loading...
Thumbnail Image
Item

Analysing chromatographic data using data mining to monitor petroleum content in water

Abstract
Chromatography is an important analytical technique that has widespread use in environmental applications. A typical application is the monitoring of water samples to determine if they contain petroleum. These tests are mandated in many countries to enable environmental agencies to determine if tanks used to store petrol are leaking into local water systems. Chromatographic techniques, typically using gas or liquid chromatography coupled with mass spectrometry, allow an analyst to detect a vast array of compounds—potentially in the order of thousands. Accurate analysis relies heavily on the skills of a limited pool of experienced analysts utilising semi-automatic techniques to analyse these datasets—making the outcomes subjective. The focus of current laboratory data analysis systems has been on refinements of existing approaches. The work described here represents a paradigm shift achieved through applying data mining techniques to tackle the problem. These techniques are compelling because the efficacy of preprocessing methods, which are essential in this application area, can be objectively evaluated. This paper presents preliminary results using a data mining framework to predict the concentrations of petroleum compounds in water samples. Experiments demonstrate that the framework can be used to produce models of sufficient accuracy—measured in terms of root mean squared error and correlation coefficients—to offer the potential for significantly reducing the time spent by analysts on this task.
Type
Conference Contribution
Type of thesis
Series
Citation
Holems, G., Fletcher, D., Reutemann, P. & Frank, E. (2009). Analysing chromatographic data using data mining to monitor petroleum content in water. In Proceedings of 4th International ICSC Symposium on Information Thessaloniki, Greece, May 28-29, 2009 (pp. 278-290). Berlin: Springer.
Date
2009
Publisher
Springer
Degree
Supervisors
Rights
This is an author’s version of an article published in the Proceedings of 4th International ICSC Symposium on Information Thessaloniki, Greece, May 28-29, 2009. ©2009 Springer-Verlag Berlin Heidelberg.