Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      GNUsmail: Open framework for on-line email classification

      Carmona-Cejudo, José M.; Baena-García, Manuel; del Campo-Ávila, José; Morales-Bueno, Rafael; Bifet, Albert
      Thumbnail
      Files
      GNUsmail.pdf
      171.3Kb
      DOI
       10.3233/978-1-60750-606-5-1141
      Find in your library  
      Citation
      Export citation
      Carmona-Cejudo, J. M., Baena-García, M., del Campo-Ávila, J., Morales-Bueno, R., & Bifet, A. (2011). GNUsmail: Open framework for on-line email classification. In Frontiers in Artificial Intelligence and Applications (pp. 1141–1142). IOS Press. http://doi.org/10.3233/978-1-60750-606-5-1141
      Permanent Research Commons link: https://hdl.handle.net/10289/5411
      Abstract
      Real-time classification of massive email data is a challenging task that presents its own particular difficulties. Since email data presents an important temporal component, several problems arise: emails arrive continuously, and the criteria used to classify those emails can change, so the learning algorithms have to be able to deal with concept drift. Our problem is more general than spam detection, which has received much more attention in the literature.

      In this paper we present GNUsmail, an open-source extensible framework for email classification, which structure supports incremental and on-line learning. This framework enables the incorporation of algorithms developed by other researchers, such as those included in WEKA and MOA. We evaluate this framework, characterized by two overlapping phases (pre-processing and learning), using the ENRON dataset, and we compare the results achieved by WEKA and MOA algorithms.
      Date
      2011
      Type
      Conference Contribution
      Publisher
      IOS Press
      Rights
      This article has been published in the Proceedings of ECAI 2010 - 19th European Conference on Artificial Intelligence. © 2010 The authors and IOS Press.
      Collections
      • Computing and Mathematical Sciences Papers [1454]
      Show full item record  

      Usage

      Downloads, last 12 months
      35
       
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement