Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computer Science Working Paper Series
      • 2013 Working Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computer Science Working Paper Series
      • 2013 Working Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Text categorization and similarity analysis: implementation and evaluation

      Fowke, Michael; Hinze, Annika; Heese, Ralf
      Thumbnail
      Files
      uow-cs-wp-2013-10.pdf
      706.6Kb
      Find in your library  
      Citation
      Export citation
      Fowke, M., Hinze, A., & Heese, R. (2013). Text categorization and similarity analysis: implementation and evaluation. (Working paper 10/2013). Hamilton, New Zealand: University of Waikato, Department of Computer Science.
      Permanent Research Commons link: https://hdl.handle.net/10289/8430
      Abstract
      This report covers the implementation of software that aims to identify document versions and se-mantically related documents. This is important due to the increasing amount of digital information. Key criteria were that the software was fast and required limited disk space. Previous research de-termined that the Simhash algorithm was the most appropriate for this application so this method was implemented. The structure of each component was well defined with the inputs and outputs constant and the result was a software system that can have interchangeable parts if required.
      Date
      2013-12
      Type
      Working Paper
      Series
      Computer Science Working Papers
      Report No.
      10/2013
      Publisher
      University of Waikato, Department of Computer Science
      Rights
      © 2013 Michael Fowke, Annika Hinze, Ralf Heese.
      Collections
      • 2013 Working Papers [13]
      Show full item record  

      Usage

      Downloads, last 12 months
      52
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement