Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computer Science Working Paper Series
      • 1992 Working Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computer Science Working Paper Series
      • 1992 Working Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Semantic and generative models for lossy text compression

      Witten, Ian H.; Bell, Timothy C.; Moffat, Alistair; Smith, Tony C.; Nevill-Manning, Craig G.
      Thumbnail
      Files
      uow-cs-wp-1992-08.pdf
      Published version, 1.737Mb
      Find in your library  
      Citation
      Export citation
      Witten, I. H., Bell, T. C., Moffat, A., Smith, T. C., & Nevill-Manning, C. G. (1992). Semantic and generative models for lossy text compression (Computer Science Working Papers 92/8). Hamilton, New Zealand: Department of Computer Science, University of Waikato.
      Permanent Research Commons link: https://hdl.handle.net/10289/9911
      Abstract
      The apparent divergence between the research paradigms of text and image compression has led us to consider the potential for applying methods developed for one domain to the other. This paper examines the idea of "lossy" text compression, which transmits an approximation to the input text rather than the text itself. In image coding, lossy techniques have proven to yield compression factors that are vastly superior to those of the best lossless schemes, and we show that this a also the case for text. Two different methods are described here, one inspired by the use of fractals in image compression. They can be combined into an extremely effective technique that provides much better compression than the present state of the art and yet preserves a reasonable degree of match between the original and received text. The major challenge for lossy text compression is identified as the reliable evaluation of the quality of this match.
      Date
      1992
      Type
      Working Paper
      Series
      Computer Science Working Papers
      Report No.
      92/8
      Publisher
      Department of Computer Science, University of Waikato
      Rights
      © 1992 by Ian H. Witten, Timothy C. Bell, Alistair Moffat, Tony C. Smith & Craig G. Nevill-Manning.
      Collections
      • 1992 Working Papers [8]
      Show full item record  

      Usage

      Downloads, last 12 months
      63
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement