Now showing items 1-4 of 4

  • Adaptive models of English text

    Teahan, W.J.; Cleary, John G. (Department of Computer Science, The University of Waikato, 1997-11)
    High quality models of English text with performance approaching that of humans is important for many applications including spelling correction, speech recognition, OCR, and encryption. A number of different statistical ...
  • A compression-based algorithm for Chinese word segmentation

    Teahan, W.J.; Wen, Yingying; McNab, Rodger J.; Witten, Ian H. (Computer Science, University of Waikato, 1999-09)
    The Chinese language is written without using spaces or other word delimiters. Although a text may be thought of as a corresponding sequence of words, there is considerable ambiguity in the placement of boundaries. ...
  • Correcting English text using PPM models

    Teahan, W.J.; Inglis, Stuart J.; Cleary, John G.; Holmes, Geoffrey (Computer Science, University of Waikato, 1997-11)
    An essential component of many applications in natural language processing is a language modeler able to correct errors in the text being processed. For optical character recognition (OCR), poor scanning quality or extraneous ...
  • Tag based models of English text

    Teahan, W.J.; Cleary, John G. (1997-11)
    The problem of compressing English text is important both because of the ubiquity of English as a target for compression and because of the light that compression can shed on the structure of English. English text is ...

W.J. Teahan has 6 co-authors in Research Commons.