Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Scalable browsing for large collections: a case study

      Paynter, Gordon W.; Witten, Ian H.; Cunningham, Sally Jo; Buchanan, George
      Thumbnail
      Files
      00GP-IHW-SJC-GB-Scalablebro.pdf
      282.1Kb
      DOI
       10.1145/336597.336666
      Link
       portal.acm.org
      Find in your library  
      Citation
      Export citation
      Paynter, G.W., Witten, I.H., Cunningham, S.J. & Buchanan, G. (2000). Scalable browsing for large collections: a case study. In Proceedings of the fifth ACM conference on Digital libraries, San Antonio, Texas, United States, June 02 - 07, 2000(pp. 215-223). New York: ACM
      Permanent Research Commons link: https://hdl.handle.net/10289/1304
      Abstract
      Phrase browsing techniques use phrases extracted automatically from a large information collection as a basis for browsing and accessing it. This paper describes a case study that uses an automatically constructed phrase hierarchy to facilitate browsing of an ordinary large Web site. Phrases are extracted from the full text using a novel combination of rudimentary syntactic processing and sequential grammar induction techniques. The interface is simple, robust and easy to use. To convey a feeling for the quality of the phrases that are generated automatically, a thesaurus used by the organization responsible for the Web site is studied and its degree of overlap with the phrases in the hierarchy is analyzed. Our ultimate goal is to amalgamate hierarchical phrase browsing and hierarchical thesaurus browsing: the latter provides an authoritative domain vocabulary and the former augments coverage in areas the thesaurus does not reach.
      Date
      2000
      Type
      Conference Contribution
      Publisher
      ACM
      Rights
      This is an author’s version of an article published in Proceedings of the fifth ACM conference on Digital libraries, San Antonio, Texas, United States, June 02 - 07, 2000.
      Collections
      • Computing and Mathematical Sciences Papers [1455]
      Show full item record  

      Usage

      Downloads, last 12 months
      79
       
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement