Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Mining Domain-Specific Thesauri from Wikipedia: A case study

      Milne, David N.; Medelyan, Olena; Witten, Ian H.
      Thumbnail
      Files
      Mining Domain-specific thesauri from wikipedia.pdf
      342.0Kb
      DOI
       10.1109/WI.2006.119
      Link
       portal.acm.org
      Find in your library  
      Citation
      Export citation
      Milne, D., Medelyan, O. & Witten, I.H. (2006). Mining Domain-Specific Thesauri from Wikipedia: A case study. In Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, Hong Kong, China, December 18 - 22, 2006(pp. 442-448). Washington DC: IEEE Computer Society.
      Permanent Research Commons link: https://hdl.handle.net/10289/1347
      Abstract
      Domain-specific thesauri are high-cost, high-maintenance, high-value knowledge structures. We show how the classic thesaurus structure of terms and links can be mined automatically from Wikipedia. In a comparison with a professional thesaurus for agriculture we find that Wikipedia contains a substantial proportion of its concepts and semantic relations; furthermore it has impressive coverage of contemporary documents in the domain. Thesauri derived using our techniques capitalize on existing public efforts and tend to reflect contemporary language usage better than their costly, painstakingly-constructed manual counterparts.
      Date
      2006
      Type
      Conference Contribution
      Publisher
      IEEE Computer Society
      Rights
      This article has been published in the Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, Hong Kong, China, December 18-22, 2006. Copyright © IEEE Computer Society.
      Collections
      • Computing and Mathematical Sciences Papers [1455]
      Show full item record  

      Usage

      Downloads, last 12 months
      146
       
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement