Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Constructing a focused taxonomy from a document collection

      Medelyan, Olena; Manion, Steve; Broekstra, Jeen; Divoli, Anna; Huang, Anna-Lan; Witten, Ian H.
      Thumbnail
      Files
      13-OM-et al-focusedtaxonomies.pdf
      778.1Kb
      DOI
       10.1007/978-3-642-38288-8_25
      Link
       link.springer.com
      Find in your library  
      Citation
      Export citation
      Medelyan, O., Manion, S., Broekstra, J., Divoli, A., Huang, A.-L., & Witten, I. H. (2013). Constructing a focused taxonomy from a document collection. In P. Cimiano et al. (Eds.): ESWC 2013, LNCS 7882 (pp. 367-381). Berlin, Germany: Springer-Verlag Berlin Heidelberg.
      Permanent Research Commons link: https://hdl.handle.net/10289/7975
      Abstract
      We describe a new method for constructing custom taxonomies from document collections. It involves identifying relevant concepts and entities in text; linking them to knowledge sources like Wikipedia, DBpedia, Freebase, and any supplied taxonomies from related domains; disambiguating conflicting concept mappings; and selecting semantic relations that best group them hierarchically. An RDF model supports interoperability of these steps, and also provides a flexible way of including existing NLP tools and further knowledge sources. From 2000 news articles we construct a custom taxonomy with 10,000 concepts and 12,700 relations, similar in structure to manually created counterparts. Evaluation by 15 human judges shows the precision to be 89% and 90% for concepts and relations respectively; recall was 75% with respect to a manually generated taxonomy for the same domain.
      Date
      2013
      Type
      Conference Contribution
      Publisher
      Springer-Verlag Berlin Heidelberg
      Rights
      This is an author’s accepted version. The original publication is available at www.springerlink.com.
      Collections
      • Computing and Mathematical Sciences Papers [1455]
      Show full item record  

      Usage

      Downloads, last 12 months
      50
       
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement