Show simple item record  

dc.contributor.authorMedelyan, Olena
dc.contributor.authorWitten, Ian H.
dc.date.accessioned2009-01-04T21:58:38Z
dc.date.available2009-01-04T21:58:38Z
dc.date.issued2008-03
dc.identifier.citationMedelyan, O. & Witten, I.H. (2008). Domain-independent authomatic keyphrase indexing with small training sets. Journal of American Society for Information Science and Technology, 59(7), 1026-1040.en
dc.identifier.urihttps://hdl.handle.net/10289/1734
dc.description.abstractKeyphrases are widely used in both physical and digital libraries as a brief, but precise, summary of documents. They help organize material based on content, provide thematic access, represent search results, and assist with navigation. Manual assignment is expensive because trained human indexers must reach an understanding of the document and select appropriate descriptors according to defined cataloging rules. We propose a new method that enhances automatic keyphrase extraction by using semantic information about terms and phrases gleaned from a domain-specific thesaurus. The key advantage of the new approach is that it performs well with very little training data. We evaluate it on a large set of manually indexed documents in the domain of agriculture, compare its consistency with a group of six professional indexers, and explore its performance on smaller collections of documents in other domains and of French and Spanish documents.en
dc.language.isoen
dc.publisherWiley InterScienceen_NZ
dc.relation.urihttp://www3.interscience.wiley.com/cgi-bin/fulltext/117935647/PDFSTARTen
dc.subjectcomputer scienceen
dc.subjectphrasesen
dc.subjectindex termsen
dc.subjectautomatic indexingen
dc.subjectsubject indexingen
dc.subjectcontrolled vocabulariesen
dc.subjectMachine learning
dc.subjectMachine learning
dc.titleDomain-independent automatic keyphrase indexing with small training setsen
dc.typeJournal Articleen
dc.identifier.doi10.1002/asi.20790en
dc.relation.isPartOfJournal of the American Society for information Science and Technologyen_NZ
pubs.begin-page1026en_NZ
pubs.elements-id32951
pubs.end-page1040en_NZ
pubs.issue7en_NZ
pubs.volume59en_NZ
uow.identifier.article-no7en_NZ


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record