An Evaluation of Document Keyphrase Sets

dc.contributor.authorJones, Steve
dc.contributor.authorPaynter, Gordon W.
dc.date.accessioned2008-12-04T03:22:33Z
dc.date.available2008-12-04T03:22:33Z
dc.date.issued2003
dc.description.abstractKeywords and keyphrases have many useful roles as document surrogates and descriptors, but the manual production of keyphrase metadata for large digital library collections is at best expensive and time-consuming, and at worst logistically impossible. Algorithms for keyphrase extraction like Kea and Extractor produce a set of phrases that are associated with a document. Though these sets are often utilized as a group, keyphrase extraction is usually evaluated by measuring the quality of individual keyphrases. This paper reports an assessment that asks human assessors to rate entire sets of keyphrases produced by Kea, Extractor and document authors. The results provide further evidence that human assessors rate all three sources highly (with some caveats), but show that the relationship between the quality of the phrases in a set and the set as a whole is not always simple. Choosing the best individual phrases will not necessarily produce the best set; combinations of lesser phrases may result in better overall quality.en_US
dc.identifier.citationJones, S. & Paynter, G.W.(2003). An evaluation of document keyphrase sets. Journal of Digital Information, 4(1).en_US
dc.identifier.urihttps://hdl.handle.net/10289/1525
dc.language.isoen
dc.publisherBritish Computer Societyen_NZ
dc.relation.isPartOfJournal of Digital Informationen_NZ
dc.relation.urihttp://journals.tdl.org/jodi/article/view/jodi-107/92en_US
dc.rightsThis is an article published in the Journal of Digital Information. The original publication is available at http://journals.tdl.org/jodi/indexen_US
dc.subjectcomputer scienceen_US
dc.subjectdigital librariesen_US
dc.subjectautomatic keyphrase extractionen_US
dc.subjectauthor keyphrasesen_US
dc.subjectsubjective evaluationen_US
dc.titleAn Evaluation of Document Keyphrase Setsen_US
dc.typeConference Contributionen_US
dspace.entity.typePublication
pubs.begin-page1en_NZ
pubs.editionFebruaryen_NZ
pubs.end-page17en_NZ
pubs.issue1en_NZ
pubs.volume4en_NZ

Files

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.79 KB
Format:
Item-specific license agreed upon to submission
Description: