Loading...
Thumbnail Image
Item

Domain-specific keyphrase extraction

Abstract
Keyphrases are an important means of document summarization, clustering, and topic search. Only a small minority of documents have author-assigned keyphrases, and manually assigning keyphrases to existing documents is very laborious. Therefore it is highly desirable to automate the keyphrase extraction process. This paper shows that a simple procedure for keyphrase extraction based on the naive Bayes learning scheme performs comparably to the state of the art. It goes on to explain how this procedure's performance can be boosted by automatically tailoring the extraction process to the particular document collection at hand. Results on a large collection of technical reports in computer science show that the quality of the extracted keyphrases improves significantly when domain-specific information is exploited.
Type
Conference Contribution
Type of thesis
Series
Citation
Frank, E., Paynter, G.W., Witten, I.H., Gutwin, C. & Nevill-Manning, C.G.(1999). Domain-specific keyphrase extraction. In Proceeding of 16th International Joint Conference on Artificial Intelligence, Stockholm, Sweden(pp.668-673). San Francisco, USA: Morgan Kaufmann Publishers.
Date
1999
Publisher
Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
Degree
Supervisors
Rights
This article has been published in Proceeding of 16th International Joint Conference on Artificial Intelligence, Stockholm, Sweden. ©1999 Morgan Kaufmann Publishers.