Predicting Library of Congress Classifications from Library of Congress Subject Headings

dc.contributor.authorFrank, Eibe
dc.contributor.authorPaynter, Gordon W.
dc.date.accessioned2008-10-10T02:37:50Z
dc.date.available2008-10-10T02:37:50Z
dc.date.issued2003-01
dc.description.abstractThis paper addresses the problem of automatically assigning a Library of Congress Classification (LCC) to work given its set of Library of Congress Subject Headings (LCSH). LCC are organized in a tree: the root node of this hierarchy comprises all possible topics, and leaf nodes correspond to the most specialized topic areas defined. We describe a procedure that, given a resource identified by its LCSH, automatically places that resource in the LCC hierarchy. The procedure uses machine learning techniques and training data from a large library catalog to learn a classification model mapping from sets of LCSH to nodes in the LCC tree. We present empirical results for our technique showing its accuracy on an independent collection of 50,000 LCSH/LCC pairs.en_US
dc.format.mimetypeapplication/pdf
dc.identifier.citationFrank, E. & Paynter, G. (2003). Predicting Library of Congress Classifications from Library of Congress Subject Headings. (Working paper 01/03). Hamilton, New Zealand: University of Waikato, Department of Computer Science.en_US
dc.identifier.issn1170-487X
dc.identifier.urihttps://hdl.handle.net/10289/1011
dc.language.isoen
dc.publisherUniversity of Waikatoen_NZ
dc.relation.ispartofseriesComputer Science Working Papers
dc.subjectcomputer scienceen_US
dc.subjectMachine learning
dc.titlePredicting Library of Congress Classifications from Library of Congress Subject Headingsen_US
dc.typeWorking Paperen_US
dspace.entity.typePublication
pubs.place-of-publicationHamiltonen_NZ
uow.relation.series01/03

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
uow-cs-wp-2003-01.pdf
Size:
272.46 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.79 KB
Format:
Item-specific license agreed upon to submission
Description: