Show simple item record  

dc.contributor.advisorBainbridge, David
dc.contributor.authorLin, Leoen_NZ
dc.date.accessioned2009-04-06T10:59:28Z
dc.date.available2009-09-02T14:44:49Z
dc.date.issued2009en_NZ
dc.identifier.citationLin, L. (2009). Improving Digital Library Support for Historic Newspaper Collections (Thesis, Master of Science (MSc)). The University of Waikato, Hamilton, New Zealand. Retrieved from https://hdl.handle.net/10289/3262en
dc.identifier.urihttps://hdl.handle.net/10289/3262
dc.descriptionDVD-ROM Appendix available with the print copy of this thesis.
dc.description.abstractNational and international initiatives are underway around the globe to digitise the vast treasure troves of historical artefacts they contain and make them available as digital libraries (DLs). The developed DLs are often constructed from facsimile pages with pre-existing metadata, such as historic newspapers stored on microfiche or generated from the non-destructive scanning of precious manuscripts. Access to the source documents is therefore limited to methods constructed from the metadata. Other projects look to introduce full-text indexing through the application of off-the-shelf commercial Optical Character Recognition (OCR) software. While this has greater potential for the end user experience over the metadata-only versions, the approach currently taken is best effort in the time available rather than a process informed by detailed analysis of the issues. In this thesis, we investigate if a richer level of support and service can be achieved by more closely integrating image processing techniques with DL software. The thesis presents a variety of experiments, implemented within the recently published open-source OCR System (Ocropus). In particular, existing segmentation algorithms are compared against our own based on Hough Transform, using our own created corpus gathered from different major online digital historic newspaper archives.en_NZ
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherThe University of Waikatoen_NZ
dc.rightsAll items in Research Commons are provided for private study and research purposes and are protected by copyright with all rights reserved unless otherwise indicated.
dc.subjectOCRen_NZ
dc.subjecthistorical newspaperen_NZ
dc.subjectdigital library supporten_NZ
dc.subjectdocument segmentationen_NZ
dc.titleImproving Digital Library Support for Historic Newspaper Collectionsen_NZ
dc.typeThesisen_NZ
thesis.degree.disciplineSCMSen_NZ
thesis.degree.grantorUniversity of Waikatoen_NZ
thesis.degree.levelMasters
thesis.degree.nameMaster of Science (MSc)en_NZ
uow.date.accession2009-04-06T10:59:28Zen_NZ
uow.date.available2009-09-02T14:44:49Zen_NZ
uow.identifier.adthttp://adt.waikato.ac.nz/public/adt-uow20090406.105928en_NZ
pubs.place-of-publicationHamilton, New Zealanden_NZ


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record