Bi-level document image compression using layout information

dc.contributor.authorInglis, Stuart J.en_NZ
dc.contributor.authorWitten, Ian H.en_NZ
dc.contributor.editorStorer, J.A.en_NZ
dc.contributor.editorCohn, M.en_NZ
dc.coverage.spatialSnowbird, UTen_NZ
dc.date.accessioned2017-05-23T22:11:05Z
dc.date.available1996en_NZ
dc.date.available2017-05-23T22:11:05Z
dc.date.issued1996en_NZ
dc.description.abstractMost bi-level images stored on computers today comprise scanned text, and are stored using generic bi-level image technology based either on classical run-length coding, such as the CCITT Group 4 method, or on modern schemes such as JBIG that predict pixels from their local image context. However, image compression methods that are tailored specifically for images known to contain printed text can provide noticeably superior performance because they effectively enlarge the context to the character level, at least for those predictions for which such a context is relevant. To deal effectively with general documents that contain text and pictures, it is necessary to detect layout and structural information from the image, and employ different compression techniques for different parts of the image. The authors extend previous work in document image compression in two ways. First, we include automatic discrimination between text and non-text zones in an image. Second, the system is tested on a large real-world image corpus.
dc.format.mimetypeapplication/pdf
dc.identifier.citationInglis, S. J., & Witten, I. H. (1996). Bi-level document image compression using layout information. In J. A. Storer & M. Cohn (Eds.), Proceedings of the DCC ’96, Data Compression Conference (pp. 442–450). Washington, DC, USA: IEEE. https://doi.org/10.1109/DCC.1996.488374en
dc.identifier.doi10.1109/DCC.1996.488374en_NZ
dc.identifier.isbn0-8186-7358-3en_NZ
dc.identifier.urihttps://hdl.handle.net/10289/11074
dc.language.isoen
dc.publisherIEEEen_NZ
dc.relation.isPartOfProceedings of the DCC '96, Data Compression Conferenceen_NZ
dc.rightsThis is an author’s accepted version of an article published in Proceedings of the DCC '96, Data Compression Conference. © 1996 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
dc.source6th Data Compression Conference (DCC 96)en_NZ
dc.subjectScience & Technologyen_NZ
dc.subjectTechnologyen_NZ
dc.subjectComputer Science, Information Systemsen_NZ
dc.subjectComputer Scienceen_NZ
dc.subjectMachine learning
dc.subjectMachine learning
dc.titleBi-level document image compression using layout informationen_NZ
dc.typeConference Contribution
pubs.begin-page442
pubs.elements-id191927
pubs.end-page450
pubs.finish-date1996-04-03en_NZ
pubs.organisational-group/Waikato
pubs.organisational-group/Waikato/2018 PBRF
pubs.organisational-group/Waikato/FCMS
pubs.organisational-group/Waikato/FCMS/2018 PBRF - FCMS
pubs.organisational-group/Waikato/FCMS/Computer Science
pubs.place-of-publicationWashington, DC, USA
pubs.publication-statusPublisheden_NZ
pubs.start-date1996-03-31en_NZ
uow.verification.statusunverified
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Bi-level document.pdf
Size:
185.25 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Research Commons Deposit Agreement 2017.pdf
Size:
188.11 KB
Format:
Adobe Portable Document Format
Description: