Bi-level document image compression using layout information

Inglis, Stuart J.; Witten, Ian H.

Bi-level document image compression using layout information

Authors

Inglis, Stuart J.

Witten, Ian H.

Files

uow-cs-wp-1996-01.pdf (16.54 MB)

Permanent Link

https://hdl.handle.net/10289/1154

Abstract

Most bi-level images stored on computers today comprise scanned text, and their number is escalating because of the drive to archive large volumes of paper-based material electronically. These documents are stored using generic bi-level image technology, based either on classical run-length coding, such as the CCITT Group 4 method, or on modern schemes such as JBIG that predict pixels from their local image context. However, image compression methods that are tailored specifically for images known to contain printed text can provide noticeably superior performance because they effectively enlarge the context to the character level, at least for those predictions for which such a context is relevant. To deal effectively with general documents that contain text and pictures, it is necessary to detect layout and structural information from the image, and employ different compression techniques for different parts of the image. Such techniques are called document image compression methods.

Citation

Jones, S. & Marsh, S. (1996). Bi-level document image compression using layout information. (Working paper 96/01). Hamilton, New Zealand: University of Waikato, Department of Computer Science.

Type

Working Paper

Series name

Computer Science Working Papers

Date

1996-01

Bi-level document image compression using layout information

Authors

Files

Permanent Link

Publisher link

Rights

Abstract

Citation

Type

Series name

Date

Publisher

Degree

Type of thesis

Supervisor