Bi-level document image compression using layout information

Inglis, Stuart J.; Witten, Ian H.

doi:10.1109/DCC.1996.488374

Bi-level document image compression using layout information

Authors

Inglis, Stuart J.

Witten, Ian H.

Files

Bi-level document.pdf (185.25 KB)

Permanent Link

https://hdl.handle.net/10289/11074

DOI

10.1109/DCC.1996.488374

Rights

This is an author’s accepted version of an article published in Proceedings of the DCC '96, Data Compression Conference. © 1996 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Abstract

Most bi-level images stored on computers today comprise scanned text, and are stored using generic bi-level image technology based either on classical run-length coding, such as the CCITT Group 4 method, or on modern schemes such as JBIG that predict pixels from their local image context. However, image compression methods that are tailored specifically for images known to contain printed text can provide noticeably superior performance because they effectively enlarge the context to the character level, at least for those predictions for which such a context is relevant. To deal effectively with general documents that contain text and pictures, it is necessary to detect layout and structural information from the image, and employ different compression techniques for different parts of the image. The authors extend previous work in document image compression in two ways. First, we include automatic discrimination between text and non-text zones in an image. Second, the system is tested on a large real-world image corpus.

Citation

Inglis, S. J., & Witten, I. H. (1996). Bi-level document image compression using layout information. In J. A. Storer & M. Cohn (Eds.), Proceedings of the DCC ’96, Data Compression Conference (pp. 442–450). Washington, DC, USA: IEEE. https://doi.org/10.1109/DCC.1996.488374

Type

Conference Contribution

Date

1996

Publisher

IEEE

Bi-level document image compression using layout information

Authors

Files

Permanent Link

DOI

Publisher link

Rights

Abstract

Citation

Type

Series name

Date

Publisher

Degree

Type of thesis

Supervisor