dc.contributor.author | Witten, Ian H. | |
dc.contributor.author | Moffat, Alistair | |
dc.contributor.author | Bell, Timothy C. | |
dc.date.accessioned | 2010-10-13T01:09:06Z | |
dc.date.available | 2010-10-13T01:09:06Z | |
dc.date.issued | 1995 | |
dc.identifier.citation | Witten, I.H., Moffat, A. & Bell, T.C. (1995). Compression and full-text indexing for Digital Libraries. SIGOIS Bulletin, 15(1), 11-13. | en_NZ |
dc.identifier.uri | https://hdl.handle.net/10289/4689 | |
dc.description.abstract | This chapter has demonstrated the feasibility of full-text indexing of large information bases. The use of modern compression techniques means that there is no space penalty: large document databases can be compressed and indexed in less than a third of the space required by the originals. Surprisingly, there is little or no time penalty either: querying can be faster because less information needs to be read from disk. Simple queries can be answered in a second; more complex ones with more query terms may take a few seconds. One important application is the creation of static databases on CD-ROM, and a 1.5 gigabyte document database can be compressed onto a standard 660 megabyte CD-ROM.
Creating a compressed and indexed document database containing hundreds of thousands of documents and gigabytes of data takes a few hours. Whereas retrieval can be done on ordinary workstations, creation requires a machine with a fair amount of main memory. | en_NZ |
dc.language.iso | en | |
dc.publisher | Springer | en_NZ |
dc.relation.uri | http://portal.acm.org/citation.cfm?id=185057.185061 | en_NZ |
dc.subject | computer science | en_NZ |
dc.subject | Digital Libraries | en_NZ |
dc.subject | compression | en_NZ |
dc.title | Compression and full-text indexing for Digital Libraries | en_NZ |
dc.type | Conference Contribution | en_NZ |
dc.identifier.doi | 10.1145/185057.185061 | en_NZ |