Searching ... in a Web

Witten, Ian H.

doi:10.3217/jucs-014-10-1739

Searching ... in a Web

Authors

Witten, Ian H.

Permanent Link

https://hdl.handle.net/10289/1774

DOI

10.3217/jucs-014-10-1739

Abstract

Search engines—“web dragons”—are the portals through which we access society’s treasure trove of information. They do not publish the algorithms they use to sort and filter information, yet what they do and how they do it are amongst the most important questions of our time. They deal not just with information per se, but evaluate it in order to prioritize it for the user. To do this they assess the prestige of each web page in terms of who links to it. This article explains in non-technical terms what is known about how web search engines work. We describe the dominant way of measuring prestige, relating it to the experience of a surfer condemned to click randomly around the web forever—and also to standard techniques of bibliometric evaluation. We review alternatives: some strive to identify subcommunities of the web; others learn based on implicit user feedback. We also takes a critical look at how people use search engines, and identify issues of bias, privacy, and personalization that crucially affect our world of information today.

Citation

Witten, I. H. (2008). Search ... in a Web. Journal of Universal Computer Science, 14(10), 1739-1762.

Type

Journal Article

Date

2008

Publisher

Institut fuer Informationssysteme und Computer Medien

Searching ... in a Web

Authors

Permanent Link

DOI

Publisher link

Rights

Abstract

Citation

Type

Series name

Date

Publisher

Degree

Type of thesis

Supervisor