Estimating Required Recall of Information Retrieval and Extraction for Successful Knowledge Acquisition from the Web
Information on the web is not only abundant but also redundant. This redundancy of information has an important consequence on the relation between the recall of an information gathering system and its capacity to harvest the core information of a certain domain of knowledge. In this paper we provide a new idea for estimating the necessary web coverage of a knowledge acquisition system in order to achieve a certain desired coverage of the contained core information.
Sponsor of The CIO Dinner