Abstract
Most probabilistic retrieval models incorporate information about the occurrence of index terms in relevant and non‐relevant documents. In this paper we consider the situation where no relevance information is available, that is, at the start of the search. Based on a probabilistic model, strategies are proposed for the initial search and an intermediate search. Retrieval experiments with the Cranfield collection of 1,400 documents show that this initial search strategy is better than conventional search strategies both in terms of retrieval effectiveness and in terms of the number of queries that retrieve relevant documents. The intermediate search is shown to be a useful substitute for a relevance feedback search. Experiments with queries that do not retrieve relevant documents at high rank positions indicate that a cluster search would be an effective alternative strategy.
Keywords
Affiliated Institutions
Related Publications
Indexing by latent semantic analysis
A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higher-order structure in the association of terms with documents (...
Evaluation of Text Summarization in a Cross-lingual Information Retrieval Framework
We report on research in multi-document summarization and on evaluation of summarization in the framework of cross-lingual information retrieval. This work was carried out durin...
An evaluation of retrieval effectiveness for a full-text document-retrieval system
An evaluation of a large, operational full-text document-retrieval system (containing roughly 350,000 pages of text) shows the system to be retrieving less than 20 percent of th...
Document Language Models, Query Models, and Risk Minimization for Information Retrieval
We present a framework for information retrieval that combines document models and query models using a probabilistic ranking function based on Bayesian decision theory. The fra...
Using Linear Algebra for Intelligent Information Retrieval
Currently, most approaches to retrieving textual materials from scientific databases depend on a lexical match between words in users’ requests and those in or assigned to docum...
Publication Info
- Year
- 1979
- Type
- article
- Volume
- 35
- Issue
- 4
- Pages
- 285-295
- Citations
- 432
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1108/eb026683