Abstract

A system is proposed that combines textual and visual statistics in a single index vector for content-based search of a WWW image database. Textual statistics are captured in vector form using latent semantic indexing based on text in the containing HTML document. Visual statistics are captured in vector form using color and orientation histograms. By using an integrated approach, it becomes possible to take advantage of possible statistical couplings between the content of the document (latent semantic content) and the contents of images (visual statistics). The combined approach allows improved performance in conducting content-based search. Search performance experiments are reported for a database containing 350,000 images collected from the WWW.

Keywords

Computer scienceInformation retrievalContent (measure theory)Content-based image retrievalArtificial intelligenceImage (mathematics)Image retrievalComputer visionMathematics

Affiliated Institutions

Related Publications

Publication Info

Year
1999
Type
article
Volume
75
Issue
1-2
Pages
86-98
Citations
130
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

130
OpenAlex

Cite This

Stan Sclaroff, Marco La Cascia, Saratendu Sethi et al. (1999). Unifying Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web. Computer Vision and Image Understanding , 75 (1-2) , 86-98. https://doi.org/10.1006/cviu.1999.0765

Identifiers

DOI
10.1006/cviu.1999.0765