Subject and citation indexing. Part I: The clustering structure of composite representations in the Cystic Fibrosis Document Collection

1991 Journal of the American Society for Information Science 10 citations

Abstract

The presence of clustering structure in the cystic fibrosis (CF) Document Collection is evaluated as a function of the exhaustivity of five composite representations. The composite representations are constructed from two subject descriptions, based on Medical Subject Headings and subheadings, and two citation indexes, based on the complete set of references in and a comprehensive set of citations to each document. Experimental results reveal observable evidence for clustering structure at all exhaustivity levels of all composite representations but also show that the evidence for clustering structure diminishes as the exhaustivity of each representation is decreased. The representation composed of references and citations shows less evidence of clustering structure at the exhaustive level but more uniform evidence of clustering structure over a wide range of exhaustivity levels than composite representations that include subject descriptions. The structures imposed on the CF Document Collection by all composite representations satisfy the necessary condition for a meaningful clustering outcome. © 1991 John Wiley & Sons, Inc.

Keywords

Cluster analysisSubject (documents)Computer scienceInformation retrievalRepresentation (politics)Field (mathematics)Document clusteringSet (abstract data type)CitationNatural language processingArtificial intelligenceMathematicsPure mathematicsWorld Wide Web

Affiliated Institutions

Related Publications

Publication Info

Year
1991
Type
article
Volume
42
Issue
9
Pages
669-675
Citations
10
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

10
OpenAlex
0
Influential
6
CrossRef

Cite This

W. M. Shaw (1991). Subject and citation indexing. Part I: The clustering structure of composite representations in the Cystic Fibrosis Document Collection. Journal of the American Society for Information Science , 42 (9) , 669-675. https://doi.org/10.1002/(sici)1097-4571(199110)42:9<669::aid-asi5>3.0.co;2-y

Identifiers

DOI
10.1002/(sici)1097-4571(199110)42:9<669::aid-asi5>3.0.co;2-y

Data Quality

Data completeness: 81%