Abstract

This paper presents a corpus-based approach to word sense disambiguation where a decision tree assigns a sense to an ambiguous word based on the bigrams that occur nearby. This approach is evaluated using the sense-tagged corpora from the 1998 SENSEVAL word sense disambiguation exercise. It is more accurate than the average results reported for 30 of 36 words, and is more accurate than the best results for 19 of 36 words.

Keywords

BigramWord-sense disambiguationWord (group theory)Computer scienceNatural language processingArtificial intelligenceDecision treeSense (electronics)Tree (set theory)SemEvalLinguisticsMathematicsWordNet

Affiliated Institutions

Related Publications

Word Space

Representations for semantic information about words are necessary for many applications of neural networks in natural language processing. This paper describes an efficient, co...

1992 Neural Information Processing Systems 212 citations

Publication Info

Year
2001
Type
article
Pages
1-8
Citations
129
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

129
OpenAlex

Cite This

Ted Pedersen (2001). A decision tree of bigrams is an accurate predictor of word sense. , 1-8. https://doi.org/10.3115/1073336.1073347

Identifiers

DOI
10.3115/1073336.1073347