Abstract
This paper presents a corpus-based approach to word sense disambiguation where a decision tree assigns a sense to an ambiguous word based on the bigrams that occur nearby. This approach is evaluated using the sense-tagged corpora from the 1998 SENSEVAL word sense disambiguation exercise. It is more accurate than the average results reported for 30 of 36 words, and is more accurate than the best results for 19 of 36 words.
Keywords
Affiliated Institutions
Related Publications
Word Space
Representations for semantic information about words are necessary for many applications of neural networks in natural language processing. This paper describes an efficient, co...
Enriching Word Vectors with Subword Information
Continuous word representations, trained on large unlabeled corpora are useful for many natural language processing tasks. Popular models that learn such representations ignore ...
The grammar of sense: Using part-of-speech tags as a first step in semantic disambiguation
This paper describes two experiments: one exploring the amount of information relevant to sense disambiguation contained in the part-of-speech field of entries in a Machine Read...
Semantic ambiguity effects in word identification.
The influence of semantic ambiguity on word identification processes was explored in a series of word naming and lexical-decision experiments. There was no reliable ambiguity ef...
Dependency-Based Construction of Semantic Space Models
Traditionally, vector-based semantic space models use word co-occurrence counts from large corpora to represent lexical meaning. In this article we present a novel framework for...
Publication Info
- Year
- 2001
- Type
- article
- Pages
- 1-8
- Citations
- 129
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.3115/1073336.1073347