Abstract

It is often assumed that class imbalances are responsible for significant losses of performance in standard classifiers. The purpose of this paper is to the question whether class imbalances are truly responsible for this degradation or whether it can be explained in some other way. Our experiments suggest that the problem is not directly caused by class imbalances, but rather, that class imbalances may yield small disjuncts which, in turn, will cause degradation. We argue that, in order to improve classifier performance, it may, then, be more useful to focus on the small disjuncts problem than it is to focus on the class imbalance problem. We experiment with a method that takes the small disjunct problem into consideration, and show that, indeed, it yields a performance superior to the performance obtained using standard or advanced solutions to the class imbalance problem.

Keywords

Computer scienceClass (philosophy)Focus (optics)Classifier (UML)Artificial intelligenceMachine learning

Affiliated Institutions

Related Publications

Publication Info

Year
2004
Type
article
Volume
6
Issue
1
Pages
40-49
Citations
669
Access
Closed

External Links

Social Impact

Altmetric

Social media, news, blog, policy document mentions

Citation Metrics

669
OpenAlex

Cite This

Taeho Jo, Nathalie Japkowicz (2004). Class imbalances versus small disjuncts. ACM SIGKDD Explorations Newsletter , 6 (1) , 40-49. https://doi.org/10.1145/1007730.1007737

Identifiers

DOI
10.1145/1007730.1007737