Objective over Architecture: Fraud Detection Under Extreme Imbalance in Bank Account Opening

Qiang Shen; Yijun Gao; Qingqing Mao; Tao Qi; Shuo Xu; Wu Sun

doi:10.3390/computation13120290

Abstract

Fraud in financial services—especially account opening fraud—poses major operational and reputational risks. Static rules struggle to adapt to evolving tactics, missing novel patterns and generating excessive false positives. Machine learning promises adaptive detection, but deployment faces severe class imbalance: in the NeurIPS 2022 BAF Base benchmark used here, fraud prevalence is 1.10%. Standard metrics (accuracy, f1_weighted) can look strong while doing little for the minority class. We compare Logistic Regression, SVM (RBF), Random Forest, LightGBM, and a GRU model on N = 1,000,000 accounts under a unified preprocessing pipeline. All models are trained to minimize their loss function, while configurations are selected on a stratified development set using validation-weighted F1-score f1_weighted. For the four classical models, class weighting in the loss (class_weight ∈{None,‘balanced’}) is treated as a hyperparameter and tuned. Similarly, the GRU is trained with a fixed class-weighted CrossEntropy loss that up-weights fraud cases. This ensures that both model families leverage weighted training objectives, while their final hyperparameters are consistently selected by the f1_weighted metric. Despite similar AUCs and aligned feature importance across families, the classical models converge to high-precision, low-recall solutions (1–6% fraud recall), whereas the GRU recovers 78% recall at 5% precision (AUC =0.8800). Under extreme imbalance, objective choice and operating point matter at least as much as architecture.

Affiliated Institutions

Related Publications

Survey on deep learning with class imbalance

Justin Johnson , Taghi M. Khoshgoftaar

Abstract The purpose of this study is to examine existing deep learning techniques for addressing class imbalanced data. Effective classification with imbalanced data is an impo...

2019 Journal Of Big Data 2538 citations

AP-Loss for Accurate One-Stage Object Detection

Kean Chen , Weiyao Lin , Jianguo Li +3 more

One-stage object detectors are trained by optimizing classification-loss and localization-loss simultaneously, with the former suffering much from extreme foreground-background ...

2020 IEEE Transactions on Pattern Analysis... 81 citations

Focal Loss for Dense Object Detection

Kaiming He , Piotr Dollár , Tsung-Yi Lin +2 more

The highest accuracy object detectors to date are based on a two-stage approach popularized by R-CNN, where a classifier is applied to a sparse set of candidate object locations...

2018 IEEE Transactions on Pattern Analysis... 9004 citations

Focal Loss for Dense Object Detection

Tsung-Yi Lin , Priya Goyal , Ross Girshick +2 more

The highest accuracy object detectors to date are based on a two-stage approach popularized by R-CNN, where a classifier is applied to a sparse set of candidate object locations...

2017 23310 citations

Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor

Virginia Eubanks

Naomi Klein: book is downright scary.Ethan Zuckerman, MIT: Should be required reading.Dorothy Roberts, author of Killing the Black Body: A must-read.Astra Taylor, author of Pe...

2018 1163 citations

Publication Info

Year: 2025
Type: article
Volume: 13
Issue: 12
Pages: 290-290
Citations: 0
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Objective over Architecture: Fraud Detection Under Extreme Imbalance in Bank Account Opening

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                                
                                    Qiang Shen, 
                                
                                    Yijun Gao, 
                                
                                    Qingqing Mao
                                
                                et al.
                            
                            (2025). 
                            Objective over Architecture: Fraud Detection Under Extreme Imbalance in Bank Account Opening. 
                            Computation
                            , 13
                            (12)
                            , 290-290.
                            https://doi.org/10.3390/computation13120290
                        

Identifiers

DOI: 10.3390/computation13120290