Accelerating the XGBoost algorithm using GPU computing

Abstract

We present a CUDA-based implementation of a decision tree construction algorithm within the gradient boosting library XGBoost. The tree construction algorithm is executed entirely on the graphics processing unit (GPU) and shows high performance with a variety of datasets and settings, including sparse input matrices. Individual boosting iterations are parallelised, combining two approaches. An interleaved approach is used for shallow trees, switching to a more conventional radix sort-based approach for larger depths. We show speedups of between 3× and 6× using a Titan X compared to a 4 core i7 CPU, and 1.2× using a Titan X compared to 2× Xeon CPUs (24 cores). We show that it is possible to process the Higgs dataset (10 million instances, 28 features) entirely within GPU memory. The algorithm is made available as a plug-in within the XGBoost library and fully supports all XGBoost features including classification, regression and ranking tasks.

Keywords

CUDAComputer scienceXeonTitan (rocket family)Parallel computingGraphics processing unitBoosting (machine learning)General-purpose computing on graphics processing unitsXeon PhiAlgorithmsortSpeedupDecision treeGradient boostingMulti-core processorGraphicsArtificial intelligenceComputer graphics (images)Random forestDatabase

Affiliated Institutions

University of Waikato NZ

Related Publications

GPU-acceleration for Large-scale Tree Boosting

Huan Zhang , Si Si , Cho‐Jui Hsieh

In this paper, we present a novel massively parallel algorithm for accelerating the decision tree building procedure on GPUs (Graphics Processing Units), which is a crucial step...

2017 arXiv (Cornell University) 61 citations

Evaluating the use of GPUs in liver image segmentation and HMMER database searches

John Paul Walters , Vidyananth Balu , Suryaprakash Kompalli +1 more

In this paper we present the results of parallelizing two life sciences applications, Markov random fields-based (MRF) liver segmentation and HMMER's Viterbi algorithm, using GP...

2009 2009 IEEE International Symposium on ... 61 citations

LightGBM: A Highly Efficient Gradient Boosting Decision Tree

Guolin Ke , Qi Meng , Thomas Finley +5 more

Gradient Boosting Decision Tree (GBDT) is a popular machine learning algorithm, and has quite a few effective implementations such as XGBoost and pGBRT. Although many engineerin...

2017 HAL (Le Centre pour la Communication ... 9477 citations

Neural GPUs Learn Algorithms

ukasz Kaiser , Ilya Sutskever

Abstract: Learning an algorithm from examples is a fundamental problem that has been widely studied. Recently it has been addressed using neural networks, in particular by Neura...

2016 arXiv (Cornell University) 63 citations

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi , Cliff Young , Nishant Patil +73 more

Many architects believe that major improvements in cost-energy-performance must now come from domain-specific hardware. This paper evaluates a custom ASIC---called a Tensor Proc...

2017 4222 citations

Publication Info

Year: 2017
Type: article
Volume: 3
Pages: e127-e127
Citations: 304
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Accelerating the XGBoost algorithm using GPU computing

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

304

OpenAlex

Cite This

APA Style

                            
                                    Rory Mitchell, 
                                
                                    Eibe Frank
                                
                            (2017). 
                            Accelerating the XGBoost algorithm using GPU computing. 
                            PeerJ Computer Science
                            , 3
                            
                            , e127-e127.
                            https://doi.org/10.7717/peerj-cs.127

Identifiers

DOI: 10.7717/peerj-cs.127