Difference between revisions of "XGBoost; eXtreme Gradient Boosted trees"

From
Jump to: navigation, search
Line 17: Line 17:
 
* Three main forms of gradient boosting:
 
* Three main forms of gradient boosting:
 
** [[Gradient Boosting Machine (GBM)]]
 
** [[Gradient Boosting Machine (GBM)]]
** Stochastic Gradient Boosting  
+
** [http://machinelearningmastery.com/gentle-introduction-gradient-boosting-algorithm-machine-learning/ Stochastic Gradient Boosting] - a big insight into bagging ensembles and random forest was allowing trees to be greedily created from subsamples of the training dataset
** Regularized Gradient Boosting
+
** [http://scikit-learn.org/stable/auto_examples/ensemble/plot_gradient_boosting_regularization.html Regularized Gradient Boosting] - regularization via shrinkage improves performance considerably. In combination with shrinkage, stochastic gradient boosting can produce more accurate models by reducing the variance via [[Multiclassifiers; Ensembles and Hybrids; Bagging, Boosting, and Stacking | bagging].
  
  

Revision as of 15:56, 27 July 2020

YouTube search... ...Google search


Its name stands for eXtreme Gradient Boosting, it was developed by Tianqi Chen and now is part of a wider collection of open-source libraries developed by the Distributed Machine Learning Community (DMLC). XGBoost is a scalable and accurate implementation of gradient boosting machines and it has proven to push the limits of computing power for boosted trees algorithms as it was built and developed for the sole purpose of model performance and computational speed. Specifically, it was engineered to exploit every bit of memory and hardware resources for tree boosting algorithms. The implementation of XGBoost offers several advanced features for model tuning, computing environments and algorithm enhancement. It is capable of performing the three main forms of gradient boosting (Gradient Boosting (GB), Stochastic GB and Regularized GB) and it is robust enough to support fine tuning and addition of regularization parameters. According to Tianqi Chen, the latter is what makes it superior and different to other libraries. XGBoost, a Top Machine Learning Method on Kaggle, Explained | Ilan Reinstein - KDnuggets

xgb1.png