Difference between revisions of "Gradient Boosting Machine (GBM)"

From
Jump to: navigation, search
m (BPeat moved page Boosted Decision Tree Regression to Gradient Boosting Machine (GBM) without leaving a redirect)
Line 8: Line 8:
 
* [http://en.wikipedia.org/wiki/Boosting_(machine_learning) Boosting | Wikipedia]
 
* [http://en.wikipedia.org/wiki/Boosting_(machine_learning) Boosting | Wikipedia]
 
* [http://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/boosted-decision-tree-regression Boosted Decision Tree Regression | Microsoft]
 
* [http://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/boosted-decision-tree-regression Boosted Decision Tree Regression | Microsoft]
 +
* [http://towardsdatascience.com/boosting-algorithm-gbm-97737c63daa3 Boosting algorithm: GBM | SauceCat - Towards Data Science]
  
  
It is also known as MART (Multiple Additive Regression Trees), Boosted Decision Tree Regression, and Gradient Boosted Regression Trees (GBRT).
+
It is also known as Multiple Additive Regression Trees (MART), Boosted Decision Tree Regression, and Gradient Boosted Regression Trees (GBRT).
  
 
the ensemble is a collection of models that do not predict the real objective field of the ensemble, but rather the improvements needed for the function that computes this objective. As shown in the image above, the modeling process starts by assigning some initial values to this function, and creates a model to predict which gradient will improve the function results. The next iteration considers both the initial values and these corrections as its original state, and looks for the next gradient to improve the prediction function results even further. The process stops when the prediction function results match the real values or the number of iterations reaches a limit. As a consequence, all the models in the ensemble will always have a numeric objective field, the gradient for this function. The real objective field of the problem will then be computed by adding up the contributions of each model weighted by some coefficients. If the problem is a classification, each category (or class) in the objective field has its own subset of models in the ensemble whose goal is adjusting the function to predict this category. [http://blog.bigml.com/2017/03/14/introduction-to-boosted-trees/ Introduction to Boosted Trees | bigML]
 
the ensemble is a collection of models that do not predict the real objective field of the ensemble, but rather the improvements needed for the function that computes this objective. As shown in the image above, the modeling process starts by assigning some initial values to this function, and creates a model to predict which gradient will improve the function results. The next iteration considers both the initial values and these corrections as its original state, and looks for the next gradient to improve the prediction function results even further. The process stops when the prediction function results match the real values or the number of iterations reaches a limit. As a consequence, all the models in the ensemble will always have a numeric objective field, the gradient for this function. The real objective field of the problem will then be computed by adding up the contributions of each model weighted by some coefficients. If the problem is a classification, each category (or class) in the objective field has its own subset of models in the ensemble whose goal is adjusting the function to predict this category. [http://blog.bigml.com/2017/03/14/introduction-to-boosted-trees/ Introduction to Boosted Trees | bigML]

Revision as of 14:48, 12 January 2019

YouTube search...


It is also known as Multiple Additive Regression Trees (MART), Boosted Decision Tree Regression, and Gradient Boosted Regression Trees (GBRT).

the ensemble is a collection of models that do not predict the real objective field of the ensemble, but rather the improvements needed for the function that computes this objective. As shown in the image above, the modeling process starts by assigning some initial values to this function, and creates a model to predict which gradient will improve the function results. The next iteration considers both the initial values and these corrections as its original state, and looks for the next gradient to improve the prediction function results even further. The process stops when the prediction function results match the real values or the number of iterations reaches a limit. As a consequence, all the models in the ensemble will always have a numeric objective field, the gradient for this function. The real objective field of the problem will then be computed by adding up the contributions of each model weighted by some coefficients. If the problem is a classification, each category (or class) in the objective field has its own subset of models in the ensemble whose goal is adjusting the function to predict this category. Introduction to Boosted Trees | bigML

https://littleml.files.wordpress.com/2017/03/boosted-trees-process.png?w=497