# Difference between revisions of "Tree-based..."

(Created page with "[http://www.youtube.com/results?search_query=Decision+Tree+Regression YouTube search...] * AI Solver * Capabilities the ensemble is a collection of models that do n...") |
|||

Line 4: | Line 4: | ||

* [[Capabilities]] | * [[Capabilities]] | ||

− | + | An ensemble is a collection of models that do not predict the real objective field of the ensemble, but rather the improvements needed for the function that computes this objective. As shown in the image above, the modeling process starts by assigning some initial values to this function, and creates a model to predict which gradient will improve the function results. The next iteration considers both the initial values and these corrections as its original state, and looks for the next gradient to improve the prediction function results even further. The process stops when the prediction function results match the real values or the number of iterations reaches a limit. As a consequence, all the models in the ensemble will always have a numeric objective field, the gradient for this function. The real objective field of the problem will then be computed by adding up the contributions of each model weighted by some coefficients. If the problem is a classification, each category (or class) in the objective field has its own subset of models in the ensemble whose goal is adjusting the function to predict this category. [http://blog.bigml.com/2017/03/14/introduction-to-boosted-trees/ Introduction to Boosted Trees | bigML] | |

https://littleml.files.wordpress.com/2017/03/boosted-trees-process.png?w=497 | https://littleml.files.wordpress.com/2017/03/boosted-trees-process.png?w=497 |

## Revision as of 18:11, 3 June 2018

An ensemble is a collection of models that do not predict the real objective field of the ensemble, but rather the improvements needed for the function that computes this objective. As shown in the image above, the modeling process starts by assigning some initial values to this function, and creates a model to predict which gradient will improve the function results. The next iteration considers both the initial values and these corrections as its original state, and looks for the next gradient to improve the prediction function results even further. The process stops when the prediction function results match the real values or the number of iterations reaches a limit. As a consequence, all the models in the ensemble will always have a numeric objective field, the gradient for this function. The real objective field of the problem will then be computed by adding up the contributions of each model weighted by some coefficients. If the problem is a classification, each category (or class) in the objective field has its own subset of models in the ensemble whose goal is adjusting the function to predict this category. Introduction to Boosted Trees | bigML

https://littleml.files.wordpress.com/2017/03/boosted-trees-process.png?w=497