Difference between revisions of "Decision Jungle"

From
Jump to: navigation, search
(Created page with "[http://www.youtube.com/results?search_query=+Linear+Regression+artificial+intelligence YouTube search...] * AI Solver ** ...predict categories * Capabilities *...")
 
Line 1: Line 1:
[http://www.youtube.com/results?search_query=+Linear+Regression+artificial+intelligence YouTube search...]
+
[http://www.youtube.com/results?search_query=Decision+Jungle+artificial+intelligence YouTube search...]
  
 
* [[AI Solver]]
 
* [[AI Solver]]
 
** [[...predict categories]]
 
** [[...predict categories]]
 
* [[Capabilities]]  
 
* [[Capabilities]]  
* [http://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/linear-regression Linear Regression | Microsoft]
 
  
Linear regression is a common statistical method, which has been adopted in machine learning and enhanced with many new methods for fitting the line and measuring error. In the most basic sense, regression refers to prediction of a numeric target. Linear regression is still a good choice when you want a very simple model for a basic predictive task. Linear regression also tends to work well on high-dimensional, sparse data sets lacking complexity. Azure Machine Learning Studio supports a variety of regression models, in addition to linear regression. However, the term "regression" can be interpreted loosely, and some types of regression provided in other tools are not supported in Studio.
+
Randomized decision trees and forests have a rich history in machine learning and have seen considerable success in application, perhaps particularly so for computer vision. However, they face a fundamental limitation: given enough data, the number of nodes in decision trees will grow exponentially with depth. For certain applications, for example on mobile or embedded processors, memory is a limited resource, and so the exponential growth of trees limits their depth, and thus their potential accuracy. Decision jungles, revisiting the idea of ensembles of rooted decision directed acyclic graphs (DAGs), and shows these to be compact and powerful discriminative models for classification. Unlike conventional decision trees that only allow one path to every node, a DAG in a decision jungle allows multiple paths from the root to each leaf. We present and compare two new node merging algorithms that jointly optimize both the features and the structure of the DAGs efficiently. During training, node splitting and node merging are driven by the minimization of exactly the same objective function, here the weighted sum of entropies at the leaves. Results on varied datasets show that, compared to decision forests and several other baselines, decision jungles require dramatically less memory while considerably improving generalization. [http://www.microsoft.com/en-us/research/publication/decision-jungles-compact-and-rich-models-for-classification/ Decision Jungles: Compact and Rich Models for Classification | Microsoft]
 +
 
 +
<youtube>q7BwufUxteE</youtube>
 +
 
 +
== Two-Class Decision Jungle ==
  
https://cdn-images-1.medium.com/max/640/1*eeIvlwkMNG1wSmj3FR6M2g.gif
+
* [http://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/linear-regression Linear Regression | Microsoft]
  
<youtube>CtKeHnfK5uA</youtube>
+
<youtube>ErVC1Tuj4IQ</youtube>
<youtube>uwwWVAgJBcM</youtube>
 

Revision as of 19:52, 3 June 2018

YouTube search...

Randomized decision trees and forests have a rich history in machine learning and have seen considerable success in application, perhaps particularly so for computer vision. However, they face a fundamental limitation: given enough data, the number of nodes in decision trees will grow exponentially with depth. For certain applications, for example on mobile or embedded processors, memory is a limited resource, and so the exponential growth of trees limits their depth, and thus their potential accuracy. Decision jungles, revisiting the idea of ensembles of rooted decision directed acyclic graphs (DAGs), and shows these to be compact and powerful discriminative models for classification. Unlike conventional decision trees that only allow one path to every node, a DAG in a decision jungle allows multiple paths from the root to each leaf. We present and compare two new node merging algorithms that jointly optimize both the features and the structure of the DAGs efficiently. During training, node splitting and node merging are driven by the minimization of exactly the same objective function, here the weighted sum of entropies at the leaves. Results on varied datasets show that, compared to decision forests and several other baselines, decision jungles require dramatically less memory while considerably improving generalization. Decision Jungles: Compact and Rich Models for Classification | Microsoft

Two-Class Decision Jungle