Difference between revisions of "Regularization"
(→Adversarial Regularization (AdvReg)) |
m |
||
| (5 intermediate revisions by the same user not shown) | |||
| Line 17: | Line 17: | ||
* add more data | * add more data | ||
| − | * use [[Data | + | * use [[Data Quality#Batch Norm(alization) & Standardization|Batch Norm(alization) & Standardization]] |
| − | |||
* use architectures that generalize well | * use architectures that generalize well | ||
* reduce architecture complexity | * reduce architecture complexity | ||
| − | * add Regularization | + | * add [[Regularization]] |
| − | ** [[L1 and L2 Regularization]] - update the general cost function by adding another term known as the regularization term. | + | ** [[L1 and L2 Regularization]] - update the general cost function by adding another term known as the regularization term. |
| − | ** | + | ** Dropout - at every iteration, it randomly selects some nodes and temporarily removes the nodes (along with all of their incoming and outgoing connections) |
| − | ** [[Data Augmentation]] | + | ** [[Data Augmentation, Data Labeling, and Auto-Tagging|Data Augmentation]] |
| − | ** [[Early Stopping]] | + | ** [[Early Stopping]] |
Regularization is a technique which makes slight modifications to the learning algorithm such that the model generalizes better. This in turn improves the model’s performance on the unseen data as well. [http://www.analyticsvidhya.com/blog/2018/04/fundamentals-deep-learning-regularization-techniques/ An Overview of Regularization Techniques in Deep Learning (with Python code) | Shubham Jain] | Regularization is a technique which makes slight modifications to the learning algorithm such that the model generalizes better. This in turn improves the model’s performance on the unseen data as well. [http://www.analyticsvidhya.com/blog/2018/04/fundamentals-deep-learning-regularization-techniques/ An Overview of Regularization Techniques in Deep Learning (with Python code) | Shubham Jain] | ||
| Line 39: | Line 38: | ||
<youtube>u73PU6Qwl1I</youtube> | <youtube>u73PU6Qwl1I</youtube> | ||
| − | <youtube> | + | <youtube>dEhGM708xUs</youtube> |
| + | <youtube>4nqD5TBlOWU</youtube> | ||
<youtube>ctmNq7FgbvI</youtube> | <youtube>ctmNq7FgbvI</youtube> | ||
<youtube>KIoz_aa1ed4</youtube> | <youtube>KIoz_aa1ed4</youtube> | ||
Latest revision as of 19:18, 19 September 2020
Youtube search... ...Google search
Good practices for addressing the Overfitting Challenge:
- add more data
- use Batch Norm(alization) & Standardization
- use architectures that generalize well
- reduce architecture complexity
- add Regularization
- L1 and L2 Regularization - update the general cost function by adding another term known as the regularization term.
- Dropout - at every iteration, it randomly selects some nodes and temporarily removes the nodes (along with all of their incoming and outgoing connections)
- Data Augmentation
- Early Stopping
Regularization is a technique which makes slight modifications to the learning algorithm such that the model generalizes better. This in turn improves the model’s performance on the unseen data as well. An Overview of Regularization Techniques in Deep Learning (with Python code) | Shubham Jain
A machine learning model can overcome underfitting by adding more parameters, although its complexity increases and will require more efforts for interpretation. However, a real dilemma of a data scientist is that minimizing the prediction errors which are decomposed due to the bias and/or variance somehow turns into overfitting problems. Lasso, Ridge, and Elastic Net are popular ways of regularized statistical modeling approaches... Regression Analysis: Lasso, Ridge, and Elastic Net | Sung Kim
- Regression Models:
Adversarial Regularization (AdvReg)
- SMART - Multi-Task Deep Neural Networks (MT-DNN)
- Adversarial regularization for image classification | Google