Difference between revisions of "L1 and L2 Regularization"
(Created page with "[http://www.youtube.com/results?search_query=L1+L2+Regularization+Dropout+Overfitting Youtube search...] [http://www.google.com/search?q=L1+L2+Regularization+Dropout+deep+mach...") |
(No difference)
|
Revision as of 12:57, 30 December 2018
Youtube search... ...Google search
Good practices for addressing overfitting:
- add more data
- use Data Augmentation
- use batch normalization
- use architectures that generalize well
- reduce architecture complexity
- add Regularization
- L1 and L2 Regularization - update the general cost function by adding another term known as the regularization term.
- Dropout - at every iteration, it randomly selects some nodes and temporarily removes the nodes (along with all of their incoming and outgoing connections)
- Data Augmentation
- Early Stopping