Difference between revisions of "L1 and L2 Regularization"

From
Jump to: navigation, search
(Created page with "[http://www.youtube.com/results?search_query=L1+L2+Regularization+Dropout+Overfitting Youtube search...] [http://www.google.com/search?q=L1+L2+Regularization+Dropout+deep+mach...")
(No difference)

Revision as of 12:57, 30 December 2018

Youtube search... ...Google search


Good practices for addressing overfitting:

  • add more data
  • use Data Augmentation
  • use batch normalization
  • use architectures that generalize well
  • reduce architecture complexity
  • add Regularization
    • L1 and L2 Regularization - update the general cost function by adding another term known as the regularization term.
    • Dropout - at every iteration, it randomly selects some nodes and temporarily removes the nodes (along with all of their incoming and outgoing connections)
    • Data Augmentation
    • Early Stopping