L1 and L2 Regularization

From
Revision as of 12:57, 30 December 2018 by BPeat (talk | contribs) (Created page with "[http://www.youtube.com/results?search_query=L1+L2+Regularization+Dropout+Overfitting Youtube search...] [http://www.google.com/search?q=L1+L2+Regularization+Dropout+deep+mach...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Youtube search... ...Google search


Good practices for addressing overfitting:

  • add more data
  • use Data Augmentation
  • use batch normalization
  • use architectures that generalize well
  • reduce architecture complexity
  • add Regularization
    • L1 and L2 Regularization - update the general cost function by adding another term known as the regularization term.
    • Dropout - at every iteration, it randomly selects some nodes and temporarily removes the nodes (along with all of their incoming and outgoing connections)
    • Data Augmentation
    • Early Stopping