Difference between revisions of "Data Preprocessing"
| Line 1: | Line 1: | ||
| − | [http://www.youtube.com/results? | + | [http://www.youtube.com/results?search_query=Data+Preprocessing+Feature+Exploration+machine+learning+ML YouTube search...] |
| + | [http://www.google.com/search?q=Data+Preprocessing+Feature+Exploration+machine+learning+ML ...Google search] | ||
* [[Datasets]] | * [[Datasets]] | ||
| + | * [[Batch Norm(alization) & Standardization]] | ||
| + | * [[Data Preprocessing & Feature Exploration/Learning]] | ||
* [[Hyperparameters]] | * [[Hyperparameters]] | ||
| + | * [[Data Augmentation]] | ||
| + | * [[Visualization]] | ||
| + | * [[Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog]] | ||
* [https://www.kaggle.com/rtatman/data-cleaning-challenge-json-txt-and-xls/ Data Cleaning Challenge: .json, .txt and .xls | Rachael Tatman] | * [https://www.kaggle.com/rtatman/data-cleaning-challenge-json-txt-and-xls/ Data Cleaning Challenge: .json, .txt and .xls | Rachael Tatman] | ||
* The Passenger Screening Kaggle challenge [http://www.kaggle.com/c/passenger-screening-algorithm-challenge/discussion/45805 1st place solution] was won in part due to data preparation/generation. | * The Passenger Screening Kaggle challenge [http://www.kaggle.com/c/passenger-screening-algorithm-challenge/discussion/45805 1st place solution] was won in part due to data preparation/generation. | ||
Revision as of 11:25, 20 January 2019
YouTube search... ...Google search
- Datasets
- Batch Norm(alization) & Standardization
- Data Preprocessing & Feature Exploration/Learning
- Hyperparameters
- Data Augmentation
- Visualization
- Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog
- Data Cleaning Challenge: .json, .txt and .xls | Rachael Tatman
- The Passenger Screening Kaggle challenge 1st place solution was won in part due to data preparation/generation.
- Notes on Feature Preprocessing: The What, the Why, and the How | Matthew Mayo - KDnuggets
Splitting Data - training and testing sets
Sparse Coding - Feature extraction