Difference between revisions of "Data Preprocessing"
m |
|||
| Line 10: | Line 10: | ||
* [http://www.kaggle.com/rtatman/data-cleaning-challenge-json-txt-and-xls/ Data Cleaning Challenge: .json, .txt and .xls | Rachael Tatman] | * [http://www.kaggle.com/rtatman/data-cleaning-challenge-json-txt-and-xls/ Data Cleaning Challenge: .json, .txt and .xls | Rachael Tatman] | ||
* The Passenger Screening Kaggle challenge [http://www.kaggle.com/c/passenger-screening-algorithm-challenge/discussion/45805 1st place solution] was won in part due to data preparation/generation. | * The Passenger Screening Kaggle challenge [http://www.kaggle.com/c/passenger-screening-algorithm-challenge/discussion/45805 1st place solution] was won in part due to data preparation/generation. | ||
| − | |||
* [[Datasets]] | * [[Datasets]] | ||
* [[Batch Norm(alization) & Standardization]] | * [[Batch Norm(alization) & Standardization]] | ||
Revision as of 09:22, 16 February 2019
YouTube search... ...Google search
- Data Cleaning Challenge: .json, .txt and .xls | Rachael Tatman
- The Passenger Screening Kaggle challenge 1st place solution was won in part due to data preparation/generation.
- Datasets
- Batch Norm(alization) & Standardization
- Feature Exploration/Learning
- Hyperparameters
- Data Augmentation
- Visualization
- Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog
Splitting Data - training and testing sets