Difference between revisions of "Data Preprocessing"
| Line 16: | Line 16: | ||
* [[Data Augmentation]] | * [[Data Augmentation]] | ||
* [[Visualization]] | * [[Visualization]] | ||
| + | * [[Python]] | ||
* [[Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog]] | * [[Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog]] | ||
| Line 46: | Line 47: | ||
http://azurecomcdn.azureedge.net/mediahandler/acomblog/media/Default/blog/578a09a1-f144-4a62-98cb-e6e3ed774817.png | http://azurecomcdn.azureedge.net/mediahandler/acomblog/media/Default/blog/578a09a1-f144-4a62-98cb-e6e3ed774817.png | ||
| − | |||
| − | |||
| − | |||
| − | |||
Revision as of 06:08, 6 March 2019
YouTube search... ...Google search
- Data Cleaning Challenge: .json, .txt and .xls | Rachael Tatman
- The Passenger Screening Kaggle challenge 1st place solution was won in part due to data preparation/generation.
- Datasets
- Batch Norm(alization) & Standardization
- Feature Exploration/Learning
- Hyperparameters
- Data Augmentation
- Visualization
- Python
- Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog
Splitting Data - training and testing sets
Time-Series Data
- Time-based Algorithms
- A Comparison of Time Series Databases and Netsil’s Use of Druid | Netsil
- Microsoft announces the general availability of Azure Time Series Insights | Ryan Waite - Microsoft
- Top 10 Time Series Databases | Outlyer