Difference between revisions of "Data Preprocessing"
| Line 21: | Line 21: | ||
<youtube>0xVqLJe9_CY</youtube> | <youtube>0xVqLJe9_CY</youtube> | ||
| − | |||
<youtube>TK-2189UcKk</youtube> | <youtube>TK-2189UcKk</youtube> | ||
<youtube>7fcWfUavO7E</youtube> | <youtube>7fcWfUavO7E</youtube> | ||
| Line 48: | Line 47: | ||
http://azurecomcdn.azureedge.net/mediahandler/acomblog/media/Default/blog/578a09a1-f144-4a62-98cb-e6e3ed774817.png | http://azurecomcdn.azureedge.net/mediahandler/acomblog/media/Default/blog/578a09a1-f144-4a62-98cb-e6e3ed774817.png | ||
| + | |||
| + | == SQL Database Optimization == | ||
| + | |||
| + | <youtube>Rw3ewEXOKC8</youtube> | ||
| + | <youtube>dUrLYznFbpQ</youtube> | ||
Revision as of 10:48, 20 April 2019
YouTube search... ...Google search
- Data Cleaning Challenge: .json, .txt and .xls | Rachael Tatman
- The Passenger Screening Kaggle challenge 1st place solution was won in part due to data preparation/generation.
- Datasets
- Batch Norm(alization) & Standardization
- Feature Exploration/Learning
- Hyperparameters
- Data Augmentation
- Visualization
- Python
- Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog
Splitting Data - training and testing sets
Time-Series Data
- Time-based Algorithms
- A Comparison of Time Series Databases and Netsil’s Use of Druid | Netsil
- Microsoft announces the general availability of Azure Time Series Insights | Ryan Waite - Microsoft
- Top 10 Time Series Databases | Outlyer
SQL Database Optimization