Difference between revisions of "Data Preprocessing"
(→Time-Series Data) |
|||
| Line 46: | Line 46: | ||
http://azurecomcdn.azureedge.net/mediahandler/acomblog/media/Default/blog/578a09a1-f144-4a62-98cb-e6e3ed774817.png | http://azurecomcdn.azureedge.net/mediahandler/acomblog/media/Default/blog/578a09a1-f144-4a62-98cb-e6e3ed774817.png | ||
| + | |||
| + | == Pandas == | ||
| + | [http://pandas.pydata.org pandas.pydata.org] | ||
| + | http://i.pinimg.com/originals/39/08/5c/39085c27945ad3eb49e0de7dff6f0b0e.png | ||
Revision as of 05:59, 6 March 2019
YouTube search... ...Google search
- Data Cleaning Challenge: .json, .txt and .xls | Rachael Tatman
- The Passenger Screening Kaggle challenge 1st place solution was won in part due to data preparation/generation.
- Datasets
- Batch Norm(alization) & Standardization
- Feature Exploration/Learning
- Hyperparameters
- Data Augmentation
- Visualization
- Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog
Splitting Data - training and testing sets
Time-Series Data
- Time-based Algorithms
- A Comparison of Time Series Databases and Netsil’s Use of Druid | Netsil
- Microsoft announces the general availability of Azure Time Series Insights | Ryan Waite - Microsoft
- Top 10 Time Series Databases | Outlyer