Difference between revisions of "Google Facets"
m |
m |
||
Line 11: | Line 11: | ||
** [http://goo.gle/38ZUlTD Hands-on labs] | ** [http://goo.gle/38ZUlTD Hands-on labs] | ||
** [http://github.com/PAIR-code/facets Facets - GitHub] | ** [http://github.com/PAIR-code/facets Facets - GitHub] | ||
− | * [[ | + | * [[Data Science]] |
− | * [[Benchmarks]] | + | ** [[Data Governance]] |
− | * [[Batch Norm(alization) & Standardization]] | + | *** [[Benchmarks]] |
− | * [[ | + | *** [[Data Preprocessing]] |
− | * [[Feature | + | **** [[Feature Exploration/Learning]] ...inspection, data profiling, selection |
+ | **** [[Data Quality]] ...[[AI Verification and Validation|validity]], accuracy, [[Data Quality#Data Cleaning|cleaning]], completeness, consistency, uniformity, [[Data Quality#Data Encoding|encoding]], [[Data Quality#Zero Padding|padding]], [[Data Quality#Data Augmentation, Data Labeling, and Auto-Tagging|augmentation, labeling, auto-tagging]], [[Data Quality#Batch Norm(alization) & Standardization| normalization, standardization]], and [[Data Quality#Imbalanced Data|imbalanced data]] | ||
+ | *** [[Bias and Variances]] | ||
+ | *** [[Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog]] | ||
+ | **** [[Natural Language Processing (NLP)#Managed Vocabularies |Managed Vocabularies]] | ||
+ | **** [[Datasets]] | ||
+ | *** [[Privacy in Data Science]] | ||
+ | *** [[Data Interoperability]] | ||
+ | *** [[Excel - Data Analysis]] | ||
+ | * [[Visualization]] | ||
* [[Hyperparameter]]s | * [[Hyperparameter]]s | ||
− | |||
− | |||
− | |||
* [http://towardsdatascience.com/visualize-your-data-with-facets-d11b085409bc Visualize your data with Facets | Yufeng G - Towards Data Science] | * [http://towardsdatascience.com/visualize-your-data-with-facets-d11b085409bc Visualize your data with Facets | Yufeng G - Towards Data Science] | ||
Revision as of 19:34, 19 September 2020
YouTube search... ...Google search
- Facets | Google
- Data Science
- Data Governance
- Benchmarks
- Data Preprocessing
- Feature Exploration/Learning ...inspection, data profiling, selection
- Data Quality ...validity, accuracy, cleaning, completeness, consistency, uniformity, encoding, padding, augmentation, labeling, auto-tagging, normalization, standardization, and imbalanced data
- Bias and Variances
- Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog
- Privacy in Data Science
- Data Interoperability
- Excel - Data Analysis
- Data Governance
- Visualization
- Hyperparameters
- Visualize your data with Facets | Yufeng G - Towards Data Science
Facets contains two robust visualizations to aid in understanding and analyzing machine learning datasets.
- Facets Overview - get a sense of the shape of each feature of your dataset
- Facets Dive - explore individual observations
Facets Overview
Overview takes input feature data from any number of datasets, analyzes them feature by feature and visualizes the analysis. Overview gives users a quick understanding of the distribution of values across the features of their dataset(s). Uncover several uncommon and common issues such as unexpected feature values, missing feature values for a large number of observation, training/serving skew and train/test/validation set skew.
Facets Dive
Dive is a tool for interactively exploring large numbers of data points at once. Dive provides an interactive interface for exploring the relationship between data points across all of the different features of a dataset. Each individual item in the visualization represents a data point. Position items by "faceting" or bucketing them in multiple dimensions by their feature values. Success stories of Dive include the detection of classifier failure, identification of systematic errors, evaluating ground truth and potential new signals for ranking.