Difference between revisions of "Datasets"

From
Jump to: navigation, search
m
m
Line 11: Line 11:
 
** [[Data Governance]]
 
** [[Data Governance]]
 
*** [[Data Science]]
 
*** [[Data Science]]
 +
*** [[Master Data Management  (MDM) / Feature Store / Data Lineage / Data Catalog]]
 +
*** [[Natural Language Processing (NLP)#Managed Vocabularies |Managed Vocabularies]]
 
*** [[Benchmarks]]
 
*** [[Benchmarks]]
 
*** [[Batch Norm(alization) & Standardization]]
 
*** [[Batch Norm(alization) & Standardization]]
Line 19: Line 21:
 
* [[Visualization]]
 
* [[Visualization]]
 
** [[Google Facets| Facets]] | [[Google]]...contains two robust [[Visualization]]s to aid in understanding and analyzing machine learning datasets.
 
** [[Google Facets| Facets]] | [[Google]]...contains two robust [[Visualization]]s to aid in understanding and analyzing machine learning datasets.
* [[Master Data Management  (MDM) / Feature Store / Data Lineage / Data Catalog]]
 
* [[Natural Language Processing (NLP)#Managed Vocabularies |Managed Vocabularies]]
 
 
* [http://www.openml.org/search?type=data OpenML datasets]
 
* [http://www.openml.org/search?type=data OpenML datasets]
 
* [http://pathmind.com/wiki/datasets-ml Datasets and Machine Learning | Chris Nicholson - A.I. Wiki pathmind]
 
* [http://pathmind.com/wiki/datasets-ml Datasets and Machine Learning | Chris Nicholson - A.I. Wiki pathmind]

Revision as of 11:17, 7 September 2020

YouTube search... ...Google search


Datasets (often in combination with algorithms) are becoming more important themselves and can sometimes be seen as the primary intellectual output of the research. The revelations about Cambridge Analytica highlights the importance of datasets and data collection. Reference also: Privacy in Data Science

Sources

Networks

Articles