Difference between revisions of "Feature Exploration/Learning"

From
Jump to: navigation, search
m
m
Line 54: Line 54:
 
||
 
||
 
<youtube>WVclIFyCCOo</youtube>
 
<youtube>WVclIFyCCOo</youtube>
<b>HH2
+
<b>Visualize your Data with Facets
</b><br>BB2
+
</b><br>In this episode of AI Adventures, Yufeng explains how to use Facets, a project from Google Research, to visualize your dataset, find interesting relationships, and clean your data for machine learning.  Learn more through our hands-on labs → http://goo.gle/38ZUlTD  Associated Medium post "Visualize your data with Facets": http://goo.gl/7FDWwk  Get Facets on GitHub: http://goo.gl/Xi8dTu
 +
Play with Facets in the browser: http://goo.gl/fFLCEV    Watch more AI Adventures on the playlist: http://goo.gl/UC5usG  Subscribe to get all the episodes as they come out: http://goo.gl/S0AS51  #AIAdventures
 
|}
 
|}
 
|}<!-- B -->
 
|}<!-- B -->
Line 63: Line 64:
 
||
 
||
 
<youtube>KvZ2KSxlWBY</youtube>
 
<youtube>KvZ2KSxlWBY</youtube>
<b>HH3
+
<b>Stephen Elston - Data Visualization and Exploration with [[Python]]
</b><br>BB3
+
</b><br>Visualization is an essential method in any data scientist’s toolbox and is a key data exploration method and is a powerful tool for presentation of results and understanding problems with analytics. Attendees are introduced to [[Python]] visualization packages, Matplotlib, Pandas, and Seaborn. [http://github.com/StephenElston/ExploringDataWithPython The Jupyter notebook]  Visualization of complex real-world datasets presents a number of challenges to data scientists. By developing skills in data visualization, data scientists can confidently explore and understand the relationships in complex data sets. Using the [[Python]] matplotlib, pandas plotting and seaborn packages attendees will learn to: • Explore complex data sets with visualization, to develop understanding of the inherent relationships. • Create multiple views of data to highlight different aspects of the inherent relationships, with different graph types. • Use plot aesthetics to project multiple dimensions. • Apply conditioning or faceting methods to project multiple dimensions  www.pydata.org
 
|}
 
|}
 
|<!-- M -->
 
|<!-- M -->
Line 71: Line 72:
 
||
 
||
 
<youtube>yQsOFWqpjkE</youtube>
 
<youtube>yQsOFWqpjkE</youtube>
<b>HH4
+
<b>The Best Way to Visualize a Dataset Easily
</b><br>BB4
+
</b><br>[[Creatives#Siraj Raval|Siraj Raval]]  In this video, we'll visualize a dataset of body metrics collected by giving people a fitness tracking device. We'll go over the steps necessary to preprocess the data, then use a technique called T-SNE to reduce the dimensionality of our data so we can visualize it.
 +
 
 
|}
 
|}
 
|}<!-- B -->
 
|}<!-- B -->
  
 
= <span id="Feature Selection"></span>Feature Selection =
 
= <span id="Feature Selection"></span>Feature Selection =
 +
[http://www.youtube.com/results?search_query=Feature+Selection+machine+learning+ML YouTube search...]
 +
[http://www.google.com/search?q=Feature+Selection+machine+learning+ML ...Google search]
 +
 +
{|<!-- T -->
 +
| valign="top" |
 +
{| class="wikitable" style="width: 550px;"
 +
||
 
<youtube>V0u6bxQOUJ8</youtube>
 
<youtube>V0u6bxQOUJ8</youtube>
 +
<b>HH1
 +
</b><br>BB1
 +
|}
 +
|<!-- M -->
 +
| valign="top" |
 +
{| class="wikitable" style="width: 550px;"
 +
||
 
<youtube>_XOKz5VlTQY</youtube>
 
<youtube>_XOKz5VlTQY</youtube>
 +
<b>HH2
 +
</b><br>BB2
 +
|}
 +
|}<!-- B -->
 +
{|<!-- T -->
 +
| valign="top" |
 +
{| class="wikitable" style="width: 550px;"
 +
||
 
<youtube>arhdVDsPLVI</youtube>
 
<youtube>arhdVDsPLVI</youtube>
 +
<b>HH3
 +
</b><br>BB3
 +
|}
 +
|<!-- M -->
 +
| valign="top" |
 +
{| class="wikitable" style="width: 550px;"
 +
||
 
<youtube>TsqTuwTKFSs</youtube>
 
<youtube>TsqTuwTKFSs</youtube>
 +
<b>HH4
 +
</b><br>BB4
 +
|}
 +
|}<!-- B -->
 +
{|<!-- T -->
 +
| valign="top" |
 +
{| class="wikitable" style="width: 550px;"
 +
||
 
<youtube>YaKMeAlHgqQ</youtube>
 
<youtube>YaKMeAlHgqQ</youtube>
 +
<b>HH5
 +
</b><br>BB5
 +
|}
 +
|<!-- M -->
 +
| valign="top" |
 +
{| class="wikitable" style="width: 550px;"
 +
||
 +
<youtube>ID6</youtube>
 +
<b>HH6
 +
</b><br>BB6
 +
|}
 +
|}<!-- B -->
  
== <span id="Sparse Coding - Feature Extraction"></span>Sparse Coding - Feature Extraction ==
 
  
  
 +
== <span id="Sparse Coding - Feature Extraction"></span>Sparse Coding - Feature Extraction ==
  
 
{|<!-- T -->
 
{|<!-- T -->

Revision as of 20:50, 21 September 2020

YouTube search... ...Google search

A feature is an individual measurable property or characteristic of a phenomenon being observed. The concept of a “feature” is related to that of an explanatory variable, which is used in statistical techniques such as linear regression. Feature vectors combine all of the features for a single row into a numerical vector. Part of the art of choosing features is to pick a minimum set of independent variables that explain the problem. If two variables are highly correlated, either they need to be combined into a single feature, or one should be dropped. Sometimes people perform principal component analysis to convert correlated variables into a set of linearly uncorrelated variables. Some of the transformations that people use to construct new features or reduce the dimensionality of feature vectors are simple. For example, subtract Year of Birth from Year of Death and you construct Age at Death, which is a prime independent variable for lifetime and mortality analysis. In other cases, feature construction may not be so obvious. Machine learning algorithms explained | Martin Heller - InfoWorld

AI Explained: Feature Importance
Fiddler Labs Learn more about feature importance, the different techniques, and the pros and cons of each. #ExplainableAI

Visualize your Data with Facets
In this episode of AI Adventures, Yufeng explains how to use Facets, a project from Google Research, to visualize your dataset, find interesting relationships, and clean your data for machine learning. Learn more through our hands-on labs → http://goo.gle/38ZUlTD Associated Medium post "Visualize your data with Facets": http://goo.gl/7FDWwk Get Facets on GitHub: http://goo.gl/Xi8dTu Play with Facets in the browser: http://goo.gl/fFLCEV Watch more AI Adventures on the playlist: http://goo.gl/UC5usG Subscribe to get all the episodes as they come out: http://goo.gl/S0AS51 #AIAdventures

Stephen Elston - Data Visualization and Exploration with Python
Visualization is an essential method in any data scientist’s toolbox and is a key data exploration method and is a powerful tool for presentation of results and understanding problems with analytics. Attendees are introduced to Python visualization packages, Matplotlib, Pandas, and Seaborn. The Jupyter notebook Visualization of complex real-world datasets presents a number of challenges to data scientists. By developing skills in data visualization, data scientists can confidently explore and understand the relationships in complex data sets. Using the Python matplotlib, pandas plotting and seaborn packages attendees will learn to: • Explore complex data sets with visualization, to develop understanding of the inherent relationships. • Create multiple views of data to highlight different aspects of the inherent relationships, with different graph types. • Use plot aesthetics to project multiple dimensions. • Apply conditioning or faceting methods to project multiple dimensions www.pydata.org

The Best Way to Visualize a Dataset Easily
Siraj Raval In this video, we'll visualize a dataset of body metrics collected by giving people a fitness tracking device. We'll go over the steps necessary to preprocess the data, then use a technique called T-SNE to reduce the dimensionality of our data so we can visualize it.

Feature Selection

YouTube search... ...Google search

HH1
BB1

HH2
BB2

HH3
BB3

HH4
BB4

HH5
BB5

HH6
BB6


Sparse Coding - Feature Extraction

HH1
BB1

HH2
BB2