Difference between revisions of "Feature Exploration/Learning"

Revision as of 20:21, 19 September 2020

Feature selection | Wikipedia
Notes on Feature Preprocessing: The What, the Why, and the How | Matthew Mayo - KDnuggets
Evaluating Machine Learning Models
Automated Machine Learning (AML) - AutoML
Recursive Feature Elimination (RFE)
Principal Component Analysis (PCA)
Representation Learning
Feature Engineering and Selection: A Practical Approach for Predictive Models | Max Kuhn and Kjell Johnson
Jon Tupitza's Famous Jupyter Notebooks:
AI Governance
- Data Science / Data Governance
  - Benchmarks
  - Data Preprocessing
    - Feature Exploration/Learning ...inspection, data profiling, selection
    - Data Quality ...validity, accuracy, cleaning, completeness, consistency, uniformity, encoding, padding, augmentation, labeling, auto-tagging, normalization, standardization, and imbalanced data
  - Bias and Variances
  - Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog
    - Managed Vocabularies
    - Datasets
  - Privacy in Data Science
  - Data Interoperability
  - Excel - Data Analysis
Visualization
Tools: Paxata, Trifacta, alteryx, databricks, Qubole

A feature is an individual measurable property or characteristic of a phenomenon being observed. The concept of a “feature” is related to that of an explanatory variable, which is used in statistical techniques such as linear regression. Feature vectors combine all of the features for a single row into a numerical vector. Part of the art of choosing features is to pick a minimum set of independent variables that explain the problem. If two variables are highly correlated, either they need to be combined into a single feature, or one should be dropped. Sometimes people perform principal component analysis to convert correlated variables into a set of linearly uncorrelated variables. Some of the transformations that people use to construct new features or reduce the dimensionality of feature vectors are simple. For example, subtract Year of Birth from Year of Death and you construct Age at Death, which is a prime independent variable for lifetime and mortality analysis. In other cases, feature construction may not be so obvious. Machine learning algorithms explained | Martin Heller - InfoWorld

@@ Line 23: / Line 23: @@
 ** [http://github.com/JonTupitza/Data-Science-Process/blob/master/04-EDA-Correlation-Analysis.ipynb Exploratory Data Analysis - Correlation]
 ** [http://github.com/JonTupitza/Data-Science-Process/blob/master/05-Feature-Selection.ipynb Feature Selection Techniques]
-* [[Datasets]]
+* [[AI Governance]]
-* [[Batch Norm(alization) & Standardization]]
+** [[Data Science]] / [[Data Governance]]
-* [[Data Preprocessing]]
+*** [[Benchmarks]]
-* [[Master Data Management  (MDM) / Feature Store / Data Lineage / Data Catalog]]
+*** [[Data Preprocessing]]
-* [[Hyperparameter]]s
+**** [[Feature Exploration/Learning]] ...inspection, data profiling, selection
-* [[Data Augmentation, Data Labeling, and Auto-Tagging]]
+**** [[Data Quality]] ...[[AI Verification and Validation|validity]], accuracy, [[Data Quality#Data Cleaning|cleaning]], completeness, consistency, uniformity, [[Data Quality#Data Encoding|encoding]], [[Data Quality#Zero Padding|padding]],   [[Data Quality#Data Augmentation, Data Labeling, and Auto-Tagging|augmentation, labeling, auto-tagging]], [[Data Quality#Batch Norm(alization) & Standardization| normalization, standardization]], and [[Data Quality#Imbalanced Data|imbalanced data]]
+*** [[Bias and Variances]]
+*** [[Master Data Management (MDM) / Feature Store / Data Lineage / Data Catalog]]
+**** [[Natural Language Processing (NLP)#Managed Vocabularies |Managed Vocabularies]]
+**** [[Datasets]]
+*** [[Privacy in Data Science]]
+*** [[Data Interoperability]]
+*** [[Excel - Data Analysis]]
 * [[Visualization]]
 * Tools: Paxata, Trifacta, alteryx, databricks, Qubole

Difference between revisions of "Feature Exploration/Learning"

Revision as of 20:21, 19 September 2020

Sparse Coding - Feature extraction

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools