Difference between revisions of "Social Network Analysis (SNA)"
(→Capstone Project: Data and Knowledge Engineering) |
|||
| Line 23: | Line 23: | ||
== Capstone Project: Data and Knowledge Engineering == | == Capstone Project: Data and Knowledge Engineering == | ||
| − | Given an instance of set of nodes (users) in a social network graph, the aim is to find the influencing (important) users and to predict the likelihood of a future association (edge) between two nodes, knowing that there is no association between the nodes in the current state of the graph. | + | Problem Statement: Given an instance of set of nodes (users) in a social network graph, the aim is to find the influencing (important) users and to predict the likelihood of a future association (edge) between two nodes, knowing that there is no association between the nodes in the current state of the graph. |
* [http://github.com/Prerna237/SocialNetworkAnalysis Project Information | P. Mathur, Prerna, and S. Vallakatla - GitHub] | * [http://github.com/Prerna237/SocialNetworkAnalysis Project Information | P. Mathur, Prerna, and S. Vallakatla - GitHub] | ||
[[Python]] Libraries: | [[Python]] Libraries: | ||
| − | [[Python#Plotly| Plotly]] graphing library for making interactive, publication-quality graphs online. | + | * [[Python#Plotly| Plotly]] graphing library for making interactive, publication-quality graphs online. |
| − | [ | + | * [http://igraph.org/ IGraph] a collection of network analysis tools with the emphasis on efficiency, portability and ease of use. igraph is open source and free. |
| − | [[Python#NumPy| NumPy]] adds support for large, multi-dimensional arrays and matrices along with a large collection of high level mathematical functions to operate on these arrays. (dependency- Scikit) | + | * [[Python#NumPy| NumPy]] adds support for large, multi-dimensional arrays and matrices along with a large collection of high level mathematical functions to operate on these arrays. (dependency- Scikit) |
| − | [[Python#SciPy| SciPy]] an open source Python library used for scientific computing and technical computing.(dependency- Scikit) | + | * [[Python#SciPy| SciPy]] an open source Python library used for scientific computing and technical computing.(dependency- Scikit) |
| − | [[Python#scikit-learn| scikit-learn]] simple and efficient tool for data mining and data analysis. Used for dimensionality reduction and implementing machine learning algorithms. | + | * [[Python#scikit-learn| scikit-learn]] simple and efficient tool for data mining and data analysis. Used for dimensionality reduction and implementing machine learning algorithms. |
<youtube>XRMhgxW-C_M</youtube> | <youtube>XRMhgxW-C_M</youtube> | ||
Revision as of 17:36, 25 July 2019
YouTube search... ...Google search
- Math for Intelligence
- Finding Paul Revere
- NetworkX for creation, manipulation, and study of the structure, dynamics, and functions of complex networks
- Matplotlib for generating plots, histograms, power spectra, bar charts, errorcharts, scatterplots, etc
- Knowledge Graphs are very useful ways of presenting information about social networks.
Capstone Project: Data and Knowledge Engineering
Problem Statement: Given an instance of set of nodes (users) in a social network graph, the aim is to find the influencing (important) users and to predict the likelihood of a future association (edge) between two nodes, knowing that there is no association between the nodes in the current state of the graph.
Python Libraries:
- Plotly graphing library for making interactive, publication-quality graphs online.
- IGraph a collection of network analysis tools with the emphasis on efficiency, portability and ease of use. igraph is open source and free.
- NumPy adds support for large, multi-dimensional arrays and matrices along with a large collection of high level mathematical functions to operate on these arrays. (dependency- Scikit)
- SciPy an open source Python library used for scientific computing and technical computing.(dependency- Scikit)
- scikit-learn simple and efficient tool for data mining and data analysis. Used for dimensionality reduction and implementing machine learning algorithms.