Difference between revisions of "Social Network Analysis (SNA)"

From
Jump to: navigation, search
(Capstone Project: Data and Knowledge Engineering)
Line 23: Line 23:
  
 
== Capstone Project: Data and Knowledge Engineering ==
 
== Capstone Project: Data and Knowledge Engineering ==
Given an instance of set of nodes (users) in a social network graph, the aim is to find the influencing (important) users and to predict the likelihood of a future association (edge) between two nodes, knowing that there is no association between the nodes in the current state of the graph.
+
Problem Statement: Given an instance of set of nodes (users) in a social network graph, the aim is to find the influencing (important) users and to predict the likelihood of a future association (edge) between two nodes, knowing that there is no association between the nodes in the current state of the graph.
  
 
* [http://github.com/Prerna237/SocialNetworkAnalysis Project Information | P. Mathur, Prerna, and S. Vallakatla - GitHub]
 
* [http://github.com/Prerna237/SocialNetworkAnalysis Project Information | P. Mathur, Prerna, and S. Vallakatla - GitHub]
  
 
[[Python]] Libraries:
 
[[Python]] Libraries:
[[Python#Plotly| Plotly]] graphing library for making interactive, publication-quality graphs online.
+
* [[Python#Plotly| Plotly]] graphing library for making interactive, publication-quality graphs online.
[[Python#IGraph| IGraph]]  a collection of network analysis tools with the emphasis on efficiency, portability and ease of use. igraph is open source and free.
+
* [http://igraph.org/ IGraph]  a collection of network analysis tools with the emphasis on efficiency, portability and ease of use. igraph is open source and free.
[[Python#NumPy| NumPy]] adds support for large, multi-dimensional arrays and matrices along with a large collection of high level mathematical functions to operate on these arrays. (dependency- Scikit)
+
* [[Python#NumPy| NumPy]] adds support for large, multi-dimensional arrays and matrices along with a large collection of high level mathematical functions to operate on these arrays. (dependency- Scikit)
[[Python#SciPy| SciPy]] an open source Python library used for scientific computing and technical computing.(dependency- Scikit)
+
* [[Python#SciPy| SciPy]] an open source Python library used for scientific computing and technical computing.(dependency- Scikit)
[[Python#scikit-learn| scikit-learn]] simple and efficient tool for data mining and data analysis. Used for dimensionality reduction and implementing machine learning algorithms.
+
* [[Python#scikit-learn| scikit-learn]] simple and efficient tool for data mining and data analysis. Used for dimensionality reduction and implementing machine learning algorithms.
  
 
<youtube>XRMhgxW-C_M</youtube>
 
<youtube>XRMhgxW-C_M</youtube>

Revision as of 17:36, 25 July 2019

YouTube search... ...Google search

Capstone Project: Data and Knowledge Engineering

Problem Statement: Given an instance of set of nodes (users) in a social network graph, the aim is to find the influencing (important) users and to predict the likelihood of a future association (edge) between two nodes, knowing that there is no association between the nodes in the current state of the graph.

Python Libraries:

  • Plotly graphing library for making interactive, publication-quality graphs online.
  • IGraph a collection of network analysis tools with the emphasis on efficiency, portability and ease of use. igraph is open source and free.
  • NumPy adds support for large, multi-dimensional arrays and matrices along with a large collection of high level mathematical functions to operate on these arrays. (dependency- Scikit)
  • SciPy an open source Python library used for scientific computing and technical computing.(dependency- Scikit)
  • scikit-learn simple and efficient tool for data mining and data analysis. Used for dimensionality reduction and implementing machine learning algorithms.