Journal Home > Volume 23 , Issue 1

Random walks are a standard tool for modeling the spreading process in social and biological systems. But in the face of large-scale networks, to achieve convergence, iterative calculation of the transition matrix in random walk methods consumes a lot of time. In this paper, we propose a three-stage hierarchical community detection algorithm based on Partial Matrix Approximation Convergence (PMAC) using random walks. First, this algorithm identifies the initial core nodes in a network by classical measurement and then utilizes the error function of the partial transition matrix convergence of the core nodes to determine the number of random walks steps. As such, the PMAC of the core nodes replaces the final convergence of all the nodes in the whole matrix. Finally, based on the approximation convergence transition matrix, we cluster the communities around core nodes and use a closeness index to merge two communities. By recursively repeating the process, a dendrogram of the communities is eventually constructed. We validated the performance of the PMAC by comparing its results with those of two representative methods for three real-world networks with different scales.


menu
Abstract
Full text
Outline
About this article

Hierarchical Community Detection Based on Partial Matrix Convergence Using Random Walks

Show Author's information Wei ZhangFeng KongLiming YangYunfang Chen( )Mengyuan Zhang
School of Computer Science, Nanjing University of Posts and Telecommunications and Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks, Nanjing 210023, China.
School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China.
Concordia Institution System Engineering (CIISE), Concordia University, Montreal QC H3G 1M8, Canada.

Abstract

Random walks are a standard tool for modeling the spreading process in social and biological systems. But in the face of large-scale networks, to achieve convergence, iterative calculation of the transition matrix in random walk methods consumes a lot of time. In this paper, we propose a three-stage hierarchical community detection algorithm based on Partial Matrix Approximation Convergence (PMAC) using random walks. First, this algorithm identifies the initial core nodes in a network by classical measurement and then utilizes the error function of the partial transition matrix convergence of the core nodes to determine the number of random walks steps. As such, the PMAC of the core nodes replaces the final convergence of all the nodes in the whole matrix. Finally, based on the approximation convergence transition matrix, we cluster the communities around core nodes and use a closeness index to merge two communities. By recursively repeating the process, a dendrogram of the communities is eventually constructed. We validated the performance of the PMAC by comparing its results with those of two representative methods for three real-world networks with different scales.

Keywords: clustering, community detection, random walks, transition matrix

References(25)

[1]
Y. Y. Ahn, J. P. Bagrow, and S. Lehmann, Link communities reveal multiscale complexity in networks, Nature, vol. 466, no. 7307, pp. 761-764, 2010.
[2]
G. W. Flake, S. Lawrence, C. L. Giles, and F. M. Coetzee, Self-organization and identification of Web communities, Computer, vol. 35, no. 3, pp. 66-70, 2002.
[3]
V. Spirin and L. A. Mirny, Protein complexes and functional modules in molecular networks, Proceedings of the National Academy of Sciences of the United States of America, vol. 100, no. 21, pp. 12123-12128, 2003.
[4]
M. E. J. Newman, Detecting community structure in networks, The European Physical Journal B-Condensed Matter and Complex Systems, vol. 38, no. 2, pp. 321-330, 2004.
[5]
S. Fortunato, Community detection in graphs, Physics Reports, vol. 486, no. 3, pp. 75-174, 2010.
[6]
M. E. J. Newman, Modularity and community structure in networks, Proceedings of the National Academy of Sciences of the United States of America, vol. 103, no. 23, pp. 8577-8582, 2006.
[7]
J. Leskovec, K. J. Lang, and M. Mahoney, Empirical com-parison of algorithms for network community detection, in Proc. 19th Int. Conf. on World Wide Web, Raleigh, USA, 2010, pp. 631-640.
DOI
[8]
N. Aston and W. Hu, Community detection in dynamic social networks, Communications and Network, vol. 6, no. 2, p. 124, 2014.
[9]
S. M. Van Dongen, Graph clustering by flow simulation, Ph.D. dissertation, University of Utrecht, Utrecht, the Netherlands, 2001.
[10]
H. Zhou and R. Lipowsky, Network brownian motion: A new method to measure vertex-vertex proximity and to identify communities and subcommunities, in Computational Science-ICCS, 2004, pp. 1062-1069.
DOI
[11]
P. Pons and M. Latapy, Computing communities in large networks using random walks, in Proc. 20th Int. Conf. on Computer and Information Sciences, Istanbul, Turkey, 2005, pp. 284-293.
DOI
[12]
M. Rosvall and C. T. Bergstrom, Maps of random walks on complex networks reveal community structure, Proceedings of the National Academy of Sciences of the United States of America, vol. 105, no. 4, pp. 1118-1123, 2008.
[13]
M. Alamgir and U. Von Luxburg, Multi-agent random walks for local clustering on graphs, in Proc. 10th Int. Conf. on IEEE Data Mining, Sydney, Australia, 2010, pp. 18-27.
DOI
[14]
J. C. Delvenne, S. N. Yaliraki, and M. Barahona, Stability of graph communities across time scales, Proceedings of the National Academy of Sciences of the United States of America, vol. 107, no. 29, pp. 12755-12760, 2010.
[15]
L. Backstrom and J. Leskovec, Supervised random walks: Predicting and recommending links in social networks, in Proc. 4th Int. Conf. on Web Search and Web Data Mining, Hong Kong, China, 2011, pp. 635-644.
DOI
[16]
B. Yang, J. Di, J. Liu, and D. Liu, Hierarchical community detection with applications to real-world network analysis, Data and Knowledge Engineering, vol. 83, pp. 20-38, 2013.
[17]
X. Fu, C. Wang, Z. Wang, and Z. Ming, Threshold random walkers for community structure detection in complex networks, Journal of Software, vol. 8, no. 2, pp. 286-295, 2013.
[18]
H. Park and K. Lee, Dependence clustering, a method revealing community structure with group dependence, Knowledge-Based Systems, vol. 60, pp. 58-72, 2014.
[19]
S. Wasserman and K. Faust, Social Network Analysis: Methods and Applications. Cambridge University Press, 1994.
DOI
[20]
R. L. Dobrushin, Central limit theorem for non-stationary Markov chains, I and II, Theory of Probability and Its Applications, vol. 1, no. 1, pp. 65-80, 1956.
[21]
K. Steinhaeuser and N. V. Chawla, Identifying and evaluating community structure in complex networks, Pattern Recognition Letters, vol. 31, no. 5, pp. 413-421, 2010.
[22]
M. E. J. Newman and M. Girvan, Finding and evaluating community structure in networks, Physical Review E, vol. 69, no. 2, p. 026113, 2004.
[23]
Institute of Web Science and Technologies at the University of KoblemeClandau, KONECT, http://konect.uni-koblenz.de/networks/.
[24]
M. Girvan and M. E. J. Newman, Community structure in social and biological networks, Proceedings of the National Academy of Sciences of Sciences of the United States of America, vol. 99, no. 12, pp. 7821-7826, 2002.
[25]
M. E. J. Newman, Fast algorithm for detecting community structure in networks, Physical Review E, vol. 69, no. 6, p. 066133, 2004.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 25 August 2016
Revised: 18 October 2016
Accepted: 20 October 2016
Published: 15 February 2018
Issue date: February 2018

Copyright

© The authors 2018

Acknowledgements

The authors would like to express their thanks to the anonymous reviewers for their constructive comments and suggestions. This work was supported by the National Natural Science Foundation of China (Nos. 61272422, 61572260, 61373017, and 61572261).

Rights and permissions

Return