Journal Home > Volume 26 , Issue 2

In recent years, multi-view clustering research has attracted considerable attention because of the rapidly growing demand for unsupervised analysis of multi-view data in practical applications. Despite the significant advances in multi-view clustering, two challenges still need to be addressed, i.e., how to make full use of the consistent and complementary information in multiple views and how to discriminate the contributions of different views and features in the same view to efficiently reveal the latent cluster structure of multi-view data for clustering. In this study, we propose a novel Two-level Weighted Collaborative Multi-view Fuzzy Clustering (TW-Co-MFC) approach to address the aforementioned issues. In TW-Co-MFC, a two-level weighting strategy is devised to measure the importance of views and features, and a collaborative working mechanism is introduced to balance the within-view clustering quality and the cross-view clustering consistency. Then an iterative optimization objective function based on the maximum entropy principle is designed for multi-view clustering. Experiments on real-world datasets show the effectiveness of the proposed approach.


menu
Abstract
Full text
Outline
About this article

TW-Co-MFC: Two-Level Weighted Collaborative Fuzzy Clustering Based on Maximum Entropy for Multi-View Data

Show Author's information Jie Hu( )Yi PanTianrui LiYan Yang
School of Information Science and Technology, Southwest Jiaotong University, Chengdu 611756, China.
Department of Computer Science, Georgia State University, Atlanta, GA 30302-3994, USA.

Abstract

In recent years, multi-view clustering research has attracted considerable attention because of the rapidly growing demand for unsupervised analysis of multi-view data in practical applications. Despite the significant advances in multi-view clustering, two challenges still need to be addressed, i.e., how to make full use of the consistent and complementary information in multiple views and how to discriminate the contributions of different views and features in the same view to efficiently reveal the latent cluster structure of multi-view data for clustering. In this study, we propose a novel Two-level Weighted Collaborative Multi-view Fuzzy Clustering (TW-Co-MFC) approach to address the aforementioned issues. In TW-Co-MFC, a two-level weighting strategy is devised to measure the importance of views and features, and a collaborative working mechanism is introduced to balance the within-view clustering quality and the cross-view clustering consistency. Then an iterative optimization objective function based on the maximum entropy principle is designed for multi-view clustering. Experiments on real-world datasets show the effectiveness of the proposed approach.

Keywords: multi-view clustering, fuzzy clustering, collaborative, weighting, maximum entropy

References(37)

[1]
J. Zhao, X. J. Xie, X. Xu, and S. L. Sun, Multi-view learning overview: Recent progress and new challenges, Inf. Fusion, vol. 38, pp. 43-54, 2017.
[2]
Y. Yang and H. Wang, Multi-view clustering: A survey, Big Data Mining and Analytics, vol. 1, no. 2, pp. 83-107, 2018.
[3]
S. Bettoumi, C. Jlassi, and N. Arous, Collaborative multi-view K-means clustering, Soft Comput., vol. 23, no. 3, pp. 937-945, 2019.
[4]
J. Yu, Z. C. Qin, T. Wan, and X. Zhang, Feature integration analysis of bag-of-features model for image retrieval, Neurocomputing, vol. 120, pp. 355-364, 2013.
[5]
Y. Z. Jiang, F. L. Chung, S. T. Wang, Z. H. Deng, J. Wang, and P. J. Qian, Collaborative fuzzy clustering from multiple weighted views, IEEE Trans. Cybern., vol. 45, no. 4, pp. 688-701, 2015.
[6]
B. Abu-Jamous, R. Fa, D. J. Roberts, and A. K. Nandi, Paradigm of tunable clustering using binarization of consensus partition matrices (Bi-CoPaM) for gene discovery, PLoS One, vol. 8, no. 2, p. e56432, 2013.
[7]
X. Cai, F. P. Nie, and H. Huang, Multi-view K-means clustering on big data, in Proc. 23rd Int. Joint Conf. Artificial Intelligence, Beijing, China, 2013, pp. 2598-2604.
[8]
X. J. Chen, X. F. Xu, J. Z. Huang, and Y. M. Ye, TW-k-means: Automated two-level variable weighting clustering algorithm for multiview data, IEEE Trans. Knowl. Data Eng., vol. 25, no. 4, pp. 932-944, 2013.
[9]
B. Jiang, F. Y. Qiu, and L. P. Wang, Multi-view clustering via simultaneous weighting on views and features, Appl. Soft Comput., vol. 47, pp. 304-315, 2016.
[10]
Y. M. Xu, C. D. Wang, and J. H. Lai, Weighted multi-view clustering with feature selection, Pattern Recognit., vol. 53, pp. 25-35, 2016.
[11]
S. D. Huang, Z. Kang, I. W. Tsang, and Z. L. Xu, Auto-weighted multi-view clustering via kernelized graph learning, Pattern Recognit., vol. 88, pp. 174-184, 2019.
[12]
H. Wang, Y. Yang, B. Liu, and H. Fujita, A study of graph-based system for multi-view clustering, Knowl. Based Syst., vol. 163, pp. 1009-1019, 2019.
[13]
W. Pedrycz, Collaborative fuzzy clustering, Pattern Recognit. Lett., vol. 23, no. 14, pp. 1675-1686, 2002.
[14]
C. D. Wang, J. H. Lai, and P. S. Yu, Multi-view clustering based on belief propagation, IEEE Trans. Knowl. Data Eng., vol. 28, no. 4, pp. 1007-1021, 2016.
[15]
G. Y. Zhang, C. D. Wang, D. Huang, W. S. Zheng, and Y. R. Zhou, TW-Co-k-means: Two-level weighted collaborative k-means for multi-view clustering, Knowl. Based Syst., vol. 150, pp. 127-138, 2018.
[16]
A. Cornuéjols, C. Wemmert, P. Gancarski, and Y. Bennani, Collaborative clustering: Why, when, what and how, Inf. Fusion, vol. 39, pp. 81-95, 2018.
[17]
S. Zeng, X. Y. Wang, H. Cui, C. J. Zheng, and D. Feng, A unified collaborative multikernel fuzzy clustering for multiview data, IEEE Trans. Fuzzy Syst., vol. 26, no. 3, pp. 1671-1687, 2018.
[18]
R. K. Xia, Y. Pan, L. Du, and J. Yin, Robust multi-view spectral clustering via low-rank and sparse decomposition. in Proc. 28th AAAI Conf. Artificial Intelligence, Québec City, Canada, 2014, pp. 2149-2155.
[19]
F. P. Nie, J. Li, and X. L. Li, Self-weighted multiview clustering with multiple graphs, in Proc. 26th Int. Joint Conf. Artificial Intelligence, Melbourne, Australia, 2017, pp. 2564-2570.
DOI
[20]
J. L. Xu, J. W. Han, F. P. Nie, and X. L. Li, Re-weighted discriminatively embedded K-means for multi-view clustering, IEEE Trans. Image Process., vol. 26, no. 6, pp. 3016-3027, 2017.
[21]
J. Hu, T. R. Li, C. Luo, H. Fujita, and Y. Yang, Incremental fuzzy cluster ensemble learning based on rough set theory, Knowl. Based Syst., vol. 132, pp. 144-155, 2017.
[22]
P. J. Qian, S. W. Sun, Y. Z. Jiang, K. H. Su, T. G. Ni, S. T. Wang, and R. F. Muzic Jr, Cross-domain, soft-partition clustering with diversity measure and knowledge reference, Pattern Recognit., vol. 50, pp. 155-177, 2016.
[23]
R. P. Li and M. Mukaidono, A maximum-entropy approach to fuzzy clustering, in Proc. 1995 IEEE Int. Conf. Fuzzy Systems, Yokohama, Japan, 1995, pp. 2227-2232.
[24]
X. B. Zhi, J. L. Fan, and F. Zhao, Fuzzy linear discriminant analysis-guided maximum entropy fuzzy clustering algorithm, Pattern Recognit., vol. 46, no. 6, pp. 1604-1615, 2013.
[25]
P. J. Qian, Y. Z. Jiang, Z. H. Deng, L. Z. Hu, S. W. Sun, S. Wang, and R. F. Muzic, Cluster prototypes and fuzzy memberships jointly leveraged cross-domain maximum entropy clustering, IEEE Trans. Cybern., vol. 46, no. 1, pp. 181-193, 2016.
[26]
P. J. Qian, J. X. Zhou, Y. Z. Jiang, F. Liang, K. F. Zhao, S. T. Wang, K. H. Su, and R. F. Muzic, Multi-view maximum entropy clustering by jointly leveraging inter-view collaborations and intra-view-weighted attributes, IEEE Access, vol. 6, pp. 28 594-28 610, 2018.
[27]
J. C. Bezdek, R. Ehrlich, and W. Full, FCM: The fuzzy c-means clustering algorithm, Comput. Geosci., vol. 10, nos. 2&3, pp. 191-203, 1984.
[28]
G. Cleuziou, M. Exbrayat, L. Martin, and J. H. Sublemontier, CoFKM: A centralized method for multiple-view clustering, in Proc. 9th IEEE Int. Conf. Data Mining, Miami, FL, USA, 2009, pp. 752-757.
DOI
[29]
S. Miyamoto, H. Ichihashi, and K. Honda, Algorithms for Fuzzy Clustering: Methods in C-Means Clustering with Applications. Berlin, Germany: Springer, 2008.
[30]
J. Wang, S. T. Wang, F. Chung, and Z. H. Deng, Fuzzy partition based soft subspace clustering and its applications in high dimensional data, Inf. Sci., vol. 246, pp. 133-154, 2013.
[31]
C. Blake and C. J. Merz, UCI repository of machine learning databases, http://archive.ics.uci.edu/ml/index.php, 1998.
[32]
D. Greene and P. Cunningham, Practical solutions to the problem of diagonal dominance in kernel document clustering, in Proc. 23rd Int. Conf. Machine Learning, New York, NY, USA, 2006, pp. 377-384.
DOI
[33]
S. F. Hussain, G. Bisson, and C. Grimal, An improved co-similarity measure for document clustering, in Proc. 9th Int. Conf. Machine Learning and Applications, Washington, DC, USA, 2010, pp. 190-197.
DOI
[34]
J. Huang, F. P. Nie, H. Huang, and C. Ding, Robust manifold nonnegative matrix factorization, ACM Trans. Knowl. Dis. Data, vol. 8, no. 3, pp. 1-21, 2014.
[35]
A. Strehl and J. Ghosh, Cluster ensembles—A knowledge reuse framework for combining multiple partitions, J. Mach. Learn. Res., vol. 3, pp. 583-617, 2003.
[36]
L. Hubert and P. Arabie, Comparing partitions, J. Classif., vol. 2, no. 1, pp. 193-218, 1985.
[37]
G. Tzortzis and A. Likas, Kernel-based weighted multi-view clustering, in Proc. IEEE 12th Int. Conf. Data Mining, Brussels, Belgium, 2012, pp. 675-684.
DOI
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 08 December 2019
Accepted: 15 December 2019
Published: 24 July 2020
Issue date: April 2021

Copyright

© The author(s) 2021.

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Nos. 61603313, 61772435, 61976182, and 61876157).

Rights and permissions

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return