Journal Home > Volume 3 , Issue 3
Purpose

The purpose of this paper is to detect abnormal data of complex and sophisticated industrial equipment with sensors quickly and accurately. Due to the rapid development of the Internet of Things, more and more equipment is equipped with sensors, especially more complex and sophisticated industrial equipment is installed with a large number of sensors. A large amount of monitoring data is quickly collected to monitor the operation of the equipment. How to detect abnormal data quickly and accurately has become a challenge.

Design/methodology/approach

In this paper, the authors propose an approach called Multiple Group Correlation-based Anomaly Detection (MGCAD), which can detect equipment anomaly quickly and accurately. The single-point anomaly degree of equipment and the correlation of each kind of data sequence are modeled by using multi-group correlation probability model (a probability distribution model which is helpful to the anomaly detection of equipment), and the anomaly detection of equipment is realized.

Findings

The simulation data set experiments based on real data show that MGCAD has better performance than existing methods in processing multiple monitoring data sequences.

Originality/value

The MGCAD method can detect abnormal data quickly and accurately, promote the intelligent level of smart articles and ultimately help to project the real world into cyber space in CrowdIntell Network.


menu
Abstract
Full text
Outline
About this article

An anomaly detection method to improve the intelligent level of smart articles based on multiple group correlation probability models

Show Author's information Xudong Lu1Shipeng Wang2Fengjian Kang1Shijun Liu3Hui Li4Xiangzhen Xu4Lizhen Cui4( )
School of Software, Shandong University, Jinan, China
School of Software, Research Center of Software and Data Engineering, Shandong University, Jinan, China
School of Computer Science and Technology, Shandong University, Jinan, China
Shandong University, Jinan, China

Abstract

Purpose

The purpose of this paper is to detect abnormal data of complex and sophisticated industrial equipment with sensors quickly and accurately. Due to the rapid development of the Internet of Things, more and more equipment is equipped with sensors, especially more complex and sophisticated industrial equipment is installed with a large number of sensors. A large amount of monitoring data is quickly collected to monitor the operation of the equipment. How to detect abnormal data quickly and accurately has become a challenge.

Design/methodology/approach

In this paper, the authors propose an approach called Multiple Group Correlation-based Anomaly Detection (MGCAD), which can detect equipment anomaly quickly and accurately. The single-point anomaly degree of equipment and the correlation of each kind of data sequence are modeled by using multi-group correlation probability model (a probability distribution model which is helpful to the anomaly detection of equipment), and the anomaly detection of equipment is realized.

Findings

The simulation data set experiments based on real data show that MGCAD has better performance than existing methods in processing multiple monitoring data sequences.

Originality/value

The MGCAD method can detect abnormal data quickly and accurately, promote the intelligent level of smart articles and ultimately help to project the real world into cyber space in CrowdIntell Network.

Keywords: Clustering, Internet of things, Anomaly detection, Intelligent level

References(18)

Aggarwal, C.C. and Yu, P.S. (2008), “Outlier detection with uncertain data”, Proceedings of the 2008 SIAM International Conference on Data Mining, Society for Industrial and Applied Mathematics, pp. 483-493.https://doi.org/10.1137/1.9781611972788.44
DOI

Atzori, L., Iera, A. and Morabito, G. (2010), “The internet of things: a survey”, Computer Networks, Vol. 54 No. 15, pp. 2787-2805.

Cacciari, M., Salam, G.P. and Soyez, G. (2008), “The anti-k_t jet clustering algorithm”, Journal of High Energy Physics, Vol. 4 No. 4, pp. 403-410.

Chai, Y., Miao, C., Sun, B., Zheng, Y. and Li, Q. (2017), “Crowd science and engineering: concept and research framework”, International Journal of Crowd Science, Vol. 1 No. 1, pp. 2-8.

Chakrabarti, K., Keogh, E., Mehrotra, S. and Pazzani, M. (2002), “Locally adaptive dimensionality reduction for indexing large time series databases”, ACM Transactions on Database Systems (TODS), Vol. 27 No. 2, pp. 188-228.

Ding, J., Liu, Y., Zhang, L. and Wang, J. (2014), “LCAD: a correlation based abnormal pattern detection approach for large amount of monitor data” in Asia-Pacific Web Conference, Springer, pp. 550-558.https://doi.org/10.1007/978-3-319-11116-2_51
DOI

Ding, J., Liu, Y., Zhang, L., Wang, J. and Liu, Y. (2016), “An anomaly detection approach for multiple monitoring data series based on latent correlation probabilistic model”, Applied Intelligence, Vol. 44 No. 2, pp. 340-361.

Faloutsos, C. (2000), “Fast time sequence indexing for arbitrary Lp norms”, Proc International Conference on Vldb.

Fisz, M. and Bartoszyński, R. (2018), Probability Theory and Mathematical Statistics, J. wiley.

Guha, S., Rastogi, R. and Shim, K. (1998), “CURE:an efficient clustering algorithm for large databases”, Information Systems, Vol. 26 No. 1, pp. 35-58.

Izakian, H. and Pedrycz, W. (2013), “Anomaly detection in time series data using a fuzzy c-means clustering”, Ifsa World Congress and Nafips Meeting, IEEE.https://doi.org/10.1109/IFSA-NAFIPS.2013.6608627
DOI

Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R. and Wu, A.Y. (2002), “An efficient k-means clustering algorithm: analysis and implementation”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24 No. 7, pp. 881-892.

Ma, J. and Perkins, S. (2003), “Online novelty detection on temporal sequences”, Acm sigkdd International Conference on Knowledge Discovery and Data Mining.https://doi.org/10.1145/956750.956828
DOI

Peterson, L. (2009), “K-nearest neighbor”, Scholarpedia, Vol. 4 No. 2, p. 1883.

Pokrajac, D., Lazarevic, A. and Latecki, L.J. (2007), “Incremental local outlier detection for data streams”, IEEE Symposium on Computational Intelligence and Data Mining.https://doi.org/10.1109/CIDM.2007.368917
DOI

Schlechtingen, M. and Santos, I.F. (2011), “Comparative analysis of neural network and regression based condition monitoring approaches for wind turbine fault detection”, Mechanical Systems and Signal Processing, Vol. 25 No. 5, pp. 1849-1875.

Wang, S., Cui, L., Liu, L., Lu, X. and Li, Q. (2019), “Projecting real world into CrowdIntell network: a methodology”, International Journal of Crowd Science.

Zhang, C., Weng, N., Chang, J. and Zhou, A. (2009), “Detecting abnormal trend evolution over multiple data streams”, in Advances in Data and Web Management, Springer, Berlin, pp. 285-296.https://doi.org/10.1007/978-3-642-00672-2_26
DOI
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 17 September 2019
Revised: 13 October 2019
Accepted: 16 October 2019
Published: 09 December 2019
Issue date: December 2019

Copyright

© The author(s)

Acknowledgements

Acknowledgements

This work is partially supported by National Key R&D Program No.2017YFB1400100.

Rights and permissions

Xudong Lu, Shipeng Wang, Fengjian Kang, Shijun Liu, Hui Li, Xiangzhen Xu and Lizhen Cui. Published in International Journal of Crowd Science. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode

Return