Journal Home > Volume 1 , Issue 2

The increase in the amount of manufacturing information available means that big data can be collected and, with appropriate deep analysis, could be of great value to manufacturers. However, most small manufacturers cannot afford the overhead of a professional data analytics team. To address this problem, in this paper a generic data analytics system, Generic Manufacturing Data Analytics system (GMDA), is proposed. This system can perform most manufacturing data analytics tasks and users can easily carry out data analysis even if they have no prior knowledge or experience of data analytics. To establish such a system, we designed an abstract language, GMDL, to describe the manufacturing data analytics tasks. Aimed at factory data analytics, several algorithms were selected, tuned, optimized, and finally integrated into the system. Some noteworthy techniques were developed in GMDA such as proper algorithm selection strategy and an optimal parameter determination algorithm. Case studies show the practicability and reliability of the system.


menu
Abstract
Full text
Outline
About this article

A Generic Data Analytics System for Manufacturing Production

Show Author's information Hao ZhangHongzhi Wang( )Jianzhong LiHong Gao
Department of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China.

Abstract

The increase in the amount of manufacturing information available means that big data can be collected and, with appropriate deep analysis, could be of great value to manufacturers. However, most small manufacturers cannot afford the overhead of a professional data analytics team. To address this problem, in this paper a generic data analytics system, Generic Manufacturing Data Analytics system (GMDA), is proposed. This system can perform most manufacturing data analytics tasks and users can easily carry out data analysis even if they have no prior knowledge or experience of data analytics. To establish such a system, we designed an abstract language, GMDL, to describe the manufacturing data analytics tasks. Aimed at factory data analytics, several algorithms were selected, tuned, optimized, and finally integrated into the system. Some noteworthy techniques were developed in GMDA such as proper algorithm selection strategy and an optimal parameter determination algorithm. Case studies show the practicability and reliability of the system.

Keywords: optimization, data mining, data analytics, manufactory

References(28)

[1]
J. A. Harding, M. S. Srinivas, and A. Kusiak, Data mining in manufacturing: A review, J. Manuf. Sci. Eng., vol. 128, no. 4, pp. 969-976, 2005.
[2]
A. A. F. Saldivar, Y. Li, W. N. Chen, Z. H. Zhan, J. Zhang, and L. Y. Chen, Industry 4.0 with cyber-physical integration: A design and manufacture perspective, in Proc. 21st Int. Conf. Automation and Computing (ICAC), Glasgow, UK, 2015, pp. 1-6.
DOI
[3]
T. T. Aye, F. Yang, L. Wang, G. K. K. Lee, X. Li, J. W. Hu, and M. C. Nguyen, Data driven framework for degraded pogo pin detection in semiconductor manufacturing, in Proc. 10th Conf. Industrial Electronics and Applications (ICIEA), Auckland, New Zealand, 2015, pp. 345-350.
DOI
[4]
M. Moghimi, M. H. Saraee, and A. Bagheri, Modeling of batch annealing process using data mining techniques for cold rolled steel sheets, in Proc. 2011 Int. Conf. Mechatronics (ICM), Istanbul, Turkey, 2011, pp. 277-281.
DOI
[5]
C. Çiflikli and E. Kahya-Özyirmidokuz, Implementing a data mining solution for enhancing carpet manufacturing productivity, Knowledge-Based Systems, vol. 23, no. 8, pp. 783-788, 2010.
[6]
C. Sassenberg, C. Weber, M. Fathi, and R. Montino, A data mining based knowledge management approach for the semiconductor industry, in Proc. 2009 IEEE Int. Conf. Electro/Information Technology, Windsor, Canada, 2009, pp. 72-77.
DOI
[7]
C. Y. Liu and Y. F. Sun, Application of data mining in production quality management, in Proc. 3rd Int. Symp. Intelligent Information Technology Application, Shanghai, China, 2009, pp. 284-287.
[8]
A. Kusiak and C. Kurasek, Data mining of printed-circuit board defects, IEEE Trans. Robot. Autom., vol. 17, no. 2, pp. 191-196, 2001.
[9]
N. de Abajo, A. B. Diez, V. Lobato, and S. R. Cuesta, ANN quality diagnostic models for packaging manufacturing: An industrial data mining case study, in Proc. 10th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, Seattle, WA, USA, 2004, pp. 799-804.
DOI
[10]
M. K. Li, S. Feng, I. K. Sethi, J. Luciow, and K. Wagner, Mining production data with neural network & CART, in Proc. 3rd Int. Conf. Data Mining, Melbourne, FL, USA, 2003, pp. 731-734.
[11]
SAP, Mercedes-AMG: Driving high performance with SAP business suite powered by SAP HANA and the internet of things, https://www.sap.com/china/documents/2014/10/1cfa7b23-0a7c-0010-82c7-eda71af511fa.html, 2014.
[12]
[13]
P. Chapman, J. Clinton, R. Kerber, T. Khabaza, T. Reinartz, C. Shearer, and R. Wirth, CRISP-DM 1.0 step-by-step data mining guide, SPSS Inc., USA, 2000.
[14]
M. Shahbaz, M. Srinivas, J. A. Harding, and M. Turner, Product design and manufacturing process improvement using association rules, Proc. Institut. Mech. Eng Part B: J. Eng. Manuf., vol. 220, no. 2, pp. 243-254, 2006.
[15]
K. Q. Wang, S. R. Tong, B. Eynard, L. Roucoules, and N. Matta, Review on application of data mining in product design and manufacturing, in Proc. 4th Int. Conf. Fuzzy Systems and Knowledge Discovery, Haikou, China, 2007, pp. 613-618.
DOI
[16]
Q. F. Zhou, R. Y. Han, and T. Li, A two-step dynamic inventory forecasting model for large manufacturing, in Proc. 14th Int. Conf. Machine Learning and Applications (ICMLA), Miami, FL, USA, 2015, pp. 749-753.
DOI
[17]
UCI, UC irvine machine learning repository, http://archive.ics.uci.edu/ml, 1997.
[18]
C. Ly, K. Tom, C. S. Byington, R. Patrick, and G. J. Vachtsevanos, Fault diagnosis and failure prognosis for engineering systems: A global perspective, in Proc. 2009 IEEE Conf. Automation Science and Engineering, Bangalore, India, 2009, pp. 108-115.
DOI
[19]
A. J. Torabi, M. J. Er, X. Li, B.S. Lim, and G. O. Peen, Application of clustering methods for online tool condition monitoring and fault diagnosis in high-speed milling processes, IEEE Sys.J., vol. 10, no. 2, pp. 721-732, 2016.
[20]
Semeion Research Center of Sciences of Communication, http://www.semeion.it, 2017.
[21]
A. Urtubia, J. R. Pérez-Correa, A. Soto, and P. Pszczólkowski, Using data mining techniques to predict industrial wine problem fermentations, Food Control, vol. 18, no. 12, pp. 1512-1517, 2007.
[22]
X. Wang, J. Lin, N. Patel, and M. Braun, A self-learning and online algorithm for time series anomaly detection, with application in CPU manufacturing, in Proc. 25th ACM Int. Conf. Information and Knowledge Management, Indianapolis, IN, USA, 2016, pp. 1823-1832.
DOI
[23]
A. Fazel, M. Saraee, and P. Shamsinejad, Mining time series data: Case of predicting consumption patterns in steel industry, in Proc. 2nd Int. Conf. Software Engineering and Data Mining (SEDM), Chengdu, China, 2010, pp. 501-505.
[24]
J. S. Racine, RStudio: A platform-independent IDE for R and sweave, J. Appl. Econom., vol. 27, no. 1, pp. 167-172, 2012.
[25]
S. B. Keser and U. Yayan, A case study of optimal decision tree construction for RFKON database, in Proc. 2016 Int. Symp. INnovations in Intelligent Systems and Applications (INISTA), Sinaia, Romania, 2016, pp. 1-6.
DOI
[26]
C. Y. Chen, J. M. Hu, Q. Meng, and Y. Zhang, Short-time traffic flow prediction with ARIMA-GARCH model, in Proc. 2011 IEEE Intelligent Vehicles Symposium (IV), Baden-Baden, Germany, 2011, pp. 607-612.
DOI
[27]
G. Weisang and Y. Awazu, Vagaries of the Euro: An introduction to ARIMA modeling, CS-BIGS, vol. 2, no. 1, pp. 45-55, 2008.
[28]
R. S. Tsay and G. C. Tiao, Consistent estimates of autoregressive parameters and extended sample autocorrelation function for stationary and nonstationary ARMA models, J. Am. Stat. Assoc., vol. 79, no. 385, pp. 84-96, 1984.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 12 January 2018
Accepted: 17 January 2018
Published: 12 April 2018
Issue date: June 2018

Copyright

© The author(s) 2018

Acknowledgements

This paper was partially supported by the National Natural Science Foundation of China (Nos. U1509216, 61472099, and 61602129), the National Key Research and Development Program of China (No. 2016YFB1000703), National Sci-Tech Support Plan (No. 2015BAH10F01), and the Scientific Research Foundation for the Returned Overseas Chinese Scholars of Heilongjiang Provience (No. LC2016026).

Rights and permissions

Return