AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (2.3 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

Resource Time Series Analysis and Forecasting in Large-Scale Virtual Clusters

Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
41st Institute of CETC, Qingdao 266555, China
BGP Inc., China National Petroleum Corporation, Zhuozhou 072751, China
National Engineering Research Center of Oil & Gas Exploration Computer Software, Zhuozhou 072751, China
Show Author Information

Abstract

In today’s rapidly evolving internet landscape, prominent companies across various industries face increasingly complex business operations, leading to significant cluster-scale growth. However, this growth brings about challenges in cluster management and the inefficient utilization of vast amounts of data due to its low value density. This paper, based on the large-scale cluster virtualization and monitoring system of the data center of the Bureau of Geophysical Prospecting (BGP), utilizes time series data of host resources from the monitoring system’s time series database to propose a multivariate multi-step time series forecasting model, MUL-CNN-BiGRU-Attention, for forecasting CPU load on virtual cluster hosts. The model undergoes extensive offline training using a large volume of time series data, followed by deployment using TensorFlow Serving. Recent small-batch data are employed for fine-tuning model parameters to better adapt to current data patterns. Comparative experiments are conducted between the proposed model and other baseline models, demonstrating notable improvements in Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and R2 metrics by up to 35.2%, 56.1%, 32.5%, and 10.3%, respectively. Additionally, ablation experiments are designed to investigate the impact of different factors on the performance of the forecasting model, providing valuable insights for parameter optimization based on experimental results.

References

【1】
【1】
 
 
Big Data Mining and Analytics
Pages 592-605

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Lin Y, Wen J, Zhang X, et al. Resource Time Series Analysis and Forecasting in Large-Scale Virtual Clusters. Big Data Mining and Analytics, 2025, 8(3): 592-605. https://doi.org/10.26599/BDMA.2024.9020085

2287

Views

180

Downloads

1

Crossref

1

Web of Science

1

Scopus

0

CSCD

Received: 13 September 2024
Revised: 22 October 2024
Accepted: 24 October 2024
Published: 04 April 2025
© The author(s) 2025.

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).