AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (2.9 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access | Just Accepted

CAE-IF: An Anomaly Detection Approach Based on Temporal Representation of the Reconstruction Error

Haijun Geng1,2Zhi Zhang4Yuhua Qian3( )Bo Yang5Qi Ma4Haotian Chi4Jing Yang4Xia Yin6

1 School of Automation and Software Engineering, Shanxi University, Taiyuan and 030006, China

2 Shanxi Qingzhong Technology Co.Ltd Taiyuan and 030000, China

3 Institute of Big Data Science and Industry, Shanxi University, Taiyuan 030006, China

4 School of Automation and Software Engineering, Shanxi University, Taiyuan and 030006, China

5 Department of Urban & Regional Planning San Jos´e State University, San Jos´e, CA, 95192, USA

6 Department of Computer Science and Technology, Tsinghua University, Beijing and 100084, China

Show Author Information

Abstract

In the field of network traffic anomaly detection, unsupervised learning plays a critical role yet encounters significant challenges, including accurately determining anomaly thresholds and modeling the intricate temporal dynamics of network traffic. To address these challenges, we present a novel approach, termed Convolutional Autoencoder-Isolation Forest (CAE-IF). By leveraging packet-level reconstruction errors with contextual information, our approach obviates the need for manual threshold setting and effectively captures temporal dynamics. The process commences with the application of the damped incremental statistics algorithm to extract statistical features from network traffic with temporal information. Subsequently, the Convolutional Autoencoder (CAE) is employed to compute the Root Mean Square Error (RMSE), offering detailed insights into the temporal correlations in network traffic. This RMSE is then refined through an aggregation mechanism based on source IP addresses, yielding a fine-grained temporal representation. Finally, the Isolation Forest (IF) algorithm is applied to establish an anomaly detection framework. Our comprehensive experimental evaluation, using three datasets: Mirai, OS Scan, and SSDP Flood, demonstrates the superior efficacy of the CAE-IF method. It achieves remarkable F1 scores of 96.14%, 99.81%, and 99.98% on these datasets, respectively. These results not only signify substantial improvements over existing methods for the Mirai and OS Scan datasets but also match the highest F1 score obtained on the SSDP Flood dataset.

Tsinghua Science and Technology
Cite this article:
Geng H, Zhang Z, Qian Y, et al. CAE-IF: An Anomaly Detection Approach Based on Temporal Representation of the Reconstruction Error. Tsinghua Science and Technology, 2025, https://doi.org/10.26599/TST.2024.9010215

88

Views

23

Downloads

0

Crossref

0

Web of Science

0

Scopus

0

CSCD

Altmetrics

Received: 16 February 2024
Revised: 08 May 2024
Accepted: 29 October 2024
Available online: 29 April 2025

© The author(s) 2025

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return