Sort:
Open Access Issue
SmartEagleEye: A Cloud-Oriented Webshell Detection System Based on Dynamic Gray-Box and Deep Learning
Tsinghua Science and Technology 2024, 29 (3): 766-783
Published: 04 December 2023
Downloads:43

Compared with traditional environments, the cloud environment exposes online services to additional vulnerabilities and threats of cyber attacks, and the cyber security of cloud platforms is becoming increasingly prominent. A piece of code, known as a Webshell, is usually uploaded to the target servers to achieve multiple attacks. Preventing Webshell attacks has become a hot spot in current research. Moreover, the traditional Webshell detectors are not built for the cloud, making it highly difficult to play a defensive role in the cloud environment. SmartEagleEye, a Webshell detection system based on deep learning that is successfully applied in various scenarios, is proposed in this paper. This system contains two important components: gray-box and neural network analyzers. The gray-box analyzer defines a series of rules and algorithms for extracting static and dynamic behaviors from the code to make the decision jointly. The neural network analyzer transforms suspicious code into Operation Code (OPCODE) sequences, turning the detection task into a classification problem. Comprehensive experiment results show that SmartEagleEye achieves an encouraging high detection rate and an acceptable false-positive rate, which indicate its capability to provide good protection for the cloud environment.

Open Access Issue
Multi-features fusion for short-term photovoltaic power prediction
Intelligent and Converged Networks 2022, 3 (4): 311-324
Published: 30 December 2022
Downloads:79

In recent years, in order to achieve the goal of “carbon peaking and carbon neutralization”, many countries have focused on the development of clean energy, and the prediction of photovoltaic power generation has become a hot research topic. However, many traditional methods only use meteorological factors such as temperature and irradiance as the features of photovoltaic power generation, and they rarely consider the multi-features fusion methods for power prediction. This paper first preprocesses abnormal data points and missing values in the data from 18 power stations in Northwest China, and then carries out correlation analysis to screen out 8 meteorological features as the most relevant to power generation. Next, the historical generating power and 8 meteorological features are fused in different ways to construct three types of experimental datasets. Finally, traditional time series prediction methods, such as Recurrent Neural Network (RNN), Convolution Neural Network (CNN) combined with eXtreme Gradient Boosting (XGBoost), are applied to study the impact of different feature fusion methods on power prediction. The results show that the prediction accuracy of Long Short-Term Memory (LSTM), stacked Long Short-Term Memory (stacked LSTM), Bi-directional LSTM (Bi-LSTM), Temporal Convolutional Network (TCN), and XGBoost algorithms can be greatly improved by the method of integrating historical generation power and meteorological features. Therefore, the feature fusion based photovoltaic power prediction method proposed in this paper is of great significance to the development of the photovoltaic power generation industry.

Open Access Issue
From computer vision to short text understanding: Applying similar approaches into different disciplines
Intelligent and Converged Networks 2022, 3 (2): 161-172
Published: 06 September 2022
Downloads:57

With the development of IoT and 5G technologies, more and more online resources are presented in trendy multimodal data forms over the Internet. Hence, effectively processing multimodal information is significant to the development of various online applications, including e-learning and digital health, to just name a few. However, most AI-driven systems or models can only handle limited forms of information. In this study, we investigate the correlation between natural language processing (NLP) and pattern recognition, trying to apply the mainstream approaches and models used in the computer vision (CV) to the task of NLP. Based on two different Twitter datasets, we propose a convolutional neural network based model to interpret the content of short text with different goals and application backgrounds. The experiments have demonstrated that our proposed model shows fairly competitive performance compared to the mainstream recurrent neural network based NLP models such as bidirectional long short-term memory (Bi-LSTM) and bidirectional gate recurrent unit (Bi-GRU). Moreover, the experimental results also demonstrate that the proposed model can precisely locate the key information in the given text.

total 3