Scholar - SciOpen

Issue

Residual network-based stacked vector quantized autoencoder

Hui WANG, Xiaojun YE, Zewei DONG

Journal of Tsinghua University (Science and Technology) 2025, 65(11): 2259-2268

Published: 15 November 2025

Abstract

PDF (7.4 MB) Collect Collected

Downloads：0

Objective

Deep learning technologies have achieved remarkable progress in the field of personalized recommendation services. However, recommendation systems based on deep neural networks still face the challenge of data sparsity, which limits the ability of a model to accurately capture subtle differences in user preferences, thereby affecting the robustness of model training. This problem is specifically prominent in scenarios with limited user interaction data. Therefore, this paper aims to propose a recommendation system model that can effectively address the data sparsity issue to enhance the capability of a model in user behavior modeling and overall performance.

Methods

To tackle the data sparsity issue, this paper proposes a residual network-based stacked vector-quantized autoencoder (RSVQ-AE). This model fully utilizes the advantages of residual connections by directly passing the continuous latent vector output from the multiple layers of encoders to the corresponding layers of the decoder. This effectively reduces the loss of high-value continuous information that is common in encoders, which is crucial for maintaining the fidelity of data representation. Meanwhile, by introducing vector quantization technology, we discretize the latent space to ensure that the model can accurately capture and represent the data. In addition, this paper constructs multiple stacked codebooks using vector quantization technology, enabling the model to learn multidimensional discrete vector quantization feature representations and capture the discretized interest representations of user behavior across multiple dimensions through stacked codebooks. To further enhance the stability and generative capabilities of the model, an adversarial network is introduced as a regularizer during the training process to promote rapid convergence.

Results

To verify the effectiveness of the model, experiments were conducted on several public datasets widely used in recommendation systems. The experimental results revealed that the RSVQ-AE model exhibits excellent reconstruction performance across multiple datasets. Based on the ML-1M (MovieLens-1M) dataset, when the sequence length is 20, the reconstruction loss of RSVQ-AE is only 0.1525, with an accuracy rate of as high as 70.69%; when the sequence length increases to 100, the reconstruction loss further decreases to 0.0039, with an accuracy rate of 50.58%. Based on the Retail Rocket dataset, when the sequence length is 20, the reconstruction loss is as low as 2.42×10^-4, with an accuracy rate of 81.26%; when the sequence length is 100, the reconstruction loss is 0.0019, with an accuracy rate of 74.21%. These results fully demonstrate that RSVQ-AE can maintain low reconstruction loss and high accuracy when processing sequences with different lengths. Its performance is only second to the autoencoder model, which cannot perform sampling generation.

Conclusions

The proposed RSVQ-AE offers a powerful solution for the generation of discrete sequence data in recommendation systems. By addressing the limitations of existing generative models and introducing innovative technologies such as stacked codebooks, this model has achieved remarkable improvements in reconstruction accuracy and data generation quality. This method not only enhances the capability of the model in user behavior modeling but also provides new ideas and approaches for the development of personalized recommendation services, holding the potential to drive the future development of more efficient and user-behavior-centered recommendation systems. In addition, the flexibility and robustness of model data generation make it applicable to a variety of recommendation system model architectures.

Issue

Unsupervised network traffic anomaly detection based on score iterations

Guolou PING, Tingyu ZENG, Xiaojun YE

Journal of Tsinghua University (Science and Technology) 2022, 62(5): 819-824

Published: 15 May 2022

Abstract

PDF (3.2 MB) Collect Collected

Downloads：17

Network traffic anomaly detection is limited by the lack of annotation information in the traffic. This paper presents an unsupervised anomaly detection method based on score iterations that overcomes this limitation. An autoencoder based anomaly score iteration process was designed to learn generic anomaly features to determine an initial anomaly score. A deep ordinal regression model based anomaly score iteration process was then designed to learn discriminative anomaly features to further improve the anomaly score accuracy. Deep models, multi-view features and ensemble learning are also used to improve the detection accuracy. Tests on several datasets show that this method has significant advantages over other methods in the absence of annotation information and can be effectively applied to network traffic anomaly detection.

Open Access Issue

PrivBV: Distance-Aware Encoding for Distributed Data with Local Differential Privacy

Lin Sun, Guolou Ping, Xiaojun Ye

Tsinghua Science and Technology 2022, 27(2): 412-421

Published: 29 September 2021

Abstract

PDF (4.9 MB) Collect Collected

Downloads：126

Recently, local differential privacy (LDP) has been used as the de facto standard for data sharing and analyzing with high-level privacy guarantees. Existing LDP-based mechanisms mainly focus on learning statistical information about the entire population from sensitive data. For the first time in the literature, we use LDP for distance estimation between distributed data to support more complicated data analysis. Specifically, we propose PrivBV—a locally differentially private bit vector mechanism with a distance-aware property in the anonymized space. We also present an optimization strategy for reducing privacy leakage in the high-dimensional space. The distance-aware property of PrivBV brings new insights into complicated data analysis in distributed environments. As study cases, we show the feasibility of applying PrivBV to privacy-preserving record linkage and non-interactive clustering. Theoretical analysis and experimental results demonstrate the effectiveness of the proposed scheme.

Open Access Issue

Propagation History Ranking in Social Networks: A Causality-Based Approach

Zheng Wang, Chaokun Wang, Xiaojun Ye, Jisheng Pei, Bin Li

Tsinghua Science and Technology 2020, 25(2): 161-179

Published: 02 September 2019

Abstract

PDF (1.8 MB) Collect Collected

Downloads：140

Information diffusion is one of the most important issues in social network analysis. Unlike most existing works, which either rely on network topology or node profiles, this study focuses on the diffusion itself, i.e., the recorded propagation histories. These histories are the evidence of diffusion and can be used to explain to users what happened in their networks. However, these histories can quickly grow in size and complexity, limiting their capacity to be intuitively understood. To reduce this information overload, in this paper we present the problem of propagation history ranking. The goal is to rank participant edges/nodes by their contribution to the diffusion. We first discuss and adapt a causal measure, Difference of Causal Effects (DCE), as the ranking criterion. Then, to avoid the complex calculation of DCE, we propose two integrated ranking strategies by adopting two indicators. One is responsibility, which captures the necessity aspect of causal effects. We further give an approximate algorithm, which could guarantee a feasible solution, for this indicator. The other is capability, which captures the sufficiency aspect of causal effects. Finally, promising experimental results are presented to verify the feasibility of the proposed ranking strategies.

Open Access Issue

Performance Prediction for Performance-Sensitive Queries Based on Algorithmic Complexity

Chihung Chi, Ye Zhou, Xiaojun Ye

Tsinghua Science and Technology 2013, 18(6): 618-628

Published: 06 December 2013

Abstract

PDF (507.2 KB) Collect Collected

Downloads：56

Performance predictions for database queries allow service providers to determine what resources are needed to ensure their performance. Cost-based or rule-based approaches have been proposed to optimize database query execution plans. However, Virtual Machine (VM)-based database services have little or no sharing of resources or interactions between applications hosted on shared infrastructures. Neither providers nor users have the right combination of visibility/access/expertise to perform proper tuning and provisioning. This paper presents a performance prediction model for query execution time estimates based on the query complexity for various data sizes. The user query execution time is a combination of five basic operator complexities: $O (1)$ , $O (\log (n))$ , $O (n)$ , $O (n \log (n))$ , and $O (n^{2})$ . Moreover, tests indicate that not all queries are equally important for performance prediction. As such, this paper illustrates a performance-sensitive query locating process on three benchmarks: RUBiS, RUBBoS, and TPC-W. A key observation is that performance-sensitive queries are only a small proportion ( $20 %$ ) of the application query set. Evaluation of the performance model on the TPC-W benchmark shows that the query complexity in a real life scenario has an average prediction error rate of less than $10 %$ which demonstrates the effectiveness of this predictive model.