Discover the SciOpen Platform and Achieve Your Research Goals with Ease.
Search articles, authors, keywords, DOl and etc.
Privacy-sensitive data encounter immense security and usability challenges in processing, analyzing, and sharing. Meanwhile, traditional privacy data desensitization methods suffer from issues such as poor quality and low usability after desensitization. Therefore, a text data desensitization model that combines Transformer and Wasserstein Text convolutional Generative Adversarial Network (Trans-WTGAN) is proposed. Transformer as the generator and its self-attention mechanism can handle long-range dependencies, enabling the generated of higher-quality text; Text Convolutional Neural Network (TextCNN) integrates the idea of Wasserstein as the discriminator to enhance the stability of model training; and the strategy gradient scheme of reinforcement learning is employed. Reinforcement learning utilizes the policy gradient scheme as the updating method of generator parameters, ensuring the generated data retains the original key features and maintains a certain level of usability. The experimental results indicate that the proposed model scheme holds a greater advantage over existing methods in terms of text quality and structural consistency, can guarantee the desensitization effect, and ensures the usability of the privacy-sensitive data to a certain extent after desensitization, facilitates the simulation of the development environment for the use of real data and the analysis and sharing of data.
P. Huang, L. Guo, and Y. Zhong, Efficient algorithms for maximizing group influence in social networks, Tsinghua Science and Technology, vol. 27, no. 5, pp. 832–842, 2022.
Y. Cao, N. Xu, H. Wang, X. Zhao, and A. M. Ahmad, Neural networks-based adaptive tracking control for full-state constrained switched nonlinear systems with periodic disturbances and actuator saturation, Int. J. Syst. Sci., vol. 54, no. 14, pp. 2689–2704, 2023.
K. Li, L. Tian, X. Zheng, and B. Hui, Plausible heterogeneous graph k-anonymization for social networks, Tsinghua Science and Technology, vol. 27, no. 6, pp. 912–924, 2022.
Y. M. Wen, X. Liu, and H. Yu, Adaptive tree-like neural network: Overcoming catastrophic forgetting to classify streaming data with concept drifts, Knowledge-Based Syst., vol. 293, p. 111636, 2024.
W. Mahanan, W. A. Chaovalitwongse, and J. Natwichai, Data privacy preservation algorithm with k-anonymity, World Wide Web, vol. 24, no. 5, pp. 1551–1561, 2021.
P. Wang, H. Yu, N. Jin, D. Davies, and W. L. Woo, QuadCDD: A quadruple-based approach for understanding concept drift in data streams, Expert Syst. Appl., vol. 238, p. 122114, 2024.
H. Che, B. Pan, M. F. Leung, Y. Cao, and Z. Yan, Tensor factorization with sparse and graph regularization for fake news detection on social networks, IEEE Trans. Computat. Soc. Syst., vol. 11, no. 4, pp. 4888–4898, 2024.
J. Li, H. Yu, Z. Zhang, X. Luo, and S. Xie, Concept drift adaptation by exploiting drift type, ACM Trans. Knowledge Discov. Data, vol. 18, no. 4, p. 96, 2024.
H. Yu, J. Li, J. Lu, Y. Song, S. Xie, and G. Zhang, Type-LDD: A type-driven lite concept drift detector for data streams, IEEE Trans. Knowledge Data Eng., vol. 36, no. 12, pp. 9476–9489, 2024.
A. Torfi, E. A. Fox, and C. K. Reddy, Differentially private synthetic medical data generation using convolutional GANs, Inf. Sci., vol. 586, pp. 485–500, 2022.
A. S. Imran, R. Yang, Z. Kastrati, S. M. Daudpota, and S. Shaikh, The impact of synthetic text generation for sentiment analysis using GAN based models, Egypt. Inform. J., vol. 23, no. 3, pp. 547–557, 2022.
C. Dewi, R. C. Chen, Y. T. Liu, and S. K. Tai, Synthetic data generation using DCGAN for improved traffic sign recognition, Neural Comput. Appl., vol. 34, no. 24, pp. 21465–21480, 2022.
H. Zhang, H. Song, S. Li, M. Zhou, and D. Song, A survey of controllable text generation using transformer-based pre-trained language models, ACM Comput. Surv., vol. 56, no. 3, p. 64, 2023.
Y. Zhang, X. X. Lü, Y. C. Zou, and Y. G. Li, Differentially private sequence generative adversarial networks for data privacy masking, (in Chinese), Chinese Journal of Network and Information Security, vol. 6, no. 4, pp. 109–119, 2020.
W. J. Jin, Z. Bu, and B. Y. Qin, Intelligent fuzzy testing method based on sequence generative adversarial networks, (in Chinese), Journal of Information Security Research, vol. 10, no. 6, pp. 490–497, 2024.
G. Liu, X. Sun, Y. Li, H. Li, S. Zhao, and Z. Guo, An automatic privacy-aware framework for text data in online social network based on a multi-deep learning model, Int. J. Intell. Syst., vol. 2023, p. 1727285, 2023.
447
Views
116
Downloads
0
Crossref
0
Web of Science
0
Scopus
0
CSCD
Altmetrics
The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).