Discover the SciOpen Platform and Achieve Your Research Goals with Ease.
Search articles, authors, keywords, DOl and etc.
In recent years, with the rapid development of deepfake technology, a large number of deepfake videos have emerged on the Internet, which poses a huge threat to national politics, social stability, and personal privacy. Although many existing deepfake detection methods exhibit excellent performance for known manipulations, their detection capabilities are not strong when faced with unknown manipulations. Therefore, in order to obtain better generalization ability, this paper analyzes global and local inter-frame dynamic inconsistencies from the perspective of spatial and frequency domains, and proposes a Local region Frequency Guided Dynamic Inconsistency Network (LFGDIN). The network includes two parts: Global SpatioTemporal Network (GSTN) and Local Region Frequency Guided Module (LRFGM). The GSTN is responsible for capturing the dynamic information of the entire face, while the LRFGM focuses on extracting the frequency dynamic information of the eyes and mouth. The LRFGM guides the GTSN to concentrate on dynamic inconsistency in some significant local regions through local region alignment, so as to improve the model’s detection performance. Experiments on the three public datasets (FF++, DFDC, and Celeb-DF) show that compared with many recent advanced methods, the proposed method achieves better detection results when detecting deepfake videos of unknown manipulation types.
K. Liu, I. Perov, D. Gao, N. Chervoniy, W. Zhou, and W. Zhang, Deepfacelab: Integrated, flexible and extensible face-swapping framework, Pattern Recognition, vol. 141, p. 109628, 2023.
H. Lin, W. Huang, W. Luo, and W. Lu, deepfake detection with multi-scale convolution and vision transformer, Digital Signal Processing, vol. 134, p. 103895, 2023.
B. Chen, X. Liu, Z. Xia, and G. Zhao, Privacy-preserving deepfake face image detection, Digital Signal Processing, vol. 143, p. 104233, 2023.
R. Caldelli, L. Galteri, I. Amerini, and A. Del Bimbo, Optical flow based CNN for detection of unlearnt deepfake manipulations, Pattern Recogn. Lett., vol. 146, pp. 31–37, 2021.
M. S. Saealal, M. Z. Ibrahim, D. J. Mulvaney, M. I. Shapiai, and N. Fadilah, Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence, PLoS ONE, vol. 17, no. 12, p. e0278989, 2022.
H. Wang, Z. Liu, and S. Wang, Exploiting complementary dynamic incoherence for deepfake video detection, IEEE Trans. Circuits Syst. Video Technol., vol. 33, no. 8, pp. 4027–4040, 2023.
Y. Zhu, C. Zhang, J. Gao, J. Gao, X. Sun, Z. Rui, and X. Zhou, High-compressed deepfake video detection with contrastive spatiotemporal distillation, Neurocomputing, vol. 565, p. 126872, 2024.
B. Chen, T. Li, and W. Ding, Detecting deepfake videos based on spatiotemporal attention and convolutional LSTM, Inform. Sci., vol. 601, pp. 58–70, 2022.
A. Koteswaramma, M. B. Rao, and G. J. Suma, An intelligent adaptive learning framework for fake video detection using spatiotemporal features, Signal, Image and Video Processing, vol. 18, no. 3, pp. 2231–2241, 2024.
J. Wu, Y. Zhu, X. Jiang, Y. Liu, and J. Lin, Local attention and long-distance interaction of rPPG for deepfake detection, Vis. Comput., vol. 40, no. 2, pp. 1083–1094, 2024.
J. Thies, M. Zollhöfer, and M. Nießner, Deferred neural rendering: Image synthesis using neural textures, ACM Trans. Graph., vol. 38, no. 4, p. 66, 2019.
J. Fridrich and J. Kodovsky, Rich models for steganalysis of digital images, IEEE Transactions on Information Forensics and Security, vol. 7, no. 3, pp. 868–882, 2012.
Z. Hu, H. Xie, L. Yu, X. Gao, Z. Shang, and Y. Zhang, Dynamic-aware federated learning for face forgery video detection, ACM Trans. Intell. Syst. Technol., vol. 13, no. 4, p. 57, 2022.
F. Dong, X. Zou, J. Wang, and X. Liu, Contrastive learning-based general deepfake detection with multi-scale RGB frequency clues, J. King Saud Univ. Comput. Inf. Sci., vol. 35, no. 4, pp. 90–99, 2023.
Y. Wang, C. Peng, D. Liu, N. Wang, and X. Gao, Spatial-temporal frequency forgery clue for video forgery detection in VIS and NIR scenario, IEEE Trans. Circuits Syst. Video Technol., vol. 33, no. 12, pp. 7943–7956, 2023.
200
Views
13
Downloads
1
Crossref
1
Web of Science
1
Scopus
0
CSCD
Altmetrics
The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).