Discover the SciOpen Platform and Achieve Your Research Goals with Ease.
Search articles, authors, keywords, DOl and etc.
Finding more specific subcategories within a larger category is the goal of fine-grained image classification (FGIC), and the key is to find local discriminative regions of visual features. Most existing methods use traditional convolutional operations to achieve fine-grained image classification. However, traditional convolution cannot extract multi-scale features of an image and existing methods are susceptible to interference from image background information. Therefore, to address the above problems, this paper proposes an FGIC model (Attention-PCNN) based on hybrid attention mechanism and pyramidal convolution. The model feeds the multi-scale features extracted by the pyramidal convolutional neural network into two branches capturing global and local information respectively. In particular, a hybrid attention mechanism is added to the branch capturing global information in order to reduce the interference of image background information and make the model pay more attention to the target region with fine-grained features. In addition, the mutual-channel loss (MC-LOSS) is introduced in the local information branch to capture fine-grained features. We evaluated the model on three publicly available datasets CUB-200-2011, Stanford Cars, FGVC-Aircraft, etc. Compared to the state-of-the-art methods, the results show that Attention-PCNN performs better.
L. Kong, G. Li, W. Rafique, S. Shen, Q. He, M. R. Khosravi, R. Wang, and L. Qi, Time-aware missing healthcare data prediction based on ARIMA model, IEEE/ACM Trans. Comput. Biol. Bioinf., vol. 21, no. 4, pp. 1042–1050, 2024.
H. Zhang, M. Hao, H. Wu, H.-F. Ting, Y. Tang, W. Xi, and Y. Wei, Protein residue contact prediction based on deep learning and massive statistical features from multi-sequence alignment, Tsinghua Science and Technology, vol. 27, no. 5, pp. 843–854, 2022.
X.-S. Wei, C.-W. Xie, J. Wu, and C. Shen, Mask-CNN: Localizing parts and selecting descriptors for fine-grained bird species categorization, Pattern Recognit., vol. 76, pp. 704–714, 2018.
S. Min, H. Yao, H. Xie, Z.-J. Zha, and Y. Zhang, Multi-objective matrix normalization for fine-grained visual recognition, IEEE Trans. Image Process., vol. 29, pp. 4996–5009, 2020.
T. Shen, C. Gao, and D. Xu, The analysis of intelligent real-time image recognition technology based on mobile edge computing and deep learning, J. Real Time Image Process., vol. 18, no. 4, pp. 1157–1166, 2021.
X. Luo, Z. Yu, Z. Zhao, W. Zhao, and J.-H. Wang, Effective short text classification via the fusion of hybrid features for IoT social data, Digit. Commun. Netw., vol. 8, no. 6, pp. 942–954, 2022.
W. Batayneh, E. Abdulhay, and M. Alothman, Comparing the efficiency of artificial neural networks in sEMG-based simultaneous and continuous estimation of hand kinematics, Digit. Commun. Netw., vol. 8, no. 2, pp. 162–173, 2022.
P. Zhuang, Y. Wang, and Y. Qiao, Learning attentive pairwise interaction for fine-grained classification, Proc. AAAI Conf. Artif. Intell., vol. 34, no. 7, pp. 13130–13137, 2020.
F. Wang, G. Li, Y. Wang, W. Rafique, M. R. Khosravi, G. Liu, Y. Liu, and L. Qi, Privacy-aware traffic flow prediction based on multi-party sensor data with zero trust in smart city, ACM Trans. Internet Technol., vol. 23, no. 3, pp. 1–19, 2023.
Y. Zhang, J. Pan, L. Qi, and Q. He, Privacy-preserving quality prediction for edge-based IoT services, Future Gener. Comput. Syst., vol. 114, pp. 336–348, 2021.
Q. Hua, L. Chen, P. Li, S. Zhao, and Y. Li, A pixel–channel hybrid attention model for image processing, Tsinghua Science and Technology, vol. 27, no. 5, pp. 804–816, 2022.
J. Ye, S. Xue, and A. Jiang, Attention-based spatio-temporal graph convolutional network considering external factors for multi-step traffic flow prediction, Digit. Commun. Netw., vol. 8, no. 3, pp. 343–350, 2022.
Y. Peng, X. He, and J. Zhao, Object-part attention model for fine-grained image classification, IEEE Trans. Image Process., vol. 27, no. 3, pp. 1487–1500, 2018.
P. Jiang, H. Wu, and C. Xin, DeepPOSE: Detecting GPS spoofing attack via deep recurrent neural network, Digit. Commun. Netw., vol. 8, no. 5, pp. 791–803, 2022.
Q. Zhang, X. Zhang, H. Hu, C. Li, Y. Lin, and R. Ma, Sports match prediction model for training and exercise using attention-based LSTM network, Digit. Commun. Netw., vol. 8, no. 4, pp. 508–515, 2022.
L. Wang, K. He, X. Feng, and X. Ma, Multilayer feature fusion with parallel convolutional block for fine-grained image classification, Appl. Intell., vol. 52, no. 3, pp. 2872–2883, 2022.
H. Liu, L. Qi, S. Shen, A. Ali Khan, S. Meng, and Q. Li, Microservice-driven privacy-aware cross-platform social relationship prediction based on sequential information, Softw. Pract. Exp., vol. 54, no. 1, pp. 85–105, 2024.
The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).