Discover the SciOpen Platform and Achieve Your Research Goals with Ease.
Search articles, authors, keywords, DOl and etc.
Cross-modal retrieval for rice leaf diseases is crucial for prevention, providing agricultural experts with data-driven decision support to address disease threats and safeguard rice production. To overcome the limitations of current crop leaf disease retrieval frameworks, we focused on four common rice leaf diseases and established the first cross-modal rice leaf disease retrieval dataset (CRLDRD). We introduced cross-modal retrieval to the domain of rice leaf disease retrieval and introduced FHTW-Net, a framework for rice leaf disease image–text retrieval. To address the challenge of matching diverse image categories with complex text descriptions during the retrieval process, we initially employed ViT and BERT to extract fine-grained image and text feature sequences enriched with contextual information. Subsequently, two-way mixed self-attention (TMS) was introduced to enhance both image and text feature sequences, with the aim of uncovering important semantic information in both modalities. Then, we developed false-negative elimination–hard negative mining (FNE-HNM) strategy to facilitate in-depth exploration of semantic connections between different modalities. This strategy aids in selecting challenging negative samples for elimination to constrain the model within the triplet loss function. Finally, we introduced warm-up bat algorithm (WBA) for learning rate optimization, which improves the model’s convergence speed and accuracy. Experimental results demonstrated that FHTW-Net outperforms state-of-the-art models. In image-to-text retrieval, it achieved R@1, R@5, and R@10 accuracies of 83.5%, 92%, and 94%, respectively, while in text-to-image retrieval, it achieved accuracies of 82.5%, 98%, and 98.5%, respectively. FHTW-Net offers advanced technical support and algorithmic guidance for cross-modal retrieval of rice leaf diseases.
Rai A, Maharjan MR, Harris Fry HA, Chhetri PK, Wasti PC, Saville NM. Consumption of rice, acceptability and sensory qualities of fortified rice amongst consumers of social safety net rice in Nepal. PLOS ONE. 2019;14(10):Article e0222903.
Kwon TH, Kim JY, Lee C, Park GH, Ashtiani-Araghi A, Baek SH, Rhee JY. Survey on informatization status of farmers for introducing ubiquitous agriculture information system. J Biosyst Eng. 2014;39(1):57–67.
Zhen Y, Yeung DY. Active hashing and its application to image and text retrieval. Data Min Knowl Disc. 2013;26:255–274.
Yilmaz T, Yazici A, Kitsuregawa M. RELIEF-MM: Effective modality weighting for multimedia information retrieval. Multimedia Syst. 2014;20(4):389–413.
Li M, Zhou G, Chen A, Yi J, Lu C, He M, Hu Y. FWDGAN-based data augmentation for tomato leaf disease identification. Comput Electron Agric. 2022;194:Article 106779.
Deng Y, Xi H, Zhou G, Chen A, Wang Y, Li L, Hu Y. An effective image-based tomato leaf disease segmentation method using MC-UNet. Plant Phenom. 2023;5:0049.
Tang Z, He X, Zhou G, Chen A, Wang Y, Li L, Hu Y. A precise image-based tomato leaf disease detection approach using PLPNet. Plant Phenom. 2023;5:0042.
Dong Y, Xu F, Liu L, du X, Ren B, Guo A, Geng Y, Ruan C, Ye H, Huang W, et al. Automatic system for crop pest and disease dynamic monitoring and early forecasting. IEEE J Sel Top Appl Earth Observ Remote Sens. 2020;13:4410–4418.
Xin M, Wang Y. Image recognition of crop diseases and insect pests based on deep learning. Wirel Commun Mob Comput. 2021;2021:1–15.
Frome A, Corrado GS, Shlens J, Begio S, Dean J, Ranzato MA, Mikolov T. Devise: A deep visual-semantic embedding model. Adv Neural Inf Proces Syst. 2013;26:2121–2129.
Sethy PK, Barpanda NK, Rath AK, Behera SK. Deep feature based rice leaf disease identification using support vector machine. Comput Electron Agric. 2020;175:Article 105527.
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I. Attention is all you need. Adv Neural Inf Proces Syst. 2017;30.
Fushiki T. Estimation of prediction error by using K-fold cross-validation. Stat Comput. 2011;21:137–146.
Distributed under a Creative Commons Attribution License 4.0 (CC BY 4.0).