首页 | 本学科首页   官方微博 | 高级检索  
     检索      

融合注意力机制的枸杞虫害图文跨模态检索方法
引用本文:刘立波,赵斐斐.融合注意力机制的枸杞虫害图文跨模态检索方法[J].农业机械学报,2022,53(2):299-308.
作者姓名:刘立波  赵斐斐
作者单位:宁夏大学信息工程学院
基金项目:国家自然科学基金项目(61862050)和宁夏自然科学基金项目(2020AAC03031)
摘    要:针对现有农作物病虫害检索模态较为单一问题,以17种常见的枸杞虫害图像和文本描述为研究对象,将跨模态检索引入枸杞虫害检索领域,提出一种融合注意力机制的枸杞虫害图文跨模态检索方法.首先,借助Transformer模型和循环神经网络分别获取带有上下文信息的细粒度图像和文本特征序列;然后,利用注意力机制对特征序列进行聚合以挖掘...

关 键 词:枸杞虫害  注意力机制  图文检索  跨模态
收稿时间:2020/12/28 0:00:00

Cross-modal Image and Text Retrieval Method for Lycium Barbarum- Pests by Integrating Attention Mechanism
LIU Libo,ZHAO Feifei.Cross-modal Image and Text Retrieval Method for Lycium Barbarum- Pests by Integrating Attention Mechanism[J].Transactions of the Chinese Society of Agricultural Machinery,2022,53(2):299-308.
Authors:LIU Libo  ZHAO Feifei
Institution:Ningxia University
Abstract:In recent years,with the change of climatic conditions and the introduction of cultivation techniques,the planting area of Lycium has gradually expanded.It has become one of the important economic crops in Ningxia and even the entire northwestern region.Lycium is a multi-insect host and has poor resistance to insect pests.It is very susceptible to insect infestation,which has a huge impact on yield and quality,causing serious economic losses.Therefore,it is very important to quickly and accurately retrieve and obtain various information about Lycium pests and provide timely and accurate control for the development of the industry.To address the problem that the present retrieval system on crop pests owns only the single mode,the cross-modal retrieval for images and texts in Lycium pest dataset was introduced,which had 17 kinds of common pests,and a cross-modal image and text retrieval method with the attention mechanism was proposed.Firstly,the transformer and the LSTM were used to obtain text and image fine-grained feature sequences with the context information,respectively.Then,the attention mechanism was leveraged to aggregate feature sequences to capture the salient semantic information in texts and images.Finally,in order to explore the semantic correlation between different modalities,the cross-media joint loss was used to constrain the proposed model.The experiment showed that the averaged MAP of the proposed method in the self-built Lycium pest dataset achieved 0.458.Compared with the existing eight methods,the averaged MAP of the method was improved by 0.011~0.195,outperforming all these methods.The proposed method can provide technical support and algorithm reference for diversified retrieval requirements of crop pests.
Keywords:Lycium barbarum-pests  attention mechanism  image and text retrieval  cross modal
本文献已被 维普 等数据库收录!
点击此处可从《农业机械学报》浏览原始摘要信息
点击此处可从《农业机械学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号