首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于改进型YOLOv4-LITE轻量级神经网络的密集圣女果识别
引用本文:张伏,陈自均,鲍若飞,张朝臣,王治豪.基于改进型YOLOv4-LITE轻量级神经网络的密集圣女果识别[J].农业工程学报,2021,37(16):270-278.
作者姓名:张伏  陈自均  鲍若飞  张朝臣  王治豪
作者单位:1. 河南科技大学农业装备工程学院,洛阳 471003;2. 江苏大学现代农业装备与技术教育部重点实验室,镇江 212013;3. 机械装备先进制造河南省协同创新中心,洛阳 471003
摘    要:对密集圣女果遮挡、粘连等情况下的果实进行快速识别定位,是提高设施农业环境下圣女果采摘机器人工作效率和产量预测的关键技术之一,该研究提出了一种基于改进YOLOv4-LITE轻量级神经网络的圣女果识别定位方法。为便于迁移到移动终端,该方法使用MobileNet-v3作为模型的特征提取网络构建YOLOv4-LITE网络,以提高圣女果果实目标检测速度;为避免替换骨干网络降低检测精度,通过修改特征金字塔网络(Feature Pyramid Networks,FPN)+路径聚合网络(Path Aggregation Network,PANet)的结构,引入有利于小目标检测的104×104尺度特征层,实现细粒度检测,在PANet结构中使用深度可分离卷积代替普通卷积降低模型运算量,使网络更加轻量化;并通过载入预训练权重和冻结部分层训练方式提高模型的泛化能力。通过与YOLOv4在相同遮挡或粘连程度的测试集上的识别效果进行对比,用调和均值、平均精度、准确率评价模型之间的差异。试验结果表明:在重叠度IOU为0.50时所提出的密集圣女果识别模型在全部测试集上调和均值、平均精度和准确率分别为0.99、99.74%和99.15%,同比YOLOv4分别提升了0.15、8.29个百分点、6.54个百分点,权重大小为45.3 MB,约为YOLOv4的1/5,对单幅416×416像素图像的检测,在图形处理器(Graphics Processing Unit,GPU)上速度可达3.01 ms/张。因此,该研究提出的密集圣女果识别模型具有识别速度快、识别准确率高、轻量化等特点,可为设施农业环境下圣女果采摘机器人高效工作以及圣女果产量预测提供有力的保障。

关 键 词:机器视觉  模型  YOLO  深度学习  图像识别  目标检测
收稿时间:2021/8/15 0:00:00
修稿时间:2021/8/15 0:00:00

Recognition of dense cherry tomatoes based on improved YOLOv4-LITE lightweight neural network
Zhang Fu,Chen Zijun,Bao Ruofei,Zhang Chaochen,Wang Zhihao.Recognition of dense cherry tomatoes based on improved YOLOv4-LITE lightweight neural network[J].Transactions of the Chinese Society of Agricultural Engineering,2021,37(16):270-278.
Authors:Zhang Fu  Chen Zijun  Bao Ruofei  Zhang Chaochen  Wang Zhihao
Institution:1. College of Agricultural Equipment Engineering, Henan University of Science and Technology, Luoyang 471003, China; 2. Key Laboratory of Modern Agricultural Equipment and Technology of Ministry of Education, Jiangsu University, Zhenjiang 212013,China; 3. Collaborative Innovation Center of Advanced Manufacturing of Machinery and Equipment of Henan Province, Luoyang 471003, China
Abstract:Abstract: Small and hidden conditions of dense cherry tomatoes have posed a great challenge on the rapid identification and positioning of fruits. A new key technology with a strong robustness is highly demanding to improve the efficiency and yield prediction of cherry tomatoes in the facility agriculture environment. In this study, a novel recognition method was proposed to locate the dense cherry tomatoes using improved YOLOv4-LITE lightweight neutral network. A mobileNet-v3 easy migration to mobile terminals was selected as the feature extraction network of model to construct a YOLOv4-LITE for a higher detection speed of cherry tomatoes. A feature pyramid network was set as the modified (FPN) + Path Aggregation Network (PANet) structure, in order to avoid replacing the backbone network to reduce the detection accuracy. Specifically, a 104×104 Future map was introduced to achieve fine-grained detection for the small targets. More importantly, a deep separable convolution was used in the PANet structure to reduce the amount of model calculations. The new network was more lightweight, where the generalization ability of model was improved by loading pre-training weights and freezing partial layer training. A comparison was made on the recognition effect of YOLOv4, F1 and AP on the test set with the same degree of occlusion or adhesion, further to evaluate the difference between the models. The test results show that the improved FPN structure on the basis of YOLOv4 was higher than the AP50 of the original YOLOv4 AP75 increased by 15 percentage points, and the F1 increased by 0.14 and 0.24 under the corresponding IOU threshold. However, the weight increased by 4MB, while the detection speed increased by 5.84%, and the amount of network parameters increased by 14.85%. The improved FPN structure on the basis of YOLOv4+MobiletNet-V3, AP50 increased by 0.14%, AP75 increased by 23.25 percentage points, F1 value increased by 0.14 and 0.49 under the corresponding IOU threshold, indicating that YOLOv4 and YOLOv4+MobiletNet-V3 lacked small goals. Fortunately, the Future map of small targets was added to improve the fine-grained detection of model, but the amount of model parameters and weights increased accordingly. As such, the PANet structure was improved to introduce a deep separable convolutional network, while ensuring high F1, AP, Recall and Precision. An optimal performance was achieved, where the model weight was compressed to 45.3MB, the detection speed was 3.01ms/sheet, and the network parameters were 12026685. Specifically, the new network was reduced by 198.7MB, increased by 34.85%, and reduced by 81.34%, respectively, compared with the original YOLOv4. It was also reduced by 55.9%, slower by 0.23 s, and reduced by 69.97%, respectively, compared with YOLOv4+MobiletNet-V3. The data indicated that the improved PANet strategy presented the similar accuracy under such circumstances, while effectively reduced memory consumption, and the amount of model parameters, but accelerated the speed of model recognition. The F1, AP50, and recall of the proposed recognition model for the dense cherry tree on all test sets were 0.99, 99.74% and 99.15%, respectively. The improved YOLOv4 increased by 0.15, 8.29, and 6.54 percentage points, respectively, and the weight size was 45.3MB, about 1/5 of YOLOv4. Additionally, the detection of a single 416×416 image reached a speed of 3.01ms/frame on the GPU. Therefore, the recognition model of dense cherry tomatoes behaved a higher speed of recognition, a higher accuracy, and lighter weight than before. The finding can provide a strong support to the efficient production forecast of cherry tomatoes in the facility agriculture environment.
Keywords:computer vision  models  YOLO  deep learning  image recognition  target detection
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号