首页 | 本学科首页   官方微博 | 高级检索  
     检索      

家禽诊疗文本多实体关系联合抽取模型研究
引用本文:胡滨,汤保虎,姜海燕,霍傲,韩文笑.家禽诊疗文本多实体关系联合抽取模型研究[J].农业机械学报,2021,52(6):268-276.
作者姓名:胡滨  汤保虎  姜海燕  霍傲  韩文笑
作者单位:南京农业大学
基金项目:国家重点研发计划项目(2016YFD0300607)
摘    要:针对传统实体关系抽取方法中主体特征与句向量难以有效融合、现有BIO标注策略难以有效处理重叠关系的问题,提出一种基于BERT和双重指针标注的家禽疾病诊疗文本实体关系联合抽取模型(Joint extraction of entity relationship of poultry disease diagnosis and treatment text,JEER_PD)。JEER_PD使用双重指针标注(Dual-pointer labeling, DPL)策略,建立头、尾2个指针标注器,一次性标注出所有实体的开始和结束位置;引入CLN(Conditional layer normalization)网络层,强化主体抽取任务与客体关系联合抽取任务之间的联系;利用概率平衡策略PBS对抗正负类标签类别失衡,以加速模型收敛。实验表明,JEER_PD准确率、召回率和F1分别为97.69%、97.59%和97.64%,3项指标较现有方法均有显著提升,说明JEER_PD能够快速、准确地抽取家禽疾病诊疗复杂知识文本中的实体关系三元组。

关 键 词:家禽疾病诊疗文本    实体关系抽取    关系重叠    BERT语言模型    双重指针标注
收稿时间:2020/9/2 0:00:00

Joint Extraction Model of Multi-entity Relations for Poultry Diagnosis and Treatment Text
HU Bin,TANG Baohu,JIANG Haiyan,HUO Ao,HAN Wenxiao.Joint Extraction Model of Multi-entity Relations for Poultry Diagnosis and Treatment Text[J].Transactions of the Chinese Society of Agricultural Machinery,2021,52(6):268-276.
Authors:HU Bin  TANG Baohu  JIANG Haiyan  HUO Ao  HAN Wenxiao
Institution:Nanjing Agricultural University
Abstract:Aiming at the problems that the subject feature and sentence vector in the traditional entity relationship extraction method are difficult to effectively integrate, and the existing BIO annotation strategy is difficult to effectively deal with the overlapping relationships, a joint extraction of entity relationship of poultry disease diagnosis and treatment text (JEER_PD) based on BERT and dual-pointer was proposed. JEER_PD used the dual-pointer labeling (DPL) strategy to establish two pointer labelers at the head and tail, marking the beginning and ending positions of all entities at once; introduced the conditional layer normalization (CLN) network layer to strengthen the connection between the subject extraction task and the object relationship joint extraction task; and used the probability balance strategy (PBS) to combat the imbalance of positive and negative labels to accelerate the model convergence.The experimental results showed that the accuracy, recall and F1 value of JEER_PD were 97.69%, 97.59% and 97.64%, respectively, and the three indicators were significantly improved compared with that of the existing methods, which proved that JEER_PD can quickly and accurately extract the entity relationship triples in the complex knowledge text of the diagnosis and treatment of poultry diseases.
Keywords:poultry disease diagnosis and treatment text  entity relationship extraction  relationship overlap  BERT language model  dual-pointer labeling
点击此处可从《农业机械学报》浏览原始摘要信息
点击此处可从《农业机械学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号