GENERATIVE AUTOMATIC abstracting METHOD BASED ON BERT AND EXTERNAL KNOWLES

A technology of automatic summarization and external knowledge, applied in knowledge expression, neural learning method, text database query, etc., can solve the problem of inability to accurately express the subject of the document, without considering external prior knowledge, and difficult to ensure the consistency and consistency of the generated summary and other problems to achieve the effect of improving integrity and fluency and improving quality

Pending Publication Date: 2022-04-26
CHONGQING UNIV OF POSTS & TELECOMM
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The above technologies are based on the original document to directly generate abstracts. Compared with manually writing abstracts, external prior knowledge is not considered, resulting in the generated abstracts being unable to accurately express the purpose of the document, and it is difficult to ensure the coherence and consistency of the generated abstracts.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • GENERATIVE AUTOMATIC abstracting METHOD BASED ON BERT AND EXTERNAL KNOWLES
  • GENERATIVE AUTOMATIC abstracting METHOD BASED ON BERT AND EXTERNAL KNOWLES
  • GENERATIVE AUTOMATIC abstracting METHOD BASED ON BERT AND EXTERNAL KNOWLES

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0048] A generative automatic summarization method based on BERT and external knowledge, such as figure 2 As shown, a generative automatic summarization model is constructed. The generative automatic summarization model includes a TextRank module, a BERT model, an external knowledge module and a Transformer model. The method includes the following steps:

[0049] 101. Acquire document data, and acquire keywords corresponding to the document data through the T...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of natural language processing, and particularly relates to a BERT and external knowledge-based generative automatic abstracting method, which comprises the following steps of: acquiring document data, and acquiring keywords corresponding to the document data through a TextRank module; inputting the document data into the BERT model for encoding to obtain encoded document information; external knowledge is retrieved from an external knowledge module through keywords, and the external knowledge and document information are fused through a gating mechanism; the fused information is input into a Transform model to be decoded, and an abstract is generated; according to the method, the BERT model is used for encoding the document data to capture more context information and internal information, the encoding quality is improved, the keyword is used for obtaining external knowledge to be fused with the document information, the Transform model is used for enriching the semantics of the generated abstract, the smoothness and integrity of the generated abstract are improved, and the high-quality abstract is generated.

Description

technical field [0001] The invention belongs to the field of natural language processing, and in particular relates to a generative automatic summarization method based on BERT and external knowledge. Background technique [0002] With the advancement of technology and the vigorous development of the mobile Internet industry, every netizen and even every terminal has become a producer of Internet information. In the face of massive amounts of information, the phenomenon of information overload is becoming more and more serious. How to enable people to efficiently obtain the information they need has become a great challenge in today's era. In order to obtain the required information more efficiently, automatic text summarization has gradually become an indispensable technology. [0003] Automatic text summarization can be divided into extractive automatic text summarization and generative automatic text summarization. Extractive summarization generates summaries by selecti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/34G06F16/33G06F40/216G06F40/289G06N3/04G06N3/08G06N5/02
CPCG06F16/345G06F16/3334G06F40/289G06F40/216G06N3/082G06N5/022G06N3/047G06N3/045
Inventor 张璞尘勇谢传威
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products