Method for generating guided text abstract based on Transformer

A text and summarization technology, applied in the field of guided text summarization generation based on Transformer, can solve the problems of poor dependence and low computational efficiency of long texts

Active Publication Date: 2020-11-06
BEIJING UNIV OF TECH
View PDF5 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The traditional Seq2Seq (Sequence to Sequence) framework based on Recurrent Neural Networks (RNN) can only input one word at a time, has low computational efficiency, and is less dependent on long texts

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for generating guided text abstract based on Transformer
  • Method for generating guided text abstract based on Transformer
  • Method for generating guided text abstract based on Transformer

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] Combining with specific embodiments of the present invention figure 2 For further detailed description, the following examples are used to illustrate the present invention, but not to limit the scope of the present invention.

[0053] Its specific implementation steps are as follows:

[0054] 1. A method for generating a Transformer-based introductory text summary, comprising the following steps:

[0055] Data preprocessing stage

[0056] Step 1 Key Semantic Feature Extraction

[0057] Step 1.1 Keyword acquisition: first, word segmentation is performed on the text in the data set, and the original text after word segmentation is recorded as Test=(w 1 ,w 2 ,...,w n ), n represents the number of words in the text, and delete the text with the number of words i The TextRank value of v i with TF-IDF value u i , and finally use the following formula to calculate the importance k of each word in the text i , and arranged in descending order of importance, extract the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for generating a guided text abstract based on Transformer, and belongs to the technical field of information processing. According to the invention, a deep learningalgorithm and a machine learning algorithm are combined to solve the problem of automatically obtaining the text abstract under the big data condition. Firstly, a text key semantic feature extractionmethod is constructed, and key semantic features of a text are obtained through the method; the method also includes, secondly, converting the long text into a key short text in combination with an extraction type abstract method to serve as input of an abstract model; and finally, constructing a text abstract generation model based on Transformer by utilizing the extracted text key semantic features. In the abstract generation model, the attention mechanism is corrected by using the key semantic features, so that the generation model can generate more abstract contents rich in key information, and a pointer and a coverage mechanism are added, so that the abstract generation model can better solve the OOV problem and the repeated fragment generation problem in the abstract generation process.

Description

technical field [0001] The invention belongs to the technical field of information processing, and relates to a Transformer-based guiding text summary generation method. Background technique [0002] Automatic text summarization is the process of extracting the most critical information from the original text, and then constructing the important content required by users. The automatic text summarization method refers to the use of machines to automatically summarize a concise and readable summary sequence rich in key information from a large amount of text data. At present, from the perspective of construction methods, automatic text summarization mainly includes two types: generative summarization and extractive summarization. Extractive summarization uses some statistical methods to sort the importance of all sentences in the original text, and then extracts several important sentences with the highest importance as the summary of the text. Generative summarization refe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/34G06F40/126G06F40/295G06F40/30G06N3/04G06N3/08
CPCG06F16/345G06F40/126G06F40/295G06F40/30G06N3/049G06N3/08G06N3/047G06N3/044G06N3/045
Inventor 刘磊孙应红侯良文李静
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products