Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Hybrid text abstract generation method and system, terminal and storage medium

A hybrid, text-based technology, applied in neural learning methods, biological neural network models, instruments, etc., can solve problems such as no description or report found, training method is not end-to-end, information loss, etc., to achieve both information and simplicity performance, rich model information, and enhanced collaboration

Active Publication Date: 2021-03-09
SHANGHAI JIAO TONG UNIV
View PDF9 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since all sentences are indiscriminately compressed and truncated, they suffer from information loss during the abstraction stage
When the whole sentence is crucial, some important content will be deleted by mistake, causing serious information loss
Furthermore, the training methods for these techniques are not end-to-end due to the lack of an effective reinforcement learning framework to connect the two modules
[0003] At present, there is no description or report of the similar technology of the present invention, and no similar data at home and abroad have been collected yet.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hybrid text abstract generation method and system, terminal and storage medium
  • Hybrid text abstract generation method and system, terminal and storage medium
  • Hybrid text abstract generation method and system, terminal and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] The following is a detailed description of the embodiments of the present invention: this embodiment is implemented on the premise of the technical solution of the present invention, and provides detailed implementation methods and specific operation processes. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention, and these all belong to the protection scope of the present invention.

[0058] figure 1 It is a flowchart of a method for generating a hybrid text summarization in an embodiment of the present invention.

[0059] like figure 1 As shown, the method for generating a hybrid text summary provided in this embodiment may include the following steps:

[0060] S100, segmenting the input text into sentences and words, and recording the index of the sentence where each word is located;

[0061] S200, sequentially perform word and sentence representation on the res...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a hybrid text abstract generation method and system, and the method comprises the steps: carrying out the sentence segmentation and word segmentation of an input text, and recording the index of a sentence where each word is located; successively representing words and sentences for the clause and word segmentation results to obtain sentence vectors; copying each sentence vector, and respectively marking copied and rewritten vector marks on the two vectors; extracting important sentence vectors, and making a copy or rewrite decision according to vector marks; editing andmodifying sentences needing to be rewritten to obtain text abstracts; and training the adopted neural network to complete gradient updating of the parameters. Meanwhile, the invention provides a corresponding terminal and a storage medium. According to the method, the extracted sentences and the abstract sentences are mixed in the abstract for the first time, and the sentences directly used for the abstract and the rewritten sentences are distinguished through a copying or rewriting mechanism; according to the hierarchical reinforcement learning method training method, the extracted sentencesare used as tasks from managers to workers, and the collaboration between two networks is improved.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a hybrid text summarization method, system, terminal and storage medium based on layered reinforcement learning. Background technique [0002] The goal of a summary is to rewrite a long essay into a short, fluent version while keeping the most salient bits. With the successful application of neural networks in natural language processing (NLP) tasks, two data-driven branch extraction and abstract summarization stand out from various methods. The extraction method generally selects the most prominent sentence from the source article as the abstract, the content selection is accurate, and the result has a large amount of information, but because the sentence is not rewritten, the redundancy is high. On the contrary, abstract methods can generate more concise summaries by compressing and paraphrasing, but existing models are weak in content selection and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/284G06F40/211G06N3/04G06N3/08
CPCG06F40/284G06F40/211G06N3/08G06N3/045
Inventor 金耀辉何浩肖力强陈文清田济东
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products