Chinese abstract generation method and system and storage medium

A summary and Chinese technology, applied in the field of Chinese summary generation, can solve the problems of low quality and poor readability of Chinese text summary generation, and achieve the effect of improving generation quality and readability

Active Publication Date: 2019-12-03
NANJING COLLEGE OF INFORMATION TECH
View PDF2 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Currently commonly used summarization methods generally have the technical problems of low quality and poor readability of Chinese text summarization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese abstract generation method and system and storage medium
  • Chinese abstract generation method and system and storage medium
  • Chinese abstract generation method and system and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be further described below in conjunction with the accompanying drawings. The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0044] Such as figure 1 Shown is a flow chart of a method for generating a Chinese abstract according to an embodiment of the present invention, which specifically includes the following steps:

[0045] a) Text preprocessing: After the target text is segmented, word vectorization processing is performed, and a corresponding vocabulary is constructed, and the formed word vector sequence is used as the input of the next stage.

[0046] b) Semantic understanding: the memory function of the cyclic neural network, the word vector sequence of the first stage is input into the encoder once (using a bidirectional long-term short-term memory neural network (Bi-LSTM)), the encoder generates the semantic ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese abstract generation method and system and a storage medium, and the method comprises the steps: obtaining a target text, and determining a Chinese word vector sequence of the target text; inputting the Chinese word vector sequence into a pre-trained encoder to generate a semantic vector; reconstructing full-text semantics most suitable for the current moment according to the semantic vector, and transmitting intermediate semantics summarizing the full-text semantics after reconstruction to a pre-trained decoder; and enabling the decoder to deduce the distribution of the words at the next moment according to the words predicted at the previous moment and the intermediate semantics summarizing the full-text semantics, wherein the finally generated word sequence is the abstract of the target text. The generation quality and readability of the Chinese text abstract can be improved.

Description

technical field [0001] The invention relates to a Chinese abstract generation method, system and storage medium, belonging to the technical field of text information processing. Background technique [0002] Automatic summarization is a technology that uses computers to realize automatic text analysis, content summary and abstract generation. It is an auxiliary means to solve the current problem of information surplus. It can help humans further understand natural language texts and obtain key information more quickly, accurately and comprehensively. , has important practical significance in both industry and commerce. [0003] Currently, the commonly used summarization methods generally have the technical problems of low quality and poor readability of Chinese text summarization. Contents of the invention [0004] The purpose of the present invention is to overcome the deficiencies in the prior art, and provide a Chinese abstract generation method, system and storage med...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F16/35G06F16/34G06N3/04G06N3/08
CPCG06F16/35G06F16/345G06N3/08G06N3/044G06N3/045Y02D10/00
Inventor 李维勇柳斌张伟李建林李方方
Owner NANJING COLLEGE OF INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products