Multilingual automatic abstract method

An automatic summarization and multilingual technology, applied in neural learning methods, special data processing applications, instruments, etc., can solve problems such as weak sentence coherence, dependence on machine translation results, and short generative summaries

Active Publication Date: 2019-05-31
YANBIAN UNIV
View PDF7 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] (1) The sentences extracted by the traditional extractive summary contain a lot of redundant information, and the coherence between the sentences is not strong, and the readability is poor, while the length of the generative summary is short, the redundancy is low, and the generalization of the sentence is strong ;
[0011] (2) Multilingual automatic summarization based on machine translation is relatively simple. Translate texts in different languages ​​into one language and then perform automatic summarization. This method is heavily dependent on the quality of machine translation results and has low execution efficiency.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multilingual automatic abstract method
  • Multilingual automatic abstract method
  • Multilingual automatic abstract method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0037] This automatic summarization system is oriented to scientific and technological documents in three languages: Chinese, Korean, and English. For a single text, a natural language abstract describing the general content of the text is generated. The language of the abstract is consistent with the language of the source text; A text set in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of text generation in natural language processing. The invention relates to a method, in particular to a multilingual automatic abstract method. INCLUDINGA Whole AUTOMATIC ABNORMATION SYSTEM, the automatic abstract system is divided into a model training module; a single-document abstract module and a multi-document abstract module, the model trainingmodule is divided into a text preprocessing module and a training module; wherein the single-document summary module is divided into a text preprocessing module and a summary generation module, the multi-document summary module is divided into a text preprocessing module, a multi-language sentence clustering module and a summary generation module, a model in the model training module is a seq2seqneural network model, and a training text is obtained through summary-summary generation. According to the invention, a multilingual generative automatic abstract system is designed and realized, a bilingual word embedding technology and a deep learning method are adopted, and a brief abstract is generated for a text or a text set specified by a user, so that the user is helped to browse intentions of an original text and quickly find out the most required information.

Description

technical field [0001] The invention relates to the technical field of text generation in natural language processing, in particular to a multilingual automatic summarization method. Background technique [0002] Text summarization usually refers to generating a piece of text from a single or multiple documents, which conveys the main information in the original text, but only has less than half or even less space than the original text. For example, summarizing a 1500-word text into a 150-word abstract can save readers a lot of reading time and also play a role in information compression. [0003] According to the generation method of the abstract, we can divide automatic summarization into extractive summarization (Extractive Summarization) and generative summarization (Abstractive Summarization). Among them, the feature of the extractive summary is that the sentences in the summary are sentences in the original text, also called "sentence excerpts", while the feature of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/04G06N3/08
CPCY02D10/00
Inventor 赵亚慧易志伟崔荣一孟先艳田明杰徐凯斌杨飞扬王琪黄政豪金国哲张振国胡荣王大千
Owner YANBIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products