Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A text abstract generation method based on a K-means model and a neural network model

A neural network model and summary technology, applied in unstructured text data retrieval, text database clustering/classification, text database browsing/visualization, etc., can solve the problem of unrealistic manpower, poor content and language quality, and no logical correlation words, etc. question

Active Publication Date: 2019-06-14
桂林远望智能通信科技有限公司
View PDF9 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The writing of these abstracts requires reading the entire article and an in-depth understanding of the article, so the process of writing an abstract requires a lot of manpower and material resources. For some professional articles, editors with professional knowledge and industry experience are required, and those who can meet these requirements people are few
In today's society, the rapid development of the Internet and the sharp increase in the amount of information have led to an increasingly scarce manpower for manually compiling summaries. In order to obtain more important information, it is obviously unrealistic to invest a lot of manpower
[0003] The methods for automatically generating text summaries in the current technology are basically extractive, and the basic process is divided into two steps. First, the paragraphs, sentences, phrases, and keywords in the article are extracted through linguistic knowledge or statistical analysis; then the extracted The text is recombined to obtain a text summary. Although the extraction method is used to extract the summary to a certain extent, it can help people quickly understand the important information of the article, but the quality of the content and language is not satisfactory, because of Sentences are just a simple patchwork of important sentences in the original text, without logical correlative words, resulting in fragmented and ambiguous information, which can easily lead to inaccurate understanding by users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text abstract generation method based on a K-means model and a neural network model
  • A text abstract generation method based on a K-means model and a neural network model
  • A text abstract generation method based on a K-means model and a neural network model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0037] like figure 1 As shown, a method for generating text summaries based on K-means model and neural network model, including:

[0038] 110, the original text is preprocessed to be divided into individual sentences and words, and the sentences and words are input into the doc2vec model, and the sentence vector is obtained through training;

[0039] 120. Determine the number of cluster centers of the original text, and input the sentence vector into an unsupervised K-means model, and train to obtain a cluster center vector;

[0040] 130. Calculate the Euclidean distance between the cluster center vector and the sentence vector, and extract the sentence corresponding to the sentence vector closest to the cluster c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text abstract generation method based on a K-means model and a neural network model, and the method comprises the steps of preprocessing an original text, obtaining single sentences and words through segmentation, inputting the sentences and words into a doc2vec model, and carrying out the training, so as to obtain sentence vectors; determining the number of clustering centers of the original text, inputting the sentence vector into an unsupervised K-means model, and training to obtain a clustering center vector; calculating the Euclidean distance between the clustering center vector and the sentence vector, and extracting a sentence corresponding to the sentence vector closest to the clustering center to serve as a reference abstract; and inputting the original text, the reference abstract and the words into a generative neural network model to generate a text abstract. The method has the beneficial effects that the unsupervised model and the supervised neural network model are combined, so that the generated text abstract can be semantically coherent and is convenient for a user to understand.

Description

technical field [0001] The invention relates to the technical field of language processing, specifically, a method for generating text summaries based on a K-means model and a neural network model. Background technique [0002] We are in the era of information explosion. While enjoying the convenience brought by a variety of information, people are still eager to remove redundancy, refine and condense the core content of the obtained information, and use fewer sentences Replace the central idea of ​​information, thereby improving efficiency and saving search and reading time. A small number of scientific and technological articles contain abstracts, while news reports and social science articles do not contain abstracts, which requires readers to read the entire article to obtain the main information. The writing of these abstracts requires reading the entire article and an in-depth understanding of the article, so the process of writing an abstract requires a lot of manpow...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/34G06K9/62
Inventor 蔡晓东秦菲
Owner 桂林远望智能通信科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products