Graph model-based automatic abstracting method

A technology of automatic summarization and graph model, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as difficulty in accurately measuring semantic correlation between sentences and ignoring text unit attributes.

Active Publication Date: 2016-01-13
TONGJI UNIV
View PDF2 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention will overcome the problem that it is difficult to accurately measure the semantic correlation between sentences and ignore some attributes of the text unit itself in the prior art, and provide an improved automatic summarization method based on the graph model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Graph model-based automatic abstracting method
  • Graph model-based automatic abstracting method
  • Graph model-based automatic abstracting method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] In order to make the purpose, technical solution and advantages of the present invention clearer, the automatic summarization method implemented according to the present invention will be further described in detail below. It should be understood that the specific embodiments described here are only used to explain the present invention, and are not intended to limit the present invention, that is, the protection scope of the present invention is not limited to the following embodiments, on the contrary, according to the inventive concept of the present invention, those skilled in the art Appropriate changes can be made by those skilled in the art, and these changes can fall within the scope of the invention defined by the claims.

[0053] The automatic summarization method according to the specific embodiment of the present invention comprises the following steps:

[0054] 1) Document preprocessing module:

[0055] Select a specific document set, the document set shou...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of automatic abstracting, and discloses a graph model-based automatic abstracting method. According to the technical scheme, an LDA probability topic model is applied to measurement of semantic correlation between sentences and improvement of the measurement effect of sentence correlation; and an idea of topic correlation and position sensitivity of the sentences is provided, so that abstract generation is relatively reasonable and effective. The method comprises the following steps: firstly, obtaining topic probability distribution of a document and word probability distribution of the topic through training the LDA topic model, determining the topic probability distribution of the sentences and effectively converting a semantic similarity measurement between the sentences into a similarity measurement problem of the topic probability distribution of the sentences; with the sentences as nodes, building edges by referring tothe cosine similarity and according to the semantic similarity between the sentences and generating a text graph representing the document; calculating the topic correlation between the sentences according to the topic probability distribution of the sentences and the topic probability distribution of the document; and calculating the position sensitivity and the like of the sentences according to the positions of the sentences in the document.

Description

technical field [0001] The invention relates to the field of automatic summarization, in particular to an automatic summarization method based on a graph model. Background technique [0002] Automatic abstraction technology is to use computer to automatically process documents, generate a summary containing the core content of the original document, and realize the compression of documents, so that people can find and obtain the required information in less time, which can effectively solve information overload. question. [0003] Although automatic summarization has a long research history since 1960, the extraction-based summarization method, that is, directly extracting key sentences from the original text to generate summaries, is still the most mainstream method in this field. The core idea of ​​automatic summarization based on extraction is: first, statistically analyze the various features of sentences in one or more documents, calculate the importance of sentences, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/345
Inventor 王俊丽魏绍臣管敏
Owner TONGJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products