Generation method of single document summaries

A technology of document summarization and summarization, applied in the field of single-document summarization generation, can solve the problems of low summarization accuracy and underutilization of a single document, and achieve the effects of accurate extraction, improved extraction accuracy, and high summarization accuracy.

Inactive Publication Date: 2013-06-05
NINGBO CHENGDIAN TAIKE ELECTRONICS INFORMATION TECH DEV
View PDF2 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Existing technologies either extend a single document into multiple documents and use multi-document summarization for single-document summarization, or only use a single document for summarization, but still do not make full use of the content of a single document, resulting in low extraction accuracy of the summaries

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Generation method of single document summaries
  • Generation method of single document summaries
  • Generation method of single document summaries

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below with reference to the accompanying drawings and examples.

[0018] The single-document summarization method of the embodiment of the present invention, the flow chart is as follows figure 1 As shown, it specifically includes the following steps:

[0019] S1. Cluster the paragraphs of the document to be summarized, and each category is a semantic block;

[0020] S2. Calculate the similarity between two sentences in the semantic block, and use it as a score for one sentence to another sentence. The sentence with the highest score is the core sentence expressing the content of the part in each semantic block;

[0021] S3. According to the order of appearance of the core sentences, connect the sentences to generate a summary.

[0022] That is, cluster the paragraphs of the document to be summarized, divide the par...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a generation method of single document summaries. The method includes the steps of clustering paragraphs of a document to be summarized, and defining each class as a semantic block; calculating similarity of each sentence pair in the semantic blocks to score one sentence with the other sentence, and defining the sentence with highest score as a sentence expressing partial core content in each semantic block; and connecting the sentences to generate a summary according to emergency sequence of core sentences. Word similarity and named entity recognition are introduced to single document summaries, so that extracting precision of summaries is higher. Clustering speed is increased by means of single pass. Single document summaries can be extracted accurately. The generation method is high in accuracy of extracting news and announcement documents.

Description

technical field [0001] The invention belongs to the technical field of computer applications, and in particular relates to a method for generating single-document summaries. Background technique [0002] With the rapid increase in the number of electronic texts, the demand for fast access to text information is becoming stronger and stronger. As a technique to condense textual information, automatic summarization can play an important role. The purpose of automatic summarization is to provide users with short text representations. Form the shortest possible abstract while retaining as much information from the original text as possible. For an ideal extractive summary, it has three basic characteristics: it is derived from the text, retains important information, and is short in length. According to the number of texts the abstract comes from, it can be divided into single-text abstract and multi-text abstract. According to the way of summarization, it is divided into ge...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 薛世帅郭成林彭春林刘红玉高云棋刘丹
Owner NINGBO CHENGDIAN TAIKE ELECTRONICS INFORMATION TECH DEV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products