Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and device for extracting abstracts

A technology of summaries and sentences, which is applied in the field of Internet news, can solve the problem of time-consuming manual extraction of summaries, and achieve the effect of improving readability and reading efficiency

Active Publication Date: 2019-08-27
NEUSOFT CORP
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the time-consuming technical problem of manually extracting summaries in the prior art, a method and device for extracting summaries are provided, which can automatically extract summaries of news and other manuscripts, and improve the work efficiency of extracting summaries

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for extracting abstracts
  • A method and device for extracting abstracts
  • A method and device for extracting abstracts

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0060] see figure 1 , which is a flow chart of Embodiment 1 of the method for extracting abstracts provided by the present invention.

[0061] The method for extracting the abstract provided in this embodiment includes the following steps:

[0062] S101: Split the manuscript to be extracted into paragraphs, and split each of the split paragraphs into sentences;

[0063] It can be understood that the manuscript to be extracted may be any manuscript on the Internet, or various electronic manuscripts. For example, news, biographies, etc.

[0064] It should be noted that splitting a manuscript into paragraphs and splitting a paragraph into sentences is a relatively mature technology at present, and will not be described in detail here.

[0065] S102: Screen candidate abstract sentences according to the correlation between the sentence and the title of the manuscript;

[0066] The title of a general manuscript summarizes the gist of the manuscript. Therefore, the relevance of t...

Embodiment 2

[0077] see figure 2 , which is a flow chart of Embodiment 2 of the method for extracting abstracts provided by the present invention.

[0078] In the method for extracting a summary provided in this embodiment, S201 is the same as S101, and will not be repeated here. And this embodiment only introduces the situation when it is judged that the sum of the word counts of all the abstract candidate sentences is greater than the predetermined word count of the abstract.

[0079] The screening of candidate abstract sentences according to the correlation between the sentence and the title of the manuscript specifically includes:

[0080] S202a: Use the first sentence in each paragraph as the first type of summary candidate sentence.

[0081] It should be noted that since the first sentence of a paragraph is generally a generalization and summary of the entire paragraph, therefore, directly using the first sentence of a paragraph as a candidate sentence for the abstract can also be...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an abstract extracting method and apparatus. The method includes the steps of dividing a manuscript, of which the abstract is to be extracted, into paragraphs, dividing each paragraph into sentences, selecting abstract selectable sentences according to the correlation degree between the sentences and the title of the manuscript, if the total character number of all abstract selectable sentences is equal to or less than the preset character number of the abstract, taking all abstract selectable sentences as abstract sentences, obtaining the weight values of the abstract selectable sentences if the total character number of all abstract selectable sentences is greater than the preset character number of the abstract, and ranking the abstract selectable sentences according to the weight values, and selecting the abstract selectable sentences ranked in the front as abstract sentences according to the preset character number of the abstract. Through the method, an abstract can be formed automatically according to the contents of a manuscript, which allows a reader to rapidly understand the main contents of the manuscript and improving reading efficiency. Since the abstract sentences are complete sentences directly selected from the manuscript, incomplete short sentences may not generated. The readability of the abstract is improved.

Description

technical field [0001] The invention relates to the technical field of Internet news, in particular to a method and device for extracting abstracts. Background technique [0002] With the rapid development of the Internet, the amount of news information released on the Internet is increasing. Many TV stations need to aggregate Internet news and provide summaries for users. Users browse the abstract and then decide whether to read the corresponding full-text content in detail. [0003] However, the amount of Internet news information is too large and the update frequency is very fast. It takes a long time to manually browse the news content and then extract the summary, and the labor cost is too high. [0004] Therefore, those skilled in the art need to provide a method and device for extracting abstracts, which can automatically extract abstracts of news and other manuscripts. Contents of the invention [0005] In order to solve the technical problem in the prior art th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/34
Inventor 王磊张明亮张旭麦涛徐超
Owner NEUSOFT CORP