Method and device for automatic generation of multi-document abstract of industrial safety topics

An industrial safety, automatic generation technology, applied in natural language data processing, special data processing applications, instruments, etc., can solve problems such as redundancy and repetition, and achieve the effect of reducing redundancy and improving readability and correctness

Inactive Publication Date: 2017-11-24
BEIHANG UNIV
View PDF2 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the summary generated by this method must select the entire sentence from the original text, which leads to possible repetition and redundancy of information between sentences.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for automatic generation of multi-document abstract of industrial safety topics
  • Method and device for automatic generation of multi-document abstract of industrial safety topics
  • Method and device for automatic generation of multi-document abstract of industrial safety topics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to understand the characteristics and technical contents of the embodiments of the present invention in more detail, the implementation of the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. The attached drawings are only for reference and description, and are not intended to limit the embodiments of the present invention.

[0044] The following is an explanation of the key terms involved in the embodiments of the present invention:

[0045] Submodular function: If A is a subset of B, then for the function f(), if f(A+e)-f(A)≥f(B+e)-f(B) holds true, then f( ) function is a submodular function, that is, the f() function is a submodular function. Generally, a submodular function is a function with diminishing marginal effect, and the increment brought by a single element decreases with the increase of the base set considered.

[0046] The technical solution of the embodiment of the present inv...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for automatic generation of a multi-document abstract of industrial safety topics. The method includes the steps that input keywords are acquired, multiple documents corresponding to the keywords are inquired, and a document set is formed by the multiple documents; aiming at the documents in the document set, grammatical component decomposition is conducted on sentences in the documents, then multiple phrases are obtained, attribute information is set for each phrase, and a phrase set is formed by the multiple phrases; the phrase set is input into a sub-modular function, and the sub-modular function is optimized; according to an optimization result, a target phrase subset is determined, wherein the target phrase subset is a subset of the phrase set; multiple sentences are formed by phrases of the target phrase subset, the priority levels of the sentences are determined according to the attribute information of the phrases in the sentences; according to the priority levels of the sentences, the sentences are spliced in sequence, and the abstract is formed.

Description

technical field [0001] The present invention relates to document summarization processing technology, in particular to a method and device for automatically generating multi-document summaries of an abstract industrial safety theme based on a submodel optimization method. Background technique [0002] With the rapid development of information technology and the advent of the mobile network era, the amount of Internet data has experienced explosive growth in the past few years. On the one hand, the vast amount of data on the Internet makes it possible for people to obtain almost any information; however, on the other hand, the excessive content makes it more difficult to locate the information that is really needed, even with the help of search engines. When people use a search engine to retrieve a certain topic, a large number of related web pages will be returned, and it is difficult for people to have a comprehensive understanding of related topics in a short time. Theref...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22G06F17/27
CPCG06F40/131G06F40/253G06F40/289G06F40/30
Inventor 李博冯岩陈汉腾符式定李建欣
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products