Method for automatically generating unsupervised science and technology intelligence abstract based on multi-sentence compression

An automatic generation, unsupervised technology, applied in natural language data processing, unstructured text data retrieval, instruments, etc., can solve the problems of high data timeliness and authority, difficulty in generating reports, manual collection and screening, etc. The effect of improving performance, improving relevance, and optimizing efficiency

Pending Publication Date: 2022-07-05
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to solve the technical problems of manual collection, screening and report generation difficulties in the field of scientific and technological information, and creatively propose an automatic scientific and technological information summary generation method that runs through data collection, data screening and intelligence generation
Th...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically generating unsupervised science and technology intelligence abstract based on multi-sentence compression
  • Method for automatically generating unsupervised science and technology intelligence abstract based on multi-sentence compression
  • Method for automatically generating unsupervised science and technology intelligence abstract based on multi-sentence compression

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0121] This embodiment describes a specific embodiment of the method of the present invention.

[0122] The implementation diagram is as follows figure 1 The overall process is shown. The invention provides a complete process from text data acquisition, data processing, and abstract text generation in the process of generating scientific and technological information abstracts. During the specific implementation of the present invention, firstly, the theme crawler module starts to work, obtains the data required for analysis according to the keyword library provided by the user, then the text information value evaluation module analyzes and sorts the obtained data, and finally uses the sorting result as The input of the summary generation module is brought into the model to get the final result.

[0123] First, according to the keywords provided by the user, the topic crawler module is used to obtain data in Google Scholar, DARPA, IARPA, and RAND Think Tank. figure 2 It is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an unsupervised scientific and technological intelligence abstract automatic generation method based on multi-sentence compression, and belongs to the technical field of natural language generation. Aiming at multi-document text generation in the field of science and technology intelligence, firstly, source data are acquired based on a topic crawler of an LDA topic similarity word library extension method; and sorting all text paragraphs through a text information value evaluation model of three indexes of authority, timeliness and content correlation of the text information. And selecting a paragraph with a higher score as an original text for generating the final science and technology intelligence. Finally, an unsupervised multi-document abstract method based on spectral clustering and multi-sentence compression is adopted, and a science and technology intelligence abstract is automatically generated. According to the method, the problem that in the data screening process, scientific and technological information generation has high requirements for data timeliness and authority is effectively solved, and the problem that a traditional multi-document generation method based on a neural network cannot be applied due to lack of a data set in the field of scientific and technological information is effectively solved.

Description

technical field [0001] The invention relates to an automatic generation method of unsupervised scientific and technological information abstracts, in particular to an automatic generation of unsupervised scientific and technological information abstracts based on multi-sentence compression, and belongs to the technical field of natural language generation. Background technique [0002] Science and technology intelligence work has played a key role in the formulation of the country's huge science and technology strategy, the deployment of huge science and technology plans, and economic and social development, and contributed to the development of society, economy, and science and technology. It is a national science and technology plan deployment and economic and social development. key components of key functions. [0003] In the field of scientific and technological intelligence, in the face of the big data environment, the use of manual collection, sorting and screening of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/34G06F16/35G06F40/30G06F40/216G06F40/211G06K9/62
CPCG06F16/345G06F16/35G06F40/30G06F40/216G06F40/211G06F18/2155G06F18/23213
Inventor 张隽驰张华平商建云
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products