Unlock instant, AI-driven research and patent intelligence for your innovation.

Method for extracting science and technology text problem method based on extremely simple abstract strategy

A text and abstract technology, which is applied in the field of scientific and technological text problem extraction based on the minimalist summary strategy, can solve the problems of high training data acquisition, cost model performance improvement, and difficulty in distinguishing the corresponding relationship between problems and methods, etc., to achieve large-scale The effect of

Inactive Publication Date: 2021-03-12
WUHAN UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the problem method extraction under the mode of "manually labeled corpus + machine learning algorithm" relies on large-scale, high-quality labeled corpus, and the high cost of training data acquisition makes the performance of the model quite constrained
Secondly, for scientific and technological texts involving multiple problems and methods, it is difficult for existing methods to distinguish the corresponding relationship between problems and methods.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for extracting science and technology text problem method based on extremely simple abstract strategy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The following will clearly and completely describe the technical solutions in the embodiments of the present invention in combination with the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0023] It should be noted that, in the case of no conflict, the embodiments of the present invention and the features in the embodiments can be combined with each other.

[0024] The present invention will be further described below in conjunction with specific examples, but not as a limitation of the present invention.

[0025] This embodiment adopts the minimalist summary strategy to extract the problem methods in scientific and technological texts, applies the neural network...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a computer technology, in particular to a method for extracting a science and technology text problem method based on extremely simple abstract strategy, which comprises the following steps of: obtaining a science and technology document data set; preprocessing the unstructured text to obtain a training corpus label; utilizing a BERT pre-training model to carry out vectorization representation on the preprocessed text; constructing a deep neural network of a seq2seq architecture by adopting a Transformer model to serve as an encoder and a decoder, and generating an extremely simple abstract for limiting contents and styles; and extracting problem method words in the generated extremely simple abstract by applying part-of-speech analysis and syntactic analysis algorithms. The method comprises data crawling, natural language processing and deep learning, large-scale science and technology text automatic processing can be achieved, and problem words and method words with corresponding relations are extracted from the science and technology text automatic processing.

Description

technical field [0001] The invention belongs to the field of computer technology, and in particular relates to a method for extracting scientific and technical text questions based on a minimalist summary strategy. Background technique [0002] The increasing number of accessible digital book resources has made it increasingly difficult to accurately retrieve information and quickly acquire knowledge. In order to facilitate the indexing of documents and the acquisition of knowledge, the existing symbol system has formulated a wide range of classification and indexing frameworks to improve retrieval efficiency. However, the retrieval strategy with documents as granular units cannot meet the fine-grained and oriented knowledge acquisition needs of readers. Studies have shown that the information acquisition behavior of researchers is often driven by goals and tasks, and they pay more attention to specific content such as problems, methods or results in the literature. Theref...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/31G06F16/34G06F40/211G06F40/268G06F40/289
CPCG06F16/313G06F16/345G06F40/211G06F40/268G06F40/289
Inventor 陆伟李鹏程张国标程齐凯
Owner WUHAN UNIV