Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Classification method of patent texts in security field

A classification method and patented technology, applied in the security field, can solve problems such as time-consuming, gradient disappearance, etc., to save storage space and improve retrieval efficiency.

Inactive Publication Date: 2018-12-18
SHANGHAI INST OF TECH
View PDF1 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, convolutional neural network performs text classification. When constructing text semantics, it is necessary to manually set a window to capture context information, and the size of the window has an important impact on the classification results, and it takes a lot of time during the training process; cyclic neural network When performing text classification, because the recurrent neural network has a deep memory of the last input signal and a shallow memory of the early input signal, this will lead to the problem of "gradient disappearance"

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classification method of patent texts in security field
  • Classification method of patent texts in security field
  • Classification method of patent texts in security field

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention provides a method for classifying patent texts in the security field, including:

[0044] Step S1, during the text preprocessing process, adding the frequently appearing words in the patent text to the stop vocabulary list;

[0045] Step S2, introducing a pre-trained Word2Vec model;

[0046] Step S3, by training the LSTM classification model, extracting text features, classifying patent texts in the security field, and obtaining classification results;

[0047] Step S4, using the accuracy rate and the ROC curve evaluation model to evaluate the classification result.

[0048] Here, for patent text classification, traditional methods such as convolutional neural network for text classification need to manually set a window to capture context information when constructing text semantics, and the window size has an important impact on the classification results. It takes a long time; although the recurrent neural network (RNN) can complete the task o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a classification method of patent texts in the security field. The invention provides a classification method of patent texts in the security field. The method comprises the steps of: 1, adding words and expressions occurring frequently into a using-stop word list in the text processing process to save the storage space and improve retrieval efficiency; 2. introducing the pre-trained Word2Vec model to solve the dimension disaster problem caused by the traditional method; 3, extracting text features by train a Long short Term memory (LSTM) classification model to classifypatent texts in that security field; 4, evaluating the classification result by using an accuracy rate and a ROC curve evaluation model. Experiments show that this method can classify the patent texts in the field of security, train and test 50,000 patent texts, and the accuracy of the test set reaches 93.48%.

Description

technical field [0001] The invention relates to a method for classifying patent texts in the security field. Background technique [0002] With the rapid development of information technology and knowledge economy, the number of patent applications in my country is increasing day by day. As an intangible asset, patent has huge commercial and research value, and has become an important indicator to measure the comprehensive strength of various countries. How to obtain cutting-edge and innovative achievements from patent texts, transform them into products, and realize industrialization has become the focus of research by experts and scholars. As a basic work, patent text classification plays an important role in patent retrieval, patent mining, and strategic decision-making. Therefore, patent text classification has very important research significance and research value. At present, there are few researches on patents in the field of security. Since patents in each field ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/62
CPCG06F18/214
Inventor 肖立中王广仲刘源夏坤
Owner SHANGHAI INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products