Unlock instant, AI-driven research and patent intelligence for your innovation.

Classification method of multi-label scientific papers based on hierarchical discriminant tree

A classification method and multi-label technology, applied in the field of classification of multi-label scientific research papers, can solve problems such as heavy workload, difficult classification of scientific research papers, and high work difficulty, and achieve the effects of less error, improved readiness, and improved accuracy

Active Publication Date: 2022-06-21
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the multi-layer and complex structure of the existing category label system brings difficulties to the classification of scientific research papers. Reasonably and comprehensively form its classification labels in the system, the workload is heavy and the work is difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classification method of multi-label scientific papers based on hierarchical discriminant tree
  • Classification method of multi-label scientific papers based on hierarchical discriminant tree
  • Classification method of multi-label scientific papers based on hierarchical discriminant tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The present invention will be further described in detail below with reference to the accompanying drawings, so that those skilled in the art can implement it with reference to the description.

[0041] like figure 1 As shown, the present invention provides a method for classifying multi-label scientific research papers based on a hierarchical discriminant tree, including:

[0042] Step 1. Build a binary discriminant model:

[0043] Obtain the labels of all papers and papers whose labels are known in the multi-level label system, use text segmentation technology to obtain the text representations of all papers, and obtain the feature word set of each label from the text representation, and each label is associated with the label. The corresponding relationship of the feature word set is constructed to form a binary discriminant model; the discriminant model adopts traditional data mining methods, such as support vector product, naive Bayes, logistic regression, etc., t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for classifying multi-label scientific research papers based on a hierarchical discriminant tree, comprising: step 1, obtaining papers and tags with known tags, extracting a set of characteristic words of the tags, and constructing a binary discriminant model; step 2, combining The label is updated to a binary discriminant model to obtain a hierarchical discriminant tree model; Step 3: Obtain the text representation of a paper with an unknown label, input it into all binary discriminant models of the root node in the hierarchical discriminant tree model, and calculate the probability of having the label corresponding to the node, If it is greater than the threshold, output the label corresponding to the root node; input it to all binary discriminant models of the child nodes of the node corresponding to the label, calculate the probability of having the label represented by the node, if it is greater than the threshold, output the child node corresponding to The label of the paper is judged step by step until the leaf node; all the output labels are the labels of the paper. The invention has the beneficial effects of fully mining the characteristic words of the papers and quickly and accurately classifying the papers hierarchically.

Description

technical field [0001] The invention relates to the field of classification of scientific research papers. More specifically, the present invention relates to a classification method for multi-label scientific research papers based on a hierarchical discriminant tree. Background technique [0002] The organization and management of scientific research papers have always been the focus of publishing institutions, scientific research institutions, and scientific research workers. In the field of organization and management of scientific papers, the classification of scientific papers is an important basic task. This task is to classify scientific papers into hierarchical labels according to the existing category labeling system, which is of great significance for the rapid retrieval, induction and summary of scientific papers. On the one hand, the classification of scientific papers can help publishers to quickly locate the categories of the latest scientific papers, and add...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F16/33
CPCG06F16/35G06F16/3347
Inventor 刘玮吴俊杰李超左源纪玉春袁石
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More