Text classification method, terminal equipment and computer readable storage medium

A text classification and computer program technology, applied in text database clustering/classification, computer components, calculation, etc., to achieve the effect of improving accuracy

Active Publication Date: 2020-06-19
SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the existing problems, the present invention provides a text classification method, terminal equipment and computer-readable storage medium

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method, terminal equipment and computer readable storage medium
  • Text classification method, terminal equipment and computer readable storage medium
  • Text classification method, terminal equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the technical problems, technical solutions and beneficial effects to be solved by the embodiments of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0026] It should be noted that when an element is referred to as being “fixed” or “disposed on” another element, it may be directly on the other element or be indirectly on the other element. When an element is referred to as being "connected to" another element, it can be directly connected to the other element or indirectly connected to the other element. In addition, the connection can be used for both fixing function and circuit communication function.

[0027] It is to be understood that the terms "length", "width", "top", "bottom", "front"...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text classification method, terminal equipment and a computer readable storage medium, and the method comprises the steps: adding a full connection layer on the basis of a first model, and carrying out the training of the parameters of the full connection layer based on a supervised data set, and obtaining a fine adjustment pre-training model; obtaining an enhanced data set with doubled data volume based on the supervised data set through a text enhancement algorithm, and classifying the enhanced data set through a fine tuning pre-training model to obtain first data distribution of the enhanced data set; training a second model by utilizing a supervised data set, classifying the enhanced data set through the second model to obtain second data distribution of the enhanced data set, wherein the parameter quantity of the first model is more than ten times that of the second model; and further training the second model by maximizing the similarity between the firstdata distribution and the second data distribution, and storing all parameters for classifying the to-be-classified text. And the accuracy is improved on the basis of ensuring that the rate of the second model is unchanged.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a text classification method, a terminal device and a computer-readable storage medium. Background technique [0002] As an important task of Chinese natural language processing, Chinese text classification is being researched and applied more and more widely. At present, Chinese text classification methods include text classification methods based on traditional machine learning and text classification methods based on deep learning, as well as some methods that tend to integrate traditional machine learning and depth. The simpler one is that the text classification model is a text classification algorithm based on the bag-of-words model. The bag-of-words model can be traced back to Zelig Harris's article in 1954 (Distributional Structure.Word.1954,10(2 / 3): 146–62.) If the bag-of-words model is used to represent the text, the sentences "I like playin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06K9/62
CPCG06F16/35G06F18/214G06F18/241
Inventor 李晨辉于苗苗袁博
Owner SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products