Supercharge Your Innovation With Domain-Expert AI Agents!

Short text classification method and device

A classification method and short text technology, applied in the field of data processing, can solve the problems of difficult implementation, occupying a lot of computing resources, and long training period, and achieve the effect of reducing the difficulty of implementation, reducing the amount of calculation, and solving the long training period.

Active Publication Date: 2019-02-26
TENCENT TECH (SHENZHEN) CO LTD
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005]In order to solve the problems of long training period, difficult implementation and high computing resource usage of classification algorithms based on feature analysis such as artificial neural networks to classify short texts, The embodiment of the present application provides a short text classification method and device

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short text classification method and device
  • Short text classification method and device
  • Short text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the purpose, technical solution and advantages of the present application clearer, the implementation manners of the present application will be further described in detail below in conjunction with the accompanying drawings.

[0037] The tree structure is a structure that organizes data elements according to the branch relationship. It is usually used to describe objects with hierarchical relationships. The data elements in the tree structure become nodes. Among them, the topmost node of the tree structure is called the root node. The bottom node in each branch is called a leaf node, and the rest of the nodes except the leaf node are called intermediate nodes (including the root node). Each intermediate node contains at least one child node, and the leaf node does not contain child nodes. Child nodes is the next layer node belonging to the intermediate node (the next layer node may be an intermediate node or a leaf node).

[0038] A binary tree is a sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a short text classification method and device, belonging to the technical field of data processing. The method comprises the following steps: obtaining short text to be classified; According to the short text traversing Huffman tree, determining the target main classification matching the short text. Wherein The Huffman tree consists of m layers of nodes, with each corresponding to a primary classification of the standard classification sample. According to the standard classification samples, the n subordinate classifications corresponding to the target master classification are determined. Acquiring short text samples corresponding to each slave classification; Calculating similarity between short text and each short text sample; According to the calculated similarities, the target classification of short text is determined. The present application solves the problems of long training period, great difficulty in implementation and more computational resourcesoccupied when the classification algorithm based on the feature analysis classifies short texts, and achieves the effects of removing the training process, reducing the difficulty in implementation and reducing the amount of computation.

Description

technical field [0001] The present application relates to the technical field of data processing, in particular to a short text classification method and device. Background technique [0002] A short text usually refers to a text consisting of 1 to 3 words, such as the name of a hospital department, the department name of a government agency, the department name of a school, etc. [0003] For classification of short texts, classification algorithms based on feature analysis, such as artificial neural networks, are mainly used in related technologies. The artificial neural network needs to be trained before it is used. The training process is as follows: Obtain a large number of training samples of known categories, input each training sample into the artificial neural network, and the artificial neural network extracts features from the training samples. According to the extracted features Predict the probability of being classified into the category of the training sample,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F17/27G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06F40/289G06N3/045G06F18/241
Inventor 阮航
Owner TENCENT TECH (SHENZHEN) CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More